Develop a Private GenAI Q&A Systems using Dolly, Spark, MongoDB & Dataworkz

Develop a Private GenAI Q&A Systems using Dolly, Spark, MongoDB & Dataworkz

:arrow_right: To RSVP - Please click on the “ ✓ RSVP ” link at the top of this event page if you plan to attend. The link should change to a green button if you are Going. You need to be signed in to access the button. Please register prior to September 13th to ensure access to the event. Have meetup.com? You can register on the event there as well.

Overview:

This talk and workshop will show you how to create and privately host an LLM to power a Q&A system like ChatGPT.

In this workshop, you will learn how to assemble a Generative AI stack to power an LLM for the purpose of building a Q&A system from data stored in your own database (ex: MongoDB Atlas). In the workshop portion of the session, you will build your own Q&A system on top of MongoDB in less than 30 minutes.

What we will cover:

  • Pre-processing of your data - Get it into a shape where it can be used by LLMs (For the purpose of the workshop, you will have access to pre-processed data).
  • Create chunks and embeddings.
  • Store chunks in MongoDB Atlas for fast retrieval and search.
  • Store embeddings in a vector database, such as MongoDB Vector Search.
  • Privately hosted LLM.
  • Use commercially available open-source models like Dolly, ChatGPT, etc. in a private VPC.

Takeaways from this session include:

  • How to assemble and operationalize a Q&A system using open-source LLMs
  • Data quality impacts the LLM response
  • How to rapidly experiment with different LLM models for your specific use case

Links to resources:

Agenda:

4:00 pm Registration, Networking & Coffee

4:15 pm Intro to Generative AI

4:45 pm Demo

5:00 pm Hands on workshop

5:30 pm Pizza and beverages

6:00 pm Wrap-up

Speakers:

  • Nikhil Smotra, CTO, Dataworkz

  • Dave Nielsen, Head of Community, MongoDB

About Nikhil

Nikhil is driven by the potential for innovation and really excited about leveraging advanced technologies such as artificial intelligence, especially LLMs and applying them to extract valuable insights from customer data. Nikhil’s robust experience working with data management at scale led him to co-found Dataworkz. His vision is to create self-service experience that brings together – data, transformation and AI applications – for users of different skill levels.

Prior to Dataworkz, Nikhil worked as SVP, Head of Data Engineering at iQor, a leader in BPO and Product Support, where he led development and management of BigData platforms. Nikhil helped launch the enterprise data initiative and built a high-performing global data engineering team. During his tenure at iQor, Nikhil also managed QeyMetrics – a Business Intelligence and Operational Analytics SaaS offering. Nikhil spent several years at Lockheed Martin(R&D) where he harnessed the potential of NoSQL technology, prior to it gaining popularity, and used it along with semantic web technologies to build a massively scalable Digital Archive with automated data preservation, curation and classification.

Nikhil is an executive alumnus of Haas School of Business, UC Berkeley (Data Science and Analytics Program) and holds a B.E in Computer Science from University of Pune, India. Nikhil also served on Advisory Board of Rutgers University’s BigData certificate program for executives from 2018-2022.

Where?

Event Type: In-Person
Location: 88 Kearny Street · San Francisco, CA

2 Likes