
Vidhai A 501(c)(3) Nonprofit Org.
Hands-on Workshop: RAG pipeline With Data Prep Kit + Milvus + Llama
Sat, Sep 21
|Mountain View
In this hands-on workshop, we will demonstrate implementing an end-to-end RAG pipeline using all opensource technologies.
Time & Location
Sep 21, 2024, 11:00 AM – 3:00 PM
Mountain View, 855 Maude Ave, Mountain View, CA 94043, USA
About the event
RAG (Retrieval-Augmented Generation) or fine-tuning a model, a significant portion of your time will be dedicated to data wrangling (cleaning, de-duping, removing markups, etc.). Data Prep Kit ([https://github.com/IBM/data-prep-kit](https://github.com/IBM/data-prep-kit)) can help you with data wrangling.
Noteworthy features of DPK include: de-duping documents (exact dedupe and fuzzy dedupe), handling documents and code, language detection (spoken languages and programming languages), malware detection and creating embeddings.
In this hands-on workshop, we will demonstrate implementing an end-to-end RAG pipeline using all opensource technologies.
Data Prep Kit for processing documents
Milvus as vector database
Llama 3 as the LLM