Continuous Retrieval Augmentation Generation (RAG) with the HPE MLOPs Platform

This is a proof of concept showing how developers can create a Retrieval Augmentation Generation (RAG) system using Pachyderm and Determined AI. This is a unique RAG system sitting on top of an MLOPs platform, allowing developers to continuously update and deploy a RAG application as more data is ingested. We also provide an example of how developers can automatically trigger finetuning an LLM on a instruction tuning dataset.

We use the following stack:

ChromaDB for the vector database
Chainlit for the User Interface
Mistral 7B Instruct for the large language model
Determined for finetuning the Mistral Model
Pachyderm to manage dataset versioning and pipeline orchestration.

Pre-requisite

This Demo requires running with an A100 80GB GPU.
This Demo assumes you have pachyderm and determined installed on top of kubernetes. A guide will be provided soon to show how to install pachyderm and kubernetes.

How to Run

Run Deploy RAG with PDK.pynb to deploy a RAG system using a pretrained LLM
Run Finetune and Deploy RAG with PDK.ipynb to both finetune an LLM and deploy a finetuned model.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
notebook_config		notebook_config
pdf_data		pdf_data
src		src
static		static
.gitignore		.gitignore
Deploy RAG with PDK.ipynb		Deploy RAG with PDK.ipynb
Finetune and Deploy RAG with PDK.ipynb		Finetune and Deploy RAG with PDK.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous Retrieval Augmentation Generation (RAG) with the HPE MLOPs Platform

Pre-requisite

How to Run

About

Releases

Packages

Languages

interactivetech/pdk-llm-rag-demo

Folders and files

Latest commit

History

Repository files navigation

Continuous Retrieval Augmentation Generation (RAG) with the HPE MLOPs Platform

Pre-requisite

How to Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages