RAGmap is a simple RAG visualization tool for exploring document chunks and queries in embedding space.
Inspired by DeepLearning.ai's short course on Advanced Retrieval for AI with Chroma and Gabriel Chua's award-winning RAGxplorer.
RAGmap supports the following features:
- Multiple vector stores
- ChromaDB
- LanceDB (NEW!)
- Multiple document formats ๐
PDF
DOCX
PPTX
- Multiple embedding models
- Amazon Bedrock โฐ๏ธ
- Hugging Face ๐ค
- OpenAI ึ (NEW!)
- Dimensionality reduction (2D and 3D)
- Natural language queries
- Advanced query augmentation
- Generated Answers (HyDE)
- Multiple Queries
- ... and more!
โ๏ธAmazonBedrockEmbeddingFunction
only works with Titan models. Feel free to upvote this PR to add support for Cohere Embed models.
Enable access to the embedding (Titan Embeddings, Cohere Embed) and text (Anthropic Claude) models via Amazon Bedrock.
For more information on how to request model access, please refer to the Amazon Bedrock User Guide (Set up > Model access)
-
Install dependencies
pip install -r requirements.txt
-
Run the application
# ChromaDB streamlit run app.py # LanceDB (NEW!) ๐งช streamlit run app_lancedb.py
-
Point your browser to http://localhost:8501
-
Run the following command to start the application
docker-compose up
-
Once the service is up and running, head over to http://localhost:8501
Example: Amazon shareholder letters
- (AWS) What is Retrieval-Augmented Generation?
- (DeepLearning.ai) Advanced Retrieval for AI with Chroma