RAG Blueprint

A comprehensive open-source framework for building production-ready Retrieval-Augmented Generation (RAG) systems. This blueprint simplifies the development of RAG applications while providing full control over performance, resource usage, and evaluation capabilities.

While building or buying RAG systems has become increasingly accessible, deploying them as production-ready data products remains challenging. Our framework bridges this gap by providing a streamlined development experience with easy configuration and customization options, while maintaining complete oversight of performance and resource usage.

It comes with built-in monitoring and observability tools for better troubleshooting, integrated LLM-based metrics for evaluation, and human feedback collection capabilities. Whether you're building a lightweight knowledge base or an enterprise-grade application, this blueprint offers the flexibility and scalability needed for production deployments.

🚀 Features

Multiple Knowledge Base Integration: Seamless extraction from several Data Sources(Confluence, Notion, PDF)
Wide Models Support: Availability of numerous embedding and language models
Vector Search: Efficient similarity search using vector stores
Interactive Chat: User-friendly interface for querying knowledge on Chainlit
Performance Monitoring: Query and response tracking with Langfuse
Evaluation: Comprehensive evaluation metrics using RAGAS
Setup flexibility: Easy and flexible setup process of the pipeline

🛠️ Tech Stack

Core

Python • LlamaIndex • Chainlit • Langfuse • RAGAS

Data Sources

Notion • Confluence • PDF files

Embedding Models

VoyageAI • OpenAI • Hugging Face

Language Models

OpenAI • Any OpenAI-compatible API models

Vector Stores

Qdrant • Chroma • PGVector

Infrastructure

PostgreSQL • Docker

🚀 Quickstart

Check the detailed Quickstart Setup

🏗️ Architecture

Data Flow

Extraction:
- Fetches content from the data sources pages through their respective APIs
- Handles rate limiting and retries
- Extracts metadata (title, creation time, URLs, etc.)
Processing:
- Markdown-aware chunking using LlamaIndex's MarkdownNodeParser
- Embedding generation using the selected embedding model
- Vector storage in Qdrant
Retrieval & Generation:
- Context-aware retrieval with configurable filters
- LLM-powered response generation
- Human feedback collection

Evaluation

The system includes comprehensive evaluation capabilities:

Automated Metrics (via RAGAS):
- Faithfulness • Answer Relevancy • Context Precision • Context Recall • Harmfulness
Human Feedback:
- Integrated feedback collection through Chainlit
- Automatic dataset creation from positive feedback
- Manual expert feedback support
Observability:
- Full tracing and monitoring with Langfuse
- Separate traces for chat completion and deployment evaluation
- Integration between Chainlit and Langfuse for comprehensive tracking

📁 Project Structure

.
├── build/            # Build and deployment scripts
│   └── workstation/  # Build scripts for workstation setup
├── configurations/   # Configuration and secrets files
├── res/              # Assets
└── src/              # Source code
    ├── augmentation/   # Retrieval and UI components
    ├── common/         # Shared utilities
    ├── embedding/      # Data extraction and embedding
    └── evaluate/       # Evaluation system
├── tests/            # Unit tests

📚 Documentation

For detailed documentation on setup, configuration, and development:

Name	Name	Last commit message	Last commit date
Latest commit gingerjx Merge pull request #26 from feld-m/feature/add-pgvector-docs Mar 14, 2025 b96e32a · Mar 14, 2025 History 71 Commits
.github/workflows	.github/workflows	Force push on sync	Feb 10, 2025
build/workstation	build/workstation	Add pgvector initialization	Mar 13, 2025
configurations	configurations	Adapt local configuration to bavarian beer dataset	Feb 19, 2025
data/bavarian_beer	data/bavarian_beer	Add Bavarian beer dataset	Feb 19, 2025
docs	docs	Update docs	Mar 14, 2025
res/readme	res/readme	Update README.md	Feb 18, 2025
src	src	Remove Optional type from secrets fields	Mar 13, 2025
tests	tests	Fix export limit pdf reader bug	Feb 19, 2025
.dockerignore	.dockerignore	Init commit	Feb 10, 2025
.gitignore	.gitignore	Load config files through env flag	Feb 11, 2025
.isort.cfg	.isort.cfg	Init commit	Feb 10, 2025
.pre-commit-config.yaml	.pre-commit-config.yaml	Add Bavarian beer dataset	Feb 19, 2025
README.md	README.md	Update docs	Mar 14, 2025
chainlit.md	chainlit.md	Init commit	Feb 10, 2025
mkdocs.yml	mkdocs.yml	Update documentation	Mar 3, 2025
pyproject.toml	pyproject.toml	Add PGVector implementation	Mar 13, 2025
uv.lock	uv.lock	Add PGVector implementation	Mar 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Blueprint

🚀 Features

🛠️ Tech Stack

Core

Data Sources

Embedding Models

Language Models

Vector Stores

Infrastructure

🚀 Quickstart

🏗️ Architecture

Data Flow

Evaluation

📁 Project Structure

📚 Documentation

About

Releases

Packages

Contributors 3

Languages

feld-m/rag_blueprint

Folders and files

Latest commit

History

Repository files navigation

RAG Blueprint

🚀 Features

🛠️ Tech Stack

Core

Data Sources

Embedding Models

Language Models

Vector Stores

Infrastructure

🚀 Quickstart

🏗️ Architecture

Data Flow

Evaluation

📁 Project Structure

📚 Documentation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages