Production-grade RAG systems, agentic workflows, evaluation pipelines, embeddings, and FastAPI integrations.
This repository showcases real-world LLM Engineering techniques used to build reliable, auditable AI systems.
It includes:
- Private RAG systems (LangChain + Chroma)
- Prompt engineering & optimisation
- Retriever evaluation (MRR, nDCG, keyword coverage)
- LLM-as-a-judge frameworks
- Agentic workflows (LiteLLM routing, orchestration patterns)
- Vector DB visualisation (t-SNE 2D/3D)
- FastAPI endpoints for RAG/agents
- Synthetic data generation (JSONL test sets)
All examples are production-ready and reflect patterns used in enterprise environments.
┌─────────────────────┐
│ Knowledge Base │
│ (.md, .pdf, etc.) │
└─────────┬───────────┘
│
┌─────────▼───────────┐
│ Text Splitter │
│ (chunk_size/overlap) │
└─────────┬───────────┘
│
┌─────────▼───────────┐
│ Embedder (HF/OpenAI)│
└─────────┬───────────┘
│ vectors
┌─────────▼───────────┐
│ Vector Store (Chroma)│
└─────────┬───────────┘
│ retrieve top-k
┌─────────▼───────────┐
│ Retriever │
└─────────┬───────────┘
│ context
┌─────────▼───────────┐
│ LLM (Chat Model) │
└─────────┬───────────┘
│ answer
┌─────────▼───────────┐
│ Evaluation Layer │
│ (MRR, nDCG, Judge) │
└──────────────────────┘
- End-to-end RAG built with LangChain and Chroma
- Chunking strategy
- Embedding pipelines
- Persistent vectorstore
- Retrieval + LLM pipeline
- RAG UI using Gradio
- Evaluation harness:
- Mean Reciprocal Rank
- nDCG
- Keyword coverage
- LLM-as-judge scoring
Tools to inspect embedding spaces:
- t-SNE 2D
- t-SNE 3D
- Interactive Plotly visualisation
Useful to validate semantic clustering and debug retrieval.
Includes:
- LiteLLM model routing
- Multi-model orchestration
- Task state machines
- Agent examples
Production endpoints for:
/rag/query→ RAG answers/embed→ Embedding service/retriever/eval→ Batch evaluation
- JSONL test sets
- Schema for question generation
- Synthetic data generator
- Example datasets
This repository demonstrates practical, applied LLM engineering for:
- RAG systems
- Agentic workflows
- Embedding pipelines
- Prompt evaluation
- Multi-model orchestration
- Enterprise AI architectures
It is designed to show depth, breadth, and real-world reliability techniques.
If you're hiring for LLM Engineering, RAG systems, agentic workflows, or AI automation, feel free to reach out.