Skip to main content

9 docs tagged with "rag-engineering"

View all tags

Agentic RAG

Build RAG systems that reason, iterate, and self-correct - covering Self-RAG, FLARE, ReAct tool-augmented RAG, RAPTOR, and Corrective RAG with full production implementations using the Anthropic SDK.

Document Ingestion and Chunking

Master every chunking strategy from fixed-size to semantic and structure-aware splitting. Learn how to parse PDFs, DOCX, and HTML, enrich metadata, evaluate chunk quality, and build a production-grade ingestion pipeline.

Embedding Models in Production

How to choose, deploy, and manage embedding models at scale - including versioning, caching, batching, and migration strategies for production RAG systems.

Hybrid Search and Reranking

How to combine BM25 sparse retrieval with dense vector search using Reciprocal Rank Fusion, and how to apply cross-encoder reranking for precision that neither method achieves alone.

Query Transformation and HyDE

Master query transformation techniques - HyDE, multi-query retrieval, step-back prompting, query decomposition, and routing - to solve the vocabulary mismatch problem that breaks naive RAG systems in production.

RAG Engineering - Module Overview

Build production-grade Retrieval-Augmented Generation systems - from document ingestion and chunking through hybrid search, query transformation, agentic RAG, and RAGAS evaluation.

RAG Evaluation and RAGAS

Build a continuous RAG evaluation pipeline using the RAGAS framework - faithfulness, answer relevance, context precision, and context recall - with full production implementations using the Anthropic SDK and automated regression detection.

Vector Search in Practice

How approximate nearest neighbor search works, how to choose the right vector database, and how to build production-grade retrieval pipelines that stay fast at millions of documents.

Why RAG Exists

Understand why Retrieval-Augmented Generation was invented, what problems it solves that fine-tuning and prompt stuffing cannot, and how to architect a minimal RAG pipeline from scratch.