hub / github.com/jamwithai/production-agentic-rag-course

github.com/jamwithai/production-agentic-rag-course @week7.0 sqlite

repository ↗ · DeepWiki ↗ · release week7.0 ↗

517 symbols 2,064 edges 121 files 430 documented · 83%

README

The Mother of AI Project

Phase 1 RAG Systems: arXiv Paper Curator

A Learner-Focused Journey into Production RAG Systems

Learn to build modern AI systems from the ground up through hands-on implementation

Master the most in-demand AI engineering skills: RAG (Retrieval-Augmented Generation)

📖 About This Course

This is a learner-focused project where you'll build a complete research assistant system that automatically fetches academic papers, understands their content, and answers your research questions using advanced RAG techniques.

The arXiv Paper Curator will teach you to build a production-grade RAG system using industry best practices. Unlike tutorials that jump straight to vector search, we follow the professional path: master keyword search foundations first, then enhance with vectors for hybrid retrieval.

🎯 The Professional Difference: We build RAG systems the way successful companies do - solid search foundations enhanced with AI, not AI-first approaches that ignore search fundamentals.

By the end of this course, you'll have your own AI research assistant and the deep technical skills to build production RAG systems for any domain.

🎓 What You'll Build

Week 1: Complete infrastructure with Docker, FastAPI, PostgreSQL, OpenSearch, and Airflow
Week 2: Automated data pipeline fetching and parsing academic papers from arXiv
Week 3: Production BM25 keyword search with filtering and relevance scoring
Week 4: Intelligent chunking + hybrid search combining keywords with semantic understanding
Week 5: Complete RAG pipeline with local LLM, streaming responses, and Gradio interface
Week 6: Production monitoring with Langfuse tracing and Redis caching for optimized performance
Week 7: Agentic RAG with LangGraph and Telegram Bot for intelligent reasoning and mobile access

🏗️ System Architecture Evolution

Week 7: Agentic RAG & Telegram Bot Integration

Week 7 Telegram and Agentic AI Architecture

Complete Week 7 architecture showing Telegram bot integration with the agentic RAG system

LangGraph Agentic RAG Workflow

LangGraph Agentic RAG Flow

Detailed LangGraph workflow showing decision nodes, document grading, and adaptive retrieval

Agentic RAG Workflow:

User Query → Guardrail Node → [PROCEED or OUT_OF_SCOPE]
         ↓
   Retrieve Node (attempt 1)
         ↓
   Grade Documents → [RELEVANT or INSUFFICIENT]
         ↓
   [If INSUFFICIENT] → Rewrite Query → Retrieve Node (attempt 2)
         ↓
   Generate Answer Node → Final Response with Citations

Key Innovations in Week 7: - 🤖 Intelligent Decision-Making: Agents evaluate and adapt retrieval strategies - 🔍 Document Grading: Automatic relevance assessment with semantic evaluation - 🔄 Query Rewriting: Adaptive query refinement when results are insufficient - 🛡️ Guardrails: Out-of-domain detection prevents hallucination - 📱 Mobile Access: Telegram bot for conversational AI on any device - 🔎 Transparency: Full reasoning step tracking for debugging and trust

🚀 Quick Start

📋 Prerequisites

Docker Desktop (with Docker Compose)
Python 3.12+
UV Package Manager (Install Guide)
8GB+ RAM and 20GB+ free disk space

⚡ Get Started

# 1. Clone and setup
git clone <repository-url>
cd arxiv-paper-curator

# 2. Configure environment (IMPORTANT!)
cp .env.example .env
# The .env file contains all necessary configuration for OpenSearch, 
# arXiv API, and service connections. Defaults work out of the box.
# For Week 4: Add JINA_API_KEY=your_key_here for hybrid search

# 3. Install dependencies
uv sync

# 4. Start all services
docker compose up --build -d

# 5. Verify everything works
curl http://localhost:8000/health

📚 Weekly Learning Path

Week	Topic	Blog Post	Code Release
Week 0	The Mother of AI project - 6 phases	The Mother of AI project	-
Week 1	Infrastructure Foundation	The Infrastructure That Powers RAG Systems	week1.0
Week 2	Data Ingestion Pipeline	Building Data Ingestion Pipelines for RAG	week2.0
Week 3	OpenSearch ingestion & BM25 retrieval	The Search Foundation Every RAG System Needs	week3.0
Week 4	Chunking & Hybrid Search	The Chunking Strategy That Makes Hybrid Search Work	week4.0
Week 5	Complete RAG system	The Complete RAG System	week5.0
Week 6	Production monitoring & caching	Production-ready RAG: Monitoring & Caching	week6.0
Week 7	Agentic RAG & Telegram Bot	Agentic RAG with LangGraph and Telegram	week7.0

📥 Clone a specific week's release:

# Clone a specific week's code
git clone --branch <WEEK_TAG> https://github.com/jamwithai/arxiv-paper-curator
cd arxiv-paper-curator
uv sync
docker compose down -v
docker compose up --build -d

# Replace <WEEK_TAG> with: week1.0, week2.0, etc.

📊 Access Your Services

Service	URL	Purpose
API Documentation	http://localhost:8000/docs	Interactive API testing
Gradio RAG Interface	http://localhost:7861	User-friendly chat interface
Langfuse Dashboard	http://localhost:3000	RAG pipeline monitoring & tracing
Airflow Dashboard	http://localhost:8080	Workflow management
OpenSearch Dashboards	http://localhost:5601	Hybrid search engine UI

NOTE: Check airflow/simple_auth_manager_passwords.json.generated for Airflow username and password

📚 Week 1: Infrastructure Foundation ✅

Start here! Master the infrastructure that powers modern RAG systems.

🎯 Learning Objectives

Complete infrastructure setup with Docker Compose
FastAPI development with automatic documentation and health checks
PostgreSQL database configuration and management
OpenSearch hybrid search engine setup
Ollama local LLM service configuration
Service orchestration and health monitoring
Professional development environment with code quality tools

🏗️ Architecture Overview

Week 1 Infrastructure Setup

Infrastructure Components: - FastAPI: REST endpoints with async support (Port 8000)
- PostgreSQL 16: Paper metadata storage (Port 5432) - OpenSearch 2.19: Search engine with dashboards (Ports 9200, 5601) - Apache Airflow 3.0: Workflow orchestration (Port 8080) - Ollama: Local LLM server (Port 11434)

📓 Setup Guide

# Launch the Week 1 notebook
uv run jupyter notebook notebooks/week1/week1_setup.ipynb

Completion Guide: Follow the Week 1 notebook for hands-on setup and verification steps.

📖 Deep Dive

Blog Post: The Infrastructure That Powers RAG Systems - Detailed walkthrough and production insights

📚 Week 2: Data Ingestion Pipeline ✅

Building on Week 1 infrastructure: Learn to fetch, process, and store academic papers automatically.

🎯 Learning Objectives

arXiv API integration with rate limiting and retry logic
Scientific PDF parsing using Docling
Automated data ingestion pipelines with Apache Airflow
Metadata extraction and storage workflows
Complete paper processing from API to database

🏗️ Architecture Overview

Week 2 Data Ingestion Architecture

Data Pipeline Components: - MetadataFetcher: 🎯 Main orchestrator coordinating the entire pipeline - ArxivClient: Rate-limited paper fetching with retry logic - PDFParserService: Docling-powered scientific document processing
- Airflow DAGs: Automated daily paper ingestion workflows - PostgreSQL Storage: Structured paper metadata and content

📓 Implementation Guide

# Launch the Week 2 notebook  
uv run jupyter notebook notebooks/week2/week2_arxiv_integration.ipynb

Completion Guide: Follow the Week 2 notebook for hands-on implementation and verification steps.

📖 Deep Dive

Blog Post: Building Data Ingestion Pipelines for RAG - arXiv API integration and PDF processing

📚 Week 3: Keyword Search First - The Critical Foundation

Building on Weeks 1-2 foundation: Implement the keyword search foundation that professional RAG systems rely on.

🎯 Learning Objectives

Why keyword search is essential for RAG systems (foundation first approach)
OpenSearch index management, mappings, and search optimization
BM25 algorithm and the math behind effective keyword search
Query DSL for building complex search queries with filters and boosting
Search analytics for measuring relevance and performance
Production patterns used by real companies

🏗️ Architecture Overview

Week 3 OpenSearch Flow Architecture

Search Infrastructure Components: - OpenSearch Service: src/services/opensearch/ - Professional search service implementation - Search API: src/routers/search.py - Search API endpoints with BM25 scoring - Learning Materials: notebooks/week3/ - Complete OpenSearch integration guide - Quality Metrics: Precision, recall, and relevance scoring

📓 Setup Guide

# Launch the Week 3 notebook
uv run jupyter notebook notebooks/week3/week3_opensearch.ipynb

Completion Guide: Follow the Week 3 notebook for hands-on OpenSearch setup and BM25 search implementation.

📖 Deep Dive

Blog Post: The Search Foundation Every RAG System Needs - Complete BM25 implementation with OpenSearch

📚 Week 4: Chunking & Hybrid Search - The Semantic Layer

Building on Week 3 foundation: Add the semantic layer that makes search truly intelligent.

🎯 Learning Objectives

Section-based chunking with intelligent document segmentation
Production embeddings with Jina AI integration and fallback strategies
Hybrid search mastery using RRF fusion for keyword + semantic retrieval
Unified API design with single endpoint supporting multiple search modes
Performance analysis and trade-offs between search approaches

🏗️ Architecture Overview

Week 4 Hybrid Search Architecture

Hybrid Search Infrastructure Components: - Text Chunker: src/services/indexing/text_chunker.py - Section-aware chunking with overlap strategies - Embeddings Service: src/services/embeddings/ - Production embedding pipeline with Jina AI - Hybrid Search API: src/routers/hybrid_search.py - Unified search API supporting all modes - Learning Materials: notebooks/week4/ - Complete hybrid search implementation guide

📓 Setup Guide

# Launch the Week 4 notebook
uv run jupyter notebook notebooks/week4/week4_hybrid_search.ipynb

Completion Guide: Follow the Week 4 notebook for hands-on implementation and verification steps.

📖 Deep Dive

Blog Post: The Chunking Strategy That Makes Hybrid Search Work - Production chunking and RRF fusion implementation

📚 Week 5: Complete RAG Pipeline with LLM Integration

Building on Week 4 hybrid search: Add the LLM layer that turns search into intelligent conversation

Core symbols most depended-on inside this repo

src/services/opensearch/query_builder.py

update

called by 8

src/db/interfaces/base.py

update_span

called by 7

src/services/langfuse/client.py

get_latest_query

called by 7

src/services/agents/nodes/utils.py

get_session

called by 7

src/db/interfaces/base.py

create_rag_prompt

called by 6

src/services/ollama/prompts.py

health_check

called by 6

src/services/ollama/client.py

Shape

Method 260

Class 128

Function 116

Route 13

Languages

Python100%

Modules by API surface

tests/unit/services/agents/test_models.py27 symbols

tests/unit/services/test_pdf_parser.py23 symbols

src/services/arxiv/client.py22 symbols

src/exceptions.py21 symbols

tests/unit/services/agents/test_nodes.py20 symbols

src/services/opensearch/client.py17 symbols

tests/api/routers/test_agentic_ask.py15 symbols

src/services/indexing/text_chunker.py15 symbols

tests/unit/services/test_arxiv_client.py13 symbols

src/repositories/paper.py13 symbols

src/dependencies.py13 symbols

src/config.py13 symbols

Dependencies from manifests, versioned

docling2.0.0 · 1×

httpx0.27.0 · 1×

opensearch-py2.4.0 · 1×

psycopg2-binary2.9.0 · 1×

pydantic2.0.0 · 1×

python-dateutil2.8.0 · 1×

sqlalchemy1.4.36 · 1×

Datastores touched

rag_dbDatabase · 1 repos

langfuseDatabase · 1 repos

For agents

$ claude mcp add production-agentic-rag-course \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact