Data & AI 90 days 2-3 hours/day updated 2026-06-01

RAGOps 90-Day Learning Path

Master RAGOps in 90 days: RAG pipeline architecture, chunking strategies, vector store operations, retrieval evaluation, index refresh pipelines, and production observability for RAG systems.

What RAGOps means

RAGOps is the operational discipline for Retrieval-Augmented Generation systems in production. It covers the full lifecycle of a RAG pipeline: document ingestion, chunking strategy selection, embedding management, vector store operations and index refresh, retrieval quality evaluation, and observability for retrieval and generation components. RAGOps ensures that knowledge-grounded AI systems remain accurate as underlying data changes.

Who should follow this path

AI engineers building enterprise knowledge assistants
ML engineers operating production RAG pipelines
Backend engineers integrating vector search into applications
DevOps engineers supporting AI inference infrastructure
Data engineers building document processing pipelines

Prerequisites

Working experience with LLM APIs and Python
Basic understanding of information retrieval concepts
Familiarity with Docker and cloud storage
Some knowledge of vector embeddings and cosine similarity
Basic data pipeline experience

The 90-day plan

Daily study recommendation: 2-3 hours/day, six days a week. Consistency beats intensity — block the time in your calendar like a meeting.

Days 1–15: Foundation

RAG architecture: naive RAG, advanced RAG, modular RAG patterns
Document ingestion pipelines: PDF, HTML, Markdown, structured data
Chunking strategies: fixed-size, recursive, semantic, late chunking
Embedding model selection and trade-offs
Vector database landscape: Pinecone, Weaviate, Qdrant, pgvector, Chroma

Outcome: Design and implement a document ingestion pipeline with a chosen chunking strategy and vector store.

Days 16–30: Core concepts

Retrieval strategies: top-k, MMR, threshold-based
Hybrid search: dense + sparse (BM25) fusion
Reranking with cross-encoders (Cohere Rerank, BGE Reranker)
Query expansion and hypothetical document embedding (HyDE)
Metadata filtering and faceted retrieval

Outcome: Implement hybrid search with reranking and query expansion to measurably improve retrieval quality.

Days 31–45: Tools and workflows

RAG evaluation with Ragas: faithfulness, context recall, precision
ARES and TruLens evaluation frameworks
Building a golden QA evaluation dataset
Ablation studies: chunk size vs retrieval quality
Automated regression testing for retrieval pipelines

Outcome: Build a Ragas evaluation pipeline that scores faithfulness and context recall on every pipeline change.

Days 46–60: Hands-on projects

Index refresh pipelines: incremental vs full rebuild strategies
Change detection and document update tracking
Embedding drift when switching embedding models
Multi-tenant vector store isolation patterns
Access control in vector databases

Outcome: Implement an automated index refresh pipeline with change detection and tenant isolation.

Days 61–75: Advanced practices

Production RAG observability: latency, retrieval quality, token cost
Tracing with Langfuse and Arize Phoenix for RAG steps
Failure mode analysis: retrieval misses, hallucinations, latency spikes
Scaling vector stores: sharding, replication, HNSW parameters
Cost optimization: embedding cache, query deduplication

Outcome: Deploy end-to-end RAG observability with per-trace scoring and cost attribution.

Days 76–90: Portfolio, interview & certification prep

RAGOps portfolio project: enterprise knowledge assistant
Interview preparation for AI engineer roles
RAGOps interview questions
Metrics: retrieval recall@k, faithfulness score, TTFT, cost-per-query
Emerging: multi-modal RAG, agentic RAG, graph RAG

Outcome: Ship a production-grade RAG application with full eval coverage and be ready for AI engineer interviews.

Weekly outcomes at a glance

Phase	Outcome
Days 1–15	Design and implement a document ingestion pipeline with a chosen chunking strategy and vector store.
Days 16–30	Implement hybrid search with reranking and query expansion to measurably improve retrieval quality.
Days 31–45	Build a Ragas evaluation pipeline that scores faithfulness and context recall on every pipeline change.
Days 46–60	Implement an automated index refresh pipeline with change detection and tenant isolation.
Days 61–75	Deploy end-to-end RAG observability with per-trace scoring and cost attribution.
Days 76–90	Ship a production-grade RAG application with full eval coverage and be ready for AI engineer interviews.

Tools to learn

LangChain
LlamaIndex
Pinecone
Weaviate
Qdrant
Ragas
Langfuse
Arize Phoenix
Cohere Rerank
pgvector
Hugging Face Transformers
Apache Airflow

Labs to practice

Mini projects

Build a production RAG system with hybrid search, Cohere reranking, and Ragas evaluation CI pipeline
Implement an automated index refresh pipeline tracking document changes in S3 and triggering incremental re-embedding
Create a multi-tenant RAG platform with per-tenant isolation, access control, and per-query cost tracking in Langfuse

Interview questions to prepare

What are the trade-offs between different chunking strategies for RAG?
How does hybrid search improve over pure dense retrieval?
What is HyDE (hypothetical document embedding) and when would you use it?
How do you evaluate retrieval quality in a RAG pipeline?
Explain the difference between faithfulness and context recall in Ragas.
How do you handle index updates when source documents change frequently?
What HNSW parameters affect recall and query latency in a vector store?
How would you design a multi-tenant RAG system with data isolation?

Certification suggestions

Hugging Face NLP Course Certificate — Hugging Face
Databricks Generative AI Fundamentals — Databricks
AWS Certified Machine Learning Specialty — AWS
DeepLearning.AI LLMOps Specialization — DeepLearning.AI / Coursera

Browse the full certification registry for exam details and official links.

Free resources

// instructor-led option

Prefer live, guided training with mentors and certification support? DevOpsSchool.com runs paid instructor-led programs that pair well with this free path.

Explore paid training on DevOpsSchool.com ↗