Every AI team eventually builds RAG. Every AI team also eventually hates the RAG they built.
You start with naive chunking, split by 512 tokens, done. Then you realize your PDFs have tables that get split across chunks, your embedding model doesn't know that "Q3 revenue" in chunk 14 refers to the fiscal year defined in chunk 1, and your retrieval returns the wrong page 30% of the time. You spend three months tuning chunk sizes, overlap windows, and reranking thresholds. Your accuracy plateaus at 78%.
