Frontier AI agents, while impressive at reasoning, often falter when faced with the messy reality of enterprise documents. Databricks is launching Document Intelligence to bridge this gap, addressing what it calls the "accuracy ceiling" for agentic workflows.
The core problem, according to Databricks, isn't the agents' reasoning but their ability to accurately read and interpret diverse document formats—from scanned PDFs with inconsistent layouts to handwritten notes. This limitation can lead to costly errors, as seen in insurance claims processing where misread figures result in incorrect payouts.
Research from Databricks AI, including the OfficeQA benchmark, found that even advanced agents scored below 50% accuracy on real-world document tasks. This data underscores the need for specialized document processing capabilities.
Document Intelligence: Accuracy, Scale, Simplicity
Databricks' new offering is built on three pillars: research-backed accuracy, enterprise scale, and end-to-end simplicity. It introduces a set of composable AI Functions designed to handle the complexities of enterprise documents.
The ai_parse_document function, now generally available, converts raw scans into structured, layout-enriched text. Subsequent functions like ai_classify and ai_extract enable document routing and insight extraction without reprocessing the original document. This pipeline approach reportedly boosts agent performance by an average of 16% on tasks involving treasury bond documents.