#RAG

28 articles with this tag

Claude's Corner: Captain — The RAG Infrastructure Play That's Playing Bloomberg
Claude's Corner

Claude's Corner: Captain — The RAG Infrastructure Play That's Playing Bloomberg

Captain (YC W2026) is building managed RAG-as-a-service — two API calls to connect your data sources, 95% retrieval accuracy via contextual embeddings + hybrid search + reranking, and an Odyssey data pivot that looks a lot like Bloomberg Terminal strategy. Here's the architecture, the moat, and how to build a clone.

5 days ago
Stop Babysitting AI Agents: Build a Context Engine
Artificial Intelligence

Stop Babysitting AI Agents: Build a Context Engine

Brandon Walsenuk from Unblocked discusses the critical need for context engines to empower AI agents, moving beyond simple data access to true understanding and autonomous operation.

6 days ago
Claude's Corner: Rhizome AI — The FDA Whisperer for Biotech
Claude's Corner

Claude's Corner: Rhizome AI — The FDA Whisperer for Biotech

Rhizome AI turns 44 million FDA and EMA regulatory documents into instant, citation-backed answers for life sciences teams. Here's how they built the data moat, why it works, and how you'd replicate it.

15 days ago
ElevenLabs Gives Chat Agents a Voice
Artificial Intelligence

ElevenLabs Gives Chat Agents a Voice

Luke Harries from ElevenLabs discusses the increasing importance of voice for AI chat agents, highlighting the benefits of speed, accessibility, and user experience.

23 days ago
RAG's Evolution: From Keywords to Agentic AI
Artificial Intelligence

RAG's Evolution: From Keywords to Agentic AI

Explore the evolution of Retrieval Augmented Generation (RAG) from basic keyword search to sophisticated agentic AI systems.

27 days ago
IBM Master Inventor on AI's Contextual Bottleneck
Artificial Intelligence

IBM Master Inventor on AI's Contextual Bottleneck

IBM Master Inventor Martin Keen discusses how context is the key bottleneck for AI models, outlining four pillars of context engineering: connected access, knowledge layer, precision retrieval, and runtime governance.

about 1 month ago
Claude's Corner: Compresr — The Token Accountant Your AI Stack Desperately Needs
Claude's Corner

Claude's Corner: Compresr — The Token Accountant Your AI Stack Desperately Needs

Four EPFL researchers built a PhD-backed LLM context compression API that could cut your token bill by 10x — or get eaten alive by Anthropic. Here's the technical breakdown and how to build your own.

about 1 month ago
Databricks Activates Documents with AI Agents
Technology

Databricks Activates Documents with AI Agents

Databricks introduces a multi-agent workflow using AI/BI Genie and Agent Bricks to automate document data extraction and activation.

about 1 month ago
AutoAdapt: Microsoft's LLM Adaptation Fix
AI Research

AutoAdapt: Microsoft's LLM Adaptation Fix

Microsoft's AutoAdapt framework automates LLM domain adaptation, making it faster, cheaper, and more reliable for real-world applications.

about 1 month ago
IBM's Katie McDonald on AI: ADK vs. RAG
Artificial Intelligence

IBM's Katie McDonald on AI: ADK vs. RAG

IBM's Katie McDonald explains the core differences between AI Agent Development Kits (ADK) and Retrieval Augmented Generation (RAG) and when to use each.

about 1 month ago
IBM's Dan Wiegand on AI and Mainframe Augmentation
Artificial Intelligence

IBM's Dan Wiegand on AI and Mainframe Augmentation

IBM's Dan Wiegand discusses how AI, including RAG and agents, is transforming daily productivity and enhancing mainframe operations.

about 1 month ago
pgvector: Postgres's AI Vector Power-Up
Technology

pgvector: Postgres's AI Vector Power-Up

pgvector brings vector embeddings and similarity search directly into PostgreSQL, simplifying AI apps like RAG and semantic search.

about 1 month ago
Cloudflare AI Search Simplifies Agent Development
Technology

Cloudflare AI Search Simplifies Agent Development

Cloudflare AI Search offers a simplified, plug-and-play primitive for developers to integrate robust search capabilities into AI agents.

about 2 months ago
Databricks Powers Real-Time Search
Technology

Databricks Powers Real-Time Search

Databricks unveils its platform for building real-time product search, integrating Vector Search, Lakeflow, and Lakebase for ingestion, retrieval, and operational data.

about 2 months ago
Databricks Touts Agentic Reasoning Gains
Technology

Databricks Touts Agentic Reasoning Gains

Databricks' Supervisor Agent enhances enterprise AI by integrating structured and unstructured data for complex reasoning tasks, showing significant performance gains.

about 2 months ago
IBM's Phil Nash Unveils Open-Source RAG Stack
Artificial Intelligence

IBM's Phil Nash Unveils Open-Source RAG Stack

IBM's Phil Nash introduces OpenRAG, an open-source RAG stack combining Docling, OpenSearch, and Langflow for flexible AI agent development.

about 2 months ago
WriteBack-RAG: Trainable Knowledge for RAG
AI Research

WriteBack-RAG: Trainable Knowledge for RAG

WriteBack-RAG enables trainable RAG knowledge bases by distilling relevant facts into the corpus, boosting performance universally across RAG systems.

2 months ago
Chroma's Context-1: Faster, Cheaper AI Search
Artificial Intelligence

Chroma's Context-1: Faster, Cheaper AI Search

Chroma Context-1, a 20B parameter AI model, offers frontier-level search performance at a fraction of the cost and latency, using self-editing to manage context efficiently.

2 months ago
Exa Unveils New Code Search Benchmarks
Artificial Intelligence

Exa Unveils New Code Search Benchmarks

Exa.ai releases 'WebCode', a new benchmark suite for evaluating search performance in coding agents, addressing limitations in existing tools.

2 months ago
Databricks Tackles Code Complexity for AI Assistants
Technology

Databricks Tackles Code Complexity for AI Assistants

Databricks details how AST-based chunking and MLflow evaluation improve AI assistants' understanding of complex codebases.

2 months ago
Mazda's GenAI Leap in Service Ops
Technology

Mazda's GenAI Leap in Service Ops

Mazda built a governed GenAI assistant on Databricks Lakehouse in 8 weeks to improve technical service operations, integrating RAG and Unity Catalog.

2 months ago
Databricks Unlocks Billion-Scale Vector Search
Technology

Databricks Unlocks Billion-Scale Vector Search

Databricks unveils a redesigned vector search capable of handling billions of vectors, drastically cutting costs and improving scalability.

3 months ago
Cloudflare Adds Website Crawling API
Artificial Intelligence

Cloudflare Adds Website Crawling API

Cloudflare launches a new /crawl endpoint for its Browser Rendering service, enabling automated website crawling via a single API call for developers.

3 months ago
IBM's Martin Keen on LLM Context Windows
Artificial Intelligence

IBM's Martin Keen on LLM Context Windows

IBM's Martin Keen explains how larger context windows in LLMs simplify deployments and improve reasoning by reducing reliance on complex RAG systems.

3 months ago
IBM Master Inventor Martin Keen on Agentic Storage
Artificial Intelligence

IBM Master Inventor Martin Keen on Agentic Storage

IBM Master Inventor Martin Keen explains 'Agentic Storage,' detailing how AI agents interact with diverse storage systems and the critical safety layers needed for responsible operation.

3 months ago
Databricks Reffy: From Tribal Data to AI Answers
Technology

Databricks Reffy: From Tribal Data to AI Answers

Databricks' Reffy uses AI and RAG to turn scattered customer stories into an instantly searchable knowledge base for sales and marketing.

3 months ago
BeeAI Framework: Extending LLMs with Tools, RAG, & AI Agents
AI Video

BeeAI Framework: Extending LLMs with Tools, RAG, & AI Agents

The BeeAI Framework: Orchestrating LLMs with Tools \n\n \n\n \"The landscape of AI is not just about building large language models, but also about making them ...

7 months ago
AI Video

BeeAI Framework: Extending LLMs with Tools, RAG, & AI Agents

The BeeAI Framework: Orchestrating LLMs with Tools \n\n \n\n \"The landscape of AI is not just about building large language models, but also about making them ...

7 months ago