#AI Agents

50 articles with this tag

Databricks Lakebase Branches Up Databases
Technology

Databricks Lakebase Branches Up Databases

Databricks Lakebase introduces copy-on-write database branching, making isolated developer environments a reality and reshaping the DBA role.

about 17 hours ago
Claude's Corner: Orthogonal — The API Economy, Rebuilt for Agents
Claude's Corner

Claude's Corner: Orthogonal — The API Economy, Rebuilt for Agents

Orthogonal gives AI agents instant access to 50+ premium APIs through a single MCP server and pay-per-call credits. No API key management, no vendor onboarding — just agents buying data the same way they call a function. Here's how it works and whether you can replicate it.

about 24 hours ago
Mercedes-Benz Korea's AI Agents
Technology

Mercedes-Benz Korea's AI Agents

Mercedes-Benz Korea leverages Databricks to build AI agents with a unified semantic layer, ensuring trusted data insights for BI and AI applications.

1 day ago
Cursor's Auto-review Tames AI Agent Autonomy
Technology

Cursor's Auto-review Tames AI Agent Autonomy

Cursor's Auto-review feature intelligently balances AI agent autonomy with security, using contextual analysis to minimize unnecessary user interruptions.

1 day ago
Cursor's Auto-review Balances Agent Autonomy
Technology

Cursor's Auto-review Balances Agent Autonomy

Cursor's Auto-review feature dynamically manages AI agent autonomy, using a classifier to balance productivity with security risks and minimize user interruptions.

1 day ago
Snowflake Summit: AI Agents Enter Healthcare
Technology

Snowflake Summit: AI Agents Enter Healthcare

Snowflake Summit 2026 highlights the move of AI agents from healthcare pilots to enterprise-wide, governed workflows, powered by new CoWork capabilities.

1 day ago
OpenAI to acquire Ona
Artificial Intelligence

OpenAI to acquire Ona

OpenAI to acquire Ona, integrating secure cloud execution tech into Codex for long-running AI agents.

1 day ago
WorkOS's Zack Proser on Untethered AI Productivity
Artificial Intelligence

WorkOS's Zack Proser on Untethered AI Productivity

WorkOS's Zack Proser discusses how developers can maintain productivity and well-being amidst the rapid advancement of AI coding agents, focusing on balance and intentional workflows.

2 days ago
AWS, Databricks Push AI at Summit
Technology

AWS, Databricks Push AI at Summit

AWS and Databricks highlighted their deepening partnership at the Data + AI Summit 2026, showcasing advancements in generative AI and secure data access.

2 days ago
Fixing AI Bugs: Humanity's Last Big Problem?
Artificial Intelligence

Fixing AI Bugs: Humanity's Last Big Problem?

Ben Hylak, CTO of Raindrop, discusses the critical challenge of fixing AI agent bugs, calling it "Humanity's Last Big Problem to Solve" and highlighting Raindrop's approach to creating self-healing AI.

2 days ago
Personalized AI Agents Now Have a Benchmark
AI Research

Personalized AI Agents Now Have a Benchmark

A new iOSWorld benchmark reveals AI agents' struggles with personalized, multi-app tasks, highlighting the need for richer context and advanced reasoning capabilities.

4 days ago
Kuba Rogut: Is RAG Dead or Evolving?
Artificial Intelligence

Kuba Rogut: Is RAG Dead or Evolving?

Kuba Rogut of Turbopuffer discusses the evolution from RAG to agentic retrieval, highlighting its benefits and practical applications in AI development.

4 days ago
Databricks Genie Tames Wild Maintenance Reports
Technology

Databricks Genie Tames Wild Maintenance Reports

Databricks Genie AI agents are transforming solar and wind maintenance by turning unstructured PDF reports into a queryable data layer for advanced analytics.

5 days ago
AI Agents Get Dumber With More Context, Expert Warns
Artificial Intelligence

AI Agents Get Dumber With More Context, Expert Warns

Nupur Sharma of Qodo explains how too much context can hinder AI agents, leading to the 'lost in the middle' problem, and discusses solutions like context engines and hybrid orchestration.

5 days ago
Cloudflare's Sunil Pai & Matt Carrie on Eval++ Compute Primitive
Artificial Intelligence

Cloudflare's Sunil Pai & Matt Carrie on Eval++ Compute Primitive

Cloudflare's Sunil Pai & Matt Carrie unveil Eval++, a new compute primitive for building durable, scalable, and low-latency AI agents.

5 days ago
OpenAI's Lee Spacagna on Operationalizing AI Workflows
Artificial Intelligence

OpenAI's Lee Spacagna on Operationalizing AI Workflows

Lee Spacagna from OpenAI demonstrates how AI agents can be built and operationalized to automate tasks and multiply workforce impact in financial services.

5 days ago
AI Evals: Broken But Essential, Use Them Anyway
Artificial Intelligence

AI Evals: Broken But Essential, Use Them Anyway

Ara Khan and Cline argue that AI evaluations, though flawed, are crucial. They outline common pitfalls and a process for iterative improvement, emphasizing honesty and nuanced assessment.

7 days ago
CrewAI: Taming AI Agent Costs
Artificial Intelligence

CrewAI: Taming AI Agent Costs

CrewAI outlines strategies to combat rising AI agent costs by optimizing token spend through orchestration and infrastructure controls.

7 days ago
AI Agents Running Businesses: Andon Labs on Project Vend
Artificial Intelligence

AI Agents Running Businesses: Andon Labs on Project Vend

Andon Labs' Lukas Petersson and Axel Backlund discuss Project Vend, an experiment using AI agents to run a simulated vending business, exploring LLM capabilities and challenges.

9 days ago
Benchmarking AI Agents: Snorkel AI's Vincent Chen Explains
AI Research

Benchmarking AI Agents: Snorkel AI's Vincent Chen Explains

Vincent Chen from Snorkel AI explores the art and science of benchmarking AI agents, detailing the complexities and methodologies involved in evaluation.

9 days ago
GitHub Universe Enters the Agentic Era
Technology

GitHub Universe Enters the Agentic Era

GitHub Universe 2026 gears up for the agentic era, focusing on practical AI integration for developers.

9 days ago
Conductor CEO on AI Agents and Workflow Optimization
Artificial Intelligence

Conductor CEO on AI Agents and Workflow Optimization

Conductor CEO Charlie Holtz discusses how his team orchestrates AI agents, emphasizing "slot-free zones," strategic model selection, and the iterative process of building effective AI workflows.

9 days ago
Endava Bets on AI Agents for Software Delivery
Artificial Intelligence

Endava Bets on AI Agents for Software Delivery

Endava is revolutionizing software delivery by embedding OpenAI's AI agents across its entire workflow, transforming how enterprises build and deploy technology.

9 days ago
Claude Code Benchmarking: Semantic Search vs. Grep
AI Research

Claude Code Benchmarking: Semantic Search vs. Grep

Turbopuffer's Kuba Rogut benchmarks semantic code retrieval on Claude Code, revealing how semantic search enhances AI agent precision and efficiency compared to grep.

10 days ago
Lassie Secures $35M Series A
Investors News

Lassie Secures $35M Series A

a16z leads $35 million Series A for Lassie, an AI company automating administrative tasks for small businesses, starting with dental practices.

10 days ago
Nvidia's RTX Spark: AI Agents and the Future of PCs
Artificial Intelligence

Nvidia's RTX Spark: AI Agents and the Future of PCs

Nvidia's new RTX Spark chip aims to redefine PCs by enabling on-device AI agents for complex tasks, promising a new era of computing power and creative potential.

12 days ago
Steven Willmott on Spec-Driven Testing for AI Agents
Artificial Intelligence

Steven Willmott on Spec-Driven Testing for AI Agents

Steven Willmott of SafeIntelligence discusses spec-driven testing for AI agents, emphasizing the need for clear specifications beyond traditional datasets to ensure robustness and safety.

13 days ago
Sakana AI: Finance Agents Take Shape
Technology

Sakana AI: Finance Agents Take Shape

Sakana AI is deploying AI agents to revolutionize financial operations, with engineers focusing on practical integration and enterprise-grade reliability.

13 days ago
Nick Nisi on Building Better AI Agents
Artificial Intelligence

Nick Nisi on Building Better AI Agents

Nick Nisi of WorkOS discusses how to build better AI agents by focusing on measurement, enforcement, and learning from failures.

14 days ago
Claude Code's Latest Updates
Technology

Claude Code's Latest Updates

Claude Code rolls out Opus 4.8 as default, introduces dynamic workflows, security plugins, and performance enhancements for developers.

14 days ago
Google DeepMind Explains AI Agent Building Struggles
AI Research

Google DeepMind Explains AI Agent Building Struggles

Philipp Schmid from Google DeepMind explains the core challenges senior engineers face when building AI agents, contrasting traditional engineering with agentic development.

14 days ago
Neo4j's Zach Blumenfeld on AI Agents and Decision Traces
Artificial Intelligence

Neo4j's Zach Blumenfeld on AI Agents and Decision Traces

Neo4j's Zach Blumenfeld explains why AI agents need decision traces and how context graphs, powered by Neo4j, can provide the necessary memory and reasoning capabilities for more accurate and accountable AI.

15 days ago
Claude's Corner: Salus (YC W2026), The Bouncer Your AI Agents Desperately Need
Claude's Corner

Claude's Corner: Salus (YC W2026), The Bouncer Your AI Agents Desperately Need

AI agents are confidently doing the wrong thing at scale. Salus is a runtime guardrails proxy that sits between your agent and its tools, validating every action before it executes. Here's what they built, how it works, and whether you could clone it.

15 days ago
Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains
Artificial Intelligence

Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains

Phil Hetzel of Braintrust discusses the fundamental differences between traditional observability and the specialized needs of AI agent evaluation.

15 days ago
OpenAI Agents SDK: Building with Model-Native Harnesses
Artificial Intelligence

OpenAI Agents SDK: Building with Model-Native Harnesses

OpenAI's latest Build Hour session dives into the updated Agents SDK, showcasing new features like a Codex-style harness and enhanced sandboxing capabilities.

16 days ago
Enterprise AI Agents: The Scale-Up Playbook
Technology

Enterprise AI Agents: The Scale-Up Playbook

Enterprise leaders are finding success in scaling AI agents by embedding governance, orchestrating complex workflows, and empowering their workforce.

16 days ago
Anthropic Debuts Claude Opus 4.8
Artificial Intelligence

Anthropic Debuts Claude Opus 4.8

Anthropic unveils Claude Opus 4.8, boosting AI performance with new features like 'effort control' and 'dynamic workflows' for complex coding.

16 days ago
Neo4j: Context Graphs for AI Agents
Artificial Intelligence

Neo4j: Context Graphs for AI Agents

Neo4j experts Andreas Kollegger and Zaid Zaim discuss how context graphs enhance AI agents for explainable and decision-aware operations.

16 days ago
AI Agents: Building Enterprise Guardians
Cybersecurity

AI Agents: Building Enterprise Guardians

Onyx Security CEO Maxim Bar Kogan discusses the critical need for AI agent security and governance in enterprises, highlighting the risks and solutions.

16 days ago
Databricks Genie Sparks Media Personalization
Technology

Databricks Genie Sparks Media Personalization

Databricks Genie uses AI to let media execs ask complex questions of their data in natural language, speeding up personalization and product development.

16 days ago
Angus McLean on Bounded Autonomy in AI
Artificial Intelligence

Angus McLean on Bounded Autonomy in AI

Angus J. McLean of Oliver discusses 'Bounded Autonomy' in AI, exploring the shift to agentic processes in advertising and offering practical advice for building AI agents.

16 days ago
Databricks Genie: Partner AI Solutions Emerge
Technology

Databricks Genie: Partner AI Solutions Emerge

Databricks partners are launching industry-specific conversational AI solutions built on Databricks Genie, democratizing data access and accelerating AI-driven decisions.

16 days ago
Rust: The Ideal Language for Vibe-Coding?
Technology

Rust: The Ideal Language for Vibe-Coding?

Daniel Szoke from Sentry argues that Rust's strict constraints make it ideal for AI agentic coding, turning compile errors into valuable debugging feedback.

16 days ago
AI Agents Are Rewriting Commerce
AI

AI Agents Are Rewriting Commerce

AI agents are rewriting the rules of commerce, forcing brands to adapt or risk becoming invisible to consumers and their digital assistants.

16 days ago
Warp Bets on Open Source with GPT-5.5
Artificial Intelligence

Warp Bets on Open Source with GPT-5.5

Warp is betting on OpenAI's GPT-5.5 to power its open-source development strategy, using AI agents for coding and humans for oversight.

16 days ago
Robinhood Embraces AI Agents for Trading, Spending
Investors News

Robinhood Embraces AI Agents for Trading, Spending

Robinhood launches Agentic Trading and Agentic Credit Card, allowing AI agents to manage investments and spending with user-controlled safety features.

16 days ago
AI's Boring Revenue Play: Compliance
Investors News

AI's Boring Revenue Play: Compliance

AI is transforming compliance from a costly, manual burden into a strategic revenue driver, leveraging advanced technology to navigate complex regulations.

18 days ago
Stop Babysitting AI Agents: Build a Context Engine
Artificial Intelligence

Stop Babysitting AI Agents: Build a Context Engine

Brandon Walsenuk from Unblocked discusses the critical need for context engines to empower AI agents, moving beyond simple data access to true understanding and autonomous operation.

18 days ago
The 4 Types of AI Agent Memory Explained
Artificial Intelligence

The 4 Types of AI Agent Memory Explained

IBM Master Inventor Martin Keen details the four essential memory types AI agents need: working, semantic, procedural, and episodic.

18 days ago
Does GenAI Belong to Data Scientists?
Artificial Intelligence

Does GenAI Belong to Data Scientists?

Phil Hetzel of Braintrust discusses the evolving role of data scientists in Generative AI agent development, arguing for a collaborative, multidisciplinary approach.

19 days ago