#Braintrust
8 articles with this tag
Braintrust Cedes Coding to Codex
Braintrust is dramatically speeding up its development cycle by integrating OpenAI's Codex, turning customer requests into code previews in minutes.

Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains
Phil Hetzel of Braintrust discusses the fundamental differences between traditional observability and the specialized needs of AI agent evaluation.

Does GenAI Belong to Data Scientists?
Phil Hetzel of Braintrust discusses the evolving role of data scientists in Generative AI agent development, arguing for a collaborative, multidisciplinary approach.

4 Levels of AI Agent Maturity: Don't Build Slop
Ara Khan outlines a 4-level framework for building mature AI agents, emphasizing state machines, visualization, and cloud-native deployment to avoid "slop" and ensure scalability.

Building Better AI Agents: The Eval Platform Challenge
Phil Hetzel of Braintrust discusses the challenges and best practices for building effective evaluation platforms for AI agents, emphasizing a systems-level approach.

Braintrust CEO: Codex Speeds Up Feature Iteration
Braintrust CEO Ankur Goyal explains how their tool, Codex, enables real-time feature iteration and feedback, bypassing traditional development backlogs.

Evals Reimagined: Braintrust's Engineering Approach to AI Development
