#Braintrust

8 articles with this tag

Braintrust Cedes Coding to Codex

Braintrust is dramatically speeding up its development cycle by integrating OpenAI's Codex, turning customer requests into code previews in minutes.

10 days ago

Artificial Intelligence

Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains

Phil Hetzel of Braintrust discusses the fundamental differences between traditional observability and the specialized needs of AI agent evaluation.

11 days ago

Artificial Intelligence

Does GenAI Belong to Data Scientists?

Phil Hetzel of Braintrust discusses the evolving role of data scientists in Generative AI agent development, arguing for a collaborative, multidisciplinary approach.

14 days ago

Artificial Intelligence

4 Levels of AI Agent Maturity: Don't Build Slop

Ara Khan outlines a 4-level framework for building mature AI agents, emphasizing state machines, visualization, and cloud-native deployment to avoid "slop" and ensure scalability.

20 days ago

Artificial Intelligence

Building Better AI Agents: The Eval Platform Challenge

Phil Hetzel of Braintrust discusses the challenges and best practices for building effective evaluation platforms for AI agents, emphasizing a systems-level approach.

about 1 month ago

Artificial Intelligence

Braintrust CEO: Codex Speeds Up Feature Iteration

Braintrust CEO Ankur Goyal explains how their tool, Codex, enables real-time feature iteration and feedback, bypassing traditional development backlogs.

about 2 months ago

AI Video

Evals Reimagined: Braintrust's Engineering Approach to AI Development

10 months ago

AI Video

Braintrust Unveils Loop, Automating AI Model Evaluation

10 months ago