#Braintrust

8 articles with this tag

Braintrust Cedes Coding to Codex
Artificial Intelligence

Braintrust Cedes Coding to Codex

Braintrust is dramatically speeding up its development cycle by integrating OpenAI's Codex, turning customer requests into code previews in minutes.

10 days ago
Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains
Artificial Intelligence

Agent vs. Traditional Observability: Braintrust's Phil Hetzel Explains

Phil Hetzel of Braintrust discusses the fundamental differences between traditional observability and the specialized needs of AI agent evaluation.

11 days ago
Does GenAI Belong to Data Scientists?
Artificial Intelligence

Does GenAI Belong to Data Scientists?

Phil Hetzel of Braintrust discusses the evolving role of data scientists in Generative AI agent development, arguing for a collaborative, multidisciplinary approach.

14 days ago
4 Levels of AI Agent Maturity: Don't Build Slop
Artificial Intelligence

4 Levels of AI Agent Maturity: Don't Build Slop

Ara Khan outlines a 4-level framework for building mature AI agents, emphasizing state machines, visualization, and cloud-native deployment to avoid "slop" and ensure scalability.

20 days ago
Building Better AI Agents: The Eval Platform Challenge
Artificial Intelligence

Building Better AI Agents: The Eval Platform Challenge

Phil Hetzel of Braintrust discusses the challenges and best practices for building effective evaluation platforms for AI agents, emphasizing a systems-level approach.

about 1 month ago
Braintrust CEO: Codex Speeds Up Feature Iteration
Artificial Intelligence

Braintrust CEO: Codex Speeds Up Feature Iteration

Braintrust CEO Ankur Goyal explains how their tool, Codex, enables real-time feature iteration and feedback, bypassing traditional development backlogs.

about 2 months ago
Evals Reimagined: Braintrust's Engineering Approach to AI Development
AI Video

Evals Reimagined: Braintrust's Engineering Approach to AI Development

10 months ago
Braintrust Unveils Loop, Automating AI Model Evaluation
AI Video

Braintrust Unveils Loop, Automating AI Model Evaluation

10 months ago