ChatGPT VIDEO 8 October 2025

Measuring Agents With Interactive Evaluations

Agents explore, plan, and reliably execute across diverse, long-horizon tasks—challenges that static benchmarks can't measure. Hear from Greg Kamradt, President of the ARC Prize Foundation, on how evaluating agentic performance requires i...

YouTube

Agents explore, plan, and reliably execute across diverse, long-horizon tasks—challenges that static benchmarks can't measure. Hear from Greg Kamradt, President of the ARC Prize...

Agents explore, plan, and reliably execute across diverse, long-horizon tasks—challenges that static benchmarks can't measure.

Hear from Greg Kamradt, President of the ARC Prize Foundation, on how evaluating agentic performance requires interactive evaluations.

More videos from ChatGPT

All videos

ChatGPT Futures, Class of 2026: The Next Generation of AI Leaders

ChatGPT Futures, Class of 2026: The Next Generation of AI Leaders

How Omio is building the future of conversational travel

How Omio is building the future of conversational travel

Meet the ChatGPT Futures, Class of 2026

Meet the ChatGPT Futures, Class of 2026

How Zendesk CEO Tom Eggemeier goes from Idea to Action

How Zendesk CEO Tom Eggemeier goes from Idea to Action

Gemini komt eraan