PodcastIntel
Sign in Get Started Free
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

⚡️The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals & Human Data

Feb 23, 2026 · 26m
AI Summary
  • OpenAI's Mia Glaese and Olivia Watkins on SWE-Bench.
  • SWE-Bench Verified is saturated and contaminated, limiting its utility.
  • The benchmark's evolution from a cleanup of the original SWE-Bench.

Guests on This Episode

More from Latent Space: The AI Engineer Podcast

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Latent Space: The AI Engineer Podcast and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →