AI Fundamentals: Benchmarks 101

✨ AI Summary

New format inspired by Acquired.fm without guests focusing on benchmarks as subtle but critical ingredient in LLM development
Progression from basic word association (WordNet 1985) through entity recognition (Enron 2004) to complex reasoning benchmarks
Benchmarks mark AI progress more than raw scale; scale alone insufficient without measuring what models actually achieve

More from Latent Space: The AI Engineer Podcast

Jul 8, 2026 · 57m

Jul 1, 2026 · 1h 48m

Jun 24, 2026 · 1h 8m

Jun 22, 2026 · 1h 6m