✨
AI Summary
- New format inspired by Acquired.fm without guests focusing on benchmarks as subtle but critical ingredient in LLM development
- Progression from basic word association (WordNet 1985) through entity recognition (Enron 2004) to complex reasoning benchmarks
- Benchmarks mark AI progress more than raw scale; scale alone insufficient without measuring what models actually achieve