Episodes (Page 2)
✨
METR's Joel Becker on exponential time horizon evaluations.
✨
Nathan Lambert and Sebastian Raschka discuss Anthropic distillation.
✨
Prof. Max Welling on CuspAI's materials discovery platform.
✨
Doug O'Laughlin discusses Claude Code for finance and memory shortage.
Doug O'Laughlin
✨
OpenAI's Mia Glaese and Olivia Watkins on SWE-Bench.
Mia Glaese
Olivia Watkins
✨
Martin Casado and Sarah Wang discuss venture vs. growth funding.
✨
Jeff Dean discusses shaping AI infrastructure from search to TPUs.
Jeff Dean
✨
Boltz is open-sourcing the future of drug discovery with new models.
✨
Goodfire converts mechanistic interpretability from 'peeking inside models' into production workflow via APIs and enterprise deployments, raising $150M Series B at $1.25B valuation
✨
Andrew White progresses from molecular dynamics simulations and red-teaming GPT-4 for chemistry to co-founding Future House (FRO) and Edison Scientific automating science at scale
Andrew White
✨
Yi Tay shipped Gemini Deep Think and IMO Gold, launched Reasoning and AGI team in Singapore expanding from 12 to 300+ researchers pivoting from architecture research to RL-driven reasoning
Yi Tay
✨
James Reggio led Brex's three-pillar AI strategy (corporate, operational, product AI) with SOP-driven agents outperforming overengineered RL in operations at scale
CTO James Reggio
✨
Artificial Analysis evolved from 2024 Sydney basement side project to independent gold standard for AI benchmarking trusted by developers, enterprises, and major labs
✨
LMArena raised $150M Series A at $1.7B valuation with $30M annualized consumption revenue ($2.5M MRR) after September 2025 evals product launch expansion
Anastasios Angelopoulos
✨
Kevin Wang et al. won NeurIPS 2025 Best Paper by scaling RL networks to 1,000 layers deep, defying decade-long conventional wisdom that depth fails in RL
✨
John Yang recaps SWE-bench's evolution from ignored (Oct 2023) to industry standard after Devin's launch, expanding from Django-heavy to 9 languages across 40 repos
John Yang
✨
Josh McGrath at OpenAI describes post-training evolution from 2023 PPO vs DPO debates to current RLVR era where data quality and signal trust matter more than optimization method
Josh McGrath
✨
Ashvin Nair from Cursor shipped RL breakthroughs on GPT-4o/o1/o3; reasoning team scaled from 12 to 300+ people; IOI Gold felt reachable in 2022 but only materialized when o1 shipped
Ashvin Nair
✨
Sarah Catanzaro transitions from modern data stack investing (DBT, Fivetran) to AI infrastructure frontier, warning that $100M+ seed rounds with no near-term roadmap are now standard
Sarah Catanzaro
✨
MCP evolved from local-only experiment (Thanksgiving launch) to de facto protocol adopted by OpenAI, Microsoft, Google, Block with hundreds of enterprise agents at scale