Episodes (Page 4)
✨
Noam Brown's work on solving Poker and Diplomacy demonstrates test-time compute scaling with multi-agent reasoning; System 1/2 analogy oversimplifies
Noam Brown
✨
Modular breaks CUDA monopoly through MAX framework and Mojo language, matching NVIDIA performance with AMD hardware through specialized low-level GPU programming
Chris Lattner
✨
Circuit Tracing reveals computational graphs in language models, providing mechanistic interpretability breakthrough through attribution graphs and open-source tooling released alongside Anthropic ...
Emmanuel Amiesen
✨
Solomon Hykes (Docker creator) leads Dagger, addressing agent chaos through environment isolation, standardization, and modular design enabling reproducible, trustworthy agent execution
Solomon Hykes
✨
Google's Gemini 2025 advances include real-time voice AI through Gemini Live API and native audio capabilities; millisecond-latency real-time workflows unlock new voice agent use cases
✨
CloudChef's Zippy provides industrial-scale kitchen robotics using one-shot demonstration learning, achieving Michelin-star food quality at $12/hour labor cost with simple business model
✨
Factory.ai raised $15M Series A from Sequoia to build autonomous software engineering 'droids' handling code generation and production incident response; launched in GA after demonstrating product ...
✨
Will Brown discusses multi-turn RL for multi-hour agents and inference-time reasoning advances in Claude 4 Opus and Gemini's Deep Think
Will Brown
✨
OpenAI launches ChatGPT Codex, the first cloud-hosted Autonomous Software Engineer (A-SWE), with core developers Josh Ma and Alexander Embiricos discussing its origin from WHAM and future roadmap
Alexander Embiricos
Josh Ma
✨
Anthropic releases Claude Code, a CLI-based agent joining a four-part AI coding landscape split between IDEs (Cursor, Windsurf), vibe platforms (Bolt.new, Lovable), teammate agents (Devin), and CLI...
✨
Vector database category experienced explosive growth in 2022-2023 following ChatGPT's launch as developers built RAG applications, with Pinecone leading the charge
✨
E2B grew from handful of developers to ~50% Fortune 500 usage generating millions of sandboxes weekly as AI agents shift from chat completions to tool usage and multi-modality
Vasek Mlejnsky
✨
OpenAI launches GPT 4.1 (formerly Quasar/Optimus) with o1-level coding abilities on SWEBench, improved instruction following, 1M token context window, and o1-level vision capabilities
✨
Dylan Patel analyzes GPU rental market following Coreweave's successful IPO, examining technical and financial factors driving AI compute commoditization
✨
MCP overtook OpenAPI in GitHub stars within 3 months after OpenAI and Google announced support, confirming MCP as presumptive winner of agent standard wars
✨
Unsupervised Learning and Latent Space crossover special featuring top AI minds including Noam Shazeer, Bob McGrew, Noam Brown, Dylan Patel, Percy Liang, and David Luan
✨
Dharmesh Shah (HubSpot co-founder) introduces 'hybrid teams' concept
Dharmesh Shah
✨
Snipd evolved from promising design to best-in-class podcast app with AI search/transcript features after Overcast 2.0 disappointed with no AI improvements in 3 years
✨
OpenAI released Operator and Deep Research agents in first 2 months of 2025, now bringing agent capabilities to API via Responses API, Web Search, Computer Use, and File Search tools
✨
Anthropic's Claude 3.5 Sonnet (Sonnet 3.7) successfully plays Pokémon Red live on Twitch with custom harness enabling screen vision, navigation, and game state memory
David Hershey