PodcastIntel
Sign in Get Started Free
Latent Space: The AI Engineer Podcast
Latent Space: The AI Engineer Podcast

The Shape of Compute (Chris Lattner of Modular)

Jun 13, 2025 · 1h 18m
AI Summary
  • Modular breaks CUDA monopoly through MAX framework and Mojo language, matching NVIDIA performance with AMD hardware through specialized low-level GPU programming
  • MAX provides cluster-scale inference architecture; company differentiates from VLLM/SGLang through comprehensive optimization stack rather than single inference engine
  • DeepSeek's efficiency impact and reasoning model scaling (inference-time compute) represent next frontier; leadership culture emphasizes 'elite nerds' doing deep technical work over large teams

Guests on This Episode

CL
Chris Lattner
6 podcast appearances

More from Latent Space: The AI Engineer Podcast

View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Latent Space: The AI Engineer Podcast and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →