State of the Art: Training >70B LLMs on 10,000 H100 clusters

✨ AI Summary

Databricks' DBRX and Imbue's 70B model outperform GPT-4o zero-shot on reasoning/coding benchmarks while using 7x less data than Llama 3 70B
Imbue releasing 11 cleaned NLP benchmarks, new code reasoning benchmark, 450k human ambiguity judgments dataset, and infrastructure scripts for bare-metal cluster training
Focus on cost-aware hyperparameter optimization and practical tools for training large models efficiently on 10,000 H100 clusters

More from Latent Space: The AI Engineer Podcast

Jul 8, 2026 · 57m

Jul 1, 2026 · 1h 48m

Jun 24, 2026 · 1h 8m

Jun 22, 2026 · 1h 6m