MPT-7B and The Beginning of Context=Infinity — with Jonathan Frankle and Abhinav Venigalla of MosaicML — Latent Space: The AI Engineer Podcast

✨ AI Summary

MosaicML's MPT-7B achieved SOTA in open-source with 84,000 token context length (vs GPT-3's 4,000), trained on 1 trillion tokens matching LLaMA-7B quality
Extended context enables new applications without requiring retrieval augmentation, representing breakthrough for commercially licensed open models
Context-as-infinity paradigm shift moving from limited token windows to practical document-length processing within single model pass

More from Latent Space: The AI Engineer Podcast

Apr 3, 2026 · 1h 16m

Apr 2, 2026 · 1h 6m

Mar 30, 2026 · 48m

Mar 24, 2026 · 35m