Latent Space: The AI Engineer Podcast

Episodes (Page 7)

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

Jul 23, 2024 · 1h 5m

✨ Meta released Llama 3.1-405B, the largest open source model trained on 15T tokens beating GPT-4 on benchmarks, with 8B and 70B models also receiving significant spec bumps

1h 5m

Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge

Jul 12, 2024 · 58m

✨ Clémentine Fourrier leads HuggingFace's OpenLLM Leaderboard, which standardizes model evaluation using high-quality benchmarks with reproducible, centralized scoring to replace lab-specific reports

58m

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

Jul 5, 2024 · 1h 44m

✨ Reka AI achieved #7 on LMsys leaderboard with only 20 employees and $60M funding, demonstrating that top-tier model performance no longer requires massive teams like OpenAI (600) or Google (950+ co...

1h 44m

State of the Art: Training >70B LLMs on 10,000 H100 clusters

Jun 25, 2024 · 1h 21m

✨ Databricks' DBRX and Imbue's 70B model outperform GPT-4o zero-shot on reasoning/coding benchmarks while using 7x less data than Llama 3 70B

1h 21m

[High Agency] AI Engineer World's Fair Preview

Jun 25, 2024 · 49m

✨ Raza Habib of HumanLoop hosts High Agency podcast, flipping the interview dynamic with Shawn Wang to discuss the AI Engineer World's Fair and the relevance of the 'Rise of the AI Engineer' essay on...

49m

How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit

Jun 21, 2024 · 1h 3m

✨ James Brady (Head of Engineering) and Adam Wiggins (Cofounder Ink & Switch, Heroku) from Elicit share hiring strategies for AI engineers, defining the role as conventional engineers with LLM and pr...

James Brady

1h 3m

How AI is eating Finance — with Mike Conover of Brightwave

Jun 11, 2024 · 54m

✨ Mike Conover, who led OSS models at Databricks and created Dolly, founded Brightwave as an AI research assistant for investment professionals and announced $6M seed round led by Alessio and Decibel

54m

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

Jun 10, 2024 · 4h 29m

✨ Discusses code editing benchmarks (WebArena, Sotopia), OpenDevin agent framework, and tensions between academic research and industry implementation of AI systems

Aman Sanger Graham Neubig Moritz Hardt

4h 29m

How to train a Million Context LLM — with Mark Huang of Gradient.ai

May 30, 2024 · 57m

✨ Documents the evolution of context window lengths from 84k tokens (MPT-7B) to current 1M+ token models, covering the competitive 'Context Extension Campaigns' between frontier labs

57m

ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever

May 27, 2024 · 3h 38m

✨ Covers ICLR 2024's best papers on image generation, vision models, and transformer architectures featuring insights from Durk Kingma, Christian Szegedy, and Ilya Sutskever

Christian Szegedy Durk Kingma Ilya Sutskever

3h 38m

Emulating Humans with NSFW Chatbots - with Jesse Silver

May 16, 2024 · 54m

✨ Explores the understudied role of adult content in driving AI innovation, from image generation to fine-tuning techniques developed in niche online communities

Jesse Silver

54m

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

Apr 27, 2024 · 53m

✨ Introduces WebSim.ai, WorldSim, and simulative AI as next frontier beyond RAG, exploring how models can generate coherent multi-step simulations and alternate worlds

53m

High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor

Apr 19, 2024 · 52m

✨ Traces the evolution from prompt engineering hacks to structured output frameworks, with Instructor providing clean SDK solution for reliable AI outputs

52m

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Apr 11, 2024 · 56m

✨ Covers Elicit's approach to supervising AI research processes through structured automation and verification methods

56m

Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem)

Apr 6, 2024 · 2h 45m

✨ Rapid-fire updates on the Four Wars framework with NLW covering Sora, Suno, and GPT-4 class landscape reshaping plus Open Source models (Llama 3, Mistral, Grok)

Harrison Chase

2h 45m

Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft

Mar 29, 2024 · 42m

✨ Announces AI Engineer World's Fair (June 25-27) as the largest AI engineer conference of 2024 with 4 tracks daily and 500+ attendees expected

Sam Schillace

42m

Why Google failed to make GPT-3 + why Multimodal Agents are the path to AGI — with David Luan of Adept

Mar 22, 2024 · 41m

✨ David Luan (Adept co-founder, ex-OpenAI/Google) explains why Google failed to capitalize on LLM advances despite early leadership in AI research

41m

Making Transformers Sing - with Mikey Shulman of Suno

Mar 14, 2024 · 52m

✨ Mikey Shulman (Suno CEO) discusses evolution from text-to-speech and voice synthesis to music generation using transformer architectures

52m

Top 5 Research Trends + OpenAI Sora, Google Gemini, Groq Math (Jan-Feb 2024 Audio Recap) + Latent Space Anniversary with Lindy.ai, RWKV, Pixee, Julius.ai, Listener Q&A!

Mar 9, 2024 · 1h 48m

✨ January 2024 recap covering top research trends including notable advances in architecture and training methodologies across major labs

1h 48m

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Mar 6, 2024 · 1h 20m

✨ Soumith Chintala (Meta AI Engineering Lead, PyTorch creator) discusses open source AI's importance for trustworthiness and accessibility

1h 20m

← Prev 1 2 3 … 6 7 8 9 10 Next →

Episodes (Page 7)

Track New Episodes & Guest Appearances