Latent Space: The AI Engineer Podcast

Episodes (Page 8)

Is finetuning GPT4o worth it? — with Alistair Pullen, Cosine (Genie)

Aug 22, 2024 · 1h 5m

✨ Cosine Genie achieved #1 ranking on SWE-Bench Full, Lite, and Verified using GPT-4o fine-tuning at scale on billions of tokens of synthetic data, beating all other agents including Cognition's Devin

Alistair Pullen

1h 5m

AI Magic: Shipping 1000s of successful products with no managers and a team of 12 — Jeremy Howard of Answer.ai

Aug 16, 2024 · 58m

✨ Jeremy Howard's Answer.AI ships 1000s of successful AI products with no managers and a team of 12, focusing on practical AI R&D aligned with GPU-poor needs

58m

Segment Anything 2: Demo-first Model Development

Aug 7, 2024 · 1h 3m

✨ Meta's Segment Anything 2 (SAM 2) improves image segmentation accuracy while being 6x faster than SAM 1, and elegantly solved video segmentation with 3x fewer interactions than prior approaches

1h 3m

The Winds of AI Winter (Q2 Four Wars Recap) + ChatGPT Voice Mode Preview

Aug 2, 2024 · 1h 55m

✨ Q2 2024 AI progress analyzed through Four Wars framework: GPU-rich frontier labs (Claude 3.5, Mistral Large), GPU-rich helping GPU-poors (Llama 3.1 synthetic data, Phi 3, Gemma 2), and on-device LL...

1h 55m

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

Jul 23, 2024 · 1h 5m

✨ Meta released Llama 3.1-405B, the largest open source model trained on 15T tokens beating GPT-4 on benchmarks, with 8B and 70B models also receiving significant spec bumps

1h 5m

Benchmarks 201: Why Leaderboards > Arenas >> LLM-as-Judge

Jul 12, 2024 · 58m

✨ Clémentine Fourrier leads HuggingFace's OpenLLM Leaderboard, which standardizes model evaluation using high-quality benchmarks with reproducible, centralized scoring to replace lab-specific reports

58m

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka

Jul 5, 2024 · 1h 44m

✨ Reka AI achieved #7 on LMsys leaderboard with only 20 employees and $60M funding, demonstrating that top-tier model performance no longer requires massive teams like OpenAI (600) or Google (950+ co...

1h 44m

State of the Art: Training >70B LLMs on 10,000 H100 clusters

Jun 25, 2024 · 1h 21m

✨ Databricks' DBRX and Imbue's 70B model outperform GPT-4o zero-shot on reasoning/coding benchmarks while using 7x less data than Llama 3 70B

1h 21m

[High Agency] AI Engineer World's Fair Preview

Jun 25, 2024 · 49m

✨ Raza Habib of HumanLoop hosts High Agency podcast, flipping the interview dynamic with Shawn Wang to discuss the AI Engineer World's Fair and the relevance of the 'Rise of the AI Engineer' essay on...

49m

How To Hire AI Engineers — with James Brady & Adam Wiggins of Elicit

Jun 21, 2024 · 1h 3m

✨ James Brady (Head of Engineering) and Adam Wiggins (Cofounder Ink & Switch, Heroku) from Elicit share hiring strategies for AI engineers, defining the role as conventional engineers with LLM and pr...

James Brady

1h 3m

How AI is eating Finance — with Mike Conover of Brightwave

Jun 11, 2024 · 54m

✨ Mike Conover, who led OSS models at Databricks and created Dolly, founded Brightwave as an AI research assistant for investment professionals and announced $6M seed round led by Alessio and Decibel

54m

ICLR 2024 — Best Papers & Talks (Benchmarks, Reasoning & Agents) — ft. Graham Neubig, Aman Sanger, Moritz Hardt)

Jun 10, 2024 · 4h 29m

✨ Discusses code editing benchmarks (WebArena, Sotopia), OpenDevin agent framework, and tensions between academic research and industry implementation of AI systems

Aman Sanger Graham Neubig Moritz Hardt

4h 29m

How to train a Million Context LLM — with Mark Huang of Gradient.ai

May 30, 2024 · 57m

✨ Documents the evolution of context window lengths from 84k tokens (MPT-7B) to current 1M+ token models, covering the competitive 'Context Extension Campaigns' between frontier labs

57m

ICLR 2024 — Best Papers & Talks (ImageGen, Vision, Transformers, State Space Models) ft. Durk Kingma, Christian Szegedy, Ilya Sutskever

May 27, 2024 · 3h 38m

✨ Covers ICLR 2024's best papers on image generation, vision models, and transformer architectures featuring insights from Durk Kingma, Christian Szegedy, and Ilya Sutskever

Christian Szegedy Durk Kingma Ilya Sutskever

3h 38m

Emulating Humans with NSFW Chatbots - with Jesse Silver

May 16, 2024 · 54m

✨ Explores the understudied role of adult content in driving AI innovation, from image generation to fine-tuning techniques developed in niche online communities

Jesse Silver

54m

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

Apr 27, 2024 · 53m

✨ Introduces WebSim.ai, WorldSim, and simulative AI as next frontier beyond RAG, exploring how models can generate coherent multi-step simulations and alternate worlds

53m

High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor

Apr 19, 2024 · 52m

✨ Traces the evolution from prompt engineering hacks to structured output frameworks, with Instructor providing clean SDK solution for reliable AI outputs

52m

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Apr 11, 2024 · 56m

✨ Covers Elicit's approach to supervising AI research processes through structured automation and verification methods

56m

Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem)

Apr 6, 2024 · 2h 45m

✨ Rapid-fire updates on the Four Wars framework with NLW covering Sora, Suno, and GPT-4 class landscape reshaping plus Open Source models (Llama 3, Mistral, Grok)

Harrison Chase

2h 45m

Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft

Mar 29, 2024 · 42m

✨ Announces AI Engineer World's Fair (June 25-27) as the largest AI engineer conference of 2024 with 4 tracks daily and 500+ attendees expected

Sam Schillace

42m

← Prev 1 2 3 … 7 8 9 … 11 12 Next →

Episodes (Page 8)

Track New Episodes & Guest Appearances