Latent Space: The AI Engineer Podcast

Episodes (Page 10)

AGI is Being Achieved Incrementally (OpenAI DevDay w/ Simon Willison, Alex Volkov, Jim Fan, Raza Habib, Shreya Rajpal, Rahul Ligma, et al)

Nov 8, 2023 · 2h 22m

✨ OpenAI DevDay delivered GPT-4 Turbo with expanded context, custom GPTs with knowledge/actions, and a no-code builder

2h 22m

Beating GPT-4 with Open Source LLMs — with Michael Royzen of Phind

Nov 3, 2023 · 1h 7m

✨ Phind combines code search with LLM assistance to help developers find and implement technical solutions more effectively than traditional search

1h 7m

Powering your Copilot for Data – with Artem Keydunov of Cube.dev

Oct 26, 2023 · 38m

✨ Artem Keydunov of Cube.dev discusses text-to-SQL as a commodity application now with multiple ChatGPT plugins and competing startups entering the space

38m

The End of Finetuning — with Jeremy Howard of Fast.ai

Oct 19, 2023 · 1h 9m

✨ Jeremy Howard from Fast.ai discusses how finetuning paradigms are shifting with larger context windows and better base models reducing the need for traditional finetuning

1h 9m

Why AI Agents Don't Work (yet) - with Kanjun Qiu of Imbue

Oct 14, 2023 · 1h 5m

✨ Kanjun Qiu of Imbue (recently valued at $1B+) explains why current AI agents fail: they lack consistent reasoning, tool use reliability, and ability to handle complex multi-step tasks

1h 5m

[AIE Summit Preview #2] The AI Horcrux — Swyx on Cognitive Revolution

Oct 8, 2023 · 1h 29m

✨ Swyx discusses the AI Horcrux concept and cognitive science approaches to AI engineering, bridging software engineering and foundational AI research perspectives

1h 29m

[AIE Summit Preview #1] Swyx on Software 3.0 and the Rise of the AI Engineer

Oct 7, 2023 · 38m

✨ Swyx outlines Software 3.0 landscape: shift from traditional code to AI-native development, requiring new tools, architectures, and engineering practices

38m

RAG Is A Hack - with Jerry Liu from LlamaIndex

Oct 5, 2023 · 1h 8m

✨ Jerry Liu of LlamaIndex discusses RAG (Retrieval Augmented Generation) as a pragmatic hack addressing context window limitations of early GPT-3 for large datasets

1h 8m

Building the Foundation Model Ops Platform — with Raza Habib of Humanloop

Sep 29, 2023 · 1h 21m

✨ Raza Habib of Humanloop addresses the rite of passage for AI engineers: transitioning from demo to production requires prompt versioning, evaluation, monitoring, and finetuning infrastructure

1h 21m

Heralds of the AI Content Flippening — with Youssef Rizk of Wondercraft.ai

Sep 20, 2023 · 52m

✨ Youssef Rizk's Wondercraft.ai builds AI-first podcasting startup, producing HN Recap podcast

52m

Doing it the Hard Way: Making the AI engine and language 🔥 of the future — with Chris Lattner of Modular

Sep 14, 2023 · 1h 29m

✨ Chris Lattner of Modular announced $100M Series A for Mojo language and modular AI infrastructure, addressing fragmentation and poor software quality in AI development

1h 29m

The Point of LangChain — with Harrison Chase of LangChain

Sep 6, 2023 · 1h 0m

✨ Harrison Chase of LangChain discusses the rapid evolution from 2022 startup to $20-25M Series A, expanding from prompt templating to comprehensive LLM orchestration framework

1h 0m

RWKV: Reinventing RNNs for the Transformer Era — with Eugene Cheah of UIlicious

Aug 30, 2023 · 1h 12m

✨ Eugene Cheah discusses RWKV (Receptance Weighted Key Value) models as significant Transformer alternative, reviving RNNs for GPT-class LLMs with better scaling properties

1h 12m

Cursor.so: The AI-first Code Editor — with Aman Sanger of Anysphere

Aug 22, 2023 · 59m

✨ Aman Sanger of Anysphere created Cursor, an AI-first code editor designed to increase AI-assisted coding beyond Copilot's current 46% of VS Code usage toward 90%+

59m

The Mathematics of Training LLMs — with Quentin Anthony of Eleuther AI

Aug 16, 2023 · 50m

✨ Quentin Anthony of Eleuther AI demystifies the tacit knowledge around training LLMs efficiently, enabling non-insiders to estimate scaling laws and cost-performance tradeoffs

50m

LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML

Aug 10, 2023 · 52m

✨ Tianqi Chen of CMU/OctoML discusses MLC (Machine Learning Compilation) enabling LLMs to run on consumer hardware without GPUs

52m

[AI Breakdown] Summer AI Technical Roundup: a Latent Space x AI Breakdown crossover pod!

Aug 4, 2023 · 59m

✨ NLW and hosts discuss Code Interpreter release as potential GPT-4.5, highlighting unexpected capabilities beyond coding and challenges in AI model evaluation

59m

FlashAttention 2: making Transformers 800% faster w/o approximation - with Tri Dao of Together AI

Jul 26, 2023 · 54m

✨ Tri Dao explains FlashAttention: I/O-aware optimization reducing attention memory from O(N²) to sub-quadratic O(N) while maintaining exact computation without approximation

54m

Llama 2: The New Open LLM SOTA (ft. Nathan Lambert, Matt Bornstein, Anton Troynikov, Russell Kaplan, Whole Mars Catalog et al.)

Jul 19, 2023 · 1h 19m

✨ Llama 2 released for commercial use with 2 trillion tokens pretraining, 2x context length, and ~$20M RLHF investment, immediately becoming leading open LLM

Anton Troynikov Matt Bornstein Nathan Lambert Russell Kaplan

1h 19m

AI Fundamentals: Datasets 101

Jul 17, 2023 · 1h 0m

✨ GPT-3 trained on ~600GB of data (Wikipedia, Books, WebText, CommonCrawl), not the entire internet as commonly claimed

1h 0m

← Prev 1 2 3 … 9 10 11 12 Next →

Episodes (Page 10)

Track New Episodes & Guest Appearances