John Schulman (OpenAI Cofounder) — Reasoning, RLHF, & plan for 2027 AGI — Dwarkesh Podcast

✨ AI Summary

John Schulman explains how post-training and reinforcement learning tame base model capabilities, discussing the nature of progress toward AGI
OpenAI's plan targets 2027 for AGI; the company focuses on teaching models reasoning through RL and maintaining human oversight in the loop
Research progress shows both plateaus and remaining moats; keeping humans in the loop is critical for developing good RL researchers and advancing capabilities responsibly

More from Dwarkesh Podcast

May 15, 2026 · 2h 37m

May 8, 2026 · 2h 13m

Apr 29, 2026 · 2h 13m

Apr 15, 2026 · 1h 43m