✨
AI Summary
- John Schulman explains how post-training and reinforcement learning tame base model capabilities, discussing the nature of progress toward AGI
- OpenAI's plan targets 2027 for AGI; the company focuses on teaching models reasoning through RL and maintaining human oversight in the loop
- Research progress shows both plateaus and remaining moats; keeping humans in the loop is critical for developing good RL researchers and advancing capabilities responsibly