[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton — Latent Space: The AI Engineer Podcast

✨ AI Summary

Kevin Wang et al. won NeurIPS 2025 Best Paper by scaling RL networks to 1,000 layers deep, defying decade-long conventional wisdom that depth fails in RL
Key insight: self-supervised RL using contrastive learning on state/action/future representations scales where value-based methods collapse; architecture matters (residual connections, layer norm)
Scaling depth proves more parameter-efficient than width (linear vs quadratic growth); shift from regression to classification objectives enabled breakthrough

More from Latent Space: The AI Engineer Podcast

Apr 3, 2026 · 1h 16m

Apr 2, 2026 · 1h 6m

Mar 30, 2026 · 48m

Mar 24, 2026 · 35m