✨
AI Summary
- Nick Joseph, Anthropic's Head of Pre-training, reveals the infrastructure and engineering challenges behind training frontier models like Claude at massive scale
- Training involves managing thousands of GPUs, debugging complex infrastructure bugs, and balancing compute allocation between pre-training and reinforcement learning
- The hardest problems in AI are often infrastructure-focused rather than pure machine learning problems, requiring deep systems engineering expertise