PodcastIntel
Sign in Get Started Free
Andrej Karpathy
Andrej Karpathy

Let's reproduce GPT-2 (124M)

Jun 9, 2024
AI Summary
  • Reproduces the GPT-2 (124M) model from scratch.
  • Details network building, training optimization, and hyperparameter setup.
  • Showcases amusing model generations after training.

More from Andrej Karpathy

Andrej Karpathy
Feb 27, 2025
View all episodes →

Get AI Summaries for Every New Episode

Subscribe to Andrej Karpathy and get AI summaries, guest tracking, and email digests delivered automatically.

Sign Up Free →