AIDevStart
HomeDirectoryModelsListsComparisonsBlogLearn AI Dev
Submit Tool
AIDevStart

Empowering developers with curated AI tools across the entire stack.

Some links on this site are affiliate links. We may earn a commission at no extra cost to you. Learn more.

PrivacyTermsCookiesDisclosure

© 2026 AIDevStart. All rights reserved.

Back to Learn AI Dev
Deep Learning

Let's Reproduce GPT-2 (124M) from Scratch

2024-06-01Andrej Karpathy

About this Video

Andrej Karpathy trains GPT-2 from scratch live on screen using modern techniques. Covers FP16 training, Flash Attention, gradient clipping, cosine learning rate schedules, and deploying to cloud GPUs — real-world LLM training from beginning to end.

Original Content

This content is embedded from YouTube. All credit goes to the original creator Andrej Karpathy. Please support them by subscribing to their channel.

Visit Channel

Recommended Courses

Udemy$16.99

LLM Fine-Tuning Masterclass

Fine-tune open-source LLMs including Llama 3 and Mistral for your specific use case.

View Course

* Links may include affiliate tracking.