Learn AI Engineering

Curated video tutorials, playlists, and courses to help you master AI development.

Attention is All You Need — Transformers Explained

Deep Learning

2024-02-15•Andrej Karpathy

Attention is All You Need — Transformers Explained

Understanding the Transformer architecture that powers every modern LLM — GPT-4, Claude, Gemini. Covers self-attention, multi-head attention, and positional encoding.

Watch VideoYouTube

Deep Learning

2023-01-17•Andrej Karpathy

Let's Build GPT: From Scratch, in Code

The definitive hands-on tutorial by Andrej Karpathy. Build a GPT language model from scratch using Python and PyTorch, implementing every component — tokenization, embeddings, multi-head attention, and training. 2 hours of pure deep learning.

Watch VideoYouTube

Neural Networks: Zero to Hero (Full Course)

Deep Learning

Playlist

2022-08-17•Andrej Karpathy

Neural Networks: Zero to Hero (Full Course)

Andrej Karpathy's complete course on building neural networks from scratch. Starts with backpropagation basics (micrograd), then builds up through MLP, WaveNet, and full transformer language models. The most thorough practical ML course on the internet.

View PlaylistYouTube

What is Backpropagation, Really? (Visual Guide)

Deep Learning

2017-11-03•3Blue1Brown

What is Backpropagation, Really? (Visual Guide)

3Blue1Brown's crystal-clear visual explanation of backpropagation — the algorithm that makes neural network training possible. Essential for understanding how LLMs learn from data.

Watch VideoYouTube

Let's Reproduce GPT-2 (124M) from Scratch

Deep Learning

2024-06-01•Andrej Karpathy

Let's Reproduce GPT-2 (124M) from Scratch

Andrej Karpathy trains GPT-2 from scratch live on screen using modern techniques. Covers FP16 training, Flash Attention, gradient clipping, cosine learning rate schedules, and deploying to cloud GPUs — real-world LLM training from beginning to end.

Watch VideoYouTube