Blogs
Essays, thoughts, and deeper dives into topics I find interesting.
Implementing AlphaZero
January 5, 2026Building AlphaZero from scratch to understand Monte Carlo Tree Search and deep reinforcement learning — neural network architecture, MCTS, and self-play training.
Building GPT from Scratch
January 3, 2026Implementing a transformer language model from scratch with multi-head self-attention, RoPE, FlashAttention, and an end-to-end training pipeline including pretraining, SFT, and DPO.