Blog Archive
Other
- February 2026 - Mini-Retirement: Or, How I Learned to Stop Grinding and Took Two Years Off
- September 2025 - The Art of Safe Policy Updates: From REINFORCE to TRPO and PPO
- September 2025 - Teaching AI to Play Hokm: A Multi-Agent Reinforcement Learning Challenge
- August 2025 - Teaching (tiny) LLMs to Play Text-Based Games Using RL (on a $300 GPU)
- July 2025 - The Beautiful Intuition Behind Diffusion Models
- July 2025 - How to Tame Your Deep RL
- December 2024 - How many words do you know?
- October 2024 - Perils and Promises of AI