Articles

This is where I share what I'm learning, building, explaining, (or breaking):

Everything About Transformers

A deep dive into everything about transformers (history + illustrations)

A hands-on guide to implementing memory-efficient attention mechanisms using CUDA kernels