Waves All the Way Down

How a transformer learns to compute modular addition, and what it reveals about generalization

April 21, 2026 · 15 min · Mark C. Kuzyk