Notes on signal processing, computing constraints, and engineering trade-offs.
How a Jenga tower taught me what's really going on inside large AI models.
Why dense computation solved the first problem, and why sparsity is the design step that follows.
Opening up the engineering notebooks.
Scaling adds capacity. It doesn't guarantee structure. What we found when we started carving.
Why learning from a single source creates limits, and why combining perspectives produces smarter, more reliable systems.
The real problem isn't how you ask. It's how the model listens.
How a Jenga tower taught me what's really going on inside large AI models.
Why dense computation solved the first problem, and why sparsity is the design step that follows.
Opening up the engineering notebooks.