Experimenting with bearblog for now
Updated
•1 min readSearch for a command to run...
Back of the envelope calculations to estimate model's GPU memory requirements & insights into HW/SW optimizations

Exploring Locality of Reference, LMAX Disruptor & Flash Attention

A digestible high-level overview of what happens in The Die

Debugging resource leakage and optimizing server configuration
