3 followers
๐งโ๐ป Staff Engineer โ Distributed Systems, Machine Learning |๐Berlin ๐ฉ๐ช | ๐ https://venkat.eu | ๐ฌ https://twitter.com/Venkat2811
Back of the envelope calculations to estimate model's GPU memory requirements & insights into HW/SW optimizations ยท (Image Credit: HF TGI...
Exploring Locality of Reference, LMAX Disruptor & Flash Attention ยท Introduction Modern software programming languages, compilers, and frameworks...
A digestible high-level overview of what happens in The Die ยท Introduction In this article, we'll go through some fundamental low level details to...
Debugging resource leakage and optimizing server configuration ยท Intro Engineers who've built, deployed and operated backend services would've...
In my previous post, I discussed Load Balancer Engine Architecture and its features. In this post, Iโll be discussing on performance...
Itโs almost four months and it has been an amazing journey! At this point, I would like to thank my mentors Isuru Ranawaka and Kasun Indrasiri and...