Nguyen Le
@yasuonguyen01



















> about
/ build - break - understand /
- I'm interested in LLM inference. So basically I will try to learn everything around it, quantization, kernel engineering, etc.
- Beside LLM inference, I'm also interested in mathematics and some theoretical computer science stuffs such as complexity theory or programming language theory.
-
I also love to spend time with
andmy family 
.my friends
> work
VinMotion
Robotics Engineer · December 2025 - Recent
VinBigdata
AI Engineer Trainee · July 2025 - December 2025
Made some good friends along the way.
> education
VNUHCM - University of Science
Bachelor of Science in Artificial Intelligence · Oct 2021 - Oct 2025
Best investment in myself I ever made
> blog
-
Why do we need KV caching?
A visual explanation of why LLM inference caches key and value vectors, and why the trick saves compute during token-by-token generation.
May 17, 2026 9 min read