Nguyen Le
@yasuonguyen01



















> about
/ build - break - understand /
- I'm interested in LLM inference. So basically I will try to learn everything around it, quantization, kernel engineering, etc.
- Beside LLM inference, I'm also interested in mathematics and some theoretical computer science stuffs such as complexity theory or programming language theory.
-
I also love to spend time with
andmy family 
.my friends
> work
VinMotion
Robotics Intern 路 December 2025 - May 2026
VinBigdata
AI Engineer Trainee 路 July 2025 - December 2025
Made some good friends along the way.
Gameloft Vietnam
C++ Game Developer Intern 路 November 2024 - March 2025
Shipped game features. Fought memory leaks. Learned why pointers matter.
Bosch Vietnam
AI Engineer Intern 路 August 2024 - November 2024
First time building an Agentic AI pipeline
> education
VNUHCM - University of Science
Bachelor of Science in Artificial Intelligence 路 Oct 2021 - Oct 2025
Best investment in myself I ever made
> blog
-
Why do we need KV caching?
A visual explanation of why LLM inference caches key and value vectors, and why the trick saves compute during token-by-token generation.
May 17, 2026 9 min read