A note on chapter 3 of Sutton & Barto: Finite Markov Decision Process (MDP) December 8, 2025 20 min read