Table of Contents
PART 1. MDPS: MODELS AND METHODS
Chapter 1. Markov Decision Processes
1.5. Characterization of optimal policies
1.6. Optimization algorithms for MDPs
Chapter 2. Reinforcement Learning
2.2. Reinforcement learning: a global view
2.4. From Monte Carlo to temporal difference methods
2.5. Temporal difference methods
2.6. Model-based methods: learning a model
Chapter 3. Approximate Dynamic Programming
3.2. Approximate value iteration (AVI)
3.3. Approximate policy iteration (API)
3.4. Direct minimization of the Bellman residual
3.5. Towards an analysis of dynamic programming in Lp-norm
Chapter 4. Factored Markov Decision Processes
4.2. Modeling a problem with an FMDP
4.4. Perspectives and conclusion
Chapter 5. Policy-Gradient Algorithms
5.1. Reminder about the notion of gradient
5.2. Optimizing a parameterized policy with a gradient algorithm
Chapter 6. Online Resolution Techniques
Get Markov Decision Processes in Artificial Intelligence now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.