Reinforcement learning : an introduction
/ Richard S. Sutton and Andrew G. Barto
- Second edition
- 1 online resource : illustrations
- Adaptive computation and machine learning .
Includes bibliographical references and index.
Reinforcement learning Part I. Tabular solution methods Multi-armed bandits Finite Markov decision processes Dynamic programming Monte Carlo methods Temporal-difference learning n-step bootstrapping Planning and learning with tabular methods Part II. Approximate solution methods On-policy prediction with approximation On-policy control with approximation Off-policy methods with approximation Eligibility traces Policy gradient methods Part III. Looking deeper Psychology Neuroscience Applications and case studies Frontiers