TY - BOOK AU - Sutton,Richard S. AU - Barto,Andrew G. TI - Reinforcement learning: : an introduction T2 - Adaptive computation and machine learning PY - 2018///] CY - Cambridge, Massachusetts PB - MIT Press KW - Reinforcement learning KW - Electronic Books N1 - Includes bibliographical references and index; Reinforcement learning; Part I. Tabular solution methods; Multi-armed bandits; Finite Markov decision processes; Dynamic programming; Monte Carlo methods; Temporal-difference learning; n-step bootstrapping; Planning and learning with tabular methods; Part II. Approximate solution methods; On-policy prediction with approximation; On-policy control with approximation; Off-policy methods with approximation; Eligibility traces; Policy gradient methods; Part III. Looking deeper; Psychology; Neuroscience; Applications and case studies; Frontiers UR - https://www.andrew.cmu.edu/course/10-703/textbook/BartoSutton.pdf ER -