Reinforcement learning (Record no. 13885)

MARC details
000 -LEADER
fixed length control field 01842nam a22005297a 4500
003 - CONTROL NUMBER IDENTIFIER
control field OSt
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20231103135354.0
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
fixed length control field m|||||o||d| 00| 0
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr || auc||a|a
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 210422s2018 maua||||sb||| 001 0 eng d
040 ## - CATALOGING SOURCE
Transcribing agency QCPL
100 1# - MAIN ENTRY--PERSONAL NAME
9 (RLIN) 1469
Personal name Sutton, Richard S.
Relator term author
245 10 - TITLE STATEMENT
Title Reinforcement learning
Remainder of title : an introduction
Statement of responsibility, etc. / Richard S. Sutton and Andrew G. Barto
250 ## - EDITION STATEMENT
Edition statement Second edition
264 #1 - PRODUCTION, PUBLICATION, DISTRIBUTION, MANUFACTURE, AND COPYRIGHT NOTICE
Place of production, publication, distribution, manufacture Cambridge, Massachusetts :
Name of producer, publisher, distributor, manufacturer MIT Press,
Date of production, publication, distribution, manufacture, or copyright notice [2018]
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource :
Other physical details illustrations
336 ## - CONTENT TYPE
Source rdacontent
Content type term text
337 ## - MEDIA TYPE
Source rdamedia
Media type term computer
338 ## - CARRIER TYPE
Source rdacarrier
Carrier type term online resource
490 ## - SERIES STATEMENT
Series statement Adaptive computation and machine learning
504 ## - BIBLIOGRAPHY, ETC. NOTE
Bibliography, etc Includes bibliographical references and index.
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Reinforcement learning
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Part I. Tabular solution methods
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Multi-armed bandits
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Finite Markov decision processes
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Dynamic programming
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Monte Carlo methods
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Temporal-difference learning
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note n-step bootstrapping
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Planning and learning with tabular methods
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Part II. Approximate solution methods
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note On-policy prediction with approximation
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note On-policy control with approximation
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Off-policy methods with approximation
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Eligibility traces
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Policy gradient methods
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Part III. Looking deeper
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Psychology
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Neuroscience
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Applications and case studies
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Frontiers
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Reinforcement learning
655 ## - INDEX TERM--GENRE/FORM
Genre/form data or focus term Electronic Books
700 1# - ADDED ENTRY--PERSONAL NAME
9 (RLIN) 1470
Personal name Barto, Andrew G.
Relator term author
856 ## - ELECTRONIC LOCATION AND ACCESS
Host name mit.edu
Uniform Resource Identifier https://www.andrew.cmu.edu/course/10-703/textbook/BartoSutton.pdf
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme Dewey Decimal Classification
Koha item type eBook (Free & Open Access)
Holdings
Library use only Collection code Permanent Location Current Location Date acquired Source of acquisition Koha item type
  Circulation Accessible online Accessible online 04/22/2021 Open access eBook (Free & Open Access)