000 01183nam a2200157 4500
020 _a9781788834247
040 _cNUST
082 _a006.31,LAP
100 _aLapan, Maxim
_952053
245 _aDeep Reinforcement Learning Hands-On :
_bApply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more /
_cMaxim Lapan
260 _aUK
_bPackt
_c2018
300 _axvi,523 Pages.;
505 _aWhat is Reinforcement Learning? (Page-1), OpenAI Gym (Page-25), Deep Learning with PyTorch (Page-49), The Cross-Entropy Method (Page-77), Tabular Learning and the Bellman Equation (Page-99), Deep Q-Networks(Page-119), DQN Extensions (Page-155), Stocks Trading Using RL (Page-217), Policy Gradients – An Alternative (Page-241), The Actor-Critic Method (Page-263), Asynchronous Advantage Actor-Critic (Page-283), Chatbots Training with RL (Page-303), Web Navigation (Page-351), Continuous Action Space (Page-399), Trust Regions – TRPO, PPO, and ACKTR (Page-427), Black-Box Optimization in RL (Page-443), Beyond Model-Free – Imagination (Page-467), AlphaGo Zero (Page-491).
650 _aDeep Learning
_9120211
942 _2ddc
_cBK
_k006.31,LAP
999 _c613739
_d613739