MARC View

000			01183nam a2200157 4500
020			_a9781788834247
040			_cNUST
082			_a006.31,LAP
100			_aLapan, Maxim _952053
245			_aDeep Reinforcement Learning Hands-On : _bApply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / _cMaxim Lapan
260			_aUK _bPackt _c2018
300			_axvi,523 Pages.;
505			_aWhat is Reinforcement Learning? (Page-1), OpenAI Gym (Page-25), Deep Learning with PyTorch (Page-49), The Cross-Entropy Method (Page-77), Tabular Learning and the Bellman Equation (Page-99), Deep Q-Networks(Page-119), DQN Extensions (Page-155), Stocks Trading Using RL (Page-217), Policy Gradients – An Alternative (Page-241), The Actor-Critic Method (Page-263), Asynchronous Advantage Actor-Critic (Page-283), Chatbots Training with RL (Page-303), Web Navigation (Page-351), Continuous Action Space (Page-399), Trust Regions – TRPO, PPO, and ACKTR (Page-427), Black-Box Optimization in RL (Page-443), Beyond Model-Free – Imagination (Page-467), AlphaGo Zero (Page-491).
650			_aDeep Learning _9120211
942			_2ddc _cBK _k006.31,LAP
999			_c613739 _d613739