| 000 | 01183nam a2200157 4500 | ||
|---|---|---|---|
| 020 | _a9781788834247 | ||
| 040 | _cNUST | ||
| 082 | _a006.31,LAP | ||
| 100 |
_aLapan, Maxim _952053 |
||
| 245 |
_aDeep Reinforcement Learning Hands-On : _bApply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / _cMaxim Lapan |
||
| 260 |
_aUK _bPackt _c2018 |
||
| 300 | _axvi,523 Pages.; | ||
| 505 | _aWhat is Reinforcement Learning? (Page-1), OpenAI Gym (Page-25), Deep Learning with PyTorch (Page-49), The Cross-Entropy Method (Page-77), Tabular Learning and the Bellman Equation (Page-99), Deep Q-Networks(Page-119), DQN Extensions (Page-155), Stocks Trading Using RL (Page-217), Policy Gradients – An Alternative (Page-241), The Actor-Critic Method (Page-263), Asynchronous Advantage Actor-Critic (Page-283), Chatbots Training with RL (Page-303), Web Navigation (Page-351), Continuous Action Space (Page-399), Trust Regions – TRPO, PPO, and ACKTR (Page-427), Black-Box Optimization in RL (Page-443), Beyond Model-Free – Imagination (Page-467), AlphaGo Zero (Page-491). | ||
| 650 |
_aDeep Learning _9120211 |
||
| 942 |
_2ddc _cBK _k006.31,LAP |
||
| 999 |
_c613739 _d613739 |
||