TY  - BOOK
AU  - Lapan, Maxim
TI  - Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more 
SN  - 9781788834247
U1  - 006.31,LAP 
PY  - 2018///
CY  - UK
PB  - Packt
KW  - Deep Learning
N1  - What is Reinforcement Learning? (Page-1), OpenAI Gym (Page-25), Deep Learning with PyTorch (Page-49), The Cross-Entropy Method (Page-77), Tabular Learning and the Bellman Equation (Page-99), Deep Q-Networks(Page-119), DQN Extensions (Page-155), Stocks Trading Using RL (Page-217), Policy Gradients – An Alternative (Page-241), The Actor-Critic Method (Page-263), Asynchronous Advantage Actor-Critic (Page-283), Chatbots Training with RL (Page-303), Web Navigation (Page-351), Continuous Action Space (Page-399), Trust Regions – TRPO, PPO, and ACKTR (Page-427), Black-Box Optimization in RL (Page-443), Beyond Model-Free – Imagination (Page-467), AlphaGo Zero (Page-491)
ER  -