Deep Reinforcement Learning with Python
eBook - ePub

Deep Reinforcement Learning with Python

Master classic RL, deep RL, distributional RL, inverse RL, and more with OpenAI Gym and TensorFlow, 2nd Edition

Sudharsan Ravichandiran

  1. 760 pagine
  2. English
  3. ePUB (disponibile sull'app)
  4. Disponibile su iOS e Android
eBook - ePub

Deep Reinforcement Learning with Python

Master classic RL, deep RL, distributional RL, inverse RL, and more with OpenAI Gym and TensorFlow, 2nd Edition

Sudharsan Ravichandiran

Dettagli del libro
Anteprima del libro
Indice dei contenuti
Citazioni

Informazioni sul libro

An example-rich guide for beginners to start their reinforcement and deep reinforcement learning journey with state-of-the-art distinct algorithms

Key Features

  • Covers a vast spectrum of basic-to-advanced RL algorithms with mathematical explanations of each algorithm
  • Learn how to implement algorithms with code by following examples with line-by-line explanations
  • Explore the latest RL methodologies such as DDPG, PPO, and the use of expert demonstrations

Book Description

With significant enhancements in the quality and quantity of algorithms in recent years, this second edition of Hands-On Reinforcement Learning with Python has been revamped into an example-rich guide to learning state-of-the-art reinforcement learning (RL) and deep RL algorithms with TensorFlow 2 and the OpenAI Gym toolkit.

In addition to exploring RL basics and foundational concepts such as Bellman equation, Markov decision processes, and dynamic programming algorithms, this second edition dives deep into the full spectrum of value-based, policy-based, and actor-critic RL methods. It explores state-of-the-art algorithms such as DQN, TRPO, PPO and ACKTR, DDPG, TD3, and SAC in depth, demystifying the underlying math and demonstrating implementations through simple code examples.

The book has several new chapters dedicated to new RL techniques, including distributional RL, imitation learning, inverse RL, and meta RL. You will learn to leverage stable baselines, an improvement of OpenAI's baseline library, to effortlessly implement popular RL algorithms. The book concludes with an overview of promising approaches such as meta-learning and imagination augmented agents in research.

By the end, you will become skilled in effectively employing RL and deep RL in your real-world projects.

What you will learn

  • Understand core RL concepts including the methodologies, math, and code
  • Train an agent to solve Blackjack, FrozenLake, and many other problems using OpenAI Gym
  • Train an agent to play Ms Pac-Man using a Deep Q Network
  • Learn policy-based, value-based, and actor-critic methods
  • Master the math behind DDPG, TD3, TRPO, PPO, and many others
  • Explore new avenues such as the distributional RL, meta RL, and inverse RL
  • Use Stable Baselines to train an agent to walk and play Atari games

Who this book is for

If you're a machine learning developer with little or no experience with neural networks interested in artificial intelligence and want to learn about reinforcement learning from scratch, this book is for you.

Basic familiarity with linear algebra, calculus, and the Python programming language is required. Some experience with TensorFlow would be a plus.

Domande frequenti

Come faccio ad annullare l'abbonamento?
È semplicissimo: basta accedere alla sezione Account nelle Impostazioni e cliccare su "Annulla abbonamento". Dopo la cancellazione, l'abbonamento rimarrà attivo per il periodo rimanente già pagato. Per maggiori informazioni, clicca qui
È possibile scaricare libri? Se sì, come?
Al momento è possibile scaricare tramite l'app tutti i nostri libri ePub mobile-friendly. Anche la maggior parte dei nostri PDF è scaricabile e stiamo lavorando per rendere disponibile quanto prima il download di tutti gli altri file. Per maggiori informazioni, clicca qui
Che differenza c'è tra i piani?
Entrambi i piani ti danno accesso illimitato alla libreria e a tutte le funzionalità di Perlego. Le uniche differenze sono il prezzo e il periodo di abbonamento: con il piano annuale risparmierai circa il 30% rispetto a 12 rate con quello mensile.
Cos'è Perlego?
Perlego è un servizio di abbonamento a testi accademici, che ti permette di accedere a un'intera libreria online a un prezzo inferiore rispetto a quello che pagheresti per acquistare un singolo libro al mese. Con oltre 1 milione di testi suddivisi in più di 1.000 categorie, troverai sicuramente ciò che fa per te! Per maggiori informazioni, clicca qui.
Perlego supporta la sintesi vocale?
Cerca l'icona Sintesi vocale nel prossimo libro che leggerai per verificare se è possibile riprodurre l'audio. Questo strumento permette di leggere il testo a voce alta, evidenziandolo man mano che la lettura procede. Puoi aumentare o diminuire la velocità della sintesi vocale, oppure sospendere la riproduzione. Per maggiori informazioni, clicca qui.
Deep Reinforcement Learning with Python è disponibile online in formato PDF/ePub?
Sì, puoi accedere a Deep Reinforcement Learning with Python di Sudharsan Ravichandiran in formato PDF e/o ePub, così come ad altri libri molto apprezzati nelle sezioni relative a Computer Science e Computer Science General. Scopri oltre 1 milione di libri disponibili nel nostro catalogo.

Informazioni

Anno
2020
ISBN
9781839215599

Appendix 1 – Reinforcement Learning Algorithms

Let's have a look at all the reinforcement learning algorithms we have learned about in this book.

Reinforcement learning algorithm

The steps involved in a typical reinforcement learning algorithm are given as follows:
  1. First, the agent interacts with the environment by performing an action.
  2. The agent performs an action and moves from one state to another.
  3. Then the agent will receive a reward based on the action it performed.
  4. Based on the reward, the agent will understand whether the action is good or bad.
  5. If the action was good, that is, if the agent received a positive reward, then the agent will prefer performing that action, else the agent will try performing other actions that can result in a positive reward. So reinforcement learning is basically a trial-and-error learning process.

Value Iteration

The algorithm of value iteration is given as follows:
  1. Compute the optimal value function by taking maximum over the Q function, that is,
  2. Extract the optimal policy from the computed optimal value function

Policy Iteration

The algorithm of policy iteration is given as follows:
  1. Initialize a random policy
  2. Compute the value function using the given policy
  3. Extract a new policy using the value function obtained from step 2
  4. If the extracted policy is the same as the policy used in step 2 then stop, else send the extracted new policy to step 2 and repeat steps 2 to 4

First-Visit MC Prediction

The algorithm of first-visit MC prediction is given as follows:
  1. Let total_return(s) be the sum of the return of a state across several episodes and N(s) be the counter, that is, the number of times a state is visited ac...

Indice dei contenuti

  1. Preface
  2. Fundamentals of Reinforcement Learning
  3. A Guide to the Gym Toolkit
  4. The Bellman Equation and Dynamic Programming
  5. Monte Carlo Methods
  6. Understanding Temporal Difference Learning
  7. Case Study – The MAB Problem
  8. Deep Learning Foundations
  9. A Primer on TensorFlow
  10. Deep Q Network and Its Variants
  11. Policy Gradient Method
  12. Actor-Critic Methods – A2C and A3C
  13. Learning DDPG, TD3, and SAC
  14. TRPO, PPO, and ACKTR Methods
  15. Distributional Reinforcement Learning
  16. Imitation Learning and Inverse RL
  17. Deep Reinforcement Learning with Stable Baselines
  18. Reinforcement Learning Frontiers
  19. Appendix 1 – Reinforcement Learning Algorithms
  20. Appendix 2 – Assessments
  21. Other Books You May Enjoy
  22. Index
Stili delle citazioni per Deep Reinforcement Learning with Python

APA 6 Citation

Ravichandiran, S. (2020). Deep Reinforcement Learning with Python (2nd ed.). Packt Publishing. Retrieved from https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf (Original work published 2020)

Chicago Citation

Ravichandiran, Sudharsan. (2020) 2020. Deep Reinforcement Learning with Python. 2nd ed. Packt Publishing. https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf.

Harvard Citation

Ravichandiran, S. (2020) Deep Reinforcement Learning with Python. 2nd edn. Packt Publishing. Available at: https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf (Accessed: 15 October 2022).

MLA 7 Citation

Ravichandiran, Sudharsan. Deep Reinforcement Learning with Python. 2nd ed. Packt Publishing, 2020. Web. 15 Oct. 2022.