eBook - ePub

Deep Reinforcement Learning with Python

Name: Deep Reinforcement Learning with Python
Author: Sudharsan Ravichandiran

Master classic RL, deep RL, distributional RL, inverse RL, and more with OpenAI Gym and TensorFlow, 2nd Edition

Sudharsan Ravichandiran

760 páginas
English
ePUB (apto para móviles)
Disponible en iOS y Android

eBook - ePub

Deep Reinforcement Learning with Python

Master classic RL, deep RL, distributional RL, inverse RL, and more with OpenAI Gym and TensorFlow, 2nd Edition

Sudharsan Ravichandiran

Detalles del libro

Vista previa del libro

Índice

Citas

Información del libro

An example-rich guide for beginners to start their reinforcement and deep reinforcement learning journey with state-of-the-art distinct algorithms

Key Features

Covers a vast spectrum of basic-to-advanced RL algorithms with mathematical explanations of each algorithm
Learn how to implement algorithms with code by following examples with line-by-line explanations
Explore the latest RL methodologies such as DDPG, PPO, and the use of expert demonstrations

Book Description

With significant enhancements in the quality and quantity of algorithms in recent years, this second edition of Hands-On Reinforcement Learning with Python has been revamped into an example-rich guide to learning state-of-the-art reinforcement learning (RL) and deep RL algorithms with TensorFlow 2 and the OpenAI Gym toolkit.

In addition to exploring RL basics and foundational concepts such as Bellman equation, Markov decision processes, and dynamic programming algorithms, this second edition dives deep into the full spectrum of value-based, policy-based, and actor-critic RL methods. It explores state-of-the-art algorithms such as DQN, TRPO, PPO and ACKTR, DDPG, TD3, and SAC in depth, demystifying the underlying math and demonstrating implementations through simple code examples.

The book has several new chapters dedicated to new RL techniques, including distributional RL, imitation learning, inverse RL, and meta RL. You will learn to leverage stable baselines, an improvement of OpenAI's baseline library, to effortlessly implement popular RL algorithms. The book concludes with an overview of promising approaches such as meta-learning and imagination augmented agents in research.

By the end, you will become skilled in effectively employing RL and deep RL in your real-world projects.

What you will learn

Understand core RL concepts including the methodologies, math, and code
Train an agent to solve Blackjack, FrozenLake, and many other problems using OpenAI Gym
Train an agent to play Ms Pac-Man using a Deep Q Network
Learn policy-based, value-based, and actor-critic methods
Master the math behind DDPG, TD3, TRPO, PPO, and many others
Explore new avenues such as the distributional RL, meta RL, and inverse RL
Use Stable Baselines to train an agent to walk and play Atari games

Who this book is for

If you're a machine learning developer with little or no experience with neural networks interested in artificial intelligence and want to learn about reinforcement learning from scratch, this book is for you.

Basic familiarity with linear algebra, calculus, and the Python programming language is required. Some experience with TensorFlow would be a plus.

Preguntas frecuentes

¿Cómo cancelo mi suscripción?

Simplemente, dirígete a la sección ajustes de la cuenta y haz clic en «Cancelar suscripción». Así de sencillo. Después de cancelar tu suscripción, esta permanecerá activa el tiempo restante que hayas pagado. Obtén más información aquí.

¿Cómo descargo los libros?

Por el momento, todos nuestros libros ePub adaptables a dispositivos móviles se pueden descargar a través de la aplicación. La mayor parte de nuestros PDF también se puede descargar y ya estamos trabajando para que el resto también sea descargable. Obtén más información aquí.

¿En qué se diferencian los planes de precios?

Ambos planes te permiten acceder por completo a la biblioteca y a todas las funciones de Perlego. Las únicas diferencias son el precio y el período de suscripción: con el plan anual ahorrarás en torno a un 30 % en comparación con 12 meses de un plan mensual.

¿Qué es Perlego?

Somos un servicio de suscripción de libros de texto en línea que te permite acceder a toda una biblioteca en línea por menos de lo que cuesta un libro al mes. Con más de un millón de libros sobre más de 1000 categorías, ¡tenemos todo lo que necesitas! Obtén más información aquí.

¿Perlego ofrece la función de texto a voz?

Busca el símbolo de lectura en voz alta en tu próximo libro para ver si puedes escucharlo. La herramienta de lectura en voz alta lee el texto en voz alta por ti, resaltando el texto a medida que se lee. Puedes pausarla, acelerarla y ralentizarla. Obtén más información aquí.

¿Es Deep Reinforcement Learning with Python un PDF/ePUB en línea?

Sí, puedes acceder a Deep Reinforcement Learning with Python de Sudharsan Ravichandiran en formato PDF o ePUB, así como a otros libros populares de Computer Science y Computer Science General. Tenemos más de un millón de libros disponibles en nuestro catálogo para que explores.

Información

Editorial

Packt Publishing

Año

2020

ISBN

9781839215599

Edición

Categoría

Computer Science

Categoría

Computer Science General

Appendix 1 – Reinforcement Learning Algorithms

Let's have a look at all the reinforcement learning algorithms we have learned about in this book.

Reinforcement learning algorithm

The steps involved in a typical reinforcement learning algorithm are given as follows:

First, the agent interacts with the environment by performing an action.
The agent performs an action and moves from one state to another.
Then the agent will receive a reward based on the action it performed.
Based on the reward, the agent will understand whether the action is good or bad.
If the action was good, that is, if the agent received a positive reward, then the agent will prefer performing that action, else the agent will try performing other actions that can result in a positive reward. So reinforcement learning is basically a trial-and-error learning process.

Value Iteration

The algorithm of value iteration is given as follows:

Compute the optimal value function by taking maximum over the Q function, that is,
Extract the optimal policy from the computed optimal value function

Policy Iteration

The algorithm of policy iteration is given as follows:

Initialize a random policy
Compute the value function using the given policy
Extract a new policy using the value function obtained from step 2
If the extracted policy is the same as the policy used in step 2 then stop, else send the extracted new policy to step 2 and repeat steps 2 to 4

First-Visit MC Prediction

The algorithm of first-visit MC prediction is given as follows:

Let total_return(s) be the sum of the return of a state across several episodes and N(s) be the counter, that is, the number of times a state is visited ac...

Índice

Preface
Fundamentals of Reinforcement Learning
A Guide to the Gym Toolkit
The Bellman Equation and Dynamic Programming
Monte Carlo Methods
Understanding Temporal Difference Learning
Case Study – The MAB Problem
Deep Learning Foundations
A Primer on TensorFlow
Deep Q Network and Its Variants
Policy Gradient Method
Actor-Critic Methods – A2C and A3C
Learning DDPG, TD3, and SAC
TRPO, PPO, and ACKTR Methods
Distributional Reinforcement Learning
Imitation Learning and Inverse RL
Deep Reinforcement Learning with Stable Baselines
Reinforcement Learning Frontiers
Appendix 1 – Reinforcement Learning Algorithms
Appendix 2 – Assessments
Other Books You May Enjoy
Index

Estilos de citas para Deep Reinforcement Learning with Python

APA 6 Citation

Ravichandiran, S. (2020). Deep Reinforcement Learning with Python (2nd ed.). Packt Publishing. Retrieved from https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf (Original work published 2020)

Chicago Citation

Ravichandiran, Sudharsan. (2020) 2020. Deep Reinforcement Learning with Python. 2nd ed. Packt Publishing. https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf.

Harvard Citation

Ravichandiran, S. (2020) Deep Reinforcement Learning with Python. 2nd edn. Packt Publishing. Available at: https://www.perlego.com/book/2094762/deep-reinforcement-learning-with-python-master-classic-rl-deep-rl-distributional-rl-inverse-rl-and-more-with-openai-gym-and-tensorflow-2nd-edition-pdf (Accessed: 15 October 2022).

MLA 7 Citation

Ravichandiran, Sudharsan. Deep Reinforcement Learning with Python. 2nd ed. Packt Publishing, 2020. Web. 15 Oct. 2022.