Tags
Language
Tags
December 2024
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30 31 1 2 3 4

Python Reinforcement Learning, Deep Q-Learning And Trfl

Posted By: ELK1nG
Python Reinforcement Learning, Deep Q-Learning And Trfl

Python Reinforcement Learning, Deep Q-Learning And Trfl
Last updated 5/2019
MP4 | Video: h264, 1280x720 | Audio: AAC, 44.1 KHz
Language: English | Size: 1.94 GB | Duration: 5h 13m

Leverage the power of Reinforcement Learning techniques to develop intelligent systems using Python

What you'll learn

Implement state-of-the-art Reinforcement Learning algorithms from the basics

Discover various techniques of Reinforcement Learning such as MDP, Q Learning, and more

Dive into Temporal Difference Learning, an algorithm that combines Monte Carlo methods and dynamic programming

Create a virtual Self Driving Car application with Deep Q-Learning

Teach a Reinforcement Learning model to play a game using TensorFlow and the OpenAI gym

Build projects with TRFL and TensorFlow and integrate essential RL building blocks into existing code

Discover improvements to RL algorithms such as DQN and DDPG with TRFL blocks—for example, advanced target network updating, Double Q Learning, and Distributional Q Learning

Modify RL agents to include multistep reward techniques such as TD lambda

Create TRFL-based RL agents with classic RL methods such as TD Learning, Q Learning, and SARSA

Requirements

Basic knowledge of Python is required.

Description

Reinforcement Learning (RL), allows you to develop smart, quick and self-learning systems in your business surroundings. It is an effective method to train your learning agents and solve a variety of problems in Artificial Intelligence—from games, self-driving cars and robots to enterprise applications that range from data centre energy saving (cooling data centres) to smart warehousing solutions.This course covers the major advancements and successes achieved in deep reinforcement learning by synergizing deep neural network architectures with reinforcement learning. You will be introduced to the concept of Reinforcement Learning, its advantages and why it's gaining so much popularity. This course also discusses on Markov Decision Process (MDPs), Monte Carlo tree searches, dynamic programmings such as policy and value iteration, temporal difference learning such as Q-learning and SARSA. You will learn to build convolutional neural network models using TensorFlow and Keras. You will also learn the use of artificial intelligence in a gaming environment with the help of OpenAI Gym.By the end of this course, you will explore reinforcement learning and will have hands-on experience with real data and artificial intelligence (AI) to build intelligent systems.Meet Your Expert(s):We have the best work of the following esteemed author(s) to ensure that your learning journey is smooth:● Lauren Washington is currently the Lead Data Scientist and Machine Learning Developer for smartQED, an AI driven start-up. Lauren worked as a Data Scientist for Topix, Payments Risk Strategist for Google (Google Wallet/Android Pay), Statistical Analyst for Nielsen, and Big Data Intern for the National Opinion Research Center through the University of Chicago. Lauren is also passionate about teaching Machine Learning. She’s currently giving back to the data science community as a Thinkful Data Science Bootcamp Mentor  and a Packt Publishing technical video reviewer. She also earned a Data Science certificate from General Assembly San Francisco (2016), a MA in the Quantitative Methods in the Social Sciences (Applied Statistical Methods) from Columbia University (2012), and a BA in Economics from Spelman College (2010). Lauren is a leader in AI, in Silicon Valley, with a passion for knowledge gathering and sharing.● Kaiser Hamid Rabbi is a Data Scientist who is super-passionate about Artificial Intelligence, Machine Learning, and Data Science. He has entirely devoted himself to learning more about Big Data Science technologies such as Python, Machine Learning, Deep Learning, Artificial Intelligence, Reinforcement Learning, Data Mining, Data Analysis, Recommender Systems and so on over the last 4 years. Kaiser also has a huge interest in Lygometry (things we know we do not know!) and always tries to understand domain knowledge based on his project experience as much as possible.● Colibri Digital is a technology consultancy company founded in 2015 by James Cross and Ingrid Funie. The company works to help its clients navigate the rapidly changing and complex world of emerging technologies, with deep expertise in areas such as big data, data science, machine learning, and Cloud computing. Over the past few years, they have worked with some of the World's largest and most prestigious companies, including a tier 1 investment bank, a leading management consultancy group, and one of the World's most popular soft drinks companies, helping each of them to better make sense of its data, and process it in more intelligent ways. The company lives by its motto: Data -> Intelligence -> Action.● Jim DiLorenzo is a freelance programmer and Reinforcement Learning enthusiast. He graduated from Columbia University and is working on his Masters in Computer Science. He has used TRFL in his own RL experiments and when implementing scientific papers into code.

Overview

Section 1: Practical Reinforcement Learning - Agents and Environments

Lecture 1 The Course Overview

Lecture 2 Install RStudio

Lecture 3 Install Python

Lecture 4 Launch Jupyter Notebook

Lecture 5 Learning Type Distinctions

Lecture 6 Get Started with Reinforcement Learning

Lecture 7 Real-world Reinforcement Learning Examples

Lecture 8 Key Terms in Reinforcement Learning

Lecture 9 OpenAI Gym

Lecture 10 Monte Carlo Method

Lecture 11 Monte Carlo Method in Python

Lecture 12 Monte Carlo Method in R

Lecture 13 Practical Reinforcement Learning in OpenAI Gym

Lecture 14 Markov Decision Process Concepts

Lecture 15 Python MDP Toolbox

Lecture 16 Value and Policy Iteration in Python

Lecture 17 MDP Toolbox in R

Lecture 18 Value Iteration and Policy Iteration in R

Lecture 19 Temporal Difference Learning

Lecture 20 Temporal Difference Learning in Python

Lecture 21 Temporal Difference Learning in R

Section 2: Advanced Practical Reinforcement Learning

Lecture 22 The Course Overview

Lecture 23 Introduction to Deep Reinforcement Learning

Lecture 24 Deep Q-Learning and Double Deep Q-Learning

Lecture 25 Q-Learning in Python

Lecture 26 Q-Learning in R

Lecture 27 TensorFlow

Lecture 28 TensorFlow in Python

Lecture 29 Deep Q-Learning with TensorFlow in Python

Lecture 30 Keras

Lecture 31 Keras in Python

Lecture 32 Deep Q-Learning with Keras in Python

Lecture 33 Deep Q-Learning with Keras in R

Lecture 34 Case Study – Reinforcement Learning

Section 3: Hands-On Deep Q-Learning

Lecture 35 The Course Overview

Lecture 36 Artificial Intelligence in a Nutshell

Lecture 37 Reinforcement Learning Dynamics

Lecture 38 The Bellman Equation

Lecture 39 Markov Decision Process

Lecture 40 Policy versus Plan and Living Penalty

Lecture 41 Q-Learning Intuition

Lecture 42 Temporal Difference

Lecture 43 Learning Phase of Deep Q-Learning

Lecture 44 Acting Phase of Deep Q-Learning

Lecture 45 Experience Reply and Action Selection Policies

Lecture 46 Installing PYTORCH environment

Lecture 47 Self Driving Car – Part 1

Lecture 48 Self Driving Car – Part 2

Lecture 49 Self Driving Car – Part 3

Lecture 50 Playing with Our SDC AI

Lecture 51 Convolutional Neural Network

Lecture 52 Deep Convolutional Q-Learning

Lecture 53 Eligibility Trace

Lecture 54 Installing OpenAIGym and ppaquette

Lecture 55 Build an AI for DOOM – Part 1

Lecture 56 Build an AI for DOOM – Part 2

Lecture 57 Build an AI for DOOM – Part 3

Lecture 58 Playing with our AI in DOOM

Section 4: Reinforcement Learning with TensorFlow & TRFL

Lecture 59 The Course Overview

Lecture 60 Set Up and Installation

Lecture 61 Getting Started with TD Learning

Lecture 62 Exploiting Off-policy Efficiency Using Q Learning

Lecture 63 Comparing On-policy Methods with SARSA and SARSE

Lecture 64 Implementing a Deep Q Network and Applying Target Network Updates

Lecture 65 Modifying a DQN with Double DQN, Persistent DQN, and Huber Loss

Lecture 66 Improving a DQN with Distributional Q Learning

Lecture 67 Utilizing Policy Gradient Methods

Lecture 68 Increasing Exploration with Policy Entropy Loss

Lecture 69 Applying Actor-Critic with A3C and A2C

Lecture 70 Performing Deterministic Policy Gradients

Lecture 71 Deploying TD(λ)

Lecture 72 Balancing Bias and Variance with Generalized λ Returns

Lecture 73 Applying Q(λ)

Lecture 74 Working with Multi-step Forward View

Lecture 75 Using Importance Sampling with Retrace (λ)

Lecture 76 Getting Started with Impala with V-Trace

Lecture 77 Augmenting an Agent with Unreal and Pixel Control

This course is designed for AI engineers, Machine Learning engineers, aspiring Reinforcement Learning and Data Science professionals keen to extend their skill set to Reinforcement Learning using Python.