Reinforcement Learning for Robotics Course - Python

Learn the main reinforcement learning techniques and algorithms.

Course Summary

The course will give you the state-of-the-art opportunity to be familiar with the general concept of reinforcement learning and to deploy theory into practice by running coding exercises and simulations in ROS.

What you will learn

The reinforcement learning problem, Multi-armed bandit problem, State-Value and Action-Value functions, Markov Decision Processes, and Bellman Equation
Dynamic programming
Monte Carlo methods
Temporal-difference methods

Course Overview

Introduction to the Course

Unit for previewing the contents of the Course. Includes a practical demo.

The reinforcement learning problem

Learn some reinforcement learning basic concepts and terminology.

Dynamic programming problem

Learn about the dynamic programming (DP) concept, which in our case is tailored for solving reinforcement learning problems - Bellman equations.

Monte Carlo methods

In this unit, we are going to continue our discussion about optimal policies, which the agent evaluates, improves, and follows through Monte Carlo methods.

Temporal-Difference methods

In this unit, we are going to continue our journey of finding the most optimal way to solve MDP, for the environment where the dynamics (transitions) are unknown in advance (model-free reinforcement learning).

Course Project

In this final project, your task is to deploy a Q-learning algorithm to solve a maze environment with 3 obstacles.

Teachers

Markus Buchholz

PhD in Robotics and M.Sc in Electronics and Computer Science and M.sc in Economics. His main passion is programming (C++, Python) the Autonomous Systems by use of AI, Deep Learning and Reinforcement Learning.

Robots used

Parrot AR.Drone robot

Turtlebot robot

Learning Path

Machine Learning for Robots