From Markov Decision Processes to Reinforcement Learning with Python

7 Applications

1 Preface
2 Abstract
3 Introduction
4 Multi-armed Bandits
5 Finite Markov Decision Processes (MDPs)
6 Dynamic Programming (DP)
7 Applications
8 Project proposal
9 Evaluation Rubric
10 List of Home Works and due dates
11 Homework grades
References

Table of contents

Recycling Robot
A robot with randomly moves in a grid world.

Edit this page
Report an issue

Applications

Recycling Robot

A robot with randomly moves in a grid world.

6 Dynamic Programming (DP)

8 Project proposal

Source Code

# Applications

## Recycling Robot

## A robot with randomly moves in a grid world.

This notes for RL, are the first draft of for the course: From Markov Decision Processes to Reinforcement Learning

Edit this page
Report an issue