Chod Matrix > Development > Data Science > Artificial Intelligence IV – Reinforcement Learning in Java

4.6 out of 5

170 reviews on Udemy

Artificial Intelligence IV – Reinforcement Learning in Java

All you need to know about Markov Decision processes, value- and policy-iteation as well as about Q learning approach

Instructor:

Holczer Balazs

1,952 students enrolled

English [Auto]

Understand reinforcement learning

Understand Markov Decision Processes

Understand value- and policy-iteration

Understand Q-learning approach and it's applications

This course is about Reinforcement Learning. The first step is to talk about the mathematical background: we can use a Markov Decision Process as a model for reinforcement learning. We can solve the problem 3 ways: value-iteration, policy-iteration and Q-learning. Q-learning is a model free approach so it is state-of-the-art approach. It learns the optimal policy by interacting with the environment. So these are the topics:

Markov Decision Processes
value-iteration and policy-iteration
Q-learning fundamentals
pathfinding algorithms with Q-learning
Q-learning with neural networks

Introduction

Types of learning

Applications of reinforcement learning

Markov Decision Process (MDP) Theory

Markov decision processes basics I

Markov decision processes basics II

Markov decision processes - equations

Markov decision processes - illustration

Bellman-equation

How to solve MDP problems?

Mathematical formulation of reinforcement learning

Reinforcement Learning Basics Quiz

Markov Decision Process - Value Iteration

What is value iteration?

Value iteration implementation I

Value iteration implementation II

Value iteration implementation III

Value iteration implementation IV

Value iteration implementation V

Markov Decision Process - Policy Iteration

What is policy iteration?

Value iteration vs policy iteration

Q Learning Theory

Q learning introduction

Q learning introduction - the algorithm

Q learning illustration

Mathematical formulation of Q learning

Q Learning Quiz

Pathfinding with Q-Learning

---- PATHFINDING ----

Pathfinding with Q-learning I

Pathfinding with Q-learning II

Pathfinding with Q-learning III

Pathfinding with Q-learning IV

---- SHORTEST PATH ----

Shortest path with Q-learning

Exploration vs. Exploitation Problem

Exploration vs exploitation problem

N-armed bandit problem introduction

N-armed bandit problem implementation I

N-armed bandit problem implementation II

Applications: A/B testing in marketing

Exploration vs. Exploitation Quiz

Deep Reinforcement Learning Theory

What is deep Q learning?

Deep Q learning and ε-greedy strategy

Deep Q-learning introduction - remember and replay

Mathematical formulation of deep Q learning

Deep Q Learning Quiz

Course Materials (DOWNLOADS)

Course materials

How long do I have access to the course materials?

You can view and review the lecture materials indefinitely, like an on-demand channel.

Can I take my courses with me wherever I go?

Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don't have an internet connection, some instructors also let their students download course lectures. That's up to the instructor though, so make sure you get on their good side!

4.6

4.6 out of 5

170 Ratings