2019-12-07

Upper Confidence Bound (UCB) Algorithm Explained with Python code

7 mins read In this tutorial, I will explain to you the application of the Upper Confidence Bound(UCB) algorithm to solve the Multi […]
2019-12-07

Difference between model-based and model-free reinforcement learning

3 mins read To answer this question, let’s revisit the components of an MDP, the most typical decision-making framework for RL. An MDP […]
2019-12-06

Reinforcement Q-Learning from Scratch in Python with OpenAI Gym

24 mins read Most of you have probably heard of AI learning to play computer games on their own, a very popular example […]
2019-12-06

Understand Q-Learning in Reinforcement Learning with a numerical example and Python implementation

14 mins read This tutorial introduces the concept of Q-learning through a simple but comprehensive numerical example.  The example describes an agent which […]
2019-12-06

What is q-learning?

5 mins read Introduction One of my favorite algorithms that I learned while taking a reinforcement learning course was q-learning. Probably because it […]