Jakob Zeitler
  • Blog
  • Causality
  • About
  • Portfolio

Thoughts on Reinforcement Learning

12/26/2016

0 Comments

 
Picture
As part of coursework I have been working with reinforcement learning algorithms lately. Reinforcement learning has gained interest with quite a lot of academic and commercial communites that are interested in deploying "intelligent" agents to make decision for them, better precision than humanly possible.

I read Sutton's book "Reinforcement Learning" to better understand the historical and mathemtical background and it turns out that all reinforcement learning is, at least in its basic parts, is a a collection of Markov Decision problem solving procedures. Specificially, Dynmaic Programming based on the Bellman equation is a good way to solve learning problems where the model is known as a whole. If you don't know the model as a whole or only want to solve a part of the problem, then the application of Marov Chain Monte Carlo methods helps you pull in Bayesian learning to deal with that. Eventually, all you do is simulate or process data and analyse corresponding Markov Chains. 

So as a whole, I was a bit disappointed, because the idea of Markov Chains as such is not to challenging (the mathematics of it is, though). Markov Chains and Monte Carlo are very intutitive concepts and solving them also coincides with goals people try to achieve in reinforcement learning.

Anyway, I had a lot of fun exploring all the different ways to solve Grid World problems and I might dive deeper into the issue with an application closer to the real world, hopefully soon.
0 Comments



Leave a Reply.

    Picture

    Jakob Zeitler

    There is no shortcut to happiness.

    Archives

    February 2020
    November 2019
    February 2019
    December 2018
    May 2018
    December 2017
    August 2017
    April 2017
    January 2017
    December 2016
    October 2016
    September 2016
    August 2016
    July 2016
    June 2016
    May 2016
    April 2016
    January 2016
    November 2015
    October 2015
    June 2015
    May 2015

    Categories

    All
    Clouds
    Computers
    Cooking
    Face Recognition
    Food
    Life
    Microsoft
    Music
    Open Source
    Project Oxford
    Python
    Raspberry Pi
    Sports
    Talks

    RSS Feed

Powered by Create your own unique website with customizable templates.