Reinforcement learning is supervised learning on optimized data

By The Berkeley Artificial Intelligence Research Blog - 2020-10-13

Description

The BAIR Blog

Summary

  • The two most common perspectives on Reinforcement learning (RL) are optimization and dynamic programming.
  • The penultimate section will discuss how goal relabeling, a modified problem definition, and inverse RL extract “good data” in the multi-task setting.
  • For example, reward-weighted regression [Williams 2007] and advantage-weighted regression [Neumann 2009, Peng 2019] combine the two steps by doing behavior cloning on reward-weighted data.
  • More generally, this result is exciting Future Directions In this article, we discussed how RL can be viewed as solving a sequence of standard supervised learning problems but using optimized (relabled) data.

 

Topics

  1. Machine_Learning (0.43)
  2. Backend (0.35)
  3. Database (0.13)

Similar Articles

Introduction to Active Learning

By KDnuggets - 2020-12-15

An extensive overview of Active Learning, with an explanation into how it works and can assist with data labeling, as well as its performance and potential limitations.

Data Science Learning Roadmap for 2021

By freeCodeCamp.org - 2021-01-12

Although nothing really changes but the date, a new year fills everyone with the hope of starting things afresh. If you add in a bit of planning, some well-envisioned goals, and a learning roadmap, yo ...

What is semi-supervised machine learning?

By TechTalks - 2021-01-04

Semi-supervised learning helps you solve classification problems when you don't have labeled data to train your machine learning model.

30 Most Asked Machine Learning Questions Answered

By Medium - 2021-03-18

Machine Learning is the path to a better and advanced future. A Machine Learning Developer is the most demanding job in 2021 and it is going to increase by 20–30% in the upcoming 3–5 years. Machine…