Robust Regression for Machine Learning in Python

By Machine Learning Mastery - 2020-10-04

Description

Regression is a modeling task that involves predicting a numerical value given an input. Algorithms used for regression tasks are also referred to as “regression” algorithms, with the most widely know ...

Summary

  • Regression is a modeling task that involves predicting a numerical value given an input.
  • Outliers in a dataset can skew summary statistics calculated for the variable, such as the mean and standard deviation, which in turn can skew the model towards the outlier values, away from the central mass of observations.
  • We can see that Theil Sen regression achieves a MAE of about 4.371 on this dataset, outperforming the linear regression model as well as RANSAC and Huber regression.
  • In this case, we can see that the line of best fit is aligned with the main body of the data.

 

Topics

  1. Machine_Learning (0.27)
  2. Backend (0.07)
  3. NLP (0.05)

Similar Articles

Random Forest for Time Series Forecasting

By Machine Learning Mastery - 2020-11-01

Random Forest is a popular and effective ensemble machine learning algorithm. It is widely used for classification and regression predictive modeling problems with structured (tabular) data sets, e.g. ...

Regression for Imbalanced Data with Application

By Medium - 2020-07-17

Imbalanced data are the situation where the less represented observations of the data are of the main interest. In some contexts, they are expressed as “outliers” which is rather more dangerous. As a…