Quantile Regression Explained

This document discusses quantile regression, which estimates linear relationships between variables by minimizing the absolute errors (L1 norm) rather than squared errors (L2 norm) as in traditional linear regression. It provides an example using the QuantileLinearRegression model to fit lines with different quantiles (0.1, 0.25, 0.5, 0.75, 0.9) to some generated data, demonstrating how quantile regression is less sensitive to outliers than traditional linear regression. It also shows the iterative reweighted least squares algorithm used to fit the quantile regression model and outputs from fitting models with different quantiles.

Uploaded by

ramesh158

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

121 views

Quantile Regression Explained

Uploaded by

ramesh158

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

quantile_regression

January 28, 2019

1 Quantile Regression
scikit-learn does not have a quantile regression. mlinsights implements a version of it.

In [1]: from jyquickhelper import add_notebook_menu

add_notebook_menu()

Out[1]: <IPython.core.display.HTML object>

In [2]: %matplotlib inline

1.1 Simple example

We generate some dummy data.

In [3]: import numpy

X = numpy.random.random(1000)
eps1 = (numpy.random.random(900) - 0.5) * 0.1
eps2 = (numpy.random.random(100)) * 10
eps = numpy.hstack([eps1, eps2])
X = X.reshape((1000, 1))
Y = X.ravel() * 3.4 + 5.6 + eps

In [4]: from sklearn.linear_model import LinearRegression

clr = LinearRegression()
clr.fit(X, Y)

Out[4]: LinearRegression(copy_X=True, fit_intercept=True, n_jobs=1, normalize=False)

In [5]: from mlinsights.mlmodel import QuantileLinearRegression

clq = QuantileLinearRegression()
clq.fit(X, Y)

Out[5]: QuantileLinearRegression(copy_X=True, delta=0.0001, fit_intercept=True,

max_iter=10, n_jobs=1, normalize=False, quantile=0.5,
verbose=False)

In [6]: from pandas import DataFrame

data= dict(X=X.ravel(), Y=Y, clr=clr.predict(X), clq=clq.predict(X))
df = DataFrame(data)
df.head()

1
Out[6]: X Y clr clq
0 0.710310 8.031079 8.515732 8.024375
1 0.246556 6.409345 6.936975 6.448834
2 0.851280 8.475841 8.995636 8.503300
3 0.140727 6.058996 6.576702 6.089295
4 0.731571 8.070341 8.588110 8.096605

In [7]: import matplotlib.pyplot as plt

fig, ax = plt.subplots(1, 1, figsize=(10, 4))
choice = numpy.random.choice(X.shape[0]-1, size=100)
xx = X.ravel()[choice]
yy = Y[choice]
ax.plot(xx, yy, '.', label="data")
xx = numpy.array([[0], [1]])
y1 = clr.predict(xx)
y2 = clq.predict(xx)
ax.plot(xx, y1, "--", label="L2")
ax.plot(xx, y2, "--", label="L1")
ax.set_title("Quantile (L1) vs Square (L2)");
ax.legend();

The L1 is clearly less sensible to extremas. The optimization algorithm is based on Iteratively reweighted
least squares. It estimates a linear regression with error L2 then reweights each oberservation with the
inverse of the error L1.

In [8]: clq = QuantileLinearRegression(verbose=True, max_iter=20)

clq.fit(X, Y)

[QuantileLinearRegression.fit] iter=1 error=901.3803392180542

[QuantileLinearRegression.fit] iter=2 error=562.663383515471
[QuantileLinearRegression.fit] iter=3 error=522.8970177647805
[QuantileLinearRegression.fit] iter=4 error=522.3766707482777
[QuantileLinearRegression.fit] iter=5 error=522.0288331540892
[QuantileLinearRegression.fit] iter=6 error=521.6797263072117
[QuantileLinearRegression.fit] iter=7 error=521.4702236617843
[QuantileLinearRegression.fit] iter=8 error=521.3419287524464

2
[QuantileLinearRegression.fit] iter=9 error=521.206723757895
[QuantileLinearRegression.fit] iter=10 error=521.1212078810222
[QuantileLinearRegression.fit] iter=11 error=521.0410686984816
[QuantileLinearRegression.fit] iter=12 error=520.9841924800792
[QuantileLinearRegression.fit] iter=13 error=520.9349774362781
[QuantileLinearRegression.fit] iter=14 error=520.907415015473
[QuantileLinearRegression.fit] iter=15 error=520.8939558844767
[QuantileLinearRegression.fit] iter=16 error=520.8845502333198
[QuantileLinearRegression.fit] iter=17 error=520.8791552281199
[QuantileLinearRegression.fit] iter=18 error=520.874494484882
[QuantileLinearRegression.fit] iter=19 error=520.8709629795006
[QuantileLinearRegression.fit] iter=20 error=520.8680582590082

Out[8]: QuantileLinearRegression(copy_X=True, delta=0.0001, fit_intercept=True,

max_iter=20, n_jobs=1, normalize=False, quantile=0.5,
verbose=True)

In [9]: clq.score(X,Y)

Out[9]: 0.5208680582590082

1.2 Regression with various quantiles

In [10]: import numpy
X = numpy.random.random(1200)
eps1 = (numpy.random.random(900) - 0.5) * 0.5
eps2 = (numpy.random.random(300)) * 2
eps = numpy.hstack([eps1, eps2])
X = X.reshape((1200, 1))
Y = X.ravel() * 3.4 + 5.6 + eps + X.ravel() * X.ravel() * 8

In [11]: fig, ax = plt.subplots(1, 1, figsize=(10, 4))

choice = numpy.random.choice(X.shape[0]-1, size=100)
xx = X.ravel()[choice]
yy = Y[choice]
ax.plot(xx, yy, '.', label="data")
ax.set_title("Almost linear dataset");

3
In [12]: clqs = {}
for qu in [0.1, 0.25, 0.5, 0.75, 0.9]:
clq = QuantileLinearRegression(quantile=qu)
clq.fit(X, Y)
clqs['q=%1.2f' % qu] = clq

In [13]: import matplotlib.pyplot as plt

fig, ax = plt.subplots(1, 1, figsize=(10, 4))
choice = numpy.random.choice(X.shape[0]-1, size=100)
xx = X.ravel()[choice]
yy = Y[choice]
ax.plot(xx, yy, '.', label="data")
xx = numpy.array([[0], [1]])
for qu in sorted(clqs):
y = clqs[qu].predict(xx)
ax.plot(xx, y, "--", label=qu)
ax.set_title("Various quantiles");
ax.legend();

2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Beta Quantil - English
No ratings yet
Beta Quantil - English
5 pages
Quantile Regression Analysis
No ratings yet
Quantile Regression Analysis
6 pages
Quantile Regression Through Linear Programming PDF
No ratings yet
Quantile Regression Through Linear Programming PDF
21 pages
Regression Prac 9
No ratings yet
Regression Prac 9
8 pages
Simple_and_Multiple_Regression
No ratings yet
Simple_and_Multiple_Regression
9 pages
Exp 1
No ratings yet
Exp 1
11 pages
Btech1007022_lab5.1
No ratings yet
Btech1007022_lab5.1
9 pages
Btech1007022_lab5
No ratings yet
Btech1007022_lab5
14 pages
Implement the Knn (2)
No ratings yet
Implement the Knn (2)
5 pages
AIML PRACTICALS
No ratings yet
AIML PRACTICALS
22 pages
Linear Regression
No ratings yet
Linear Regression
104 pages
ML Activity Kalyan
No ratings yet
ML Activity Kalyan
21 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
No ratings yet
Programming Exercise 5: Regularized Linear Regression and Bias V.S. Variance
14 pages
DM Slip Solutions
100% (1)
DM Slip Solutions
24 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
AIML CODES
No ratings yet
AIML CODES
11 pages
Linear _Regression_Insuarace_StudentsPerformance
No ratings yet
Linear _Regression_Insuarace_StudentsPerformance
4 pages
Neural Networks Report HW2: Pripoae Serbanescu Mihai
No ratings yet
Neural Networks Report HW2: Pripoae Serbanescu Mihai
5 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Ridge - Lasso - Regression (1) .Ipynb - Colaboratory
No ratings yet
Ridge - Lasso - Regression (1) .Ipynb - Colaboratory
4 pages
Ex 5
No ratings yet
Ex 5
14 pages
Exam 21
No ratings yet
Exam 21
17 pages
AIML_week7_week8_week9
No ratings yet
AIML_week7_week8_week9
6 pages
05 Draft
No ratings yet
05 Draft
68 pages
Ex5 PDF
No ratings yet
Ex5 PDF
14 pages
Ex 5
No ratings yet
Ex 5
14 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
ML Labs
No ratings yet
ML Labs
46 pages
LAB5_Regularization
No ratings yet
LAB5_Regularization
6 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
DM Practice
No ratings yet
DM Practice
15 pages
ML-journal
No ratings yet
ML-journal
45 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
HW5_11.1.R_Submission
No ratings yet
HW5_11.1.R_Submission
11 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
Data Mining & Data Science Practical Slips
No ratings yet
Data Mining & Data Science Practical Slips
45 pages
2021 Quiz2 Problems
No ratings yet
2021 Quiz2 Problems
13 pages
LR-LogReg
No ratings yet
LR-LogReg
53 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
AI LAB
No ratings yet
AI LAB
19 pages
Machine 2021 Jan-Apr
No ratings yet
Machine 2021 Jan-Apr
45 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Implementation of Linear Regression With Python
No ratings yet
Implementation of Linear Regression With Python
5 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
DA_012307
No ratings yet
DA_012307
8 pages
HASRITH ML LAB 10 ASSIGNMENT - Jupyter Notebook (1)
No ratings yet
HASRITH ML LAB 10 ASSIGNMENT - Jupyter Notebook (1)
8 pages
Arima Model
No ratings yet
Arima Model
6 pages
HW 2
No ratings yet
HW 2
5 pages
ML Practical File
No ratings yet
ML Practical File
30 pages
Naive Bayes
No ratings yet
Naive Bayes
58 pages
Tutorial 7.ipynb - Colab
No ratings yet
Tutorial 7.ipynb - Colab
7 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Develop Snakes & Ladders Game Complete Guide with Code & Design
From Everand
Develop Snakes & Ladders Game Complete Guide with Code & Design
Anurag Pandey
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Scrum Methodology
100% (3)
Scrum Methodology
14 pages
Decision Management in Financial Services Whitepaper
No ratings yet
Decision Management in Financial Services Whitepaper
6 pages
Seenu Travel To Nellore - 26august2016
No ratings yet
Seenu Travel To Nellore - 26august2016
1 page
Bayesian Statistics Explained To Beginners in Simple English
No ratings yet
Bayesian Statistics Explained To Beginners in Simple English
16 pages
HP Vertica
No ratings yet
HP Vertica
18 pages
T4S5 Machine Learning - A Giant Leap For Supply Chain Forecasting
100% (1)
T4S5 Machine Learning - A Giant Leap For Supply Chain Forecasting
28 pages
Types of Analytics - Descriptive, Predictive, Prescriptive Analytics
No ratings yet
Types of Analytics - Descriptive, Predictive, Prescriptive Analytics
6 pages
Four Types of Analytics PDF
No ratings yet
Four Types of Analytics PDF
32 pages
A Short Introduction To Vertica
No ratings yet
A Short Introduction To Vertica
21 pages
SZ022575 and SZ041635 - June 2016
No ratings yet
SZ022575 and SZ041635 - June 2016
1 page
GMA - 2002 - Worldwide - OOS - Study PDF
No ratings yet
GMA - 2002 - Worldwide - OOS - Study PDF
80 pages
Java Job Profile
No ratings yet
Java Job Profile
2 pages
Temples and Tourist Places in Nellore District
No ratings yet
Temples and Tourist Places in Nellore District
7 pages
SZ022575 and SZ041635 - July 2016
No ratings yet
SZ022575 and SZ041635 - July 2016
1 page
Data Science Hiring Guide
50% (2)
Data Science Hiring Guide
56 pages
Kurnool Visiting Places
No ratings yet
Kurnool Visiting Places
6 pages
What Main Methodology Are You Using For Your Analytics
No ratings yet
What Main Methodology Are You Using For Your Analytics
1 page
Candidate Pre-Screening Form: Requirement / Availability Summary
No ratings yet
Candidate Pre-Screening Form: Requirement / Availability Summary
3 pages
Gen Math Pretest
No ratings yet
Gen Math Pretest
2 pages
Module 5 BC
No ratings yet
Module 5 BC
2 pages
Pricelist Amd + Asrock + Intel SSD +pny PT Emedia Devices 281124
No ratings yet
Pricelist Amd + Asrock + Intel SSD +pny PT Emedia Devices 281124
27 pages
Buy People Link 1 Microphone with Speaker online _ Government e Marketplace (GeM)
No ratings yet
Buy People Link 1 Microphone with Speaker online _ Government e Marketplace (GeM)
7 pages
Examen de Certificación Práctica CCENT Nº1
No ratings yet
Examen de Certificación Práctica CCENT Nº1
19 pages
Ketikan Soal
No ratings yet
Ketikan Soal
6 pages
Introduction: Databases and Database Users
No ratings yet
Introduction: Databases and Database Users
29 pages
Daa 3a and B PRG
No ratings yet
Daa 3a and B PRG
4 pages
RetailManagerUserGuide PDF
No ratings yet
RetailManagerUserGuide PDF
322 pages
DLP TRENDS Q2 Week G - Neural and Social Networks
No ratings yet
DLP TRENDS Q2 Week G - Neural and Social Networks
10 pages
Ir Thermometer: Presented By: Mustafa Ali Yassin Mohamed Fadel Alaa Khalil 2Nd Stage (Evening Study) Dr. Omar Youssef
No ratings yet
Ir Thermometer: Presented By: Mustafa Ali Yassin Mohamed Fadel Alaa Khalil 2Nd Stage (Evening Study) Dr. Omar Youssef
17 pages
Tessellation Project
No ratings yet
Tessellation Project
8 pages
Man VLF 62 en
No ratings yet
Man VLF 62 en
68 pages
Topic: Visual::Worksheet Number:10: 1 - Find The Missing Part From The Options Given Below
100% (1)
Topic: Visual::Worksheet Number:10: 1 - Find The Missing Part From The Options Given Below
5 pages
GAMESH
No ratings yet
GAMESH
2 pages
Ophidian 2350 Spoiler
No ratings yet
Ophidian 2350 Spoiler
66 pages
Linear Programming
100% (1)
Linear Programming
17 pages
Rogue Manual PDF
No ratings yet
Rogue Manual PDF
17 pages
MP_unit 1_new
No ratings yet
MP_unit 1_new
84 pages
In-Vehicle Networking: Introduce Class
No ratings yet
In-Vehicle Networking: Introduce Class
9 pages
A Brief History of Programmable Logic - CourseraFPGA - M1V2
No ratings yet
A Brief History of Programmable Logic - CourseraFPGA - M1V2
12 pages
Case Study Presentation
No ratings yet
Case Study Presentation
21 pages
2020 YZF R6 Kit Manual en
No ratings yet
2020 YZF R6 Kit Manual en
70 pages
Logica 1
No ratings yet
Logica 1
30 pages
DAA Module 1 Power Point-S.Mercy
No ratings yet
DAA Module 1 Power Point-S.Mercy
49 pages
Cj1m-Cpu1 Cpu Units Datasheet en
No ratings yet
Cj1m-Cpu1 Cpu Units Datasheet en
12 pages
Move and Selection Tools: Polygonal Lasso Tool
No ratings yet
Move and Selection Tools: Polygonal Lasso Tool
5 pages
Ad7817 7818
No ratings yet
Ad7817 7818
20 pages
Whitepaper Navi Trans 50
No ratings yet
Whitepaper Navi Trans 50
8 pages
Configuring External Storage For Archive Backup - Tech
No ratings yet
Configuring External Storage For Archive Backup - Tech
6 pages