Gradient Boosting on Stochastic Data Streams

Hu, Hanzhang; Sun, Wen; Venkatraman, Arun; Hebert, Martial; Bagnell, J. Andrew

Computer Science > Machine Learning

arXiv:1703.00377 (cs)

[Submitted on 1 Mar 2017]

Title:Gradient Boosting on Stochastic Data Streams

Authors:Hanzhang Hu, Wen Sun, Arun Venkatraman, Martial Hebert, J. Andrew Bagnell

View PDF

Abstract:Boosting is a popular ensemble algorithm that generates more powerful learners by linearly combining base models from a simpler hypothesis class. In this work, we investigate the problem of adapting batch gradient boosting for minimizing convex loss functions to online setting where the loss at each iteration is i.i.d sampled from an unknown distribution. To generalize from batch to online, we first introduce the definition of online weak learning edge with which for strongly convex and smooth loss functions, we present an algorithm, Streaming Gradient Boosting (SGB) with exponential shrinkage guarantees in the number of weak learners. We further present an adaptation of SGB to optimize non-smooth loss functions, for which we derive a O(ln N/N) convergence rate. We also show that our analysis can extend to adversarial online learning setting under a stronger assumption that the online weak learning edge will hold in adversarial setting. We finally demonstrate experimental results showing that in practice our algorithms can achieve competitive results as classic gradient boosting while using less computation.

Comments:	To appear in AISTATS 2017
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1703.00377 [cs.LG]
	(or arXiv:1703.00377v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1703.00377

Submission history

From: Hanzhang Hu [view email]
[v1] Wed, 1 Mar 2017 16:46:54 UTC (331 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2017-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hanzhang Hu
Wen Sun
Arun Venkatraman
Martial Hebert
J. Andrew Bagnell

export BibTeX citation

Computer Science > Machine Learning

Title:Gradient Boosting on Stochastic Data Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Gradient Boosting on Stochastic Data Streams

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators