Extracting certainty from uncertainty: regret bounded by variation in costs

Hazan, Elad; Kale, Satyen

doi:10.1007/s10994-010-5175-x

Extracting certainty from uncertainty: regret bounded by variation in costs

Published: 29 April 2010

Volume 80, pages 165–188, (2010)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Extracting certainty from uncertainty: regret bounded by variation in costs

Download PDF

Elad Hazan¹ &
Satyen Kale²

1466 Accesses
54 Citations
Explore all metrics

Abstract

Prediction from expert advice is a fundamental problem in machine learning. A major pillar of the field is the existence of learning algorithms whose average loss approaches that of the best expert in hindsight (in other words, whose average regret approaches zero). Traditionally the regret of online algorithms was bounded in terms of the number of prediction rounds.

Cesa-Bianchi, Mansour and Stoltz (Mach. Learn. 66(2–3):21–352, 2007) posed the question whether it is be possible to bound the regret of an online algorithm by the variation of the observed costs. In this paper we resolve this question, and prove such bounds in the fully adversarial setting, in two important online learning scenarios: prediction from expert advice, and online linear optimization.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Allenberg-Neeman, C., & Neeman, B. (2004). Full information game with gains and losses. In 15th international conference on algorithmic learning theory.
Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (2003). The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1), 48–77.
Article MathSciNet Google Scholar
Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, learning, and games. Cambridge: Cambridge University Press.
Book MATH Google Scholar
Cesa-Bianchi, N., Mansour, Y., & Stoltz, G. (2007). Improved second-order bounds for prediction with expert advice. Machine Learning, 66(2–3), 21–352.
Google Scholar
Cover, T. (1991). Universal portfolios. Mathematical Finance, 1, 1–19.
Article MATH MathSciNet Google Scholar
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119–139.
Article MATH MathSciNet Google Scholar
Hannan, J. (1957). Approximation to Bayes risk in repeated play. In M. Dresher, A. W. Tucker, & P. Wolfe (Eds.), Contributions to the theory of games (Vol. III, pp. 97–139).
Hazan, E., & Kale, S. (2009a). On stochastic and worst-case models for investing. In Advances in neural information processing systems (NIPS) (Vol. 22).
Hazan, E., & Kale, S. (2009b). Better algorithms for benign bandits. In ACM-SIAM symposium on discrete algorithms (SODA09).
Helmbold, D. P., Kivinen, J., & Warmuth, M. K. (1999). Relative loss bounds for single neurons. IEEE Transactions on Neural Networks, 10(6), 1291–1304.
Article Google Scholar
Herbster, M., & Warmuth, M. K. (2001). Tracking the best linear predictor. Journal of Machine Learning Research, 1, 281–309.
Article MATH MathSciNet Google Scholar
Kalai, A., & Vempala, S. (2005). Efficient algorithms for online decision problems. Journal of Computer and System Sciences, 71(3), 291–307.
Article MATH MathSciNet Google Scholar
Kivinen, J., & Warmuth, M. K. (1997). Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132(1), 1–63.
Article MATH MathSciNet Google Scholar
Littlestone, N., & Warmuth, M. K. (1994). The weighted majority algorithm. Information and Computation, 108(2), 212–261.
Article MATH MathSciNet Google Scholar
Vovk, V. (1998). A game of prediction with expert advice. Journal of Computer and System Sciences, 56(2), 153–173.
Article MATH MathSciNet Google Scholar
Zinkevich, M. (2003). Online convex programming and generalized infinitesimal gradient ascent. In ICML (pp. 928–936).

Download references

Author information

Authors and Affiliations

IBM Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120, USA
Elad Hazan
Yahoo! Research, 4301 Great America Parkway, Santa Clara, CA, 95054, USA
Satyen Kale

Authors

Elad Hazan
View author publications
You can also search for this author in PubMed Google Scholar
Satyen Kale
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elad Hazan.

Additional information

Editors: Sham Kakade and Ping Li.

Work done while S. Kale was at Microsoft Research.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hazan, E., Kale, S. Extracting certainty from uncertainty: regret bounded by variation in costs. Mach Learn 80, 165–188 (2010). https://doi.org/10.1007/s10994-010-5175-x

Download citation

Received: 15 March 2009
Accepted: 01 November 2009
Published: 29 April 2010
Issue Date: September 2010
DOI: https://doi.org/10.1007/s10994-010-5175-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Extracting certainty from uncertainty: regret bounded by variation in costs

Abstract

Article PDF

Similar content being viewed by others

Online Prediction Problems with Variation

Prediction with Expert Advice: A PDE Perspective

Machine Learning Advised Ski Rental Problem with a Discount

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Extracting certainty from uncertainty: regret bounded by variation in costs

Abstract

Article PDF

Similar content being viewed by others

Online Prediction Problems with Variation

Prediction with Expert Advice: A PDE Perspective

Machine Learning Advised Ski Rental Problem with a Discount

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation