research-article

Online Active Learning with Expert Advice

Authors:

Steven C. H. Hoi,

Chunyan MiaoAuthors Info & Claims

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 12, Issue 5

Article No.: 58, Pages 1 - 22

https://doi.org/10.1145/3201604

Published: 27 June 2018 Publication History

Abstract

In literature, learning with expert advice methods usually assume that a learner always obtain the true label of every incoming training instance at the end of each trial. However, in many real-world applications, acquiring the true labels of all instances can be both costly and time consuming, especially for large-scale problems. For example, in the social media, data stream usually comes in a high speed and volume, and it is nearly impossible and highly costly to label all of the instances. In this article, we address this problem with active learning with expert advice, where the ground truth of an instance is disclosed only when it is requested by the proposed active query strategies. Our goal is to minimize the number of requests while training an online learning model without sacrificing the performance. To address this challenge, we propose a framework of active forecasters, which attempts to extend two fully supervised forecasters, Exponentially Weighted Average Forecaster and Greedy Forecaster, to tackle the task of online active learning (OAL) with expert advice. Specifically, we proposed two OAL with expert advice algorithms, named Active Exponentially Weighted Average Forecaster (AEWAF) and active greedy forecaster (AGF), by considering the difference of expert advices. To further improve the robustness of the proposed AEWAF and AGF algorithms in the noisy scenarios (where noisy experts exist), we also proposed two robust active learning with expert advice algorithms, named Robust Active Exponentially Weighted Average Forecaster and Robust Active Greedy Forecaster. We validate the efficacy of the proposed algorithms by an extensive set of experiments in both normal scenarios (where all of experts are comparably reliable) and noisy scenarios.

References

[1]

Jacob D. Abernethy, Peter L. Bartlett, and Alexander Rakhlin. 2007. Multitask learning with expert advice. In Proceedings of the 20th Annual Conference on Learning Theory (COLT’07). San Diego, CA. 484--498.

Digital Library

[2]

Martin Aleksandrov, Haris Aziz, Serge Gaspers, and Toby Walsh. 2015. Online fair division: Analysing a food bank problem. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 2540--2546.

Digital Library

[3]

Kareem Amin, Satyen Kale, Gerald Tesauro, and Deepak S. Turaga. 2015. Budgeted prediction with expert advice. In Proceedings of the 29th AAAI Conference on Artificial Intelligence. Austin, Texas. 2490--2496.

Digital Library

[4]

Les E. Atlas, David A. Cohn, and Richard E. Ladner. 1990. Training connectionist networks with queries and selective sampling. In Advances in Neural Information Processing Systems. 566--573.

Digital Library

[5]

H. D. Block. 1962. The Perceptron: A model for brain functioning. I. Reviews of Modern Physics 34, 1 (1962), 123--135.

[6]

Olivier Bousquet and Manfred K. Warmuth. 2002. Tracking a small set of experts by mixing past posteriors. Journal of Machine Learning Research 3 (2002), 363--396.

Digital Library

[7]

Nicolò Cesa-Bianchi, Alex Conconi, and Claudio Gentile. 2005. A second-order Perceptron algorithm. SIAM Journal on Computing 34, 3 (Jan. 2005), 640--668.

Digital Library

[8]

Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, and Manfred K. Warmuth. 1997. How to use expert advice. Journal of the Association for Computing Machinery 44, 3 (May 1997), 427--485.

Digital Library

[9]

Nicolò Cesa-Bianchi, Claudio Gentile, and Luca Zaniboni. 2006. Worst-case analysis of selective sampling for linear classification. The Journal of Machine Learning Research 7 (Dec. 2006), 1205--1230.

Digital Library

[10]

N. Cesa-Bianchi and G. Lugosi. 2006. Prediction, Learning, and Games. Cambridge University Press.

Digital Library

[11]

K. Crammer, O. Dekel, Joseph Keshet, and Shai Shalev-shwartz. 2006. Online Passive-Aggressive algorithms. The Journal of Machine Learning 7 (2006), 551--585.

Digital Library

[12]

Koby Crammer, Alex Kulesza, and Mark Dredze. 2009. Adaptive regularization of weight vectors. In Proceedings of the 22nd International Conference on Neural Information Processing Systems. 414--422.

Digital Library

[13]

Koby Crammer, Alex Kulesza, and Mark Dredze. 2013. Adaptive regularization of weight vectors. Machine Learning 91, 2 (March 2013), 155--187.

Digital Library

[14]

Mark Dredze, Koby Crammer, and Fernando Pereira. 2008. Confidence-weighted linear classification. In Proceedings of the 25th International Conference on Machine Learning. ACM, 264--271.

Digital Library

[15]

Dean P. Foster and Rakesh V. Vohra. 1993. A randomization rule for selecting forecasts. Operations Research 41, 4 (1993), 704--709.

Digital Library

[16]

Yoav Freund and Yishay Mansour. 1997. Learning under persistent drift. In Proceedings of the European Conference on Computational Learning Theory. 109--118.

Digital Library

[17]

Yoav Freund and Robert E. Schapire. 1997. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55, 1 (1997), 119--139.

Digital Library

[18]

Kaito Fujii and Hisashi Kashima. 2016. Budgeted stream-based active learning via adaptive submodular maximization. In Proceedings of the 30th International Conference on Neural Information Processing Systems. D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett (Eds.). 514--522.

Digital Library

[19]

Claudio Gentile. 2001. A new approximate maximal margin classification algorithm. Journal of Machine Learning Research 2 (2001), 213--242.

Digital Library

[20]

Xiaojie Guo. 2015. Online robust low rank matrix recovery. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 3540--3546.

Digital Library

[21]

Yuhong Guo and Dale Schuurmans. 2008. Discriminative batch mode active learning. In Proceedings of the International Conference on Neural Information Processing Systems. 593--600.

Digital Library

[22]

Steve Hanneke. 2012. Activized learning: Transforming passive to active with improved label complexity *. Journal of Machine Learning Research 13 (2012), 1469--1587.

Digital Library

[23]

Steve Hanneke. 2014. Theory of disagreement-based active learning. Foundations and Trends® in Machine Learning 7, 2--3 (2014), 131--309.

Digital Library

[24]

Steve Hanneke. 2016. The optimal sample complexity of PAC learning. Journal of Machine Learning Research 17, 38 (2016), 1--15.

Digital Library

[25]

Steve Hanneke and Liu Yang. 2015. Minimax analysis of active learning. Journal of Machine Learning Research 16 (2015), 3487--3602.

Digital Library

[26]

Shuji Hao, Jing Lu, Peilin Zhao, Chi Zhang, Steven C. H. Hoi, and Chunyan Miao. 2017a. Second-order online active learning and its applications. IEEE Transactions on Knowledge and Data Engineering 30 (2017), 1338--1351.

[27]

Shuji Hao, Peilin Zhao, Steven C. H. Hoi, and Chunyan Miao. 2015. Learning relative similarity from data streams: Active online learning approaches. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM’15). ACM, 1181--1190.

Digital Library

[28]

Shuji Hao, Peilin Zhao, Yong Liu, Steven C. H. Hoi, and Chunyan Miao. 2017b. Online multi-task relative similarity learning. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17).

Digital Library

[29]

Shuji Hao, Peilin Zhao, Jing Lu, Steven C. H. Hoi, Chunyan Miao, and Chi Zhang. 2016. SOAL: Second-order online active learning. In Proceedings of the 16th International Conference on Data Mining (ICDM’16). IEEE, 931--936.

[30]

David Haussler, Jyrki Kivinen, and Manfred K. Warmuth. 1995. Tight worst-case loss bounds for predicting with expert advice. In Proceedings of the European Conference on Computational Learning Theory. Springer, 69--83.

Digital Library

[31]

Keiichiro Hayakawa, Enrico H. Gerding, Sebastian Stein, and Takahiro Shiga. 2015. Online mechanisms for charging electric vehicles in settings with varying marginal electricity costs. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 2610--2616.

Digital Library

[32]

Mark Herbster and Manfred K. Warmuth. 1998. Tracking the best expert. Machine Learning 32, 2 (1998), 151--178.

Digital Library

[33]

Steven C. H. Hoi, Doyen Sahoo, Jing Lu, and Peilin Zhao. 2018. Online learning: A comprehensive survey. arXiv:1802.02871.

[34]

Steven C. H. Hoi, Jialei Wang, and Peilin Zhao. 2014. LIBOL: A library for online learning algorithms. Journal of Machine Learning Research 15, 1 (2014), 495--499.

Digital Library

[35]

Fatemeh Jahedpari. 2015. Artificial prediction markets for online prediction. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 4371--4372.

Digital Library

[36]

J. Zico Kolter and Marcus A. Maloof. 2007. Dynamic weighted majority: An ensemble method for drifting concepts. The Journal of Machine Learning Research 8 (2007), 2755--2790.

Digital Library

[37]

Guangxia Li, Steven C. H. Hoi, Kuiyu Chang, and Ramesh Jain. 2010. Micro-blogging sentiment detection by collaborative online learning. In Proceedings of the 10th International Conference on Data Mining (ICDM’10). IEEE, 893--898.

Digital Library

[38]

Yi Li and Philip M. Long. 1999. The relaxed online maximum margin algorithm. In Proceedings of the Neural Information Processing Systems (NIPS’99). 498--504.

Digital Library

[39]

Yi Li and Philip M. Long. 2002. The relaxed online maximum margin algorithm. Machine Learning 46, 1--3 (2002), 361--387.

Digital Library

[40]

Nick Littlestone and Manfred K. Warmuth. 1994. The weighted majority algorithm. Information and Computation 108, 2 (1994), 212--261.

Digital Library

[41]

Jing Lu, Zhao Peilin, and C. H. Steven Hoi. 2014. Online Passive-Aggressive active learning and its applications. In Proceedings of the Asian Conference on Machine Learning Research (ACML’14).

[42]

Jing Lu, Zhao Peilin, and C. H. Steven Hoi. 2016. Online Passive-Aggressive active learning and its applications. 103, 2 (2016), 141--183.

Digital Library

[43]

Mehryar Mohri and Afshin Rostamizadeh. 2013. Perceptron mistake bounds. arxiv:1305.0208v2.

[44]

Francesco Orabona and Koby Crammer. 2010. New adaptive algorithms for online classification. In Proceedings of the 23rd International Conference on Neural Information Processing Systems. 1840--1848.

Digital Library

[45]

F Rosenblatt. 1958. The Perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65, 6 (Nov. 1958), 386--408.

[46]

Paul Ruvolo and Eric Eaton. 2014. Online multi-task learning via sparse dictionary optimization. In Proceedings of the 28th Conference on Artificial Intelligence. AAAI Press, Québec City, Québec, Canada, 2062--2068.

Digital Library

[47]

Burr Settles. 2010. Active learning. Synthesis Lectures on Artificial Intelligence and Machine Learning 6, 1 (Jun. 2010), 1--114.

Digital Library

[48]

Victor S. Sheng, Foster Provost, and Panagiotis G. Ipeirotis. 2008. Get another label? Improving data quality and data mining using multiple, noisy labelers. In Proceedings of the 14th SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 614--622.

Digital Library

[49]

S. Tong. 2001. Active Learning: Theory and Applications. Ph.D. Dissertation. Stanford University.

Digital Library

[50]

Simon Tong and Daphne Koller. 2002. Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research 2, 3/1 (2002), 45--66.

Digital Library

[51]

Joel Veness, Marcus Hutter, Laurent Orseau, and Marc G. Bellemare. 2015. Online learning of k-CNF Boolean functions. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15), AAAI Press, Buenos Aires, Argentina, 3865--3873.

Digital Library

[52]

Volodimir G. Vovk. 1990. Aggregating strategies. In Proceedings of the 3rd Annual Workshop on Computational Learning Theory (COLT’90). 371--386.

Digital Library

[53]

Jialei Wang, Steven CH Hoi, Peilin Zhao, and Zhi-Yong Liu. 2013. Online multi-task collaborative filtering for on-the-fly recommender systems. In Proceedings of the 7th Conference on Recommender Systems. ACM, 237--244.

Digital Library

[54]

Jialei Wang, Peilin Zhao, and Steven C. Hoi. 2012. Exact soft confidence-weighted learning. In Proceedings of the 29th International Conference on Machine Learning (ICML’12). J. Langford and J. Pineau (Eds.), ACM, New York, NY, 121--128.

Digital Library

[55]

Jialei Wang, Peilin Zhao, and Steven CH Hoi. 2014. Cost-sensitive online classification. IEEE Transactions on Knowledge and Data Engineering 26, 10 (2014), 2425--2438.

[56]

Chi Zhang, Peilin Zhao, Shuji Hao, and Steven Hoi. 2018. Distributed multi-task classification: A decentralized online learning approach. Machine Learning 107, 4 (April 2018), 727--747.

Digital Library

[57]

Chi Zhang, Peilin Zhao, Shuji Hao, Yeng Chai Soh, and Bu Sung Lee. 2016. ROM: A robust online multi-task learning approach. In Proceedings of the16th International Conference on Data Mining (ICDM’16). IEEE, 1341--1346.

[58]

Peilin Zhao and Steven C. H. Hoi. 2013. Cost-sensitive online active learning with application to malicious URL detection. In Proceedings of the 19th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’13). ACM, New York, NY, 919.

Digital Library

[59]

Peilin Zhao, Steven C. H. Hoi, and Rong Jin. 2011a. Double updating online learning. Journal of Machine Learning Research 12 (2011), 1587--1615.

Digital Library

[60]

Peilin Zhao, Rong Jin, Tianbao Yang, and Steven C. Hoi. 2011b. Online AUC maximization. In Proceedings of the 28th International Conference on Machine Learning (ICML’11). 233--240.

Digital Library

Cited By

Cacciarelli DKulahci M(2024)Active learning for data streams: a surveyMachine Language10.1007/s10994-023-06454-2113:1(185-239)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1007/s10994-023-06454-2
Shui CWang WHedhli IWong CWan FWang BGagné C(2023)Lifelong Online Learning from Accumulated KnowledgeACM Transactions on Knowledge Discovery from Data10.1145/356394717:4(1-23)Online publication date: 24-Feb-2023
https://dl.acm.org/doi/10.1145/3563947
Gu SQian YHou C(2022)Incremental Feature Spaces Learning with Label ScarcityACM Transactions on Knowledge Discovery from Data10.1145/351636816:6(1-26)Online publication date: 8-Sep-2022
https://dl.acm.org/doi/10.1145/3516368
Show More Cited By

Index Terms

Online Active Learning with Expert Advice
1. Theory of computation
  1. Models of computation
    1. Streaming models

Recommendations

Active learning with expert advice
UAI'13: Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence

Conventional learning with expert advice methods assume a learner is always receiving the outcome (e.g., class labels) of every incoming training instance at the end of each trial. In real applications, acquiring the outcome from oracle can be costly or ...
Knowledge transfer for multi-labeler active learning
ECMLPKDD'13: Proceedings of the 2013th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I

In this paper, we address multi-labeler active learning, where data labels can be acquired from multiple labelers with various levels of expertise. Because obtaining labels for data instances can be very costly and time-consuming, it is highly desirable ...
Online Portfolio Selection Strategy Based on Combining Experts' Advice

The weak aggregating algorithm (WAA) developed from learning and prediction with expert advice makes decisions by considering all the experts' advice, and each expert's weight is updated according to his performance in previous periods. In this paper, ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data

ACM Transactions on Knowledge Discovery from Data Volume 12, Issue 5

October 2018

354 pages

ISSN:1556-4681

EISSN:1556-472X

DOI:10.1145/3234931

Editors:
Charu Aggarwal
IBM T. J. Watson Research, USA
,
Xindong Wu
University of Louisiana at Lafayette, USA

Issue’s Table of Contents

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 June 2018

Accepted: 01 March 2018

Revised: 01 March 2018

Received: 01 February 2017

Published in TKDD Volume 12, Issue 5

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

National Research Foundation, Prime Minister's Office, Singapore under its International Research Centres in Singapore Funding Initiative

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

14
Total Citations
View Citations
472
Total Downloads

Downloads (Last 12 months)44
Downloads (Last 6 weeks)6

Reflects downloads up to 12 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Cacciarelli DKulahci M(2024)Active learning for data streams: a surveyMachine Language10.1007/s10994-023-06454-2113:1(185-239)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1007/s10994-023-06454-2
Shui CWang WHedhli IWong CWan FWang BGagné C(2023)Lifelong Online Learning from Accumulated KnowledgeACM Transactions on Knowledge Discovery from Data10.1145/356394717:4(1-23)Online publication date: 24-Feb-2023
https://dl.acm.org/doi/10.1145/3563947
Gu SQian YHou C(2022)Incremental Feature Spaces Learning with Label ScarcityACM Transactions on Knowledge Discovery from Data10.1145/351636816:6(1-26)Online publication date: 8-Sep-2022
https://dl.acm.org/doi/10.1145/3516368
Hong SChae J(2022)Active Learning With Multiple KernelsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2020.304795333:7(2980-2994)Online publication date: Jul-2022
https://doi.org/10.1109/TNNLS.2020.3047953
Agarwal DCovarrubias-Zambrano OBossmann SNatarajan B(2022)Impacts of Behavioral Biases on Active Learning Strategies2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)10.1109/ICAIIC54071.2022.9722689(256-261)Online publication date: 21-Feb-2022
https://doi.org/10.1109/ICAIIC54071.2022.9722689
Singh CSharma A(2022)A review of online supervised learningEvolving Systems10.1007/s12530-022-09448-y14:2(343-364)Online publication date: 8-Jul-2022
https://doi.org/10.1007/s12530-022-09448-y
Gao LKonomi S(2022)A Cost-Effective and Quality-Ensured Framework for Crowdsourced Indoor LocalizationHuman-Automation Interaction10.1007/978-3-031-10784-9_27(451-467)Online publication date: 1-Nov-2022
https://doi.org/10.1007/978-3-031-10784-9_27
Del Pino Espinoza ARuiz Guzmán NVeloz de la Torre FLloret Romero MGarcía León APérez AArredondo S(2022)Covid-19: Study of Online Teaching, Availability and Use of Technological ResourcesRe-imagining Educational Futures in Developing Countries10.1007/978-3-030-88234-1_11(201-219)Online publication date: 1-Jan-2022
https://doi.org/10.1007/978-3-030-88234-1_11
Hoi SSahoo DLu JZhao P(2021)Online learning: A comprehensive surveyNeurocomputing10.1016/j.neucom.2021.04.112459(249-289)Online publication date: Oct-2021
https://doi.org/10.1016/j.neucom.2021.04.112
Le Thi HHo V(2021)DCA for online prediction with expert adviceNeural Computing and Applications10.1007/s00521-021-05709-033:15(9521-9544)Online publication date: 1-Aug-2021
https://dl.acm.org/doi/10.1007/s00521-021-05709-0
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents