Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Online Active Learning with Expert Advice

Published: 27 June 2018 Publication History

Abstract

In literature, learning with expert advice methods usually assume that a learner always obtain the true label of every incoming training instance at the end of each trial. However, in many real-world applications, acquiring the true labels of all instances can be both costly and time consuming, especially for large-scale problems. For example, in the social media, data stream usually comes in a high speed and volume, and it is nearly impossible and highly costly to label all of the instances. In this article, we address this problem with active learning with expert advice, where the ground truth of an instance is disclosed only when it is requested by the proposed active query strategies. Our goal is to minimize the number of requests while training an online learning model without sacrificing the performance. To address this challenge, we propose a framework of active forecasters, which attempts to extend two fully supervised forecasters, Exponentially Weighted Average Forecaster and Greedy Forecaster, to tackle the task of online active learning (OAL) with expert advice. Specifically, we proposed two OAL with expert advice algorithms, named Active Exponentially Weighted Average Forecaster (AEWAF) and active greedy forecaster (AGF), by considering the difference of expert advices. To further improve the robustness of the proposed AEWAF and AGF algorithms in the noisy scenarios (where noisy experts exist), we also proposed two robust active learning with expert advice algorithms, named Robust Active Exponentially Weighted Average Forecaster and Robust Active Greedy Forecaster. We validate the efficacy of the proposed algorithms by an extensive set of experiments in both normal scenarios (where all of experts are comparably reliable) and noisy scenarios.

References

[1]
Jacob D. Abernethy, Peter L. Bartlett, and Alexander Rakhlin. 2007. Multitask learning with expert advice. In Proceedings of the 20th Annual Conference on Learning Theory (COLT’07). San Diego, CA. 484--498.
[2]
Martin Aleksandrov, Haris Aziz, Serge Gaspers, and Toby Walsh. 2015. Online fair division: Analysing a food bank problem. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 2540--2546.
[3]
Kareem Amin, Satyen Kale, Gerald Tesauro, and Deepak S. Turaga. 2015. Budgeted prediction with expert advice. In Proceedings of the 29th AAAI Conference on Artificial Intelligence. Austin, Texas. 2490--2496.
[4]
Les E. Atlas, David A. Cohn, and Richard E. Ladner. 1990. Training connectionist networks with queries and selective sampling. In Advances in Neural Information Processing Systems. 566--573.
[5]
H. D. Block. 1962. The Perceptron: A model for brain functioning. I. Reviews of Modern Physics 34, 1 (1962), 123--135.
[6]
Olivier Bousquet and Manfred K. Warmuth. 2002. Tracking a small set of experts by mixing past posteriors. Journal of Machine Learning Research 3 (2002), 363--396.
[7]
Nicolò Cesa-Bianchi, Alex Conconi, and Claudio Gentile. 2005. A second-order Perceptron algorithm. SIAM Journal on Computing 34, 3 (Jan. 2005), 640--668.
[8]
Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, and Manfred K. Warmuth. 1997. How to use expert advice. Journal of the Association for Computing Machinery 44, 3 (May 1997), 427--485.
[9]
Nicolò Cesa-Bianchi, Claudio Gentile, and Luca Zaniboni. 2006. Worst-case analysis of selective sampling for linear classification. The Journal of Machine Learning Research 7 (Dec. 2006), 1205--1230.
[10]
N. Cesa-Bianchi and G. Lugosi. 2006. Prediction, Learning, and Games. Cambridge University Press.
[11]
K. Crammer, O. Dekel, Joseph Keshet, and Shai Shalev-shwartz. 2006. Online Passive-Aggressive algorithms. The Journal of Machine Learning 7 (2006), 551--585.
[12]
Koby Crammer, Alex Kulesza, and Mark Dredze. 2009. Adaptive regularization of weight vectors. In Proceedings of the 22nd International Conference on Neural Information Processing Systems. 414--422.
[13]
Koby Crammer, Alex Kulesza, and Mark Dredze. 2013. Adaptive regularization of weight vectors. Machine Learning 91, 2 (March 2013), 155--187.
[14]
Mark Dredze, Koby Crammer, and Fernando Pereira. 2008. Confidence-weighted linear classification. In Proceedings of the 25th International Conference on Machine Learning. ACM, 264--271.
[15]
Dean P. Foster and Rakesh V. Vohra. 1993. A randomization rule for selecting forecasts. Operations Research 41, 4 (1993), 704--709.
[16]
Yoav Freund and Yishay Mansour. 1997. Learning under persistent drift. In Proceedings of the European Conference on Computational Learning Theory. 109--118.
[17]
Yoav Freund and Robert E. Schapire. 1997. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55, 1 (1997), 119--139.
[18]
Kaito Fujii and Hisashi Kashima. 2016. Budgeted stream-based active learning via adaptive submodular maximization. In Proceedings of the 30th International Conference on Neural Information Processing Systems. D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett (Eds.). 514--522.
[19]
Claudio Gentile. 2001. A new approximate maximal margin classification algorithm. Journal of Machine Learning Research 2 (2001), 213--242.
[20]
Xiaojie Guo. 2015. Online robust low rank matrix recovery. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 3540--3546.
[21]
Yuhong Guo and Dale Schuurmans. 2008. Discriminative batch mode active learning. In Proceedings of the International Conference on Neural Information Processing Systems. 593--600.
[22]
Steve Hanneke. 2012. Activized learning: Transforming passive to active with improved label complexity *. Journal of Machine Learning Research 13 (2012), 1469--1587.
[23]
Steve Hanneke. 2014. Theory of disagreement-based active learning. Foundations and Trends® in Machine Learning 7, 2--3 (2014), 131--309.
[24]
Steve Hanneke. 2016. The optimal sample complexity of PAC learning. Journal of Machine Learning Research 17, 38 (2016), 1--15.
[25]
Steve Hanneke and Liu Yang. 2015. Minimax analysis of active learning. Journal of Machine Learning Research 16 (2015), 3487--3602.
[26]
Shuji Hao, Jing Lu, Peilin Zhao, Chi Zhang, Steven C. H. Hoi, and Chunyan Miao. 2017a. Second-order online active learning and its applications. IEEE Transactions on Knowledge and Data Engineering 30 (2017), 1338--1351.
[27]
Shuji Hao, Peilin Zhao, Steven C. H. Hoi, and Chunyan Miao. 2015. Learning relative similarity from data streams: Active online learning approaches. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management (CIKM’15). ACM, 1181--1190.
[28]
Shuji Hao, Peilin Zhao, Yong Liu, Steven C. H. Hoi, and Chunyan Miao. 2017b. Online multi-task relative similarity learning. In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI’17).
[29]
Shuji Hao, Peilin Zhao, Jing Lu, Steven C. H. Hoi, Chunyan Miao, and Chi Zhang. 2016. SOAL: Second-order online active learning. In Proceedings of the 16th International Conference on Data Mining (ICDM’16). IEEE, 931--936.
[30]
David Haussler, Jyrki Kivinen, and Manfred K. Warmuth. 1995. Tight worst-case loss bounds for predicting with expert advice. In Proceedings of the European Conference on Computational Learning Theory. Springer, 69--83.
[31]
Keiichiro Hayakawa, Enrico H. Gerding, Sebastian Stein, and Takahiro Shiga. 2015. Online mechanisms for charging electric vehicles in settings with varying marginal electricity costs. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 2610--2616.
[32]
Mark Herbster and Manfred K. Warmuth. 1998. Tracking the best expert. Machine Learning 32, 2 (1998), 151--178.
[33]
Steven C. H. Hoi, Doyen Sahoo, Jing Lu, and Peilin Zhao. 2018. Online learning: A comprehensive survey. arXiv:1802.02871.
[34]
Steven C. H. Hoi, Jialei Wang, and Peilin Zhao. 2014. LIBOL: A library for online learning algorithms. Journal of Machine Learning Research 15, 1 (2014), 495--499.
[35]
Fatemeh Jahedpari. 2015. Artificial prediction markets for online prediction. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15). AAAI Press, Buenos Aires, Argentina, 4371--4372.
[36]
J. Zico Kolter and Marcus A. Maloof. 2007. Dynamic weighted majority: An ensemble method for drifting concepts. The Journal of Machine Learning Research 8 (2007), 2755--2790.
[37]
Guangxia Li, Steven C. H. Hoi, Kuiyu Chang, and Ramesh Jain. 2010. Micro-blogging sentiment detection by collaborative online learning. In Proceedings of the 10th International Conference on Data Mining (ICDM’10). IEEE, 893--898.
[38]
Yi Li and Philip M. Long. 1999. The relaxed online maximum margin algorithm. In Proceedings of the Neural Information Processing Systems (NIPS’99). 498--504.
[39]
Yi Li and Philip M. Long. 2002. The relaxed online maximum margin algorithm. Machine Learning 46, 1--3 (2002), 361--387.
[40]
Nick Littlestone and Manfred K. Warmuth. 1994. The weighted majority algorithm. Information and Computation 108, 2 (1994), 212--261.
[41]
Jing Lu, Zhao Peilin, and C. H. Steven Hoi. 2014. Online Passive-Aggressive active learning and its applications. In Proceedings of the Asian Conference on Machine Learning Research (ACML’14).
[42]
Jing Lu, Zhao Peilin, and C. H. Steven Hoi. 2016. Online Passive-Aggressive active learning and its applications. 103, 2 (2016), 141--183.
[43]
Mehryar Mohri and Afshin Rostamizadeh. 2013. Perceptron mistake bounds. arxiv:1305.0208v2.
[44]
Francesco Orabona and Koby Crammer. 2010. New adaptive algorithms for online classification. In Proceedings of the 23rd International Conference on Neural Information Processing Systems. 1840--1848.
[45]
F Rosenblatt. 1958. The Perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65, 6 (Nov. 1958), 386--408.
[46]
Paul Ruvolo and Eric Eaton. 2014. Online multi-task learning via sparse dictionary optimization. In Proceedings of the 28th Conference on Artificial Intelligence. AAAI Press, Québec City, Québec, Canada, 2062--2068.
[47]
Burr Settles. 2010. Active learning. Synthesis Lectures on Artificial Intelligence and Machine Learning 6, 1 (Jun. 2010), 1--114.
[48]
Victor S. Sheng, Foster Provost, and Panagiotis G. Ipeirotis. 2008. Get another label? Improving data quality and data mining using multiple, noisy labelers. In Proceedings of the 14th SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 614--622.
[49]
S. Tong. 2001. Active Learning: Theory and Applications. Ph.D. Dissertation. Stanford University.
[50]
Simon Tong and Daphne Koller. 2002. Support vector machine active learning with applications to text classification. The Journal of Machine Learning Research 2, 3/1 (2002), 45--66.
[51]
Joel Veness, Marcus Hutter, Laurent Orseau, and Marc G. Bellemare. 2015. Online learning of k-CNF Boolean functions. In Proceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI’15), AAAI Press, Buenos Aires, Argentina, 3865--3873.
[52]
Volodimir G. Vovk. 1990. Aggregating strategies. In Proceedings of the 3rd Annual Workshop on Computational Learning Theory (COLT’90). 371--386.
[53]
Jialei Wang, Steven CH Hoi, Peilin Zhao, and Zhi-Yong Liu. 2013. Online multi-task collaborative filtering for on-the-fly recommender systems. In Proceedings of the 7th Conference on Recommender Systems. ACM, 237--244.
[54]
Jialei Wang, Peilin Zhao, and Steven C. Hoi. 2012. Exact soft confidence-weighted learning. In Proceedings of the 29th International Conference on Machine Learning (ICML’12). J. Langford and J. Pineau (Eds.), ACM, New York, NY, 121--128.
[55]
Jialei Wang, Peilin Zhao, and Steven CH Hoi. 2014. Cost-sensitive online classification. IEEE Transactions on Knowledge and Data Engineering 26, 10 (2014), 2425--2438.
[56]
Chi Zhang, Peilin Zhao, Shuji Hao, and Steven Hoi. 2018. Distributed multi-task classification: A decentralized online learning approach. Machine Learning 107, 4 (April 2018), 727--747.
[57]
Chi Zhang, Peilin Zhao, Shuji Hao, Yeng Chai Soh, and Bu Sung Lee. 2016. ROM: A robust online multi-task learning approach. In Proceedings of the16th International Conference on Data Mining (ICDM’16). IEEE, 1341--1346.
[58]
Peilin Zhao and Steven C. H. Hoi. 2013. Cost-sensitive online active learning with application to malicious URL detection. In Proceedings of the 19th SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’13). ACM, New York, NY, 919.
[59]
Peilin Zhao, Steven C. H. Hoi, and Rong Jin. 2011a. Double updating online learning. Journal of Machine Learning Research 12 (2011), 1587--1615.
[60]
Peilin Zhao, Rong Jin, Tianbao Yang, and Steven C. Hoi. 2011b. Online AUC maximization. In Proceedings of the 28th International Conference on Machine Learning (ICML’11). 233--240.

Cited By

View all

Index Terms

  1. Online Active Learning with Expert Advice

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Knowledge Discovery from Data
    ACM Transactions on Knowledge Discovery from Data  Volume 12, Issue 5
    October 2018
    354 pages
    ISSN:1556-4681
    EISSN:1556-472X
    DOI:10.1145/3234931
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2018
    Accepted: 01 March 2018
    Revised: 01 March 2018
    Received: 01 February 2017
    Published in TKDD Volume 12, Issue 5

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Online learning
    2. active learning
    3. data streaming
    4. expert advice

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Funding Sources

    • National Research Foundation, Prime Minister's Office, Singapore under its International Research Centres in Singapore Funding Initiative

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)44
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Active learning for data streams: a surveyMachine Language10.1007/s10994-023-06454-2113:1(185-239)Online publication date: 1-Jan-2024
    • (2023)Lifelong Online Learning from Accumulated KnowledgeACM Transactions on Knowledge Discovery from Data10.1145/356394717:4(1-23)Online publication date: 24-Feb-2023
    • (2022)Incremental Feature Spaces Learning with Label ScarcityACM Transactions on Knowledge Discovery from Data10.1145/351636816:6(1-26)Online publication date: 8-Sep-2022
    • (2022)Active Learning With Multiple KernelsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2020.304795333:7(2980-2994)Online publication date: Jul-2022
    • (2022)Impacts of Behavioral Biases on Active Learning Strategies2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)10.1109/ICAIIC54071.2022.9722689(256-261)Online publication date: 21-Feb-2022
    • (2022)A review of online supervised learningEvolving Systems10.1007/s12530-022-09448-y14:2(343-364)Online publication date: 8-Jul-2022
    • (2022)A Cost-Effective and Quality-Ensured Framework for Crowdsourced Indoor LocalizationHuman-Automation Interaction10.1007/978-3-031-10784-9_27(451-467)Online publication date: 1-Nov-2022
    • (2022)Covid-19: Study of Online Teaching, Availability and Use of Technological ResourcesRe-imagining Educational Futures in Developing Countries10.1007/978-3-030-88234-1_11(201-219)Online publication date: 1-Jan-2022
    • (2021)Online learning: A comprehensive surveyNeurocomputing10.1016/j.neucom.2021.04.112459(249-289)Online publication date: Oct-2021
    • (2021)DCA for online prediction with expert adviceNeural Computing and Applications10.1007/s00521-021-05709-033:15(9521-9544)Online publication date: 1-Aug-2021
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media