Abstract
Multi-label classification is very common in practical applications. Compared with multi-class classification, multi-label classification has larger label space and thus the annotations of multi-label instances are typically more time-consuming. It is significant to develop active learning methods for multi-label classification problems. In addition, multi-view learning is more and more popular, which treats data from different views discriminatively and integrates information from all the views effectively. Introducing multi-view methods into active learning can further enhance its performance when processing multi-view data. In this paper, we propose multi-view active learning methods for multi-label classifications. The proposed methods are developed based on the conditional Bernoulli mixture model which is an effective model for multi-label classification. For making active selection criteria, we consider selecting informative and representative instances. From the informative perspective, least confidence and entropy of the predicting results are employed. From the representative perspective, clustering results on the unlabeled data are exploited. Particularly for multi-view active learning, novel multi-view prediction methods are designed to make final prediction and view consistency is additionally considered to make selection criteria. Finally, we demonstrate the effectiveness of the proposed methods through experiments on real-world datasets.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Boutell MR, Luo J, Shen X, Brown CM (2004) Learning multi-label scene classification. Pattern Recogn 37:1757–1771
Brinker K (2006) On active learning in multi-label classification. From data and information analysis to knowledge engineering pp. 206–213
Chatzilari E, Nikolopoulos S, Kompatsiaris Y, Kittler J (2016) Salic: social active learning for image classification. IEEE Trans Multimedia 18(8):1488–1503
Chen J, Sun S, Zhao J (2018) Multi-label active learning with conditional Bernoulli mixtures. In: Pacific rim international conference on artificial intelligence, pp. 954–967
Chen X, Yu G, Domeniconi C, Wang J, Li Z, Zhang Z (2018) Cost effective multi-label active learning via querying subexamples. IEEE Int Conf Data Mining (ICDM). https://doi.org/10.1109/ICDM.2018.00109
Dembczyński K, Waegeman W, Cheng W, Hüllermeier E (2012) On label dependence and loss minimization in multi-label classification. Mach Learn 88:5–45
Di W, Crawford MM (2011) Active learning via multi-view and local proximity co-regularization for hyperspectral image classification. IEEE J Sel Top Signal Process 5:618–628
Ding X, Li B, Xiong W, Guo W, Hu W, Wang B (2016) Multi-instance multi-label learning combining hierarchical context and its application to image annotation. IEEE Transactions on Multimedia 18(8):1616–1627
Du B, Wang Z, Zhang L, Zhang L, Tao D (2017) Robust and discriminative labeling for multi-label active learning based on maximum correntropy criterion. IEEE Trans Image Process 26(4):1694–1707. https://doi.org/10.1109/TIP.2017.2651372
Esuli A, Sebastiani F (2009) Active learning strategies for multi-label text classification. In: Advances in information retrieval, pp. 102–113
Gao N, Huang S, Chen S (2016) Multi-label active learning by model guided distribution matching. Front Comput Sci 10:845–855
Gibaja E, Ventura S (2015) A tutorial on multilabel learning. ACM Comput. Surv. 47:52:1-52:38
Hassani MS (2018) Active and multi-view machine learning for microRNA prediction. Master’s thesis, Carleton University, Ottawa, Ontario, Canada
Herrera F, Charte F, Rivera AJ, del Jesus MJ (2016) Multilabel classification. Springer International Publishing, Berlin, pp 17–31
Huang H, Zhang C, Hu Q, Zhu P (2016) Multi-view representative and informative induced active learning. In: Pacific rim international conference on artificial intelligence, pp. 139–151
Huang S, Chen S, Zhou Z (2015) Multi-label active learning: query type matters. In: International joint conference on artificial intelligence, pp. 946–952
Huang S, Zhou Z (2013) Active query driven by uncertainty and diversity for incremental multi-label learning. In: International conference on data mining, pp. 1079–1084
Huang Y, Wang W, Wang L (2015) Unconstrained multimodal multi-label learning. IEEE Trans Multimedia 17(11):1923–1935
Joachims T (1998) Text categorization with support vector machines: learning with many relevant features. In: European conference on machine learning, pp. 137–142
Jordan MI, Jacobs RA (1994) Hierarchical mixtures of experts and the EM algorithm. Neural Comput 6:181–214
Katakis I, Tsoumakas G, Vlahavas I (2008) Multilabel text classification for automated tag suggestion. In: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, pp. 1–9
Kazawa H, Izumitani T, Taira H, Maeda E (2005) Maximal margin labeling for multi-topic text categorization. In: Advances in neural information processing systems, pp. 649–656
Lee KH, Hwang JN (2015) On-road pedestrian tracking across multiple driving recorders. IEEE Trans Multimedia 17(9):1429–1438
Lewis DD, Yang Y, Rose TG, Li F (2004) Rcv1: a new benchmark collection for text categorization research. J Mach Learn Res 5:361–397
Li C, Wang B, Pavlu V, Aslam J (2016) Conditional Bernoulli mixtures for multi-label classification. In: International conference on machine learning, pp. 2482–2491
Li X, Guo Y (2013) Active learning with multi-label SVM classification. In: International joint conference on artificial intelligence, pp. 1479–1485
Li X, Kuang D, Ling CX (2012) Active learning for hierarchical text classification. In: Advances in knowledge discovery and data mining, pp. 14–25
Li X, Wang L, Sung E (2004) Multilabel SVM active learning for image classification. In: International conference on image processing, pp. 2207–2210
McCallum A (1999) Multi-label text classification with a mixture model trained by EM. In: AAAI Workshop on text learning, pp. 1–7
Muslea I, Minton S, Knoblock C (2006) Active learning with multiple views. J Artif Intell Res 27:203–233
Muslea I, Minton S, Knoblock CA (2000) Selective sampling with redundant views. In: AAAI conference on artificial intelligence, pp. 621–626
Muslea I, Minton S, Knoblock CA (2002) Active + semi-supervised learning = robust multiview learning. In: International conference on machine learning, pp. 435–442
Qi GJ, Hua XS, Rui Y, Tang J, Zhang HJ (2008) ’Two-dimensional active learning for image classification. In: Computer vision and pattern recognition, pp. 1–8
Reyes O, Morell C, Ventura S (2018) Effective active learning strategy for multi-label learning. Neurocomputing 273:494–508
Schapire RE, Singer Y (2000) Boostexter: a boosting-based system for text categorization. Mach Learn 39:135–168
Singh M, Curran E, Cunningham P (2009) Active learning for multi-label image annotation. University College Dublin, Tech. rep
Song Y, Zhang L, Giles CL (2008) A sparse Gaussian processes classification framework for fast tag suggestions. In: ACM Conference on information and knowledge management, pp. 93–102
Sun S (2008) Semantic features for multi-view semi-supervised and active learning of text classification. In: IEEE international conference on data mining workshops, pp. 731–735
Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23:2031–2038
Sun S, Shawe-Taylor J, Mao L (2017) PAC-Bayes analysis of multi-view learning. Inf Fus 35:117–131
Tsoumakas G, Katakis I (2007) Multi-label classification: an overview. Int J Data Wareh Min 3:1–13
Ueda N, Saito K (2003) Parametric mixture models for multi-labeled text. In: Advances in neural information processing systems, pp. 737–744
Wang W, Zhou Z (2010) Multi-view active learning in the non-realizable case. In: Advances in Neural Information Processing Systems, pp. 2388–2396
Wu J, Zhao S, Sheng VS, Zhang J, Ye C, Zhao P, Cui Z (2017) Weak-labeled active learning with conditional label dependence for multilabel image classification. IEEE Trans Multimedia 19:1156–1169
Xie X (2018) Regularized multi-view least squares twin support vector machines. Appl Intell 48(9):3108–3115. https://doi.org/10.1007/s10489-017-1129-3
Xie X, Sun S (2019) General multi-view learning with maximum entropy discrimination. Neurocomputing 332:184–192
Xie X, Sun S (2020) General multi-view semi-supervised least squares support vector machines with multi-manifold regularization. Inf Fus 62:63–72
Xie X, Sun S (2020) Multi-view support vector machines with the consensus and complementarity information. IEEE Trans Knowl Data Eng 32(12):2401–2413. https://doi.org/10.1109/TKDE.2019.2933511
Xing Y, Yu GX, Domeniconi C, Wang J, Zhang Z, Guo M (2019) Multi-view multi-instance multi-label learning based on collaborative matrix factorization. Proc AAAI Conf Artif Intell 33:5508–5515. https://doi.org/10.1609/aaai.v33i01.33015508
Xu X, Li J, Li S (2018) Multiview intensity-based active learning for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing 56:669–680
Yan Y, Nie F, Li W, Gao C, Yang Y, Xu D (2016) Image classification by cross-media active learning with privileged information. IEEE Trans Multimedia 18(12):2494–2502
Yang B, Sun JT, Wang T, Chen Z (2009) Effective multi-label active learning for text classification. In: ACM SIGKDD international conference on knowledge discovery and data mining, pp. 917–926
Yang Y, Yu GX, Wang J, Domeniconi C, Zhang X (2020) Multi-typed objects multi-view multi-instance multi-label learning. In: IEEE international conference on data mining, pp. 1370–1375
Yin J, Sun S (2020) Multiview uncorrelated locality preserving projection. IEEE Transact Neural Netw Learn Syst 31(9):3442–3455. https://doi.org/10.1109/TNNLS.2019.2944664
Yu G, Chen X, Domeniconi C, Wang J, Li Z, Zhang Z, Zhang X (2020) Cmal: Cost-effective multi-label active learning by querying subexamples. IEEE Transactions on Knowledge and Data Engineering pp. 1–1. https://doi.org/10.1109/TKDE.2020.3003899
Zhang M, Zhou Z (2014) A review on multi-label learning algorithms. IEEE Trans Knowl Data Eng 26:1819–1837
Zhao J, Xie X, Xu X, Sun S (2017) Multi-view learning overview: recent progress and new challenges. Inf Fus 38:43–54
Zhou J, Sun S (2015) Gaussian process versus margin sampling active learning. Neurocomputing 167:122–131
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China under Project 62076096 and Project 62006078, in part by the Shanghai Municipal Project 20511100900, in part by the Shanghai Knowledge Service Platform Project under Grant ZF1213, in part by the Chenguang Program of the Shanghai Education Development Foundation and the Shanghai Municipal Education Commission under Grant 19CG25, and in part by the Fundamental Research Funds for the Central Universities.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhao, J., Qiu, Z. & Sun, S. Multi-view multi-label active learning with conditional Bernoulli mixtures. Int. J. Mach. Learn. & Cyber. 13, 1589–1601 (2022). https://doi.org/10.1007/s13042-021-01467-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-021-01467-6