Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/3060832.3060848guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Multi-label informed feature selection

Published: 09 July 2016 Publication History

Abstract

Multi-label learning has been extensively studied in the area of bioinformatics, information retrieval, multimedia annotation, etc. In multi-label learning, each instance is associated with multiple interdependent class labels, the label information can be noisy and incomplete. In addition, multilabeled data often has noisy, irrelevant and redundant features of high dimensionality. As an effective data preprocessing step, feature selection has shown its effectiveness to prepare high-dimensional data for numerous data mining and machine learning tasks. Most of existing multi-label feature selection algorithms either boil down to solving multiple single-labeled feature selection problems or directly make use of imperfect labels. Therefore, they may not be able to find discriminative features that are shared by multiple labels. In this paper, we propose a novel multi-label informed feature selection framework MIFS, which exploits label correlations to select discriminative features across multiple labels. Specifically, to reduce the negative effects of imperfect label information in finding label correlations, we decompose the multi-label information into a low-dimensional space and then employ the reduced space to steer the feature selection process. Empirical studies on real-world datasets demonstrate the effectiveness and efficiency of the proposed framework.

References

[1]
Dimitri P Bertsekas. Nonlinear Programming . Athena scientific, 1999.
[2]
Matthew R Boutell, Jiebo Luo, Xipeng Shen, and Christopher M Brown. Learning multilabel scene classification. Pattern Recognition , 37(9):1757-1771, 2004.
[3]
Deng Cai, Chiyuan Zhang, and Xiaofei He. Unsupervised feature selection for multi-cluster data. In KDD , pages 333-342, 2010.
[4]
Xiaojun Chang, Feiping Nie, Yi Yang, and Heng Huang. A convex formulation for semisupervised multi-label feature selection. In AAAI , pages 1171-1177, 2014.
[5]
Richard O Duda, Peter E Hart, and David G Stork. Pattern Classification . John Wiley & Sons, 2012.
[6]
Susan T Dumais. Latent semantic analysis. Annual Review of Information Science and Technology , 38(1):188-230, 2004.
[7]
André Elisseeff and Jason Weston. A kernel method for multi-labelled classification. In NIPS , pages 681-687, 2001.
[8]
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. Liblinear: A library for large linear classification. The Journal of Machine Learning Research , 9:1871-1874, 2008.
[9]
Siddharth Gopal and Yiming Yang. Multilabel classification with meta-level features. In SIGIR , pages 315-322, 2010.
[10]
Xian-Sheng Hua and Guo-Jun Qi. Online multi-label active annotation: towards large-scale content-based video search. In ACM MM , pages 141-150, 2008.
[11]
Sheng-Jun Huang, Zhi-Hua Zhou, and ZH Zhou. Multi-label learning by exploiting label correlations locally. In AAAI , pages 949-955, 2012.
[12]
Shuiwang Ji, Lei Tang, Shipeng Yu, and Jieping Ye. Extracting shared subspace for multilabel classification. In KDD , pages 381-389, 2008.
[13]
Ioannis Katakis, Grigorios Tsoumakas, and Ioannis Vlahavas. Multilabel text classification for automated tag suggestion. ECML PKDD discovery challenge , 75, 2008.
[14]
David D Lewis, Yiming Yang, Tony G Rose, and Fan Li. Rcv1: A new benchmark collection for text categorization research. The Journal of Machine Learning Research , 5:361-397, 2004.
[15]
Jundong Li, Xia Hu, Jiliang Tang, and Huan Liu. Unsupervised streaming feature selection in social media. In CIKM , pages 1041-1050, 2015.
[16]
Jundong Li, Kewei Cheng, Suhang Wang, Fred Morstatter, Robert P Trevino, Jiliang Tang, and Huan Liu. Feature selection: A data perspective. arXiv preprint arXiv:1601.07996 , 2016.
[17]
Jundong Li, Xia Hu, Liang Wu, and Huan Liu. Robust unsupervised feature selection on networked data. In SDM , 2016.
[18]
Huan Liu and Hiroshi Motoda. Computational Methods of Feature Selection . CRC Press, 2007.
[19]
Jun Liu, Shuiwang Ji, and Jieping Ye. Multi-task feature learning via efficient l 2,1 -norm minimization. In UAI , pages 339-348, 2009.
[20]
Zhigang Ma, Feiping Nie, Yi Yang, Jasper RR Uijlings, and Nicu Sebe. Web image annotation via subspace-sparsity collaborated feature selection. IEEE Transactions on Multimedia , 14(4):1021-1030, 2012.
[21]
Donald Michie, David J Spiegelhalter, and Charles C Taylor. Machine Learning, Neural and Statistical Classification . Ellis Horwood, 1994.
[22]
Feiping Nie, Heng Huang, Xiao Cai, and Chris H Ding. Efficient and robust feature selection via joint l 2,1 -norms minimization. In NIPS , pages 1813-1821, 2010.
[23]
Hanchuan Peng, Fuhui Long, and Chris Ding. Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Transactions on Pattern Analysis and Machine Intelligence , 27(8):1226-1238, 2005.
[24]
Yang Song, Lu Zhang, and C Lee Giles. A sparse gaussian processes classification framework for fast tag suggestions. In CIKM , pages 93-102, 2008.
[25]
Lei Tang, Suju Rajan, and Vijay K Narayanan. Large scale multi-label classification via metalabeler. In WWW , pages 211-220, 2009.
[26]
Robert Tibshirani. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) , pages 267-288, 1996.
[27]
Yi Yang, Heng Tao Shen, Zhigang Ma, Zi Huang, and Xiaofang Zhou. l 2,1 -norm regularized discriminative feature selection for unsupervised learning. In IJCAI , pages 1589-1594, 2011.
[28]
Kai Yu, Shipeng Yu, and Volker Tresp. Multi-label informed latent semantic indexing. In SIGIR , pages 258-265, 2005.
[29]
Min-Ling Zhang and Zhi-Hua Zhou. Multilabel neural networks with applications to functional genomics and text categorization. IEEE Transactions on Knowledge and Data Engineering , 18(10):1338-1351, 2006.
[30]
Yin Zhang and Zhi-Hua Zhou. Multilabel dimensionality reduction via dependence maximization. ACM Transactions on Knowledge Discovery from Data , 4(3):14, 2010.

Cited By

View all
  • (2024)Evolutionary Sparse Coding and Graph Regularisation for Embedded Multi-label Feature SelectionProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3638530.3654094(271-274)Online publication date: 14-Jul-2024
  • (2023)Co-operative Co-evolutionary Many-objective Embedded Multi-label Feature Selection with Decomposition-based PSOProceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590373(438-446)Online publication date: 15-Jul-2023
  • (2022)Learning common and label-specific features for multi-Label classification with correlation informationPattern Recognition10.1016/j.patcog.2021.108259121:COnline publication date: 1-Jan-2022
  • Show More Cited By

Index Terms

  1. Multi-label informed feature selection
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    IJCAI'16: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence
    July 2016
    4277 pages
    ISBN:9781577357704

    Sponsors

    • Sony: Sony Corporation
    • Arizona State University: Arizona State University
    • Microsoft: Microsoft
    • Facebook: Facebook
    • AI Journal: AI Journal

    Publisher

    AAAI Press

    Publication History

    Published: 09 July 2016

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 23 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Evolutionary Sparse Coding and Graph Regularisation for Embedded Multi-label Feature SelectionProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3638530.3654094(271-274)Online publication date: 14-Jul-2024
    • (2023)Co-operative Co-evolutionary Many-objective Embedded Multi-label Feature Selection with Decomposition-based PSOProceedings of the Genetic and Evolutionary Computation Conference10.1145/3583131.3590373(438-446)Online publication date: 15-Jul-2023
    • (2022)Learning common and label-specific features for multi-Label classification with correlation informationPattern Recognition10.1016/j.patcog.2021.108259121:COnline publication date: 1-Jan-2022
    • (2021)Sparsity-based evolutionary multi-objective feature selection for multi-label classificationProceedings of the Genetic and Evolutionary Computation Conference Companion10.1145/3449726.3459467(147-148)Online publication date: 7-Jul-2021
    • (2021)Multi-objective Cuckoo Search-based Streaming Feature Selection for Multi-label DatasetACM Transactions on Knowledge Discovery from Data10.1145/344758615:6(1-24)Online publication date: 19-May-2021
    • (2019)Latent semantics encoding for label distribution learningProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367471.3367595(3982-3988)Online publication date: 10-Aug-2019
    • (2019)Learning for tail label dataProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367471.3367575(3842-3848)Online publication date: 10-Aug-2019
    • (2019)Correlated Multi-label Classification with Incomplete Label Space and Class ImbalanceACM Transactions on Intelligent Systems and Technology10.1145/334251210:5(1-26)Online publication date: 5-Sep-2019
    • (2019)Feature selection for text classificationMultimedia Tools and Applications10.1007/s11042-018-6083-578:3(3797-3816)Online publication date: 1-Feb-2019
    • (2019)Semi-supervised dual low-rank feature mapping for multi-label image annotationMultimedia Tools and Applications10.1007/s11042-018-5719-978:10(13149-13168)Online publication date: 1-May-2019
    • Show More Cited By

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media