Abstract
Unsupervised feature selection (UFS) plays a critical role in the maintenance of representative feature subset from high dimensional data. Both the spectral analysis model and the self-expression model are effective in selecting important features. However, few alternative methods embed these two models into a joint FS framework. To address this problem, we propose a novel UFS method that simultaneously selects the most representative feature subset and makes the selected feature subset discriminative by mapping the original features into the label space. Specifically, both the self-expression and spectral analysis are introduced into our method and the self-expression matrix is used as the FS matrix. The two modules are not simply added together, but interact with each other through two graph constraints which preserve the local structure and the manifold structure of the original data, respectively. Furthermore, this paper proposes an alternative iterative algorithm to solve the four matrices involved in the proposed method. To verify the effectiveness of our method, extensive experiments are implemented, and the experimental results prove that the proposed method achieves the best performance among the current state-of-the-art UFS methods. Moreover, an ablation study is performed to show the effectiveness of each part of the proposed method.



Similar content being viewed by others
References
Ali A, Zhu Y, Zakarya M (2021) A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing. Multim Tools Appl 80(20):31,401–31,433
Ali A, Zhu Y, Zakarya M (2021) Exploiting dynamic spatio-temporal correlations for citywide traffic flow prediction using attention based neural networks. Inf Sci 577:852–870
Ali A, Zhu Y, Zakarya M (2022) Exploiting dynamic spatio-temporal graph convolutional neural networks for citywide traffic flows prediction. Neural Netw 145:233–247
Cai D, Zhang C, He X (2010) Unsupervised feature selection for multi-cluster data. In: Proceedings of the ACM SIGKDD international conference on knowledge discovery and data mining, pp 333–342
Cao X, Zhang C, Fu H, Liu S, Zhang H (2015) Diversity-induced multi-view subspace clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 586–594
Cui G, Li X, Dong Y (2018) Subspace clustering guided convex nonnegative matrix factorization. Neurocomputing 292:38–48
Gadekallu TR, Rajput DS, Reddy MPK, Lakshmanna K, Bhattacharya S, Singh S, Jolfaei A, Alazab M (2020) A novel pca–whale optimization-based deep neural network model for classification of tomato plant diseases using gpu. J Real-Time Image Proc:1–14
Gui J, Sun Z, Ji S, Tao D, Tan T (2017) Feature selection based on structured sparsity: a comprehensive study. IEEE Trans Neural Netw Learning Syst 28(7):1490–1507
Hakak S, Alazab M, Khan S, Gadekallu TR, Maddikunta PKR, Khan WZ (2021) An ensemble machine learning approach through effective feature extraction to classify fake news. Futur Gener Comput Syst 117:47–58
Hou C, Nie F, Li X, Yi D, Wu Y (2014) Joint embedding learning and sparse regression: a framework for unsupervised feature selection. IEEE Trans Cybernetics 44(6):793–804
Huang D, Cai X, Wang CD (2019) Unsupervised feature selection with multi-subspace randomization and collaboration. Knowl-Based Syst 182 (104):856
Huang Q, Xia T, Sun H, Yamada M, Chang Y (2020) Unsupervised nonlinear feature selection from high-dimensional signed networks. In: The thirty-fourth AAAI conference on artificial intelligence, AAAI 2020, the thirty-second innovative applications of artificial intelligence conference, IAAI 2020, the tenth AAAI symposium on educational advances in artificial intelligence. AAAI Press, EAAI 2020, New York, 7-12 February 2020, pp 4182–4189
Li W, Chen H, Li T, Wan J, Sang B (2022) Unsupervised feature selection via self-paced learning and low-redundant regularization. Knowl Based Syst 240(108):150
Li Z, Yang Y, Liu J, Zhou X, Lu H (2012) Unsupervised feature selection using nonnegative spectral analysis. In: Proceedings of the AAAI conference on artificial intelligence
Li X, Yuan A, Lu X (2019) Vision-to-language tasks based on attributes and attention mechanism. IEEE Trans Cybern:1–14. https://doi.org/10.1109/TCYB.2019.2914351
Li X, Zhang H, Zhang R, Liu Y, Nie F (2019) Generalized uncorrelated regression with adaptive graph for unsupervised feature selection. IEEE Trans Neural Netw Learning Syst 30(5):1587–1595
Lu Q, Li X, Dong Y (2018) Structure preserving unsupervised feature selection. Neurocomputing 301:36–45
Lu C, Min H, Gui J, Zhu L, Lei Y (2013) Face recognition via weighted sparse representation. J Visual Commun Image Representation 24(2):111–116
Luo C, Zheng J, Li T, Chen H, Huang Y, Peng X (2022) Orthogonally constrained matrix factorization for robust unsupervised feature selection with local preserving. Inf Sci 586:662–675. https://doi.org/10.1016/j.ins.2021.11.068
Lyons MJ, Budynek J, Akamatsu S (1999) Automatic classification of single facial images. IEEE Trans Pattern Anal Mach Intell 21(12):1357–1362
Mi JX, Lei D, Gui J (2013) A novel method for recognizing face with partial occlusion via sparse representation. Optik 124(24):6786–6789
Nene SA, Nayar SK, Murase H (1996) Columbia object image library (coil-20). Tech Rep, Department of Computer Science Columbia University
Nie F, Yuan J, Huang H (2014) Optimal mean robust principal component analysis. In: Proceedings of the 31th international conference on machine learning, ICML 2014, Beijing, China. JMLR.org, 21–26 June 2014, vol 32, pp 1062–1070
Nie F, Zhang R, Li X (2017) A generalized power iteration method for solving quadratic problem on the stiefel manifold. SCIENCE CHINA Inf Sci 60 (11):112,101:1–112,101:10
Parsons L, Haque E, Liu H (2004) Subspace clustering for high dimensional data: a review. SIGKDD Explorations 6(1):90–105
Qian M, Zhai C (2013) Robust unsupervised feature selection. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), pp 1621–1627
Strehl A, Ghosh J (2002) Cluster ensembles—a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3(Dec):583–617
Vasan D, Alazab M, Wassan S, Naeem H, Safaei B, Zheng Q (2020) Imcfn: Image-based malware classification using fine-tuned convolutional neural network architecture. Comput Netw 171:107–138
Venkatraman S, Alazab M (2018) Use of data visualisation for zero-day malware detection. Security Commun Netw, vol 2018
Wang Q, He X, Jiang X, Li X (2020) Robust bi-stochastic graph regularized matrix factorization for data clustering. IEEE Trans Pattern Anal Mach Intell
Wang Q, Liu R, Chen M, Li X (2021) Robust rank-constrained sparse learning: a graph-based framework for single view and multiview clustering. IEEE Trans Cybern:1–12. https://doi.org/10.1109/TCYB.2021.3067137
Wang Q, Zhang F, Li X (2020) Hyperspectral band selection via optimal neighborhood reconstruction. IEEE Trans Geosci Remote Sens 58 (12):8465–8476
Xing EP, Jordan MI, Karp RM (2001) Feature selection for high-dimensional genomic microarray data. In: Proceedings of the eighteenth international conference on machine learning (ICML), pp 601–608
Yang Y, Shen HT, Ma Z, Huang Z (2011) Zhou, x.: l(_2, 1)-norm regularized discriminative feature selection for unsupervised learning. In: Proceedings of the 22nd international joint conference on artificial intelligence (IJCAI), pp 1589–1594
You M, Yuan A, Zou M, D.j H, Li X (2021) Robust unsupervised feature selection via multi-group adaptive graph representation. IEEE Trans Knowl Data Eng:1–1. https://doi.org/10.1109/TKDE.2021.3124255
Yu L, Liu H (2003) Feature selection for high-dimensional data: a fast correlation-based filter solution. In: Proceedings of the twentieth international conference (ICML), pp 856–863
Yuan A, Gao X, You M, He D (2020) Joint self-expression with adaptive graph for unsupervised feature selection. In: peng Y, Liu Q, Lu H, Sun Z, Liu C, Chen X, Zha H, Yang J (eds) Pattern recognition and computer vision - third chinese conference. Springer, PRCV 2020, Nanjing, China, 16–18 October 2020, Proceedings, Part III, vol 12307, pp 185–196
Yuan A, Gao X, You M, He D (2020) Joint self-expression with adaptive graph for unsupervised feature selection. In: Chinese conference on pattern recognition and computer vision (PRCV). Springer, pp 185–196
Yuan A, You M, He D, Li X (2020) Convex non-negative matrix factorization with adaptive graph for unsupervised feature selection. IEEE Trans Cybern:1–13. https://doi.org/10.1109/TCYB.2020.3034462
Zhang R, Nie F, Li X (2017) Projected clustering via robust orthogonal least square regression with optimal scaling. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 2784–2791
Zhang R, Nie F, Li X (2018) Self-weighted supervised discriminative feature selection. IEEE Trans Neural Netw Learning Syst 29(8):3913–3918
Zhang R, Nie F, Wang Y, Li X (2019) Unsupervised feature selection via adaptive multimeasure fusion. IEEE Trans Neural Netw Learning Syst 30 (9):2886–2892
Zhao Z, Liu H (2007) Spectral feature selection for supervised and unsupervised learning. In: Proceedings of the international conference on machine learning (ICML), pp 1151–1157
Zhou P, Du L, Li X, Shen Y, Qian Y (2020) Unsupervised feature selection with adaptive multiple graph learning. Pattern Recognit 105(107):375
Zhu X, Li X, Zhang S, Ju C, Wu X (2017) Robust joint graph sparse coding for unsupervised spectral feature selection. IEEE Trans Neural Netw Learning Syst 28(6):1263–1275
Acknowledgements
This work was supported in part by the National College Students Innovation and Entrepreneurship Training Program under Grant S202010712042, in part by the Natural Science Foundation of Shaanxi Province under Grant 2020JQ-279, in part by the Doctoral Start-up Foundation of Northwest A&F University under Grant Z1090219095, and Grant Z109021803.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The authors have no financial or proprietary interests in any material discussed in this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
You, M., Ban, L., Wang, Y. et al. Unsupervised feature selection with joint self-expression and spectral analysis via adaptive graph constraints. Multimed Tools Appl 82, 5879–5898 (2023). https://doi.org/10.1007/s11042-022-13426-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13426-6