Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Variable selection in model-based clustering: A general variable role modeling

Published: 01 September 2009 Publication History

Abstract

The currently available variable selection procedures in model-based clustering assume that the irrelevant clustering variables are all independent or are all linked with the relevant clustering variables. A more versatile variable selection model is proposed, taking into account three possible roles for each variable: The relevant clustering variables, the irrelevant clustering variables dependent on a part of the relevant clustering variables and the irrelevant clustering variables totally independent of all the relevant variables. A model selection criterion and a variable selection algorithm are derived for this new variable role modeling. The model identifiability and the consistency of the variable selection criterion are also established. Numerical experiments highlight the interest of this new modeling.

References

[1]
Model-based Gaussian and non-Gaussian clustering. Biometrics. v49 i3. 803-821.
[2]
Model-based cluster and discriminant analysis with the mixmod software. Computational Statistics and Data Analysis. v51 i2. 587-600.
[3]
Classification and Regression Trees. Wadsworth International, Belmont, California.
[4]
Gaussian parsimonious clustering models. Pattern Recognition. v28 i5. 781-793.
[5]
Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, Series B. v39 i1. 1-38.
[6]
Simultaneous feature selection and clustering using mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence. v26 i9. 1154-1166.
[7]
Finite Mixture Models. Wiley-Interscience, New York.
[8]
Maugis, C., Celeux, G., Martin-Magniette, M.-L., 2009. Variable selection for clustering with Gaussian mixture models. Biometrics (in press)
[9]
Maugis, C., Celeux, G., Martin-Magniette, M.-L., 2008. Variable selection in model-based clustering: A general variable role modelling. Technical Report INRIA, RR-6744
[10]
Variable selection for model-based clustering. Journal of the American Statistical Association. v101 i473. 168-178.
[11]
Bayesian variable selection in clustering high-dimensional data. Journal of the American Statistical Association. v100 i470. 602-617.

Cited By

View all
  • (2023)A Survey on Model-Based Co-Clustering: High Dimension and Estimation ChallengesJournal of Classification10.1007/s00357-023-09441-340:2(332-381)Online publication date: 17-Jul-2023
  • (2022)Unobserved classes and extra variables in high-dimensional discriminant analysisAdvances in Data Analysis and Classification10.1007/s11634-021-00474-316:1(55-92)Online publication date: 1-Mar-2022
  • (2022)Improving the Stability of the Variable Selection with Small Datasets in Classification and Regression TasksNeural Processing Letters10.1007/s11063-022-10916-455:5(5331-5356)Online publication date: 10-Jun-2022
  • Show More Cited By
  1. Variable selection in model-based clustering: A general variable role modeling

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Computational Statistics & Data Analysis
    Computational Statistics & Data Analysis  Volume 53, Issue 11
    September, 2009
    177 pages

    Publisher

    Elsevier Science Publishers B. V.

    Netherlands

    Publication History

    Published: 01 September 2009

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 09 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A Survey on Model-Based Co-Clustering: High Dimension and Estimation ChallengesJournal of Classification10.1007/s00357-023-09441-340:2(332-381)Online publication date: 17-Jul-2023
    • (2022)Unobserved classes and extra variables in high-dimensional discriminant analysisAdvances in Data Analysis and Classification10.1007/s11634-021-00474-316:1(55-92)Online publication date: 1-Mar-2022
    • (2022)Improving the Stability of the Variable Selection with Small Datasets in Classification and Regression TasksNeural Processing Letters10.1007/s11063-022-10916-455:5(5331-5356)Online publication date: 10-Jun-2022
    • (2022)Group-Wise Shrinkage Estimation in Penalized Model-Based ClusteringJournal of Classification10.1007/s00357-022-09421-z39:3(648-674)Online publication date: 11-Oct-2022
    • (2022)High-Dimensional Clustering via Random ProjectionsJournal of Classification10.1007/s00357-021-09403-739:1(191-216)Online publication date: 1-Mar-2022
    • (2021)Simultaneous dimension reduction and clustering via the NMF-EM algorithmAdvances in Data Analysis and Classification10.1007/s11634-020-00398-415:1(231-260)Online publication date: 1-Mar-2021
    • (2021)A Combined Approach for Enhancing the Stability of the Variable Selection Stage in Binary Classification TasksAdvances in Computational Intelligence10.1007/978-3-030-85099-9_20(248-259)Online publication date: 16-Jun-2021
    • (2019)Unifying data units and models in (co-)clusteringAdvances in Data Analysis and Classification10.1007/s11634-018-0325-213:1(7-31)Online publication date: 1-Mar-2019
    • (2019)Variable selection in model-based clustering and discriminant analysis with a regularization approachAdvances in Data Analysis and Classification10.1007/s11634-018-0322-513:1(259-278)Online publication date: 1-Mar-2019
    • (2019)A survey of feature selection methods for Gaussian mixture models and hidden Markov modelsArtificial Intelligence Review10.1007/s10462-017-9581-352:3(1739-1779)Online publication date: 1-Oct-2019
    • Show More Cited By

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media