Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1143844.1143923acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
Article

The uniqueness of a good optimum for K-means

Published: 25 June 2006 Publication History

Abstract

If we have found a "good" clustering C of a data set, can we prove that C is not far from the (unknown) best clustering Copt of these data? Perhaps surprisingly, the answer to this question is sometimes yes. When "goodness" is measured by the distortion of K-means clustering, this paper proves spectral bounds on the distance d(C, Copt). The bounds exist in the case when the data admits a low distortion clustering.

References

[1]
Ben-Hur, A., Elisseeff, A., & Guyon, I. (2002). A stability based method for discovering structure in clustered data. Pacific Symposium on Biocomputing (pp. 6--17).
[2]
Dasgupta, S. (1999). Learning mixtures of gaussians. FOCS '99: Proceedings of the 40th Annual Symposium on Foundations of Computer Science (p. 634). Washington, DC, USA: IEEE Computer Society.
[3]
Ding, C., & He, X. (2004). K-means clustering via principal component analysis. Proceedings of the International Machine Learning Conference (ICML). Morgan Kauffman.
[4]
Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2, 193--218.
[5]
Lancaster, H. (1969). The Chi-squared distribution. Wiley.
[6]
Lange, T., Roth, V., Braun, M. L., & Buhmann, J. M. (2004). Stability-based validation of clustering solutions. Neural Comput., 16, 1299--1323.
[7]
Meilă, M. (2005). Comparing clusterings-an axiomatic view. Proceedings of the International Machine Learning Conference (ICML). ACM Press.
[8]
Meilă, M. (2006). The local equivalence of two distances between finite random variables: the misclassification error metric and the X2 distance. (submitted).
[9]
Meilă, M., Shortreed, S., & Xu, L. (2005). Regularized spectral learning. Proceedings of the Artificial Intelligence and Statistics Workshop(AISTATS 05).
[10]
Papadimitriou, C., & Steiglitz, K. (1998). Combinatorial optimization. algorithms and complexity. Minneola, NY: Dover Publication, Inc.
[11]
Vempala, S., & Wang, G. (2004). A spectral algorithm for learning mixture models. J. Comput. Syst. Sci., 68, 841--860.

Cited By

View all
  • (2024)Fractal-Based Multi-Criteria Feature Selection to Enhance Predictive Capability of AI-Driven Mineral Prospectivity MappingFractal and Fractional10.3390/fractalfract80402248:4(224)Online publication date: 12-Apr-2024
  • (2023)Pattern Selection for Graph DatabasesPlug-and-Play Visual Subgraph Query Interfaces10.1007/978-3-031-16162-9_6(49-81)Online publication date: 14-Mar-2023
  • (2022)Aerodynamic data predictions based on multi-task learning▪Applied Soft Computing10.1016/j.asoc.2021.108369116:COnline publication date: 6-May-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICML '06: Proceedings of the 23rd international conference on Machine learning
June 2006
1154 pages
ISBN:1595933832
DOI:10.1145/1143844
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2006

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Acceptance Rates

ICML '06 Paper Acceptance Rate 140 of 548 submissions, 26%;
Overall Acceptance Rate 140 of 548 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)1
Reflects downloads up to 30 Aug 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Fractal-Based Multi-Criteria Feature Selection to Enhance Predictive Capability of AI-Driven Mineral Prospectivity MappingFractal and Fractional10.3390/fractalfract80402248:4(224)Online publication date: 12-Apr-2024
  • (2023)Pattern Selection for Graph DatabasesPlug-and-Play Visual Subgraph Query Interfaces10.1007/978-3-031-16162-9_6(49-81)Online publication date: 14-Mar-2023
  • (2022)Aerodynamic data predictions based on multi-task learning▪Applied Soft Computing10.1016/j.asoc.2021.108369116:COnline publication date: 6-May-2022
  • (2021)A K-Means Clustering-Based Multiple Importance Sampling Algorithm for Integral Global OptimizationJournal of the Operations Research Society of China10.1007/s40305-021-00353-wOnline publication date: 6-Jul-2021
  • (2020)Vehicle Trajectory SimilarityACM Computing Surveys10.1145/340609653:5(1-32)Online publication date: 28-Sep-2020
  • (2020)Tunneling parameters optimization based on multi-objective differential evolution algorithmSoft Computing10.1007/s00500-020-05392-8Online publication date: 4-Nov-2020
  • (2019)Machine Learning Techniques Applied to Multiband Spectrum Sensing in Cognitive RadiosSensors10.3390/s1921471519:21(4715)Online publication date: 30-Oct-2019
  • (2019)CATAPULTProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3300072(900-917)Online publication date: 25-Jun-2019
  • (2019)Multiple Target Exploration Approach for Design Exploration Using a Swarm Intelligence and ClusteringJournal of Mechanical Design10.1115/1.4043201141:9(091401)Online publication date: 22-Apr-2019
  • (2019)Machine Learning in the Tasks of Identifying Unwanted Content2019 Wave Electronics and its Application in Information and Telecommunication Systems (WECONF)10.1109/WECONF.2019.8840130(1-6)Online publication date: Jun-2019
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media