Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/380752.380754acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article

Approximating min-sum k-clustering in metric spaces

Published: 06 July 2001 Publication History

Abstract

The min-sum k-clustering problem in a metric space is to find a partition of the space into k clusters as to minimize the total sum of distances between pairs of points assigned to the same cluster. We give the first polynomial time non-trivial approximation algorithm for this problem. The algorithm provides an $\ratio$ approximation to the min-sum k-clustering problem in general metric spaces, with running time $\runtime$. The result is based on embedding of metric spaces into hierarchically separated trees. We also provide a bicriteria approximation result that provides a constant approximation factor solution with only a constant factor increase in the number of clusters. This result is obtained by modifying and drawing ideas from recently developed primal dual approximation algorithms for facility location.

References

[1]
Y. Bartal. Probabilistic approximation of metric spaces and its algorithmic application. In Proceedings of the 37th Annual Symposium on Foundations of Computer Science, pages 184-193, 1996.
[2]
Y. Bartal. On approximating arbitrary metrics by tree metrics. In Proceedings of the 30th Annual ACM Symposium on Theory of Computing, pages 161-168, 1998.
[3]
M. Charikar, C. Chekuri, A. Goel, S. Guha, and S. Plotkin. Approximating an arbitrary metric by a small number of tree metrics. In Proceedings of the 39th Annual Symposium on Foundations of Computer Science, pages 379-388, 1998.
[4]
N. Guttman-Beck and R. Hassin. Approximation algorithms for min-sum p-clustering. Discrete Applied Mathematics, 89:125-142, 1998.
[5]
P. Indyk. A sublinear time approximation scheme for clustering in metric spaces. In Proceedings of the 40th Annual Symposium on Foundations of Computer Science, pages 154-159, 1999.
[6]
K. Jain and V. Vazirani. Primal-dual approximation algorithms for metric facility location and k-median problems. In Proceedings of the 40th Annual Symposium on Foundations of Computer Science, pages 2-13, 1999. to appear in Journal of the ACM.
[7]
V. Kann, S. Khanna, J. Lagergren, and A. Panconesi. On the hardness of max k-cut and its dual. In Israeli Symposium on Theoretical Computer Science, pages 61-67, 1996.
[8]
M. V. Marathe, R. Ravi, R. Sundaram, S. S. Ravi, D. J. Rosenkrantz, and H. B. Hunt. Bicriteria network design problems. Journal of Algorithms, 28(1):142-171, 1998.
[9]
S. Sahni and T. Gonzalez. P-complete approximation problems. Journal of the ACM, 23:555-566, 1976.
[10]
L. Schulman. Clustering for edge-cost minimization. In Proceedings of the 32nd Annual ACM Symposium on Theory of Computing, 2000.

Cited By

View all
  • (2023)Introducing the Partial Clustering Problem2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386398(6148-6151)Online publication date: 15-Dec-2023
  • (2021)On approximability of clustering problems without candidate centersProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458220(2635-2648)Online publication date: 10-Jan-2021
  • (2021)A Multi-population BRKGA for the Automatic Clustering Problem2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC52423.2021.9658983(368-373)Online publication date: 17-Oct-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
STOC '01: Proceedings of the thirty-third annual ACM symposium on Theory of computing
July 2001
755 pages
ISBN:1581133499
DOI:10.1145/380752
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 July 2001

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

STOC01
Sponsor:

Acceptance Rates

STOC '01 Paper Acceptance Rate 83 of 230 submissions, 36%;
Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)77
  • Downloads (Last 6 weeks)6
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Introducing the Partial Clustering Problem2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386398(6148-6151)Online publication date: 15-Dec-2023
  • (2021)On approximability of clustering problems without candidate centersProceedings of the Thirty-Second Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3458064.3458220(2635-2648)Online publication date: 10-Jan-2021
  • (2021)A Multi-population BRKGA for the Automatic Clustering Problem2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC)10.1109/SMC52423.2021.9658983(368-373)Online publication date: 17-Oct-2021
  • (2020)Massively Distributed Graph DistancesIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2020.30220036(667-683)Online publication date: 2020
  • (2019)The Informativeness of $k$ -Means for Learning Mixture ModelsIEEE Transactions on Information Theory10.1109/TIT.2019.292756065:11(7460-7479)Online publication date: Nov-2019
  • (2019)Approximation Algorithms for Min-Sum k-Clustering and Balanced k-MedianAlgorithmica10.1007/s00453-018-0454-181:3(1006-1030)Online publication date: 1-Mar-2019
  • (2016)Approximating Metric Spaces by Tree MetricsEncyclopedia of Algorithms10.1007/978-1-4939-2864-4_25(113-116)Online publication date: 22-Apr-2016
  • (2015)TEGRAProceedings of the 2015 ACM SIGMOD International Conference on Management of Data10.1145/2723372.2723725(1713-1728)Online publication date: 27-May-2015
  • (2015)Interactive Clustering of Linear Classes and Cryptographic Lower BoundsProceedings of the 26th International Conference on Algorithmic Learning Theory - Volume 935510.1007/978-3-319-24486-0_11(165-176)Online publication date: 4-Oct-2015
  • (2013)Clustering under approximation stabilityJournal of the ACM10.1145/2450142.245014460:2(1-34)Online publication date: 3-May-2013
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media