Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
article

Local search approximation algorithms for the k-means problem with penalties

Published: 01 February 2019 Publication History

Abstract

In this paper, we study the k-means problem with (nonuniform) penalties (k-MPWP) which is a natural generalization of the classic k-means problem. In the k-MPWP, we are given an n-client set $$ {\mathcal {D}} \subset {\mathbb {R}}^d$$D?Rd, a penalty cost $$p_j>0$$pj>0 for each $$j \in {\mathcal {D}}$$j?D, and an integer $$k \le n$$k≤n. The goal is to open a center subset $$F \subset {\mathbb {R}}^d$$F?Rd with $$ |F| \le k$$|F|≤k and to choose a client subset $$P \subseteq {\mathcal {D}} $$P⊆D as the penalized client set such that the total cost (including the sum of squares of distance for each client in $$ {\mathcal {D}} \backslash P $$D\P to the nearest open center and the sum of penalty cost for each client in P) is minimized. We offer a local search $$( 81+ \varepsilon )$$(81+?)-approximation algorithm for the k-MPWP by using single-swap operation. We further improve the above approximation ratio to $$( 25+ \varepsilon )$$(25+?) by using multi-swap operation.

References

[1]
Ahmadian S, Norouzi-Fard A, Svensson O, Ward J (2017) Better guarantees for k-means and Euclidean k-median by primal-dual algorithms. In: Proceedings of FOCS, pp 61-72.
[2]
Aloise D, Deshpande A, Hansen P, Popat P (2009) NP-hardness of Euclidean sum-of-squares clustering. Mach Learn 75:245-249.
[3]
Arya V, Garg N, Khandekar R, Meyerson A, Munagala K, Pandit V (2004) Local search heuristics for k-median and facility location problems. SIAM J Comput 33:544-562.
[4]
Bandyapadhyay S, Varadarajan K (2016) On variants of k-means clustering. In: Proceedings of SoCG, article no. 14:14:1-14:15.
[5]
Byrka J, Pensyl T, Rybicki B, Srinivasan A, Trinh K (2017) An improved approximation for k-median, and positive correlation in budgeted optimization. ACM Transactions on Algorithms, 13(2): Article No. 23.
[6]
Charikar M, Guha S (1999) Improved combinatorial algorithms for the facility location and k-median problems. In: Proceedings of FOCS, pp 378-388.
[7]
Charikar M, Guha S, Tardos É, Shmoys DB (1999) A constant-factor approximation algorithm for the k-median problem. In: Proceedings of STOC, pp 1-10.
[8]
Charikar M, Khuller S, Mount DM, Narasimhan G (2001) Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp 642-651.
[9]
Dasgupta S (2007) The hardness of k-means clustering. Technical report CS2007-0890, University of California, San Diego.
[10]
Georgogiannis A (2016) Robust k-means: a theoretical revisit. In: Proceedings of NIPS, pp 2883-2891.
[11]
Jain K, Vazirani VV (2001) Approximation algorithms for metric facility location and k-median problems using the primal-dual schema and Lagrangian relaxation. J ACM 48:274-296.
[12]
Kanungo T, Mount DM, Netanyahu NS, Piatko CD, Silverman R, Wu AY (2004) A local search approximation algorithm for k-means clustering. Comput Geom Theory Appl 28:89-112.
[13]
Li Y, Du D, Xiu N, Xu D (2015) Improved approximation algorithms for the facility location problems with linear/submodular penalties. Algorithmica 73:460-482.
[14]
Li S, Svensson O (2016) Approximating k-median via pseudo-approximation. SIAM J Comput 45:530-547.
[15]
Lloyd S (1957) Least squares quantization in PCM. Technical report, Bell Laboratories.
[16]
Lloyd S (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28:129-137.
[17]
Mahajan M, Nimbhorkar P, Varadarajan K (2009) The planar k-means problem is NP-hard. In: Proceedings of WALCOM, pp 274-285.
[18]
Makarychev K, Makarychev Y, Sviridenko M, Ward J (2016) A bi-criteria approximation algorithm for k-means. In: Proceedings of APPROX/RONDOM, article no. 14, pp 14:1-14:20.
[19]
Matou?ek J (2000) On approximate geometric k-clustering. Discrete Comput Geom 24:61-84.
[20]
Tseng GC (2007) Penalized and weighted k-means for clustering with scattered objects and prior information in high-throughput biological data. Bioinformatics 23:2247-2255.
[21]
Wang Y, Xu D, Du D, Wu C An approximation algorithm for k-facility location problem with linear penalties using local search scheme. J Comb Optim.
[22]
Ward J (2017) Private communication.
[23]
Zhang P (2007) A new approximation algorithm for the k-facility location problem. Theor Comput Sci 384:126-135.

Cited By

View all
  1. Local search approximation algorithms for the k-means problem with penalties

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Journal of Combinatorial Optimization
    Journal of Combinatorial Optimization  Volume 37, Issue 2
    February 2019
    333 pages

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 01 February 2019

    Author Tags

    1. Approximation algorithm
    2. Local search
    3. Penalty
    4. k-means

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 26 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Approximation Algorithms for Robust Clustering Problems Using Local Search TechniquesTheory and Applications of Models of Computation10.1007/978-981-97-2340-9_17(197-208)Online publication date: 13-May-2024
    • (2023)Approximation algorithms for the capacitated correlation clustering problem with penaltiesJournal of Combinatorial Optimization10.1007/s10878-022-00930-645:1Online publication date: 1-Jan-2023
    • (2022)The bi-criteria seeding algorithms for two variants of k-means problemJournal of Combinatorial Optimization10.1007/s10878-020-00537-944:3(1693-1704)Online publication date: 1-Oct-2022
    • (2021)MOEA/UENeurocomputing10.1016/j.neucom.2020.04.149458:C(535-545)Online publication date: 11-Oct-2021
    • (2021)Approximation Algorithm for the Capacitated Correlation Clustering Problem with PenaltiesAlgorithmic Aspects in Information and Management10.1007/978-3-030-93176-6_2(15-26)Online publication date: 20-Dec-2021
    • (2020)An Improved Approximation Algorithm for the Prize-Collecting Red-Blue Median ProblemTheory and Applications of Models of Computation10.1007/978-3-030-59267-7_9(94-106)Online publication date: 18-Oct-2020
    • (2020)A Primal-Dual Algorithm for Euclidean k-Means Problem with PenaltiesTheory and Applications of Models of Computation10.1007/978-3-030-59267-7_32(377-389)Online publication date: 18-Oct-2020

    View Options

    View options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media