Abstract
Given a data set in metric space, we study the problem of hierarchical clustering to minimize the maximum cluster diameter, and the hierarchical k-supplier problem with customers arriving online.
We prove that two previously known algorithms for hierarchical clustering, one (offline) due to Dasgupta and Long and the other (online) due to Charikar, Chekuri, Feder and Motwani, are essentially the same algorithm when points are considered in the same order. We show that the analysis of both algorithms are tight and exhibit a new lower bound for hierarchical clustering. Finally we present the first constant factor approximation algorithm for the online hierarchical k-supplier problem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Arabie, P., Hubert, L.J., De Soete, G. (eds.): Clustering and Classification. World Scientific, River Edge (1998)
Charikar, M., Chekuri, C., Feder, T., Motwani, R.: Incremental clustering and dynamic information retrieval. In: Proceedings of the 29th Annual ACM Symposium on the Theory of Computing, pp. 626–635 (1997)
Chrobak, M., Kenyon, C., Noga, J., Young, N.E.: Online Medians via Online Bribery. In: Correa, J.R., Hevia, A., Kiwi, M. (eds.) LATIN 2006. LNCS, vol. 3887, pp. 311–322. Springer, Heidelberg (2006)
Dasgupta, S., Long, P.: Performance guarantees for hierarchical clustering. Journal of Computer and System Sciences 70(4), 555–569 (2005)
Duda, R.O., Hart, P.E., Sork, D.G.: Pattern Classification. Wiley and Sons, Chichester (2001)
Dyer, M.E., Frieze, A.M.: A simple heuristic for the p-center problem. Operations Research Letters 3, 285–288 (1985)
Gonzalez, T.F.: Clustering to Minimize the Maximum Intercluster Distance. In: Proceedings of the 17th Annual ACM Symposium on the Theory of Computing, vol. 38, pp. 293–306 (1985)
Hochbaum, D.S.: Various Notions of Approximations: Good, Better, Best and More. In: Hochbaum, D.S. (ed.) Approximation Algorithms for NP-Hard Problems. PWS Publishing Company (1996)
Hochbaum, D.S., Shmoys, D.B.: A best possible heuristic for the k-center problem. Mathematics of Operations Research 10, 180–184 (1985)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs (1988)
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, NY (1990)
Lin, G., Nagarajan, C., Rajamaran, R., Williamson, D.P.: A general approach for incremental approximation and hierarchical clustering. In: Proceedings of the seventeenth annual ACM-SIAM Symposium on Discrete algorithm (SODA), pp. 1147–1156 (2006)
NSF Workshop Report on Emerging Issues in Aerosol Particle Science and Technology (NAST), UCLA, 2003, Chapter 1, Section 18, Improved and rapid data analysis tools (Chemical Characterization), Available at http://www.nano.gov/html/res/NSFAerosolParteport.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Das, A., Kenyon, C. (2007). On Hierarchical Diameter-Clustering, and the Supplier Problem. In: Erlebach, T., Kaklamanis, C. (eds) Approximation and Online Algorithms. WAOA 2006. Lecture Notes in Computer Science, vol 4368. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11970125_11
Download citation
DOI: https://doi.org/10.1007/11970125_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69513-4
Online ISBN: 978-3-540-69514-1
eBook Packages: Computer ScienceComputer Science (R0)