Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/2448556.2448628acmconferencesArticle/Chapter ViewAbstractPublication PagesicuimcConference Proceedingsconference-collections
research-article

A data partitioning approach for hierarchical clustering

Published: 17 January 2013 Publication History

Abstract

In this paper, we propose a parameter-insensitive data partitioning approach for Chameleon, a hierarchical clustering algorithm. The proposed method splits a given dataset into every possible number of clusters by using existing algorithms that do allow arbitrary-sized sub-clusters in partitioning. After that, it evaluates the quality of every set of initial sub-clusters by using our measurement function, and decides the optimal set of initial sub-clusters such that they show the highest value of measurement. Finally, it merges these optimal initial sub-clusters repeatedly and produces the final clustering result. We perform extensive experiments, and the results show that the proposed approach is insensitive to parameters and also produces a set of final clusters whose quality is better than the previous one.

References

[1]
J. Han and M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2006.
[2]
X. Yin, J. Han, and P. Yu, "LinkClus: Efficient Clustering via Heterogeneous Semantic Links," In Proc. of Int'l. Conf. on Very Large Data Bases, pp. 427--438, 2006.
[3]
T. Li and S. Anand, "DIVA: A Variance-based Clustering Approach for Multi-type Relational Data," In Proc. of ACM Int'l. Conf. on Information and Knowledge Management, pp. 147--156, 2007.
[4]
L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, John Wiley & Sons, 1990.
[5]
T. Zhang, R. Ramakrishnan, and M. Livny, "BIRCH: An Efficient Data Clustering Method for Very Large Databases," ACM SIGMOD Record, Vol. 25, No. 2, pp. 103--114, 1996.
[6]
S. Guha, R. Rastogi, and K. Shim, "ROCK: A Robust Clustering Algorithm for Categorical Attributes," In Proc. of IEEE Int'l. Conf. Data Engineering, pp. 512--521, 1999.
[7]
S. Guha, R. Rastogi, and K. Shim, "CURE: An Efficient Clustering Algorithm for Large Databases," In Proc. of ACM SIGMOD Int'l. Conf. Management of Data, pp. 73--84, 1998.
[8]
G. Karypis, E.-H. Han, and V. Kumar, "Chameleon: Hierarchical Clustering using Dynamic Modeling," IEEE Computer, Vol. 32, pp. 68--74, 1999.
[9]
G. Karypis, and V. Kumar, "A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs," Journal of Society of Industrial and Applied Mathematics, Vol. 20, No. 1, pp. 359--392, 2008.
[10]
J. Shi and J. Malik, "Normalized Cuts and Image Segmentation," In Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 731--737, 1997.
[11]
A. Likas, N. Vlassis, and J. Verbeek, "The Global K-means Clustering Algorithm," Pattern Recognition, Vol. 36, No. 2, pp. 451--461, 2003.
[12]
R. Ng and J. Han, "Efficient and Effective Clustering Method for Spatial Data Mining," In Proc. of Int'l Conf. Very Large Data Bases, pp. 144--155, 1994.

Cited By

View all
  • (2021)Prediction of the Target User Segment for Paid Advertisement2021 IEEE 8th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)10.1109/UPCON52273.2021.9667580(1-5)Online publication date: 11-Nov-2021

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICUIMC '13: Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
January 2013
772 pages
ISBN:9781450319584
DOI:10.1145/2448556
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 January 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data partitioning
  2. hierarchical clustering
  3. parameter-insensitive

Qualifiers

  • Research-article

Funding Sources

Conference

ICUIMC '13
Sponsor:

Acceptance Rates

Overall Acceptance Rate 251 of 941 submissions, 27%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Prediction of the Target User Segment for Paid Advertisement2021 IEEE 8th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)10.1109/UPCON52273.2021.9667580(1-5)Online publication date: 11-Nov-2021

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media