Abstract
Clustering is a data mining activity that aims to differentiate groups inside a given set of objects, with respect to a set of relevant attributes of the analyzed objects. Generally, existing clustering methods, such as k-means algorithm, start with a known set of objects, measured against a known set of attributes. But there are numerous applications where the attribute set characterizing the objects evolves. We propose an incremental, k-means based clustering method, Core Based Incremental Clustering (CBIC), that is capable to re-partition the objects set, when the attribute set increases. The method starts from the partitioning into clusters that was established by applying k-means or CBIC before the attribute set changed. The result is reached more efficiently than running k-means again from the scratch on the feature-extended object set. Experiments proving the method’s efficiency are also reported.
Chapter PDF
Similar content being viewed by others
Key words
References
Serban, G.: A Programming Interface for Non-Hierarchical Clustering, Studia Universitatis “Babes-Bolyai”, Informatica, XLX(1), 2005, to appear.
Serban, G., Campan, A.: Core Based Incremental Clustering, Studia Universitatis “Babes Bolyai”, Informatica, XLXI(2), 2005, to appear.
Wolberg, W., Mangasarian, O.L.: “Multisurface method of pattern separation for medical diagnosis applied to breast cytology”, Proceedings of the National Academy of Sciences, U.S.A., Volume 87, December 1990: 9193–9196.
Demiroz, G., Govenir, H. A., Ilter, N.: “Learning Differential Diagnosis of Eryhemato Squamous Diseases using Voting Feature Intervals”, Artificial Intelligence in Medicine.
Aeberhard, S., Coomans, D., de Vel, O.: “THE CLASSIFICATION PERFORMANCE OF RDA” Tech. Rep. no. 92—01, Dept. of Computer Science and Dept. of Mathematics and Statistics, James Cook University of North Queensland, 1992.
Jain, A., Dubes, R, “Algorithms for Clustering Data”, Prentice Hall, Englewood Cliffs, New Jersey, 1998.
Han, J., Kamber, M., “Data Mining: Concepts and Techniques”, Morgan Kaufmann Publishers, 2001.
http://www.cormactech.com/neunet, “Discover the Patterns in Your Data”, CorMac Technologies Inc, Canada.
Quinlan, J. R., “C4.5: Programs for Machine Learning, Morgan Kaufmann”, 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 International Federation for Information Processing
About this paper
Cite this paper
Câmpan, A., Şerban, G. (2005). A New Incremental Core-Based Clustering Method. In: Li, D., Wang, B. (eds) Artificial Intelligence Applications and Innovations. AIAI 2005. IFIP — The International Federation for Information Processing, vol 187. Springer, Boston, MA. https://doi.org/10.1007/0-387-29295-0_29
Download citation
DOI: https://doi.org/10.1007/0-387-29295-0_29
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-28318-0
Online ISBN: 978-0-387-29295-3
eBook Packages: Computer ScienceComputer Science (R0)