Abstract
This paper introduces a clustering algorithm for Chinese text based on both SOM (Self-Organizing Map) neural network and density. The algorithm contains two stages. During the first stage, Chinese text are transformed into text vectors, which are used as training data of SOM and mapped by training SOM so that an initial clustering result for text data, i.e., a virtual coordinates set, is obtained. Then, during the second stage, the virtual coordinates set is further clustered according to density. It should be pointed out that the proposed algorithm in the first stage is different from the existing ones. Moreover, in the second stage, it outperforms other algorithms in computing time due to decreasing dimension. Numerical experiment shows that the algorithm is efficient for clustering text data and high multi-dimensional data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhou, S.G., Zhou, A.Y.: FDBSCAN:A Fast DBSCAN Algorithm. Journal of Software 11, 735–744 (2000)
Ma, S., Wang, T.J., Tang, S.W., Yang, D.Q., Gao, J.: A Fast Clustering Algorithm Based on Reference and Density. Journal of Software 14, 1089–1095 (2003) (in Chinese)
Chen, N., Chen, A.: An Incremental Density-Based Clustering Algorithm. Journal of Software 11, 1–7 (2002)
Juha, V.: Clustering of the self-organizing Map. IEEE Transaction on Neural Networks 11, 586–600 (2000)
Rauber, A., Merkl, D.: The Growing Hierarchical Self-Organizing Maps Exploratory Analysis of High-Dimensional Data. IEEE Transactions on Neural Networks 13, 1331–1341 (2002)
Hung, C., Wermter, S.: A Self-Organising Hybrid Model for Dynamic for Text Clustering (2003), http://citeseer.ist.psu.edu/646370.html
Elias, P.: A New Approach to Hierarchical Clustering and Structuring of Data with Self-Organizing Maps. Intelligent Data Analysis Journal, 1–23 (2003) (extended version)
Kohonen, T., Kaski, S., Lagus, K., Salojarvi, J., Honkela, J., Paatero, V., Saarela, A.: Self Organization of a Massive Document Collection. IEEE Transactions on Neural Networks 11, 574–585 (2000)
Xu, J.S., Wang, Z.O., Wang, L.: A Novel Approach of Chinese Text Clustering Based on Self-Organizing Neural Network. Journal of Information 22, 676–680 (2003) (in Chinese)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Meng, Z., Zhu, H., Zhu, Y., Zhou, G. (2005). A Clustering Algorithm for Chinese Text Based on SOM Neural Network and Density. In: Wang, J., Liao, XF., Yi, Z. (eds) Advances in Neural Networks – ISNN 2005. ISNN 2005. Lecture Notes in Computer Science, vol 3497. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11427445_40
Download citation
DOI: https://doi.org/10.1007/11427445_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25913-8
Online ISBN: 978-3-540-32067-8
eBook Packages: Computer ScienceComputer Science (R0)