Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Computer Science and Information Systems 2006 Volume 3, Issue 1, Pages: 23-32
https://doi.org/10.2298/CSIS0601023H
Full text ( 736 KB)
Cited by


Improving categorical data clustering algorithm by weighting uncommon attribute value matches

He Zengyou (Department of Computer Science and Engineering, Harbin Institute of Technology, China)
Xu Xiaofei (Department of Computer Science and Engineering, Harbin Institute of Technology, China)
Deng Shenchun (Department of Computer Science and Engineering, Harbin Institute of Technology, China)

This paper presents an improved Squeezer algorithm for categorical data clustering by giving greater weight to uncommon attribute value matches in similarity computations. Experimental results on real life datasets show that, the modified algorithm is superior to the original Squeezer algorithm and other clustering algorithm with respect to clustering accuracy.