Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Multivariate fuzzy k-modes algorithm

  • Theoretical Advances
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

In the fuzzy k-modes clustering, there is just one membership degree of interest by class for each individual which cannot be sufficient to model ambiguity of data precisely. It is known that the essence of a multivariate thinking allows to expose the inherent structure and meaning revealed within a set of variables classified. In this paper, a multivariate approach for membership degrees is presented to better handle ambiguous data that share properties of different clusters. This method is compared with other fuzzy k-modes methods of the literature based on a multivariate internal index that is also proposed in this paper. Synthetic and real categorical data sets are considered in this study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Dodge Y (2008) The concise encyclopedia of statistics. Springer, New York

    MATH  Google Scholar 

  2. Kaufman L, Rouseeuw PJ (1990) Finding groups in data: an introduction to cluster analysis. Wiley, New York

    Book  Google Scholar 

  3. Xu R, Wunsch D (2005) Survey of clustering algorithms. Trans Neural Netw 16:645–678

    Article  Google Scholar 

  4. Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, New York

    Book  MATH  Google Scholar 

  5. Ismail MA, Selim SZ (1986) Fuzzy K-means: optimality of solutions and effective termination of the problem. Pattern Recognit 19:481–485

    Article  MATH  Google Scholar 

  6. Simar L, Hardle W (2007) Applied multivariate statistical analysis, 2nd edn. Springer, New York

    MATH  Google Scholar 

  7. Ralambondrainy H (1995) A conceptual version of the k-means algorithm. Pattern Recognit Lett 16:1147–1157

    Article  Google Scholar 

  8. Huang Z (1997) Clustering large data sets with mixed numeric and categorical values. In: Proceedings of the first Pacific Asia knowledge discovery and data mining conference, vol 4, pp 21–34

  9. Huang Z, Ng MK (1999) A fuzzy k-modes algorithm for clustering categorical data. Trans Fuzzy Syst 7:446–452

    Article  Google Scholar 

  10. Zadeh L (1995) Fuzzy sets. Inf Control 3:338–353

    MathSciNet  MATH  Google Scholar 

  11. Ruspini EH (1969) A new approach to clustering. Inf Control 15:22–32

    Article  MATH  Google Scholar 

  12. Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, New York

    Book  MATH  Google Scholar 

  13. Tamura S, Higuchi S, Tanaka K (1971) Pattern classification based on fuzzy relations. Trans Syst 1:61–66

    MathSciNet  MATH  Google Scholar 

  14. Pimentel BA, Souza RMCR (2013) A multivariate fuzzy c-means method. Appl Soft Comput 13(4):1592–1607

    Article  Google Scholar 

  15. Trigo MM (2005) Using fuzzy k-modes to analyse patterns of system calls for intrusion detection. Thesis, California State University

  16. Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:283–286

    Article  MATH  Google Scholar 

  17. Campello RJGB, Hruschka ER (2006) A fuzzy extension of the silhouette width criterion for cluster analysis. Fuzzy Sets Syst 157:2858–2875

    Article  MathSciNet  MATH  Google Scholar 

  18. Hullermeier E, Rifqi M, Henzgen S, Senge R (2012) Comparing fuzzy partitions: a generalization of the Rand index and related measures. Trans Fuzzy Syst 20:546–556

    Article  Google Scholar 

  19. Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66:846–850

    Article  Google Scholar 

  20. Mingoti SA, Matos RA (2012) Clustering algorithms for categorical data: a Monte Carlo study. Int J Stat Appl 2:24–32

    Google Scholar 

Download references

Acknowledgments

The authors would like to thank Brazilian agencies CNPq (National Council for Scientific and Technological Development) and CAPES (Coordination for the Improvement of Higher Education Personnel) for financial support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Renata M. C. R. de Souza.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maciel, D.B.M., Amaral, G.J.A., de Souza, R.M.C.R. et al. Multivariate fuzzy k-modes algorithm. Pattern Anal Applic 20, 59–71 (2017). https://doi.org/10.1007/s10044-015-0465-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-015-0465-3

Keywords