Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
Skip header Section
Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability)May 2007
Publisher:
  • Society for Industrial and Applied Mathematics
  • 3600 University City Science Center Philadelphia, PA
  • United States
ISBN:978-0-89871-623-8
Published:01 May 2007
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. ACM
    Bohlouli M and He Z EMR: Scalable Clustering of Big HR Data using Evolutionary MapReduce Companion Proceedings of the Web Conference 2021, (26-34)
  2. Vela-Rincón V, Ramos-Palencia C, Mújica-Vargas D, Kinani J and Ramos-Díaz E Comparison Methods for Fuzzy C-Means Initialization Applied to Image Segmentation Advances in Computational Intelligence, (350-362)
  3. Su H, Liu S, Zheng B, Zhou X and Zheng K (2019). A survey of trajectory distance measures and performance evaluation, The VLDB Journal — The International Journal on Very Large Data Bases, 29:1, (3-32), Online publication date: 1-Jan-2020.
  4. Kim S and Gil J (2019). Research paper classification systems based on TF-IDF and LDA schemes, Human-centric Computing and Information Sciences, 9:1, (1-21), Online publication date: 1-Dec-2019.
  5. Poddar S and Jacob M (2019). Clustering of Data With Missing Entries Using Non-Convex Fusion Penalties, IEEE Transactions on Signal Processing, 67:22, (5865-5880), Online publication date: 15-Nov-2019.
  6. ACM
    Huang Q, Liang Z, Wang H, Zuo S and Bajaj C (2019). Tensor maps for synchronizing heterogeneous shape collections, ACM Transactions on Graphics, 38:4, (1-18), Online publication date: 31-Aug-2019.
  7. Xue H, Li S, Chen X and Wang Y (2019). A maximum margin clustering algorithm based on indefinite kernels, Frontiers of Computer Science: Selected Publications from Chinese Universities, 13:4, (813-827), Online publication date: 1-Aug-2019.
  8. Yassin N and Farouk M (2019). Application of Quantum-Clustering on Thermograms of WiFi Circuits in Different Operation Modes, Pattern Recognition and Image Analysis, 29:3, (565-571), Online publication date: 1-Jul-2019.
  9. Rizo Rodríguez S and de Assis Tenório de Carvalho F A new fuzzy clustering algorithm for interval-valued data based on City-Block distance 2019 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), (1-6)
  10. Louhichi S, Gzara M and Ben-Abdallah H (2019). MDCUT2, Distributed and Parallel Databases, 37:1, (73-99), Online publication date: 1-Mar-2019.
  11. Mashtalir S, Stolbovyi M and Yakovlev S (2019). Clustering Video Sequences by the Method of Harmonic k-Means, Cybernetics and Systems Analysis, 55:2, (200-206), Online publication date: 1-Mar-2019.
  12. Sameer F, Abu Bakar M, Zaidan A and Zaidan B (2019). A new algorithm of modified binary particle swarm optimization based on the Gustafson-Kessel for credit risk assessment, Neural Computing and Applications, 31:2, (337-346), Online publication date: 1-Feb-2019.
  13. Gowthul Alam M and Baulkani S (2019). Geometric structure information based multi-objective function to increase fuzzy clustering performance with artificial and real-life data, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 23:4, (1079-1098), Online publication date: 1-Feb-2019.
  14. ACM
    Wang C, Chi C, She Z, Cao L and Stantic B (2018). Coupled Clustering Ensemble by Exploring Data Interdependence, ACM Transactions on Knowledge Discovery from Data, 12:6, (1-38), Online publication date: 17-Oct-2018.
  15. Alshamaa D, Chehade F and Honeine P (2018). A hierarchical classification method using belief functions, Signal Processing, 148:C, (68-77), Online publication date: 1-Jul-2018.
  16. Ishak Boushaki S, Kamel N and Bendjeghaba O (2018). A new quantum chaotic cuckoo search algorithm for data clustering, Expert Systems with Applications: An International Journal, 96:C, (358-372), Online publication date: 15-Apr-2018.
  17. ACM
    Souza I, Escarpinati M and Abdala D A curve completion algorithm for agricultural planning Proceedings of the 33rd Annual ACM Symposium on Applied Computing, (284-291)
  18. Kumar A and Kumar S (2018). A Support Based Initialization Algorithm for Categorical Data Clustering, Journal of Information Technology Research, 11:2, (53-67), Online publication date: 1-Apr-2018.
  19. Bijari K, Zare H, Veisi H and Bobarshad H (2018). Memory-enriched big bang---big crunch optimization algorithm for data clustering, Neural Computing and Applications, 29:6, (111-121), Online publication date: 1-Mar-2018.
  20. Wang C, Bo T, Zhao Y, Chi C, Lam K, Wang S, Shu M and Zhang X (2018). Behavior-Interior-Aware User Preference Analysis Based on Social Networks, Complexity, 2018, Online publication date: 1-Jan-2018.
  21. Dutta P and Saha S (2017). Fusion of expression values and protein interaction information using multi-objective optimization for improving gene clustering, Computers in Biology and Medicine, 89:C, (31-43), Online publication date: 1-Oct-2017.
  22. Rodrigues r, Torok L, Liatsis P, Viterbo J and Conci A (2017). k-MS, Pattern Recognition, 66:C, (392-403), Online publication date: 1-Jun-2017.
  23. Peng H, Shi P, Wang J, Riscos-Nez A and Prez-Jimnez M (2017). Multiobjective fuzzy clustering approach based on tissue-like membrane systems, Knowledge-Based Systems, 125:C, (74-82), Online publication date: 1-Jun-2017.
  24. Yu Y, Wang C, Wang H and Gao Y (2017). Attributes coupling based matrix factorization for item recommendation, Applied Intelligence, 46:3, (521-533), Online publication date: 1-Apr-2017.
  25. ACM
    Kang Y, Zhou Y, Xu H and Lyu M DiagDroid: Android performance diagnosis via anatomizing asynchronous executions Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, (410-421)
  26. ACM
    Chen L, Zhang D, Wang L, Yang D, Ma X, Li S, Wu Z, Pan G, Nguyen T and Jakubowicz J Dynamic cluster-based over-demand prediction in bike sharing systems Proceedings of the 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing, (841-852)
  27. Shariat Yazdi H, Angelis L, Kehrer T and Kelter U (2016). A framework for capturing, statistically modeling and analyzing the evolution of software models, Journal of Systems and Software, 118:C, (176-207), Online publication date: 1-Aug-2016.
  28. Zhang Z, Fang H and Wang H (2016). Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth, Journal of Medical Systems, 40:6, (1-9), Online publication date: 1-Jun-2016.
  29. Yıldırım N and Uzunoğlu B Data Mining via Association Rules for Power Ramps Detected by Clustering or Optimization Special Issue on Cyberworlds and Cybersecurity on Transactions on Computational Science XXVIII - Volume 9590, (163-176)
  30. Liu L, Chen X, Liu M, Jia Y, Zhong J, Gao R and Zhao Y (2016). An influence power-based clustering approach with PageRank-like model, Applied Soft Computing, 40:C, (17-32), Online publication date: 1-Mar-2016.
  31. ACM
    Calzarossa M, Massari L and Tessera D (2016). Workload Characterization, ACM Computing Surveys, 48:3, (1-43), Online publication date: 8-Feb-2016.
  32. ACM
    Gourgaris P and Makris C A Density Based k-Means Initialization Scheme Proceedings of the 16th International Conference on Engineering Applications of Neural Networks (INNS), (1-9)
  33. De Jesus Silva L, Claro D and Lopes D (2015). Semantic-based clustering of web services, Journal of Web Engineering, 14:3-4, (325-345), Online publication date: 1-Jul-2015.
  34. ACM
    Lahiouel O, Zaki M and Tahar S Towards enhancing analog circuits sizing using SMT-based techniques Proceedings of the 52nd Annual Design Automation Conference, (1-6)
  35. Sabo K and Scitovski R (2015). An approach to cluster separability in a partition, Information Sciences: an International Journal, 305:C, (208-218), Online publication date: 1-Jun-2015.
  36. Mishulina O and Sukonkin I (2015). Genetic algorithm for data clustering based on SV-criterion, Optical Memory and Neural Networks, 24:2, (82-92), Online publication date: 1-Apr-2015.
  37. Bhardwaj S, Srivastava S and Gupta J (2015). Pattern-Similarity-Based Model for Time Series Prediction, Computational Intelligence, 31:1, (106-131), Online publication date: 1-Feb-2015.
  38. Scitovski R and Marošević T (2015). Multiple circle detection based on center-based clustering, Pattern Recognition Letters, 52:C, (9-16), Online publication date: 15-Jan-2015.
  39. Thomann P, Steinwart I and Schmid N (2015). Towards an axiomatic approach to hierarchical clustering of measures, The Journal of Machine Learning Research, 16:1, (1949-2002), Online publication date: 1-Jan-2015.
  40. Morales-Esteban A, Martínez-Álvarez F, Scitovski S and Scitovski R (2014). A fast partitioning algorithm using adaptive Mahalanobis clustering with application to seismic zoning, Computers & Geosciences, 73:C, (132-141), Online publication date: 1-Dec-2014.
  41. Wikaisuksakul S (2014). A multi-objective genetic algorithm with fuzzy c-means for automatic data clustering, Applied Soft Computing, 24:C, (679-691), Online publication date: 1-Nov-2014.
  42. Hadian A and Shahrivari S (2014). High performance parallel $$k$$k-means clustering for disk-resident datasets on multi-core CPUs, The Journal of Supercomputing, 69:2, (845-863), Online publication date: 1-Aug-2014.
  43. ACM
    Wenzel F and Kießling W Aggregation and Analysis of Enriched Spatial User Models from Location-Based Social Networks Proceedings of Workshop on Managing and Mining Enriched Geo-Spatial Data, (1-6)
  44. Skibski O, Michalak T, Rahwan T and Wooldridge M Algorithms for the shapley and myerson values in graph-restricted games Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems, (197-204)
  45. ACM
    Silva J, Faria E, Barros R, Hruschka E, Carvalho A and Gama J (2013). Data stream clustering, ACM Computing Surveys, 46:1, (1-31), Online publication date: 1-Oct-2013.
  46. Rafiee G, Dlay S and Woo W (2013). Region-of-interest extraction in low depth of field images using ensemble clustering and difference of Gaussian approaches, Pattern Recognition, 46:10, (2685-2699), Online publication date: 1-Oct-2013.
  47. Wang J, Wang S, Chung F and Deng Z (2013). Fuzzy partition based soft subspace clustering and its applications in high dimensional data, Information Sciences: an International Journal, 246, (133-154), Online publication date: 1-Oct-2013.
  48. Lipovetsky S (2013). Finding cluster centers and sizes via multinomial parameterization, Applied Mathematics and Computation, 221:C, (571-580), Online publication date: 15-Sep-2013.
  49. Reyes-Vargas M, Sánchez-Gutiérrez M, Rufiner L, Albornoz M, Vignolo L, Martínez-Licona F and Goddard-Close J Hierarchical Clustering and Classification of Emotions in Human Speech Using Confusion Matrices Proceedings of the 15th International Conference on Speech and Computer - Volume 8113, (162-169)
  50. ACM
    Jiang Y and Duan H Spatial codebook for robust background detection in visual information analysis Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service, (399-402)
  51. Zhai H, Haraguchi M, Okubo Y, Hashimoto K and Hirokawa S Shifting concepts to their associative concepts via bridges Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition, (586-600)
  52. Yu T and Peng H (2013). Hierarchical Clustering of High- Throughput Expression Data Based on General Dependences, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 10:4, (1080-1085), Online publication date: 1-Jul-2013.
  53. Louati A, Aufaure M, Cuvelier E and Pimentel B Soft and Adaptive Aggregation of Heterogeneous Graphs with Heterogeneous Attributes Revised Selected and Invited Papers of the International Workshop on Semantic Web Collaborative Spaces - Volume 9507, (145-180)
  54. Toledano-Kitai D, Avros R, Volkovich Z, Weber G and Yahalom O (2013). A binomial noised model for cluster validation, Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology, 24:3, (417-427), Online publication date: 1-May-2013.
  55. Bilge A and Polat H (2013). A comparison of clustering-based privacy-preserving collaborative filtering schemes, Applied Soft Computing, 13:5, (2478-2489), Online publication date: 1-May-2013.
  56. Liu K, Xu J, Zhang L, Ding Z and Li M (2013). Discovering hot topics from geo-tagged video, Neurocomputing, 105, (90-99), Online publication date: 1-Apr-2013.
  57. Pimentel B and De Souza R (2013). A multivariate fuzzy c-means method, Applied Soft Computing, 13:4, (1592-1607), Online publication date: 1-Apr-2013.
  58. Tan K, Mat Isa N and Lim W (2013). Color image segmentation using adaptive unsupervised clustering approach, Applied Soft Computing, 13:4, (2017-2036), Online publication date: 1-Apr-2013.
  59. Lerman L, Medeiros S, Veshchikov N, Meuter C, Bontempi G and Markowitch O Semi-Supervised template attack Proceedings of the 4th international conference on Constructive Side-Channel Analysis and Secure Design, (184-199)
  60. Liu S, Matzavinos A and Sethuraman S (2013). Random walk distances in data clustering and applications, Advances in Data Analysis and Classification, 7:1, (83-108), Online publication date: 1-Mar-2013.
  61. Costa A, Pimentel B and Souza R (2013). Clustering interval data through kernel-induced feature space, Journal of Intelligent Information Systems, 40:1, (109-140), Online publication date: 1-Feb-2013.
  62. Saez M, González-Vázquez S, González-Penedo M, Barceló M, Pena-Seijo M, Coll de Tuero G and Pose-Reino A (2012). Development of an automated system to classify retinal vessels into arteries and veins, Computer Methods and Programs in Biomedicine, 108:1, (367-376), Online publication date: 1-Oct-2012.
  63. Garcia-Piquer A, Fornells A, Orriols-Puig A, Corral G and Golobardes E (2012). Data classification through an evolutionary approach based on multiple criteria, Knowledge and Information Systems, 33:1, (35-56), Online publication date: 1-Oct-2012.
  64. Kaleli C and Polat H (2012). Privacy-preserving SOM-based recommendations on horizontally distributed data, Knowledge-Based Systems, 33, (124-135), Online publication date: 1-Sep-2012.
  65. Gazi V and Kayis E Comparing Clustering Techniques for Real Microarray Data Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), (788-791)
  66. Ben-Arieh D and Gullipalli D (2012). Data Envelopment Analysis of clinics with sparse data, Computers and Industrial Engineering, 63:1, (13-21), Online publication date: 1-Aug-2012.
  67. Zanda A, Eibe S and Menasalvas E (2012). SOMAR, Expert Systems with Applications: An International Journal, 39:9, (8423-8429), Online publication date: 1-Jul-2012.
  68. Aridhi H, Zaki M and Tahar S Towards improving simulation of analog circuits using model order reduction Proceedings of the Conference on Design, Automation and Test in Europe, (1337-1342)
  69. Lo J (2012). A cortex-like learning machine for temporal hierarchical pattern clustering, detection, and recognition, Neurocomputing, 78:1, (89-103), Online publication date: 1-Feb-2012.
  70. David G and Averbuch A (2012). SpectralCAT, Pattern Recognition, 45:1, (416-433), Online publication date: 1-Jan-2012.
  71. Thomas J A new clustering algorithm based on k-means using a line segment as prototype Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications, (638-645)
  72. ACM
    Wang C, Cao L, Wang M, Li J, Wei W and Ou Y Coupled nominal similarity in unsupervised learning Proceedings of the 20th ACM international conference on Information and knowledge management, (973-978)
  73. ACM
    Zhang W, Yang Y and Wang Q Handling missing data in software effort prediction with naive Bayes and EM algorithm Proceedings of the 7th International Conference on Predictive Models in Software Engineering, (1-10)
  74. Ashour W and Sunoallah S Multi density DBSCAN Proceedings of the 12th international conference on Intelligent data engineering and automated learning, (446-453)
  75. Li A, Haraguchi M and Okubo Y Contrasting correlations by an efficient double-clique condition Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition, (469-483)
  76. Jiang H, Li J, Yi S, Wang X and Hu X (2011). A new hybrid method based on partitioning-based DBSCAN and ant clustering, Expert Systems with Applications: An International Journal, 38:8, (9373-9381), Online publication date: 1-Aug-2011.
  77. ACM
    Yan X, Lau R, Song D, Li X and Ma J (2011). Toward a semantic granularity model for domain-specific information retrieval, ACM Transactions on Information Systems, 29:3, (1-46), Online publication date: 1-Jul-2011.
  78. Elkamel A, Gzara M and Ben-Abdallah H Clustering aggregation for improving ant based clustering Proceedings of the Second international conference on Advances in swarm intelligence - Volume Part I, (250-259)
  79. ACM
    Gullo F, Domeniconi C and Tagarelli A Advancing data clustering via projective clustering ensembles Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, (733-744)
  80. Luo T and Zhong C A neighborhood density estimation clustering algorithm based on minimum spanning tree Proceedings of the 5th international conference on Rough set and knowledge technology, (557-565)
  81. Drapała J, Żatuchin D and Sobecki J Multidimensional data visualization applied for user's questionnaire data quality assessment Proceedings of the 4th KES international conference on Agent and multi-agent systems: technologies and applications, Part I, (351-360)
  82. Vázquez S, Barreira N, Penedo M, Saez M and Pose-Reino A Using retinex image enhancement to improve the artery/vein classification in retinal images Proceedings of the 7th international conference on Image Analysis and Recognition - Volume Part II, (50-59)
  83. Pereira J, Schmidt F, Contreras P, Murtagh F and Astudillo H Clustering and semantics preservation in cultural heritage information spaces Adaptivity, Personalization and Fusion of Heterogeneous Information, (100-105)
  84. ACM
    Miljenovic I The SourceGraph program Proceedings of the 2010 ACM SIGPLAN workshop on Partial evaluation and program manipulation, (151-154)
  85. Estrada E and Hatano N (2009). Communicability graph and community structures in complex networks, Applied Mathematics and Computation, 214:2, (500-511), Online publication date: 1-Aug-2009.
  86. Guimarães F, Wanner E and Takahashi R A quality metric for multi-objective optimization based on hierarchical clustering techniques Proceedings of the Eleventh conference on Congress on Evolutionary Computation, (3292-3299)
  87. Phillips R, Watson L and Wynne R Examining the potential parallel scalability of a fuzzy semi-supervised classification algorithm Proceedings of the 2009 Spring Simulation Multiconference, (1-7)
  88. Li A, Haraguchi M and Okubo Y Implicit Groups of Web Pages as Constrained Top N Concepts Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03, (190-194)
  89. Dráždilová P, Martinovic J, Slaninová K and Snášel V Analysis of Relations in eLearning Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03, (373-376)
  90. Gan G and Wu J (2008). A convergence theorem for the fuzzy subspace clustering (FSC) algorithm, Pattern Recognition, 41:6, (1939-1947), Online publication date: 1-Jun-2008.
  91. Sannen D, Nuttin M, Smith J, Tahir M, Caleb-Solly P, Lughofer E and Eitzinger C An on-line interactive self-adaptive image classification framework Proceedings of the 6th international conference on Computer vision systems, (171-180)
  92. ACM
    Phillips R, Watson L and Wynne R A study of fuzzy clustering within the IGSCR framework Proceedings of the 46th annual ACM Southeast Conference, (140-145)
Contributors
  • Guilin University of Technology

Reviews

Fatih Kurugollu

Data clustering seeks to partition data into subsets that contain data showing common properties. It is an indispensable tool in many applications, such as data analysis, pattern analysis, image processing, machine learning, and data mining. In this book, the authors examine data-clustering problems systematically. They present the existing data-clustering methods in a logical and understandable way, without going into too deep theoretical analysis. Methods are explained clearly, with enough mathematical formulation, and pseudocode for most of the algorithms is provided. In this sense, I really like how this book provides a quick reference for the clustering algorithms. I believe many researchers and engineers who need to implement a clustering algorithm will find this book very useful. However, it may not be appropriate for one who seeks a more theoretical explanation of data clustering. I do not think it can be used as a textbook in the classroom, since the book suffers from a lack of rigorous examples and questions for each chapter. However, it can be used as supplementary reading. The book is organized into four major parts, containing a total of 20 chapters. The first part is devoted to a general introduction to data clustering and related matters. Data clustering is explained in the first chapter, with required background information for the next chapters. Data types, conversions in data, standardization and transformation data before applying it to an algorithm, and the problem of visualizing the structure of data are presented in chapters 2 to 5. The last chapter of this part, chapter 6, is reserved for explanations of the similarity and dissimilarity measures that are used in the clustering algorithms. The second part, chapters 7 to 17, is the main part of the book, presenting the clustering algorithms. In each chapter, a different clustering methodology is examined, providing enough mathematical background and pseudocode for the algorithms in this category. The final chapter in this part, chapter 17, introduces the evaluation of clustering algorithms, which is of paramount importance in unsupervised clustering schemes. Since there is no prior information about the clusters, cluster shapes, and number of clusters in the data in unsupervised clustering, the result should be validated in terms of some criteria. This chapter explains the methods and criteria that are used to test the performance of clustering algorithms, as well as the ones used to estimate the number of clusters in the given data set. The third part is reserved for applications of clustering algorithms. Although the name of the part implies different applications, there is only one application: clustering gene expression data, which is considered as a case study. The background for the problem is introduced, and then a fuzzy subspace clustering algorithm is applied to the problem by providing full C++ code. The results are tabulated using a small MATLAB program, showing the closeness of data in each cluster after the clustering. In the final part of the book, some MATLAB and C++ based tools for clustering are presented. Chapter 19 offers preliminary information and techniques, as well as some built-in functions in MATLAB that are useful for clustering. Finally, in chapter 20, clustering tools in C++ are briefly presented. Online Computing Reviews Service

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.

Recommendations