Abstract
Consensus clustering is the problem of coordinating clustering information about the same data set coming from different runs of the same algorithm. Consensus clustering is becoming a state-of-the-art approach in an increasing number of applications. However, determining the optimal cluster number is still an open problem. In this paper, we propose a novel consensus clustering algorithm that is based on the Minkowski distance. Fusing with the Newman greedy algorithm in complex networks, the proposed clustering algorithm can automatically set the number of clusters. It is less sensitive to noise and can integrate solutions from multiple samples of data or attributes for processing data in the processing industry. A numerical simulation is also given to demonstrate the effectiveness of the proposed algorithm. Finally, this consensus clustering algorithm is applied to a froth flotation process.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
A. L. Barabási, R. Albert. Emergence of scaling in random networks. Science, vol. 286, no. 5439, pp. 509–512, 1999.
V. Mayer-Schönberger, K. Cukier. Big Data: A Revolution that will Transform How We Live, Work, and Think, Boston, USA: Houghton Mifflin Harcourt, 2013.
V. R. Radhakrishnan, A. R. Mohamed. Neural networks for the identification and control of blast furnace hot metal quality. Journal of Process Control, vol. 10, no. 6, pp. 509–524, 2000.
S. Monti, P. Tamayo, J. Mesirov, T. Golub. Consensus clustering: A resampling-based method for class discovery and visualization of gene expression microarray data. Machine Learning, vol. 52, no. 1–2, pp. 91–118, 2003.
S. Race, C. Meyer, K. Valakuzhy. Determining the number of clusters via iterative consensus clustering. arXiv: 1408.0967, 2014.
M. E. J. Newman. Fast algorithm for detecting community structure in networks. Physical Review E, vol. 69, no. 6, pp. 066133, 2004.
P. Deuflhard, W. Huisinga, A. Fischer, C. Schütte. Identification of almost invariant aggregates in reversible nearly uncoupled Markov chains. Linear Algebra and its Applications, vol. 315, no. 1–3, pp. 39–59, 2000.
W. J. Stewart. Probability, Markov Chains, Queues, and Simulation: The Mathematical Basis of Performance Modeling, Princeton, USA: Princeton University Press, 2009.
K. Fujiwara, M. Kano, S. Hasebe. Development of correlation-based clustering method and its application to software sensing. Chemometrics & Intelligent Laboratory Systems, vol. 101, no. 2, pp. 130–138, 2010.
K. Fujiwara, M. Kano, S. Hasebe. Correlation-based spectral clustering for flexible process monitoring. Journal of Process Control, vol. 21, no. 10, pp. 1438–1448, 2011.
U. Von Luxburg. A tutorial on spectral clustering. Statistics and Computing, vol. 17, no. 4, pp. 395–416, 2007.
B. Yang, D. Y. Liu, J. M. Liu, D. Jin, H. B. Ma. Complex network clustering algorithms. Journal of Software, vol. 20, no. 1, pp. 54–66, 2009. (in Chinese)
A. Strehl, J. Ghosh. Cluster ensembles—a knowledge reuse framework for combining multiple partitions. The Journal of Machine Learning Research, vol. 3, pp. 583–617, 2003.
T. Li, C. Ding, M. I. Jordan. Solving consensus and semisupervised clustering problems using nonnegative matrix factorization. In Proceedings of the 7th IEEE International Conference on Data Mining, IEEE, Omaha, USA, pp. 577–582, 2007.
X. H. Hu, I. Yoo, X. D. Zhang, P. Nanavati, D. Das. Wavelet transformation and cluster ensemble for gene expression analysis. International Journal of Bioinformatics Research and Applications, vol. 1, no. 4, pp. 447–460, 2005.
R. C. De Amorim, B. Mirkin. Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering. Pattern Recognition, vol. 45, no. 3, pp. 1061–1075, 2012.
R. C. De Amorim, C. Hennig. Recovering the number of clusters in data sets with noise features using feature rescaling factors. Information Sciences, vol. 324, pp. 126–145, 2015.
B. S. Everitt, S. Landau, M. Leese, D. Stahl. Cluster Analysis, 5th ed., UK: Wiley, 2011.
B. Auffarth. Clustering by a genetic algorithm with biased mutation operator. In Proceedings of the IEEE Congress on Evolutionary Computation, IEEE, Barcelona, Spain, pp. 1–8, 2010.
B. J. Frey, D. Dueck. Clustering by passing messages between data points. Science, vol. 315, no. 5814, pp. 972–976, 2007.
H. G. Ayad, M. S. Kamel. On voting-based consensus of cluster ensembles. Pattern Recognition, vol. 43, no.5, pp. 1943–1953, 2010.
M. E. J. Newman. Detecting community structure in networks. The European Physical Journal B, vol. 38, no. 2, pp. 321–330, 2004.
D. G. Xu, X. Chen, Y. F. Xie, C. H. Yang, W. H. Gui. Complex networks-based texture extraction and classification method for mineral flotation froth images. Minerals Engineering, vol. 83, pp. 105–116, 2015.
J. Zhang, Z. H. Tang, J. P. Liu, Z. Tan, P. F. Xu. Recognition of flotation working conditions through froth image statistical modeling for performance monitoring. Minerals Engineering, vol. 86, pp. 116–129, 2016.
N. Barbian, J. J. Cilliers, S. H. Morar, D. J. Bradshaw. Froth imaging, air recovery and bubble loading to describe flotation bank performance. International Journal of Mineral Processing, vol. 84, no. 1–4, pp. 81–88, 2007.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by National Natural Science Foundation of China (Nos. 61473319 and 61104135), the Key Project of National Natural Science Foundation of China (Nos. 61621062 and 61134006), the Innovation Research Funds of Central South University (No. 2016CX014), and National High Technology Research and Development Program (“863”Program) (No. 2013AA040301-3).
Recommended by Guest Editor Dong-Bing Gu
De-Gang Xu received the Ph.D. degree in control science and engineering from Zhejiang University, China in 2007. He is currently a professor with College of Information Science and Engineering, Central South University, China.
His research interests include intelligent control, process control, machine learning and computation algorithms.
ORCID iD: 0000-0003-1730-9410
Pan-Lei Zhao received the M. Sc. degree in control science and engineering from Central South University, China in 2014. He is currently an engineer with China Railway Rolling Stock Corporation Limited.
His research interests include intelligent control, process control and computation algorithms.
Chun-Hua Yang received the Ph.D. degree in control science and engineering from Zhejiang University, China in 2002. She is currently a professor with College of Information Science and Engineering, Central South University, China.
Her research interests include intelligent control, process control, machine learning and dispatching control system.
Wei-Hua Gui received the M. Sc. degree in control science and engineering from Zhejiang University, China in 1984. He is currently a professor with College of Information Science and Engineering, Central South University, China.
His research interests include large-scale control, process control and computer control system.
Jian-Jun He received the Ph.D. degree in control science and engineering from Zhejiang University, China in 2003. He is currently a professor with the School of Information Science and Engineering.
His research interests include large-scale control, process control and computer control system.
Rights and permissions
About this article
Cite this article
Xu, DG., Zhao, PL., Yang, CH. et al. A novel Minkowski-distance-based consensus clustering algorithm. Int. J. Autom. Comput. 14, 33–44 (2017). https://doi.org/10.1007/s11633-016-1033-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11633-016-1033-z