Abstract
In this paper, a clustering model based on deep learning RBM encoding is proposed for the further data mining of the massive, complex and high-dimensional data. This model includes two major parts: pre-training and fine-tuning & optimization. In the pre-training part, proper parameters are adopted for RBM encoding to reduce the high-dimensional and large-scaled data, and then pre-clustering is done with k-means and other algorithms. The fine-tuning & optimization part is developed from the deep structure of pre-training to form a deep fine-tuning, and network is initialized with the parameters generated from the pre-training, and then the initial clustering center generated from pre-training process is further clustered and optimized. At the same time, encoding features are optimized and the final clustering center and membership matrix are obtained. In order to validate this model, some data are selected from the UCI dataset for clustering comparison. It is indicated in the data analysis that this clustering model based on RBM encoding has little impact on the clustering effect, but the execution is more efficient.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Jinjia, W., et al.: The study of deep learning under big data. High Technol. Lett. 27(1), 27–37 (2017). (in Chinese)
Weng, S.: The construction of the cognitive modeling for deep learning based on the micro-MOOC learning system. Mod. Educ. Technol. 27(6), 87–93 (2017). (in Chinese)
Cai, H.: Research of clustering algorithms in big data analysis. Hehui University Of Science Technology, An Hui, pp. 1–33 (2016). (in Chinese)
Qi, Y.: Research of key technologies of clustering based on deep learning. Southwest Jiaotong University, Si Chuan, pp. 1–58 (2016). (in Chinese)
Li, F., Xie, D., Qi, D., Xie, G., Chen, W., Peng, L.: Research on effective and intelligent resource management in internet computing. Appl. Math. Inf. Sci. 8(2), 625–631 (2014)
Ma, S.: ETc. Deep learning with big data: state of art and development. CAAI Trans. Intell. Syst. 11(6), 728–740 (2016). (in Chinese)
Zhang, J.: ETc. Review of deep learning. Appl. Res. Comput. 35(7), 27–37 (2018). (in Chinese)
Keguang, Y.: Research on incremental clustering method for large dataset. Mod. Electron. Tech. 40(9), 176–182 (2017). (in Chinese)
Acknowledgments
We would like to thank the anonymous referees for their careful readings of the manuscripts and many useful suggestions. This work has been co-supported by: Natural Science Foundation of China under Grant No. 61472092; Guangdong Provincial Science and Technology Plan Fund with grant No. 2013B010401037; Natural Science Foundation of Guangdong Province under Grant No. S2011040003843; GuangZhou Municipal High School Science Research Fund under grant No. 1201421317; State Scholarship Fund by China Scholarship Council under Grant No. [2013]3018-201308440096; and Yuexiu District Science and Technology Plan Fund of GuangZhou City with grant No. 2013-GX-005.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Yuan, L., Xiao, X., Li, F., Deng, N. (2018). Clustering Model Based on RBM Encoding in Big Data. In: Sun, X., Pan, Z., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2018. Lecture Notes in Computer Science(), vol 11063. Springer, Cham. https://doi.org/10.1007/978-3-030-00006-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-030-00006-6_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00005-9
Online ISBN: 978-3-030-00006-6
eBook Packages: Computer ScienceComputer Science (R0)