Abstract
In order to improve the stability of heterogeneous big data mining operations in complex attribute environment, such as data analysis and cleaning, a heterogeneous big data intelligent clustering algorithm is established. The data cleaning classification method is applied to clean the parameter space in complex attribute environment, and the regular term of sparse subspace clustering is introduced to eliminate the irrelevant and redundant information of heterogeneous big data, and the intelligent clustering index of heterogeneous big data is obtained. By measuring the clustering results, the design of heterogeneous big data intelligent clustering algorithm in complex attribute environment is completed. The experimental results show that the heterogeneous big data intelligent clustering algorithm in complex attribute environment has strong stability in the process of data analysis and cleaning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anonymous: Large data optimal clustering algorithms in cloud computing environment based on PSO. Electron. Des. Eng. 26(19), 86–89+94 (2018)
Qujie: Research on intelligent parallel clustering method for large data in virtual environment. Comput. Measur. Control 25(6), 257–260 (2017)
Yi, M., Ting, X., Shaobin, L.: Research on NoSQL distributed large data mining method in complex attribute environment. Sci. Technol. Eng. 17(09), 244–248 (2017)
Chunhua, H.: Clustering algorithm analysis of multidimensional data de-duplication in large data environment. Comput. Prod. Circ. 32(11), 151 (2017)
Anonymous: Prediction and analysis of energy consumption behavior of integrated energy system users under multi-source heterogeneous large data. Smart Power 46(10), 92–101 (2018)
Li, B.H., Junhua, C., et al.: Distributed clustering algorithms of attribute graph under multi-agent architecture. Comput. Sci. 44(S1), 407–413 (2017)
Linjing, W., Lulu, N., Bin, G., et al.: Sparse fractional feature selection clustering algorithms based on entropy weighting in large data. Comput. Appl. Res. 35(8), 59–60 + 69 (2018)
Houlisa: Clustering algorithm design for eliminating redundant features in large data sets. Mod. Electron. Technol. 41(14), 56–58+62 (2018)
Xiaoyu, C., Xiaojing, L., Haiying, M.: A fast automatic clustering algorithm for large data. Comput. Appl. Res. 34(9), 2651–2654 (2017)
Xiaoyan, T.: A large data text clustering algorithm based on word embedding and density peak strategy. Innov. Appl. Sci. Technol. 6, 90–90 (2017)
Fu, W., Liu, S., Srivastava, G.: Optimization of big data scheduling in social networks. Entropy 21(9), 902 (2019)
Sun, G., Liu, S. (eds.): ADHIP 2017. LNICST, vol. 219. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73317-3
Shuai, L., Weiling, B., Nianyin, Z., et al.: A fast fractal based compression for MRI images. IEEE Access 7, 62412–62420 (2019)
Liu, S., Li, Z., Zhang, Y., et al.: Introduction of key problems in long-distance learning and training. Mob. Netw. Appl. 24(1), 1–4 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Wang, Y., Zhai, Jl. (2021). Heterogeneous Big Data Intelligent Clustering Algorithm in Complex Attribute Environment. In: Liu, S., Xia, L. (eds) Advanced Hybrid Information Processing. ADHIP 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 347. Springer, Cham. https://doi.org/10.1007/978-3-030-67871-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-030-67871-5_32
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67870-8
Online ISBN: 978-3-030-67871-5
eBook Packages: Computer ScienceComputer Science (R0)