Abstract
In recent years, machine learning, deep learning, neural network and other artificial intelligence technologies have made significant development and breakthroughs. Thus more mature and reliable technologies start to be applied in the field of data analysis. Data analysis is the process of studying and summarizing data in detail in order to extract useful information, that is, collecting, sorting, processing and analyzing data. Moreover, in order to make the analysis results easier to understand, data visualization technology is widely employed. This paper tries to review the state-of-the-art data analysis methods based on artificial intelligence. Besides, the mainstream data visualization technology and cases are elaborated. Finally, some open problems and challenges are also put forward for the future research.
This work was supported by the National Key Research and Development Program of China (Project No. 2018YFC0806903,2018YFC0820104) and Key Lab of Information Network Security of Ministry of Public Security C19606 (The Third Research Institute of Ministry of Public Security).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Genyuan, D., Genyuan, Z.: Introduction to Information Technology. Wuhan University Press, (in Chinese), 09, p. 222 (2015)
Lei, R., Yi, D., Shuai, M.: Review of big data visual analysis. J. Softw. 25(09), 1909–1936 (2014). (in Chinese)
Pang, S.: Analysis of multidimensional data de-duplication clustering algorithm based on large data mining. Mod. Electron. Tech. 42(23), 150–153 (2019)
Xiaohui, H., Cheng, W., Liyan, X.: A weighting k-means clustering approach by integrating intra-cluster and inter-cluster distances. Chin. J. Comput. 42(12), 2836–2848 (2019)
Ezhilvathani, A., Raja, K.: Implementation of parallel apriori algorithm on Hadoop cluster. Int. J. Comput. Sci. Mobile Comput. 2(4), 513–516 (2013)
Qiu, H., Gu, R., Yuan, C.: Yafim: a parallel frequent itemset mining algorithm with spark. In: 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, Washington, D.C., USA, pp. 1664-1671. IEEE (2014)
Jie, W., Qinghao, D., Zeng, Y.: Optimization of parallel frequent modulus growth algorithm in cloud manufacturing environment. Comput. Integr. Manuf. Syst. 18(9), 2124–2129 (2012). (in Chinese)
Zhiyong, L.: Research on Parallel Algorithm for Mining Association Rules. Southeast University, Nanjing (2016). (in Chinese)
Singh, S., Garg, R., Mishra, P.: Review of apriori based algorithms on mapreduce framework, pp. 593–604. IEEE, Washington, D.C., USA (2014)
Lopez, V., Del R.S., Benitez, J.M.: Cost sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data. Fuzzy Sets Syst. 258, 5–38 (2015)
Huang, G., Huang, G.B., Song, S.: Trends in extreme learning machines: a review. Neural Netw. 61, 32–48 (2015)
Kamal, S., Ripon, S.H., Dey, N.: A MapReduce approach to diminish imbalance parametersfor big deoxyribonucleic acid dataset. Comput. Methods Programs Biomed. 131, 191–206 (2016)
Fernandez-Delgado, M., Cernadas, E., Barro, S.: Do we need hundreds of classifiers to solve real world classification problems. J. Mach. Learn. Res. 15(1), 3133–3181 (2014)
Huiying, Q., Yuhe, J.: Predicting breast cancer survival length with multi-omics data fusion. Data Anal. Knowl. Discov. 3(8), (2019). (in Chinese)
Shiming, T., Taorong, G., Xiaoqing, H.: Forecasting regional E-GDP value using power big data. Electric Power Autom. Equipment, 39(11) (2019). (in Chinese)
Jihua, Y., Changbin, Y.: Wear predication of tunnel boring machine cutters based on in-situ measured data. J. Southwest Jiaotong Univ. 54(6), 1283–1292 (2019). (in Chinese)
Wanling, W.: Artificial Intelligence and its Applications (3rd ed). Higher Education Press, Beijing (2016). (in Chinese)
Zhang, K., Chen, X.W.: Large-scale deep belief nets with mapreduce. IEEE Access 2, 395–403 (2014)
Zhang, H.J., Xiao, N.F.: Parallel implementation of multilayered neural networks based on MapReduce on cloud computing clusters. Soft. Comput. 20(4), 1471–1483 (2016)
Cao, J., Cui, H., Shi, H.: Big data: a parallel particle swarm optimization-back-propagation neural network algorithm based on MapReduce. PLoS ONE 11(6), e0157551 (2016)
Guojun, M., Dianjun, H., Songyan, X.: Big data classification model and algorithm based on distributed data flow. Chin. J. Comput. 40(1), 161–175 (2017). (in Chinese)
Jingyue, H., Bei, M.: Collaborative filtering recommendation algorithm using real-valued conditional Boltzmann machine with social relations. Chin. J. Comput. 39(1), 183–195 (2016)
Li, H., Su, P., Chi, Z.: Image retrieval and classification on deep convolutional SparkNet. In: 2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC), Washington, D.C., USA, pp. 1–6. IEEE (2016)
Yan, Y., Chen, M., Sadiq, S.: Efficient imbalanced multimedia concept retrieval by deep learning on spark clusters. Int. J. Multimedia Data Eng. Manage. 8(1), 1–20 (2017)
Huan, Y., Yinan, L., Kang, Y.: Proper use of color in visualization. J. Comput. Aided Des. Comput. Graph. 27(9), 1587 (2015) (in Chinese)
Ran, X.: TOP50 + 5 big data visualization analysis tool. Internet Wkly 17, 58–59 (2014). (in Chinese)
Xiaoyan, C., Liping, G., Wenpin, G.: Comparison and application of big data visualization tools. Comput. Educ. 06, 97–102 (2018). (in Chinese)
Kadembo, E.M.: Anchored in the story: the core of human understanding, branding, education, socialisation and the shaping of values. Mark Rev. 12(3), 221–231 (2012)
Yongchao, W., Shengwen, L., Yingbao, Y.: Review of knowledge graph visualization. J. Comput. Aided Des. Comput. Graph. 31(10), 1666–1676 (2019). (in Chinese)
Dudáš, M., Lohmann, S., Svátek, V., et al.: Ontology visualization methods and tools: a survey of the state of the art. Knowl. Eng. Rev. 33, e10 (2018)
Hussain, A., Latif, K., Rextin, A.T., et al.: Scalable visualization of semantic nets using power-law graphs. Appl. Math. Inf. Sci. 8(1), 355–367 (2014)
Sichen, J., Yubo, T., Yuyu, Y., et al.: Analysis of infectious disease patterns based on multi-dimensional spatiotemporal data visualization. J. Comput. Aided Des. Comput. Graph. 31(2), 241–255 (2019). (in Chinese)
Buchin, K., Speckmann, B., Verbeek, K.: Flow map layout via spiral trees. IEEE Trans. on Visual. Comput. Graph. 17(12), 2536–2544 (2011)
Abello, J., van Ham, F., Krishnan, N.: ASK-Graphview: a large scale graph visualization system. IEEE Trans. on Visual. Comput. Graph., 12(5), 669–676 (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Xie, K., Han, L., Jing, M., Luan, J., Yang, T., Fan, R. (2021). Review of Intelligent Data Analysis and Data Visualization. In: Barolli, L., Takizawa, M., Enokido, T., Chen, HC., Matsuo, K. (eds) Advances on Broad-Band Wireless Computing, Communication and Applications. BWCCA 2020. Lecture Notes in Networks and Systems, vol 159. Springer, Cham. https://doi.org/10.1007/978-3-030-61108-8_36
Download citation
DOI: https://doi.org/10.1007/978-3-030-61108-8_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-61107-1
Online ISBN: 978-3-030-61108-8
eBook Packages: EngineeringEngineering (R0)