Abstract
The operation and management of a municipality generate large amounts of complex data, enclosing information that is not easy to infer or extract. Their analysis is challenging and requires specialized approaches and tools, usually based on statistical techniques or on machine learning and artificial intelligence algorithms. These Big Data is often created by combining many data sources that correspond to different operational groups in the city, such as transport, energy consumption, water consumption, maintenance, and many others. Each group exhibits unique characteristics that are usually not shared by others. This paper provides a detailed systematic literature review on applying different algorithms to urban data processing. The study aims to figure out how this kind of information was collected, stored, pre-processed, and analyzed, to compare various methods, and to select feasible solutions for further research. The review finds that clustering, classification, correlation, anomaly detection, and prediction algorithms are frequently used. Moreover, the interpretation of relevant and available research results is presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alshami, A., Guo, W., Pogrebna, G.: Fuzzy partition technique for clustering Big Urban dataset, pp. 212–216 (2016). https://doi.org/10.1109/SAI.2016.7555984
Azri, S., Ujang, U., Abdul Rahman, A.: Dendrogram clustering for 3D data analytics in smart city. 42(4/W9), 247–253 (2018). https://doi.org/10.5194/isprs-archives-XLII-4-W9-247-2018
Bermudez-Edo, M., Barnaghi, P., Moessner, K.: Analysing real world data streams with spatio-temporal correlations: entropy vs. Pearson correlation. Autom. Constr. 88, 87–100 (2018). https://doi.org/10.1016/j.autcon.2017.12.036
Bordogna, G., Cuzzocrea, A., Frigerio, L., Psaila, G.: An effective and efficient similarity-matrix-based algorithm for clustering big mobile social data, pp. 514–521 (2017). https://doi.org/10.1109/ICMLA.2016.188
Chae, S., Kwon, S., Lee, D.: Predicting infectious disease using deep learning and big data. Int. J. Environ. Res. Public Health 15(8), 1596 (2018). https://doi.org/10.3390/ijerph15081596. http://www.mdpi.com/1660-4601/15/8/1596
Chang, C.S., Chang, C.T., Lee, D.S., Liou, L.H.: K-sets+: a linear-Time clustering algorithm for data points with a sparse similarity measure, pp. 1–8 (2018). https://doi.org/10.1109/UIC-ATC.2017.8397636
Chen, W., Huang, Z., Wu, F., Zhu, M., Guan, H., Maciejewski, R.: VAUD: a visual analysis approach for exploring spatio-temporal urban data. IEEE Trans. Visual. Comput. Graph. 24(9), 2636–2648 (2018). https://doi.org/10.1109/TVCG.2017.2758362
Girvan, M., Newman, M.E.J.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99(12), 7821–7826 (2002). https://doi.org/10.1073/pnas.122653799
Karyotis, V., Tsitseklis, K., Sotiropoulos, K., Papavassiliou, S.: Big data clustering via community detection and hyperbolic network embedding in IoT applications. Sensors (Switzerland) 18(4) (2018). https://doi.org/10.3390/s18041205
Kwon, O., Kim, Y., Lee, N., Jung, Y.: When collective knowledge meets crowd knowledge in a smart city: a prediction method combining open data keyword analysis and case-based reasoning. J. Healthcare Eng. 2018 (2018). https://doi.org/10.1155/2018/7391793
Materla, T., Cudney, E.A., Antony, J.: The application of Kano model in the healthcare industry: a systematic literature review. Total Qual. Manag. Bus. Excellence 1–22 (2017). https://doi.org/10.1080/14783363.2017.1328980
Pérez-Chacón, R., Luna-Romera, J., Troncoso, A., Martínez-Alvarez, F., Riquelme, J.: Big data analytics for discovering electricity consumption patterns in smart cities. Energies 11(3) (2018). https://doi.org/10.3390/en11030683
Simhachalam, B., Ganesan, G.: Performance comparison of fuzzy and non-fuzzy classification methods. Egypt. Inform. J. 17 (2015). https://doi.org/10.1016/j.eij.2015.10.004
Subhash, S., Cudney, E.A.: Gamified learning in higher education: a systematic review of the literature. Comput. Hum. Behav. 87, 192–206 (2018). https://doi.org/10.1016/j.chb.2018.05.028
Trilles, S., Belmonte, O., Schade, S., Huerta, J.: A domain-independent methodology to analyze IoT data streams in real-time. A proof of concept implementation for anomaly detection from environmental data. Int. J. Digit. Earth 10(1), 103–120 (2017). https://doi.org/10.1080/17538947.2016.1209583
Wang, G., et al.: Application of a long short-term memory neural network: a burgeoning method of deep learning in forecasting HIV incidence in Guangxi, China. Epidemiol. Infect. 147, e194 (2019). https://doi.org/10.1017/S095026881900075X
Acknowledgments
This work has been supported by FCT - Fundação para a Ciência e Tecnologia within the Project Scope: DSAIPA/AI/0088/2020.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Gubareva, R., Lopes, R.P. (2023). Big Data Trends in the Analysis of City Resources. In: Nesmachnow, S., Hernández Callejo, L. (eds) Smart Cities. ICSC-CITIES 2022. Communications in Computer and Information Science, vol 1706. Springer, Cham. https://doi.org/10.1007/978-3-031-28454-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-031-28454-0_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28453-3
Online ISBN: 978-3-031-28454-0
eBook Packages: Computer ScienceComputer Science (R0)