Abstract
Accurate information about real estate in the city, and about residential buildings in particular, is the most important resource for managing the development of the urban environment. Information about residential buildings, for example, the number of residents, is used in the inventory and digitization of the urban economy and subsequently becomes the basis of digital platforms for managing urban processes. Inventory of urban property can be carried out independently by different departments within the framework of official functions, which leads to the problem of conflicting information and missing values in urban data, in building data in particular. These problems are especially pronounced when information from different sources is combined to create centralized repositories and digital twins of the city. This leads to the need to develop approaches to filling missing values and correcting distorted information about residential buildings. As part of this work, the authors propose an approach to data imputation of residential buildings, including additional information about the environment. The analysis of the effectiveness of the approach is based on data collected for St. Petersburg (Russia).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Khrulkov, A., Mityagin, S.A., Repkin, A.I.: Multi-factor approach to investment attractiveness assessment of urban spaces. Proc. Comput. Sci. 178, 94–102 (2020)
van Nes, A., Pont, M.B., Mashhoodi, B.: Combination of Space syntax with spacematrix and the mixed-use index: the Rotterdam South test case. In: 8th International Space Syntax Symposium, Santiago de Chile, 3–6 January (2012)
Boiko, D., Parygin, D., Savina, O., Golubev, A., Zelenskiy, I., Mityagin, S.: Approaches to analysis of factors affecting the residential real estate bid prices in case of open data use. In: Chugunov, A., Khodachek, I., Misnikov, Y., Trutnev, D. (eds.) EGOSE 2019. CCIS, vol. 1135, pp. 360–375. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-39296-3_27
Wang, R.Y., Strong, D.M.: Beyond accuracy: what data quality means to data consumers. J. Manag. Inf. Syst. 12, 5–33 (1996)
Rubin, D.B.: Inference and missing data. Biometrika 63, 581–592 (1976)
Curley, C., Krause, R.M., Feiock, R., Hawkins, C.: Dealing with missing data: a comparative exploration of approaches using the integrated city sustainability database. Urban Affairs Rev. 55, 591–615 (2019)
van Buuren, S.: Flexible Imputation of Missing Data. CRC Press (2018)
Lin, W.-C., Tsai, C.-F.: Missing value imputation: a review and analysis of the literature (2006–2017). Artif. Intell. Rev. 53(2), 1487–1509 (2019). https://doi.org/10.1007/s10462-019-09709-4
Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)
Jamieson, K., Talwalkar, A.: Non-stochastic best arm identification and hyperparameter optimization. In: Artificial Intelligence and Statistics, pp 240–248 (2016)
Li, L., Jamieson, K., DeSalvo, G., et al.: Hyperband: a novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res. 18, 6765–6816 (2017)
Bergstra, J., Bardenet, R., Bengio, Y., Kégl, B.: Algorithms for hyper-parameter optimization. Adv. Neural Inf. Process. Syst. 24 (2011)
van Buuren, S., Groothuis-Oudshoorn, K.: mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 45, 1–67 (2011)
Conzen, M.R.G.: Alnwick, Northumberland: a study in town-plan analysis. Transactions and Papers (Institute of British Geographers) iii–122 (1960)
Pont, M.Y.B., Haupt, P.A.: Spacematrix. Space, Density and Urban Form. NAi Publishers (2010)
van den Hoek, J.W.: Towards a Mixed-use Index (MXI) as a tool for urban planning and analysis. Urbanism: PhD Research 2008–2012 65 (2009)
Templ, M., Kowarik, A., Filzmoser, P.: Iterative stepwise regression imputation using standard and robust methods. Comput. Stat. Data Anal. 55, 2793–2806 (2011)
Schafer, J.L., Graham, J.W.: Missing data our view of the state of the art. Psychol. Methods 7, 147 (2002)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Khrulkov, A.A., Mishina, M.E., Mityagin, S.A. (2022). Approach to Imputation Multivariate Missing Data of Urban Buildings by Chained Equations Based on Geospatial Information. In: Groen, D., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2022. ICCS 2022. Lecture Notes in Computer Science, vol 13352. Springer, Cham. https://doi.org/10.1007/978-3-031-08757-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-08757-8_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-08756-1
Online ISBN: 978-3-031-08757-8
eBook Packages: Computer ScienceComputer Science (R0)