skip to main content
research-article

Geo-Tile2Vec: A Multi-Modal and Multi-Stage Embedding Framework for Urban Analytics

Published:12 April 2023Publication History
Skip Abstract Section

Abstract

Cities are very complex systems. Representing urban regions are essential for exploring, understanding, and predicting properties and features of cities. The enrichment of multi-modal urban big data has provided opportunities for researchers to enhance urban region embedding. However, existing works failed to develop an integrated pipeline that fully utilizes effective and informative data sources within geographic units. In this article, we regard a geo-tile as a geographic unit and propose a multi-modal and multi-stage representation learning framework, namely Geo-Tile2Vec, for urban analytics, especially for urban region properties identification. Specifically, in the early stage, geo-tile embeddings are firstly inferred through dynamic mobility events which are combinations of point-of-interest (POI) data and trajectory data by a Word2Vec-like model and metric learning. Then, in the latter stage, we use static street-level imagery to further enrich the embedding information by metric learning. Lastly, the framework learns distributed geo-tile embeddings for the given multi-modal data. We conduct experiments on real-world urban datasets. Four downstream tasks, i.e., main POI category classification task, main land use category classification task, restaurant average price regression task, and firm number regression task, are adopted for validating the effectiveness of the proposed framework in representing geo-tiles. Our proposed framework can significantly improve the performances of all downstream tasks. In addition, we also demonstrate that geo-tiles with similar urban region properties are geometrically closer in the vector space.

REFERENCES

  1. [1] Ayala Daniel, Wolfson Ouri, Dasgupta Bhaskar, Lin Jie, and Xu Bo. 2018. Spatio-temporal matching for urban transportation applications. ACM Transactions on Spatial Algorithms and Systems 3, 4 (2018), 139.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Cesario Eugenio, Comito Carmela, and Talia Domenico. 2017. An approach for the discovery and validation of urban mobility patterns. Pervasive and Mobile Computing 42, 2 (Dec.2017), 7792.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Chang Buru, Park Yonggyu, Park Donghyeon, Kim Seongsoon, and Kang Jaewoo. 2018. Content-aware hierarchical point-of-interest embedding model for successive POI recommendation. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 33013307.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Chen Tianqi and Guestrin Carlos. 2016. XGBoost: A scalable tree boosting system. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, New York, NY, 785794.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Comito Carmela. 2020. NexT: A framework for next-place prediction on location based social networks. Knowledge-Based Systems 204 (Sept.2020), 106205.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Crivellari Alessandro and Resch Bernd. 2022. Investigating functional consistency of mobility-related urban zones via motion-driven embedding vectors and local POI-type distributions. Computational Urban Science 2, 1 (2022), 19.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Dong Lei, Ratti Carlo, and Zheng Siqi. 2019. Predicting neighborhoods’ socioeconomic attributes using restaurant data. Proceedings of the National Academy of Sciences 116, 31 (2019), 1544715452.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Dong Lei, Yuan Xiao-Hui, Li Meng, Ratti Carlo, and Liu Yu. 2021. A gridded establishment dataset as a proxy for economic activity in China. Scientific Data 8, 1 (2021), 1–9.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Ferreira Danielle L., Nunes Bruno A. A., Campos Carlos Alberto V., and Obraczka Katia. 2020. A deep learning approach for identifying user communities based on geographical preferences and its applications to urban and environmental planning. ACM Transactions on Spatial Algorithms and Systems 6, 3 (2020), 124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Fu Yanjie, Wang Pengyang, Du Jiadi, Wu Le, and Li Xiaolin. 2019. Efficient region embedding with multi-view spatial networks: A perspective of locality-constrained spatial autocorrelations. Proceedings of the AAAI Conference on Artificial Intelligence 33, 01 (July2019), 906913.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Gebru Timnit, Krause Jonathan, Wang Yilun, Chen Duyun, Deng Jia, Aiden Erez Lieberman, and Fei-Fei Li. 2017. Using deep learning and Google street view to estimate the demographic makeup of the US. Proceedings of the National Academy of Sciences 114 (112017), 201700035.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Haikal Christophe, Alizadeh Pegah, Rodrigues Christophe, and Chongke Bi. 2022. Place embedding across cities in location-based social networks. In Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing (2022-04-25). 539546.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] He Kaiming, Zhang Xiangyu, Ren Shaoqing, and Sun Jian. 2016. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770778.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Herrmann Stefanie M., Brandt Martin, Rasmussen Kjeld, and Fensholt Rasmus. 2020. Accelerating land cover change in West Africa over four decades as population pressure increased. Communications Earth & Environment 1, 1 (2020), 110.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Hu Sheng, Gao Song, Wu Liang, Xu Yongyang, Zhang Ziwei, Cui Haifu, and Gong Xi. 2021. Urban function classification at road segment level using taxi trajectory data: A graph convolutional neural network approach. Computers, Environment and Urban Systems 87 (2021), 101619.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Huang Tianyuan, Wang Zhecheng, Sheng Hao, Ng Andrew Y., and Rajagopal Ram. 2021. M3G: Learning urban neighborhood representation from multi-modal multi-graph. In Proceedings of the DeepSpatial 2021: 2nd ACM KDD Workshop on Deep Learning for Spatio-Temporal Data, Applications and Systems.Google ScholarGoogle Scholar
  17. [17] Huang Weiming, Cui Lizhen, Chen Meng, Zhang Daokun, and Yao Yao. 2022. Estimating urban functional distributions with semantics preserved POI embedding. International Journal of Geographical Information Science 0, 0 (2022), 126.Google ScholarGoogle Scholar
  18. [18] Jean Neal, Wang Sherrie, Samar Anshul, Azzari George, Lobell David, and Ermon Stefano. 2019. Tile2Vec: Unsupervised representation learning for spatially distributed data. Proceedings of the AAAI Conference on Artificial Intelligence 33, 01 (July2019), 39673974.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Jenkins Porter, Farag Ahmad, Wang Suhang, and Li Zhenhui. 2019. Unsupervised representation learning of spatial data via multimodal embedding. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 19932002.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] Jiang Renhe, Song Xuan, Fan Zipei, Xia Tianqi, Wang Zhaonan, Chen Quanjun, Cai Zekun, and Shibasaki Ryosuke. 2021. Transfer urban human mobility via POI embedding over multiple cities. ACM/IMS Transactions on Data Science 2, 1 (2021), 4:1–4:26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Law Stephen and Neira Mateo. 2019. An unsupervised approach to geographical knowledge discovery using street level and street network images. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery. 5665.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Li Bin, Gao Song, Liang Yunlei, Kang Yuhao, Prestby Timothy, Gao Yuqi, and Xiao Run mou. 2020. Estimation of regional economic development indicator from transportation network analytics. Scientific Reports 10, 1 (2020), 1–15.Google ScholarGoogle Scholar
  23. [23] Li Shen, Zhao Zhe, Hu Renfen, Li Wensi, Liu Tao, and Du Xiaoyong. 2018. Analogical reasoning on chinese morphological and semantic relations. In Proceedings of the Annual Conference of the Association for Computational Linguistics. Association for Computational Linguistics, 138143.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Lin Yan, Wan Huaiyu, Guo Shengnan, and Lin Youfang. 2021. Pre-training context and time aware location embeddings from spatial-temporal trajectories for user next location prediction. Proceedings of the AAAI Conference on Artificial Intelligence 35, 5 (May2021), 42414248.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Liu Xi, Kang Chaogui, Gong Li, and Liu Yu. 2016. Incorporating spatial interaction patterns in classifying and understanding urban land use. International Journal of Geographical Information Science 30, 2 (2016), 334350.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Liu Yu, Wang Fahui, Xiao Yu, and Gao Song. 2012. Urban land uses and traffic “source-sink areas”: Evidence from GPS-enabled taxi data in Shanghai. Landscape and Urban Planning 106, 1 (2012), 7387.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Lu Dengsheng and Weng Qihao. 2006. Use of impervious surface in urban land-use classification. Remote Sensing of Environment 102, 1–2 (2006), 146160.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Tomas Mikolov, Kai Chen, Gregory S. Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. International Conference on Learning Representations, 112.Google ScholarGoogle Scholar
  29. [29] Mikolov Tomas, Sutskever Ilya, Chen Kai, Corrado Greg, and Dean Jeffrey. 2013. Distributed representations of words and phrases and their compositionality. In Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2. 31113119.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Pan Gang, Qi Guande, Wu Zhaohui, Zhang Daqing, and Li Shijian. 2012. Land-use classification using taxi GPS traces. IEEE Transactions on Intelligent Transportation Systems 14, 1 (2012), 113123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Pearson Karl. 1901. LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2, 11 (1901), 559572.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Pei Tao, Sobolevsky Stanislav, Ratti Carlo, Shaw Shih-Lung, Li Ting, and Zhou Chenghu. 2014. A new insight into land use classification based on aggregated mobile phone data. International Journal of Geographical Information Science 28, 9 (2014), 19882007.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Rodrigue Jean-Paul, Comtois Claude, and Slack Brian. 2016. The Geography of Transport Systems.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Salton Gerard and Buckley Christopher. 1988. Term-weighting approaches in automatic text retrieval. Information Processing & Management 24, 5 (1988), 513523.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Schroff Florian, Kalenichenko Dmitry, and Philbin James. 2015. FaceNet: A unified embedding for face recognition and clustering. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 815823.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Shimizu Toru, Yabe Takahiro, and Tsubouchi Kota. 2020. Enabling finer grained place embeddings using spatial hierarchy from human mobility trajectories. In Proceedings of the 28th International Conference on Advances in Geographic Information Systems. 187190.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Sun Zhihao, Jiao Hongzan, Wu Hao, Peng Zhenghong, and Liu Lingbo. 2021. Block2vec: An approach for identifying urban functional regions by integrating sentence embedding model and points of interest. ISPRS International Journal of Geo-Information 10, 5 (2021), 339.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Tobler Waldo Rudolph. 1970. A computer movie simulating urban growth in the detroit region. Economic Geography 46, Sup1 (1970), 234240.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Wang Senzhang, Cao Jiannong, Chen Hao, Peng Hao, and Huang Zhiqiu. 2020. SeqST-GAN: Seq2Seq generative adversarial nets for multi-step urban crowd flow prediction. ACM Transactions on Spatial Algorithms and Systems 6, 4 (2020), Article 22.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Wang Zhecheng, Li Haoyuan, and Rajagopal Ram. 2020. Urban2Vec: Incorporating street view imagery and POIs for multi-modal urban neighborhood embedding. Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (April2020), 10131020.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Kim Nam woo and Yoon Yoonjin. 2021. Representation learning of urban regions via mobility-signature-based zone embedding: A case study of Seoul, South Korea. In Proceedings of the 5th ACM SIGSPATIAL International Workshop on Location-Based Recommendations, Geosocial Networks and Geoadvertising (2021-11-02). 14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Yabe Takahiro, Tsubouchi Kota, Shimizu Toru, Sekimoto Yoshihide, and Ukkusuri Satish V.. 2019. City2City: Translating place representations across cities. In Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (2019-11-05). 412415.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Yan Bo, Janowicz Krzysztof, Mai Gengchen, and Gao Song. 2017. From ITDL to Place2Vec: Reasoning about place type similarity and relatedness by learning embeddings from augmented spatial contexts. In Proceedings of the 25th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. 110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Yao Zijun, Fu Yanjie, Liu Bin, Hu Wangsu, and Xiong Hui. 2018. Representing urban functions through zone embedding with human mobility patterns. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 39193925.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Ye Chao, Zhang Fan, Mu Lan, Gao Yong, and Liu Yu. 2020. Urban function recognition by integrating social media and street-level imagery. Environment and Planning B: Urban Analytics and City Science 48, 6 (2020), 1430–1444.Google ScholarGoogle Scholar
  46. [46] Yuan Jing, Zheng Yu, and Xie Xing. 2012. Discovering regions of different functions in a city using human mobility and POIs. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, New York, NY, 186194.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Yuan Jing, Zheng Yu, Xie Xing, Wang Yingzi, Zheng Kai, and Xiong Hui. 2015. Discovering urban functional zones using latent activity trajectories. IEEE Transactions on Knowledge and Data Engineering 27, 3 (2015), 712725.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Zhai Wei, Bai Xueyin, Shi Yu, Han Yu, Peng Zhong-Ren, and Gu Chaolin. 2019. Beyond Word2vec: An approach for urban functional region extraction and identification by combining Place2vec and POIs. Computers, Environment and Urban Systems 74 (March2019), 112.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Zhang Chengkun, Xu Liuchang, Yan Zhen, and Wu Sensen. 2021. A GloVe-Based POI type embedding model for extracting and identifying urban functional regions. ISPRS International Journal of Geo-Information 10, 6 (2021), 372.Google ScholarGoogle ScholarCross RefCross Ref
  50. [50] Zhang Chao, Zhang Keyang, Yuan Quan, Peng Haoruo, Zheng Yu, Hanratty Tim, Wang Shaowen, and Han Jiawei. 2017. Regions, periods, activities: Uncovering urban dynamics via cross-modal representation learning. In Proceedings of the 26th International Conference on World Wide Web (2017-04-03). 361370.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Zhang Mingyang, Li Tong, Li Yong, and Hui Pan. 2020. Multi-view joint graph representation learning for urban region embedding. In Proceedings of the 29th International Joint Conference on Artificial Intelligence, Vol. 5. 44314437.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Zhang Yunchao, Fu Yanjie, Wang Pengyang, Li Xiaolin, and Zheng Yu. 2019. Unifying inter-region autocorrelation and intra-region structures for spatial embedding via collective adversarial learning. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 17001708.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Zhou Bolei, Lapedriza Agata, Xiao Jianxiong, Torralba Antonio, and Oliva Aude. 2014. Learning deep features for scene recognition using places database. In Proceedings of the Conference on Neural Information Processing Systems. MIT Press, Cambridge, MA, 487495.Google ScholarGoogle Scholar
  54. [54] Zhou Bolei, Lapedriza Àgata, Khosla Aditya, Oliva Aude, and Torralba Antonio. 2017. Places: A 10 million image database for scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 6 (2017), 1452–1464.Google ScholarGoogle Scholar
  55. [55] Zhou Yang and Huang Yan. 2018. DeepMove: Learning place representations through large scale movement data. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data). 24032412.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Geo-Tile2Vec: A Multi-Modal and Multi-Stage Embedding Framework for Urban Analytics

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM Transactions on Spatial Algorithms and Systems
            ACM Transactions on Spatial Algorithms and Systems  Volume 9, Issue 2
            June 2023
            201 pages
            ISSN:2374-0353
            EISSN:2374-0361
            DOI:10.1145/3592535
            Issue’s Table of Contents

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 12 April 2023
            • Online AM: 18 November 2022
            • Accepted: 7 November 2022
            • Revised: 14 September 2022
            • Received: 22 August 2021
            Published in tsas Volume 9, Issue 2

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
          • Article Metrics

            • Downloads (Last 12 months)645
            • Downloads (Last 6 weeks)49

            Other Metrics

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          View Full Text

          HTML Format

          View this article in HTML Format .

          View HTML Format