skip to main content
research-article

STICAP: Spatio-temporal Interactive Attention for Citywide Crowd Activity Prediction

Published:15 January 2024Publication History
Skip Abstract Section

Abstract

Accurate citywide crowd activity prediction (CAP) can enable proactive crowd mobility management and timely responses to urban events, which has become increasingly important for a myriad of smart city planning and management purposes. However, complex correlations across the crowd activities, spatial and temporal urban environment features and their interactive dependencies, and relevant external factors (e.g., weather conditions) make it highly challenging to predict crowd activities accurately in terms of different venue categories (for instance, venues related to dining, services, and residence) and varying degrees (e.g., daytime and nighttime).

To address the above concerns, we propose STICAP, a citywide spatio-temporal interactive crowd activity prediction approach. In particular, STICAP takes in the location-based social network check-in data (e.g., from Foursquare/Gowalla) as the model inputs and forecasts the crowd activity within each time step for each venue category. Furthermore, we have integrated multiple levels of temporal discretization to interactively capture the relations with historical data. Then, three parallel Residual Spatial Attention Networks (RSAN) in the Spatial Attention Component exploit the hourly, daily, and weekly spatial features of crowd activities, which are further fused and processed by the Temporal Attention Component for interactive CAP. Along with other external factors such as weather conditions and holidays, STICAP adaptively and accurately forecasts the final crowd activities per venue category, enabling potential activity recommendation and other smart city applications. Extensive experimental studies based on three different real-world crowd activity datasets have demonstrated that our proposed STICAP outperforms the baseline and state-of-the-art algorithms in CAP accuracy, with an average error reduction of 35.02%.

REFERENCES

  1. [1] 2009–2010. Gowalla Data. Retrieved from https://go.gowalla.com/Google ScholarGoogle Scholar
  2. [2] 2012. Foursquare Data. Retrieved from https://foursquare.com/Google ScholarGoogle Scholar
  3. [3] 2021. Weather Data API. Retrieved from https://api.weather.com/Google ScholarGoogle Scholar
  4. [4] Barraco Manuele, Bicocchi Nicola, Mamei Marco, and Zambonelli Franco. 2021. Forecasting parking lots availability: Analysis from a real-world deployment. In IEEE PerCom Workshops. IEEE, 299304.Google ScholarGoogle Scholar
  5. [5] Cao Hancheng, Sankaranarayanan Jagan, Feng Jie, Li Yong, and Samet Hanan. 2019. Understanding metropolitan crowd mobility via mobile cellular accessing data. ACM Trans. Spatial Algor. Syst. 5, 2 (2019), 118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Chauhan Jatin, Raghuveer Aravindan, Nandy Jay, Saket Rishi, and Ravindran Balaraman. 2022. Multi-variate time series forecasting on variable subsets. In ACM SIGKDD.Google ScholarGoogle Scholar
  7. [7] Chen Long, Zhang Hanwang, Xiao Jun, Nie Liqiang, Shao Jian, Liu Wei, and Chua Tat-Seng. 2017. SCA-CNN: Spatial and channel-wise attention in convolutional networks for image captioning. In IEEE CVPR. 56595667.Google ScholarGoogle Scholar
  8. [8] Civitarese Gabriele, Mascetti Sergio, Butifar Alberto, and Bettini Claudio. 2019. Automatic detection of urban features from wheelchair users’ movements. In IEEE PerCom. IEEE, 110.Google ScholarGoogle Scholar
  9. [9] Cramer Henriette, Rost Mattias, and Holmquist Lars Erik. 2011. Performing a check-in: Emerging practices, norms and “conflicts” in location-sharing using Foursquare. In ACM MobileHCI. 5766.Google ScholarGoogle Scholar
  10. [10] Deng Jinliang, Chen Xiusi, Jiang Renhe, Song Xuan, and Tsang Ivor W.. 2021. ST-Norm: Spatial and temporal normalization for multi-variate time series forecasting. In ACM SIGKDD. 269278.Google ScholarGoogle Scholar
  11. [11] Fan Zipei, Song Xuan, Shibasaki Ryosuke, and Adachi Ryutaro. 2015. CityMomentum: An online approach for crowd behavior prediction at a citywide level. In ACM UbiComp. 559569.Google ScholarGoogle Scholar
  12. [12] Fernando Tharindu, Denman Simon, Sridharan Sridha, and Fookes Clinton. 2018. Soft+ hardwired attention: An LSTM framework for human trajectory prediction and abnormal event detection. Neural Netw. 108 (2018), 466478.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Hemminki Samuli, Kuribayashi Keisuke, Konomi Shin’ichi, Nurmi Petteri, and Tarkoma Sasu. 2019. Crowd replication: Sensing-assisted quantification of human behavior in public spaces. ACM Trans. Spatial Algor. Syst. 5, 3 (2019), 134.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Hoang Minh X., Zheng Yu, and Singh Ambuj K.. 2016. FCCF: Forecasting citywide crowd flows based on big data. In ACM SIGSPATIAL. 110.Google ScholarGoogle Scholar
  15. [15] Huang Chao, Zhang Chuxu, Dai Peng, and Bo Liefeng. 2021. Cross-interaction hierarchical attention networks for urban anomaly prediction. In IJCAI. 43594365.Google ScholarGoogle Scholar
  16. [16] Huang Huiqun, Yang Xi, and He Suining. 2021. Multi-head spatio-temporal attention mechanism for urban anomaly event prediction. Proc. ACMInteract., Mob., Wear. Ubiq. Technol. 5, 3 (2021), 121.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Iwata Tomoharu, Shimizu Hitoshi, Naya Futoshi, and Ueda Naonori. 2017. Estimating people flow from spatiotemporal population data via collective graphical mixture models. ACM Trans. Spatial Algor. Syst. 3, 1 (2017), 118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Jiang Renhe, Song Xuan, Huang Dou, Song Xiaoya, Xia Tianqi, Cai Zekun, Wang Zhaonan, Kim Kyoung-Sook, and Shibasaki Ryosuke. 2019. Deepurbanevent: A system for predicting citywide crowd dynamics at big events. In ACM SIGKDD. 21142122.Google ScholarGoogle Scholar
  19. [19] Liang Yuxuan, Ke Songyu, Zhang Junbo, Yi Xiuwen, and Zheng Yu. 2018. Geoman: Multi-level attention networks for geo-sensory time series prediction.. In IJCAI. 34283434.Google ScholarGoogle Scholar
  20. [20] Liu Lingbo, Zhang Ruimao, Peng Jiefeng, Li Guanbin, Du Bowen, and Lin Liang. 2018. Attentive crowd flow machines. In ACM MM. 15531561.Google ScholarGoogle Scholar
  21. [21] Liu Ning, Long Yongchao, Zou Changqing, Niu Qun, Pan Li, and Wu Hefeng. 2019. ADCrowdNet: An attention-injective deformable convolutional network for crowd understanding. In IEEE CVPR. 32253234.Google ScholarGoogle Scholar
  22. [22] Pei Wenjie, Baltrusaitis Tadas, Tax David M. J., and Morency Louis-Philippe. 2017. Temporal attention-gated model for robust sequence classification. In IEEE CVPR. 67306739.Google ScholarGoogle Scholar
  23. [23] Purushotham Sanjay and Kuo C.-C. Jay. 2016. Personalized group recommender systems for location-and event-based social networks. ACM Trans. Spatial Algor. Syst. 2, 4 (2016), 129.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Qin Yao, Song Dongjin, Chen Haifeng, Cheng Wei, Jiang Guofei, and Cottrell Garrison. 2017. A dual-stage attention-based recurrent neural network for time series prediction. arXiv preprint arXiv:1704.02971 (2017).Google ScholarGoogle Scholar
  25. [25] Rojas Julian A. Ramos, Rosas Johana, Shen Yilin, Jin Hongxia, and Dey Anind K.. 2020. Activity recommendation: Optimizing life in the long term. In IEEE PerCom. IEEE, 110.Google ScholarGoogle Scholar
  26. [26] Sadri Amin, Salim Flora D., Ren Yongli, Shao Wei, Krumm John C., and Mascolo Cecilia. 2018. What will you do for the rest of the day? An approach to continuous trajectory prediction. Proc. ACMInteract., Mob., Wear. Ubiq. Technol. 2, 4 (2018), 126.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Shi Xingjian, Chen Zhourong, Wang Hao, Yeung Dit-Yan, Wong Wai-Kin, and Woo Wang-chun. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. arXiv preprint arXiv:1506.04214 (2015).Google ScholarGoogle Scholar
  28. [28] Song Xuan, Zhang Quanshi, Sekimoto Yoshihide, and Shibasaki Ryosuke. 2014. Prediction of human emergency behavior and their mobility following large-scale disaster. In ACM SIGKDD. 514.Google ScholarGoogle Scholar
  29. [29] Teixeira Douglas Do Couto, Viana Aline Carneiro, Almeida Jussara M., and Alvim Mrio S.. 2021. The impact of stationarity, regularity, and context on the predictability of individual human mobility. ACM Trans. Spatial Algor. Syst. 7, 4 (2021), 124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Tomaras Dimitrios, Boutsis Ioannis, and Kalogeraki Vana. 2018. Modeling and predicting bike demand in large city situations. In IEEE PerCom. IEEE, 110.Google ScholarGoogle Scholar
  31. [31] Verma Rohit, Mitra Bivas, and Chakraborty Sandip. 2019. Avoiding stress driving: Online trip recommendation from driving behavior prediction. In IEEE PerCom. IEEE, 110.Google ScholarGoogle Scholar
  32. [32] Wang Fei, Jiang Mengqing, Qian Chen, Yang Shuo, Li Cheng, Zhang Honggang, Wang Xiaogang, and Tang Xiaoou. 2017. Residual attention network for image classification. In IEEE CVPR. 31563164.Google ScholarGoogle Scholar
  33. [33] Wang Haiquan, Li Yilin, Liu Guoping, Wen Xiang, and Qie Xiaohu. 2019. Accurate detection of road network anomaly by understanding crowd’s driving strategies from human mobility. ACM Trans. Spatial Algor. Syst. 5, 2 (2019), 117.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Wang Senzhang, Miao Hao, Chen Hao, and Huang Zhiqiu. 2020. Multi-task adversarial spatial-temporal networks for crowd flow prediction. In ACM CIKM. 15551564.Google ScholarGoogle Scholar
  35. [35] Yang Dongfang, Özgüner Ümit, and Redmill Keith. 2020. A social force based pedestrian motion model considering multi-pedestrian interaction with a vehicle. ACM Trans. Spatial Algor. Syst. 6, 2 (2020), 127.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Yang Dingqi, Zhang Daqing, Zheng Vincent W., and Yu Zhiyong. 2014. Modeling user activity preference by leveraging user spatial temporal characteristics in LBSNs. IEEE Trans. Syst., Man, Cybern.: Syst. 45, 1 (2014), 129142.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Yang Zheng, Wang Feng, and Gong Wei. 2021. Mobi-Track: Distilling direct path in time domain for high accuracy WiFi tracking. In IEEE PerCom Workshops. IEEE, 287292.Google ScholarGoogle Scholar
  38. [38] Zhang Junbo, Zheng Yu, and Qi Dekang. 2017. Deep spatio-temporal residual networks for citywide crowd flows prediction. In AAAI. AAAI Press, 16551661.Google ScholarGoogle Scholar
  39. [39] Zhang Jason Shuo, Gartrell Mike, Han Richard, Lv Qin, and Mishra Shivakant. 2019. GEVR: An event venue recommendation system for groups of mobile users. Proc. ACMInteract., Mob., Wear. Ubiq. Technol. 3, 1 (2019), 125.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Zhou Xian, Shen Yanyan, Zhu Yanmin, and Huang Linpeng. 2018. Predicting multi-step citywide passenger demands using attention-based neural networks. In ACM WSDM. 736744.Google ScholarGoogle Scholar

Index Terms

  1. STICAP: Spatio-temporal Interactive Attention for Citywide Crowd Activity Prediction

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Spatial Algorithms and Systems
      ACM Transactions on Spatial Algorithms and Systems  Volume 10, Issue 1
      March 2024
      174 pages
      ISSN:2374-0353
      EISSN:2374-0361
      DOI:10.1145/3613533
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 15 January 2024
      • Online AM: 1 December 2023
      • Accepted: 24 April 2023
      • Revised: 6 December 2022
      • Received: 11 April 2022
      Published in tsas Volume 10, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)108
      • Downloads (Last 6 weeks)36

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text