skip to main content
10.1145/3366423.3380296acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Hierarchically Structured Transformer Networks for Fine-Grained Spatial Event Forecasting

Published: 20 April 2020 Publication History

Abstract

Spatial event forecasting is challenging and crucial for urban sensing scenarios, which is beneficial for a wide spectrum of spatial-temporal mining applications, ranging from traffic management, public safety, to environment policy making. In spite of significant progress has been made to solve spatial-temporal prediction problem, most existing deep learning based methods based on a coarse-grained spatial setting and the success of such methods largely relies on data sufficiency. In many real-world applications, predicting events with a fine-grained spatial resolution do play a critical role to provide high discernibility of spatial-temporal data distributions. However, in such cases, applying existing methods will result in weak performance since they may not well capture the quality spatial-temporal representations when training triple instances are highly imbalanced across locations and time.
To tackle this challenge, we develop a hierarchically structured Spatial-Temporal ransformer network (STtrans) which leverages a main embedding space to capture the inter-dependencies across time and space for alleviating the data imbalance issue. In our STtrans framework, the first-stage transformer module discriminates different types of region and time-wise relations. To make the latent spatial-temporal representations be reflective of the relational structure between categories, we further develop a cross-category fusion transformer network to endow STtrans with the capability to preserve the semantic signals in a fully dynamic manner. Finally, an adversarial training strategy is introduced to yield a robust spatial-temporal learning under data imbalance. Extensive experiments on real-world imbalanced spatial-temporal datasets from NYC and Chicago demonstrate the superiority of our method over various state-of-the-art baselines.

References

[1]
Laura Alfers, Phumzile Xulu, and Richard Dobson. 2016. Promoting workplace health and safety in urban public space: reflections from Durban, South Africa. Environment and Urbanization 28, 2 (2016), 391–404.
[2]
Dzmitry Bahdanau, Jan Chorowski, Dmitriy Serdyuk, Philemon Brakel, and Yoshua Bengio. 2016. End-to-end attention-based large vocabulary speech recognition. In International conference on acoustics, speech and signal processing (ICASSP). IEEE, 4945–4949.
[3]
Chih-Chung Chang and Chih-Jen Lin. 2011. LIBSVM: a library for support vector machines. Transactions on Intelligent Systems and Technology (TIST) 2, 3(2011), 27.
[4]
Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua. 2017. Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention. In Proceedings of the conference on Research and Development in Information Retrieval (SIGIR). ACM, 335–344.
[5]
Quanjun Chen, Xuan Song, Harutoshi Yamada, and Ryosuke Shibasaki. 2016. Learning deep representation from big and heterogeneous data for traffic accident inference. In Proceedings of AAAI Conference on Artificial Intelligence (AAAI).
[6]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).
[7]
Jie Feng, Yong Li, Chao Zhang, Funing Sun, Fanchao Meng, Ang Guo, and Depeng Jin. 2018. Deepmove: Predicting human mobility with attentional recurrent networks. In Proceedings of International Conference on World Wide Web (WWW). ACM, 1459–1468.
[8]
Xu Geng, Yaguang Li, Leye Wang, Lingyu Zhang, Qiang Yang, Jieping Ye, and Yan Liu. 2019. Spatiotemporal Multi-Graph Convolution Network for Ride-hailing Demand Forecasting. In Proceedings of AAAI Conference on Artificial Intelligence (AAAI).
[9]
Ian J. Goodfellow, Jonathon Shlens, and Christian Szegedy. 2015. Explaining and Harnessing Adversarial Examples. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings.
[10]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of International Conference on World Wide Web (WWW). ACM, 173–182.
[11]
Chao Huang, Dong Wang, and Shenglong Zhu. 2017. Where are you from: Home location profiling of crowd sensors from noisy and sparse crowdsourcing data. In International Conference on Computer Communications (Infocom). IEEE, 1–9.
[12]
Chao Huang, Xian Wu, and Dong Wang. 2016. Crowdsourcing-based urban anomaly prediction system for smart cities. In International on Conference on Information and Knowledge Management (CIKM). 1969–1972.
[13]
Chao Huang, Chuxu Zhang, Peng Dai, and Liefeng Bo. 2019. Deep Dynamic Fusion Network for Traffic Accident Forecasting. In International Conference on Information and Knowledge Management (CIKM). 2673–2681.
[14]
Chao Huang, Chuxu Zhang, Jiashu Zhao, Xian Wu, Nitesh Chawla, and Dawei Yin. 2019. MiST: A Multiview and Multimodal Spatial-Temporal Learning Framework for Citywide Abnormal Event Forecasting. In The World Wide Web Conference (WWW). ACM, 717–728.
[15]
Chao Huang, Junbo Zhang, Yu Zheng, and Nitesh V Chawla. 2018. DeepCrime: Attentive Hierarchical Recurrent Networks for Crime Prediction. In Proceedings of International Conference on Information and Knowledge Management (CIKM). ACM, 1423–1432.
[16]
Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, and Luke Zettlemoyer. 2016. Summarizing source code using a neural attention model. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL). 2073–2083.
[17]
Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).
[18]
İbrahim Kök, Mehmet Ulvi Şimşek, and Suat Özdemir. 2017. A deep learning model for air quality prediction in smart cities. In Proceedings of International Conference on Big Data (Big Data). IEEE, 1983–1990.
[19]
Nikolay Laptev, Jason Yosinski, Li Erran Li, and Slawek Smyl. 2017. Time-series extreme event forecasting with neural networks at uber. In Proceedings of International Conference on Machine Learning (ICML). 1–5.
[20]
Quoc V Le, Jiquan Ngiam, Adam Coates, Abhik Lahiri, 2011. On optimization methods for deep learning. In Proceedings of International Conference on Machine Learning (ICML). 265–272.
[21]
Ruirui Li, Jyun-Yu Jiang, Chelsea J-T Ju, and Wei Wang. 2019. CORALS: Who Are My Potential New Customers? Tapping into the Wisdom of Customers’ Decisions. In International Conference on Web Search and Data Mining (WSDM). 69–77.
[22]
Yaguang Li, Rose Yu, Cyrus Shahabi, and Yan Liu. 2018. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In ICLR.
[23]
Yuxuan Liang, Kun Ouyang, Lin Jing, Sijie Ruan, Ye Liu, Junbo Zhang, David S Rosenblum, and Yu Zheng. 2019. UrbanFM: Inferring Fine-Grained Urban Flows. In International Conference on Knowledge Discovery & Data Mining (KDD). ACM, 3132–3142.
[24]
Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts. In Proceedings of AAAI Conference on Artificial Intelligence (AAAI). 194–200.
[25]
Zuozhu Liu, Wenyu Zhang, Shaowei Lin, and Tony QS Quek. 2017. Heterogeneous sensor data fusion by deep multimodal encoding. Journal of Selected Topics in Signal Processing 11, 3(2017), 479–491.
[26]
Bei Pan, Ugur Demiryurek, 2012. Utilizing real-world transportation data for accurate traffic prediction. In Proceedings of International Conference on Data Mining (ICDM). IEEE, 595–604.
[27]
Zheyi Pan, Yuxuan Liang, Weifeng Wang, Yong Yu, Yu Zheng, and Junbo Zhang. 2019. Urban Traffic Prediction from Spatio-Temporal Data Using Deep Meta Learning. In Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD). ACM.
[28]
Garvesh Raskutti, Martin J Wainwright, and Bin Yu. 2014. Early stopping and non-parametric regression: an optimal data-dependent stopping rule. Journal of Machine Learning Research (JMLR) 15, 1 (2014), 335–366.
[29]
Skipper Seabold and Josef Perktold. 2010. Statsmodels: Econometric and statistical modeling with python. In Python in Science Conference, Vol. 57. SciPy society Austin, 61.
[30]
Xuan Song, Hiroshi Kanasugi, and Ryosuke Shibasaki. 2016. DeepTransport: Prediction and Simulation of Human Mobility and Transportation Mode at a Citywide Level. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), Vol. 16. 2618–2624.
[31]
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research (JMLR) 15, 1 (2014), 1929–1958.
[32]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of International Conference on Neural Information Processing Systems (NIPS). 5998–6008.
[33]
Xian Wu, Yuxiao Dong, Chao Huang, Jian Xu, Dong Wang, and Nitesh V Chawla. 2017. Uapd: Predicting urban anomalies from spatial-temporal data. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD). Springer, 622–638.
[34]
Xian Wu, Baoxu Shi, Yuxiao Dong, Chao Huang, Louis Faust, and Nitesh V Chawla. 2018. RESTFul: Resolution-Aware Forecasting of Behavioral Time Series Data. In Proceedings of International Conference on Information and Knowledge Management (CIKM). ACM, 1073–1082.
[35]
Jun Xu, Rouhollah Rahmatizadeh, Ladislau Bölöni, and Damla Turgut. 2017. Real-time prediction of taxi demand using recurrent neural networks. Transactions on Intelligent Transportation Systems 19, 8 (2017), 2572–2581.
[36]
Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, and Jieping Ye. 2018. Deep multi-view spatial-temporal network for taxi demand prediction. In Proceedings of AAAI Conference on Artificial Intelligence (AAAI). 2588–2595.
[37]
Chih-Kuan Yeh, Wei-Chieh Wu, Wei-Jen Ko, and Yu-Chiang Frank Wang. 2017. Learning deep latent space for multi-label classification. In Proceedings of AAAI Conference on Artificial Intelligence (AAAI).
[38]
Rose Yu, Dehua Cheng, and Yan Liu. 2015. Accelerated online low rank tensor learning for multivariate spatiotemporal streams. In Proceedings of International Conference on Machine Learning (ICML). 238–247.
[39]
Rose Yu, Yaguang Li, Ugur Demiryurek, Cyrus Shahabi, and Yan Liu. 2017. Deep learning: a generic approach for extreme condition traffic forecasting. In Proceedings of SIAM International Conference on Data Mining (SDM). SIAM, 777–785.
[40]
Rose Yu, Yaguang Li, Cyrus Shahabi, Ugur Demiryurek, and Yan Liu. 2017. Deep learning: A generic approach for extreme condition traffic forecasting. In Proceedings of SIAM International Conference on Data Mining (SDM). SIAM, 777–785.
[41]
Biao Zhang, Deyi Xiong, and Jinsong Su. 2018. Neural machine translation with deep attention. Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) (2018).
[42]
Chao Zhang, Liyuan Liu, Dongming Lei, Quan Yuan, Honglei Zhuang, Tim Hanratty, and Jiawei Han. 2017. Triovecevent: Embedding-based online local event detection in geo-tagged tweet streams. In Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD). ACM, 595–604.
[43]
Chuxu Zhang, Dongjin Song, Yuncong Chen, Xinyang Feng, Cristian Lumezanu, Wei Cheng, Jingchao Ni, Bo Zong, Haifeng Chen, and Nitesh V Chawla. 2019. A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. In AAAI Conference on Artificial Intelligence (AAAI), Vol. 33. 1409–1416.
[44]
Junbo Zhang, Yu Zheng, and Dekang Qi. 2017. Deep spatio-temporal residual networks for citywide crowd flows prediction. In AAAI Conference on Artificial Intelligence (AAAI).
[45]
Liang Zhao, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2016. Multi-resolution spatial event forecasting in social media. In Proceedings of International Conference on Data Mining (ICDM). IEEE, 689–698.
[46]
Guanjie Zheng, Susan Brantely, Thomas Lauvaus, and Li Zhenhui. 2017. Contextual spatial outlier detection with metric learning. In Proceedings of International Conference on Knowledge Discovery and Data Mining (KDD). ACM, 2161–2170.

Cited By

View all
  • (2025)Multi-view collaborative signal fusion and representation property optimization for recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2025.110085144(110085)Online publication date: Mar-2025
  • (2024)Multi-modality spatio-temporal forecasting via self-supervised learningProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/223(2018-2026)Online publication date: 3-Aug-2024
  • (2024)Hawkes-enhanced spatial-temporal hypergraph contrastive learning based on criminal correlationsProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28719(8733-8741)Online publication date: 20-Feb-2024
  • Show More Cited By

Index Terms

  1. Hierarchically Structured Transformer Networks for Fine-Grained Spatial Event Forecasting
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Information & Contributors

            Information

            Published In

            cover image ACM Conferences
            WWW '20: Proceedings of The Web Conference 2020
            April 2020
            3143 pages
            ISBN:9781450370233
            DOI:10.1145/3366423
            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Sponsors

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            Published: 20 April 2020

            Permissions

            Request permissions for this article.

            Check for updates

            Author Tags

            1. Deep neural networks
            2. Spatial-temporal data mining

            Qualifiers

            • Research-article
            • Research
            • Refereed limited

            Conference

            WWW '20
            Sponsor:
            WWW '20: The Web Conference 2020
            April 20 - 24, 2020
            Taipei, Taiwan

            Acceptance Rates

            Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)127
            • Downloads (Last 6 weeks)4
            Reflects downloads up to 28 Feb 2025

            Other Metrics

            Citations

            Cited By

            View all
            • (2025)Multi-view collaborative signal fusion and representation property optimization for recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2025.110085144(110085)Online publication date: Mar-2025
            • (2024)Multi-modality spatio-temporal forecasting via self-supervised learningProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/223(2018-2026)Online publication date: 3-Aug-2024
            • (2024)Hawkes-enhanced spatial-temporal hypergraph contrastive learning based on criminal correlationsProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i8.28719(8733-8741)Online publication date: 20-Feb-2024
            • (2024)Empirical and Experimental Insights into Data Mining Techniques for Crime Prediction: A Comprehensive SurveyACM Transactions on Intelligent Systems and Technology10.1145/369951516:2(1-75)Online publication date: 7-Oct-2024
            • (2024)Spatio-Temporal Graph Neural Networks for Predictive Learning in Urban Computing: A SurveyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.333382436:10(5388-5408)Online publication date: Oct-2024
            • (2024)Semantics and Geography Aware Hierarchical Learning for Sequential Crime PredictionIEEE Signal Processing Letters10.1109/LSP.2024.339386331(1234-1238)Online publication date: 2024
            • (2024)A survey on personalized itinerary recommendation: From optimisation to deep learningApplied Soft Computing10.1016/j.asoc.2023.111200152(111200)Online publication date: Feb-2024
            • (2023)Learning Gaussian mixture representations for tensor time series forecastingProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/231(2077-2085)Online publication date: 19-Aug-2023
            • (2023)TTS-Norm: Forecasting Tensor Time Series via Multi-Way NormalizationACM Transactions on Knowledge Discovery from Data10.1145/360589418:1(1-25)Online publication date: 10-Aug-2023
            • (2023)Modeling Spatial–Temporal Constraints and Spatial-Transfer Patterns for Couriers’ Package Pick-up Route PredictionIEEE Transactions on Intelligent Transportation Systems10.1109/TITS.2023.330166124:12(13787-13800)Online publication date: Dec-2023
            • Show More Cited By

            View Options

            Login options

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format.

            HTML Format

            Figures

            Tables

            Media

            Share

            Share

            Share this Publication link

            Share on social media