research-article

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search

Authors:
Chunnan Wang

Harbin Institute of Technology, Heilongjiang, China

Harbin Institute of Technology, Heilongjiang, China

0000-0002-8971-7096
View Profile

,
Kaixin Zhang

Harbin Institute of Technology, Heilongjiang, China

Harbin Institute of Technology, Heilongjiang, China

0000-0002-1596-7287
View Profile

,
Hongzhi Wang

Harbin Institute of Technology, Heilongjiang, China

Harbin Institute of Technology, Heilongjiang, China

0000-0002-7521-2871
View Profile

,
Bozhou Chen

Harbin Institute of Technology, Heilongjiang, China

Harbin Institute of Technology, Heilongjiang, China

0000-0002-9086-2995
View Profile

ACM Transactions on Knowledge Discovery from Data Volume 17 Issue 5Article No.: 73pp 1–21https://doi.org/10.1145/3571285

Published:07 April 2023Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

In recent years, many spatial-temporal graph convolutional network (STGCN) models are proposed to deal with the spatial-temporal network data forecasting problem. These STGCN models have their own advantages, i.e., each of them puts forward many effective operations and achieves good prediction results in the real applications. If users can effectively utilize and combine these excellent operations integrating the advantages of existing models, then they may obtain more effective STGCN models thus create greater value using existing work. However, they fail to do so due to the lack of domain knowledge, and there is lack of automated system to help users to achieve this goal. In this article, we fill this gap and propose Auto-STGCN algorithm, which makes use of existing models to automatically explore high-performance STGCN model for specific scenarios. Specifically, we design Unified-STGCN framework, which summarizes the operations of existing architectures, and use parameters to control the usage and characteristic attributes of each operation, so as to realize the parameterized representation of the STGCN architecture and the reorganization and fusion of advantages. Then, we present Auto-STGCN, an optimization method based on reinforcement learning, to quickly search the parameter search space provided by Unified-STGCN, and generate optimal STGCN models automatically. Extensive experiments on real-world benchmark datasets show that our Auto-STGCN can find STGCN models superior to existing STGCN models used for search space construction, which demonstrates the effectiveness of our proposed method.

REFERENCES

[1] Bacciu Davide, Errica Federico, and Micheli Alessio. 2018. Contextual graph Markov model: A deep and generative approach to graph processing. In Proceedings of the 35th International Conference on Machine Learning. Vol. 80, 304–313.Google Scholar
[2] Bai Lei, Yao Lina, Kanhere Salil S., Wang Xianzhi, Liu Wei, and Yang Zheng. 2019. Spatio-temporal graph convolutional and recurrent networks for citywide passenger demand prediction. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. ACM, 2293–2296.Google ScholarDigital Library
[3] Bai Lei, Yao Lina, Kanhere Salil S., Yang Zheng, Chu Jing, and Wang Xianzhi. 2019. Passenger demand forecasting with multi-task convolutional recurrent neural networks. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Vol. 11440, 29–42.Google ScholarDigital Library
[4] Bellman Richard and Kalaba Robert. 1957. On the role of dynamic programming in statistical communication theory. IRE Transactions on Information Theory 3, 3 (1957), 197–203.Google ScholarCross Ref
[5] Bello Irwan, Zoph Barret, Vasudevan Vijay, and Le Quoc V.. 2017. Neural optimizer search with reinforcement learning. In Proceedings of the 34th International Conference on Machine Learning. Vol. 70, PMLR, 459–468.Google Scholar
[6] Chen Chao, Petty Karl, Skabardonis Alexander, Varaiya Pravin, and Jia Zhanfeng. 2001. Freeway performance measurement system: Mining loop detector data. Transportation Research Record 1748, 1 (2001), 96–102.Google ScholarCross Ref
[7] Chen Tianqi, Li Mu, Li Yutian, Lin Min, Wang Naiyan, Wang Minjie, Xiao Tianjun, Xu Bing, Zhang Chiyuan, and Zhang Zheng. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv:1512.01274. Retrieved from https://arxiv.org/abs/1512.01274.Google Scholar
[8] Chen Yukang, Meng Gaofeng, Zhang Qian, Xiang Shiming, Huang Chang, Mu Lisen, and Wang Xinggang. 2019. RENAS: Reinforced evolutionary neural architecture search. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4787–4796.Google ScholarCross Ref
[9] Chiang Wei-Lin, Liu Xuanqing, Si Si, Li Yang, Bengio Samy, and Hsieh Cho-Jui. 2019. Cluster-GCN: An efficient algorithm for training deep and large graph convolutional networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 257–266.Google ScholarDigital Library
[10] Deng Jinliang, Chen Xiusi, Fan Zipei, Jiang Renhe, Song Xuan, and Tsang Ivor W.. 2021. The pulse of urban transport: Exploring the co-evolving pattern for spatio-temporal forecasting. ACM Transactions on Knowledge Discovery from Data 15, 6 (2021), 103:1–103:25.Google ScholarDigital Library
[11] Drucker Harris, Burges Christopher J. C., Kaufman Linda, Smola Alexander J., and Vapnik Vladimir. 1996. Support vector regression machines. In Proceedings of the 9th International Conference on Neural Information Processing Systems. 155–161.Google ScholarDigital Library
[12] Gao Yang, Yang Hong, Zhang Peng, Zhou Chuan, and Hu Yue. 2020. Graph neural architecture search. In Proceedings of the International Joint Conference on Artificial Intelligence. Bessiere Christian (Ed.), ijcai.org, 1403–1409.Google ScholarCross Ref
[13] Guo Shengnan, Lin Youfang, Feng Ning, Song Chao, and Wan Huaiyu. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 922–929.Google ScholarDigital Library
[14] Guo Shengnan, Lin Youfang, Li Shijie, Chen Zhaoming, and Wan Huaiyu. 2019. Deep spatial-temporal 3D convolutional neural networks for traffic data forecasting. IEEE Transactions on Intelligent Transportation Systems. 20, 10 (2019), 3913–3926.Google ScholarCross Ref
[15] Hamilton William L., Ying Zhitao, and Leskovec Jure. 2017. Inductive representation learning on large graphs. In Proceedings of the International Conference on Neural Information Processing Systems. 1024–1034.Google Scholar
[16] Hosseini Ramtin, Yang Xingyi, and Xie Pengtao. 2021. DSRNA: Differentiable search of robust neural architectures. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Computer Vision Foundation/ IEEE, 6196–6205.Google ScholarCross Ref
[17] Jaakkola Tommi S., Jordan Michael I., and Singh Satinder P.. 1994. On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation 6, 6 (1994), 1185–1201.Google ScholarDigital Library
[18] Jain Ashesh, Zamir Amir Roshan, Savarese Silvio, and Saxena Ashutosh. 2016. Structural-RNN: Deep learning on spatio-temporal graphs. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 5308–5317.Google ScholarCross Ref
[19] Kong Xiangyuan, Xing Weiwei, Wei Xiang, Bao Peng, Zhang Jian, and Lu Wei. 2020. STGAT: Spatial-temporal graph attention networks for traffic flow forecasting. IEEE Access 8 (2020), 134363–134372.Google ScholarCross Ref
[20] Kröse Ben J. A.. 1995. Learning from delayed rewards. Robotics and Autonomous Systems. 15, 4 (1995), 233–235.Google ScholarCross Ref
[21] Li Yaguang, Yu Rose, Shahabi Cyrus, and Liu Yan. 2018. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In Proceedings of the International Conference on Learning Representations.Google Scholar
[22] Liu Hanxiao, Simonyan Karen, and Yang Yiming. 2019. DARTS: Differentiable architecture search. In Proceedings of the International Conference on Learning Representations.Google Scholar
[23] Liu Ziqi, Chen Chaochao, Li Longfei, Zhou Jun, Li Xiaolong, Song Le, and Qi Yuan. 2019. GeniePath: Graph neural networks with adaptive receptive paths. In Proceedings of the AAAI Conference on Artificial Intelligence. 4424–4431.Google ScholarDigital Library
[24] Mnih Volodymyr, Kavukcuoglu Koray, Silver David, Rusu Andrei A., Veness Joel, Bellemare Marc G., Graves Alex, Riedmiller Martin A., Fidjeland Andreas, Ostrovski Georg, Petersen Stig, Beattie Charles, Sadik Amir, Antonoglou Ioannis, King Helen, Kumaran Dharshan, Wierstra Daan, Legg Shane, and Hassabis Demis. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529–533.Google ScholarCross Ref
[25] Ng Andrew Y., Harada Daishi, and Russell Stuart J.. 1999. Policy invariance under reward transformations: Theory and application to reward shaping. In Proceedings of the 16th International Conference on Machine Learning. 278–287.Google ScholarDigital Library
[26] Noy Asaf, Nayman Niv, Ridnik Tal, Zamir Nadav, Doveh Sivan, Friedman Itamar, Giryes Raja, and Zelnik Lihi. 2020. ASAP: Architecture search, anneal and prune. In Proceedings of the International Conference on Artificial Intelligence and Statistics. Vol. 108, PMLR, 493–503.Google Scholar
[27] Puterman Martin L.. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley.Google ScholarCross Ref
[28] Real Esteban, Aggarwal Alok, Huang Yanping, and Le Quoc V.. 2019. Regularized evolution for image classifier architecture search. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 4780–4789.Google ScholarDigital Library
[29] Ren Pengzhen, Xiao Yun, Chang Xiaojun, Huang Poyao, Li Zhihui, Chen Xiaojiang, and Wang Xin. 2021. A comprehensive survey of neural architecture search: Challenges and solutions. ACM Computing Surveys 54, 4 (2021), 76:1–76:34.Google Scholar
[30] Shi Xingjian, Chen Zhourong, Wang Hao, Yeung Dit-Yan, Wong Wai-Kin, and Woo Wang-chun. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Proceedings of the International Conference on Neural Information Processing Systems. 802–810.Google Scholar
[31] Song Chao, Lin Youfang, Guo Shengnan, and Wan Huaiyu. 2020. Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 914–921.Google ScholarCross Ref
[32] Tan Mingxing, Chen Bo, Pang Ruoming, Vasudevan Vijay, Sandler Mark, Howard Andrew, and Le Quoc V.. 2019. MnasNet: Platform-aware neural architecture search for mobile. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 2820–2828.Google ScholarCross Ref
[33] Velickovic Petar, Cucurull Guillem, Casanova Arantxa, Romero Adriana, Liò Pietro, and Bengio Yoshua. 2018. Graph attention networks. In Proceedings of the International Conference on Learning Representations.Google Scholar
[34] Williams Billy M. and Hoel Lester A.. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of Transportation Engineering 129, 6 (2003), 664–672.Google ScholarCross Ref
[35] Xu Keyulu, Hu Weihua, Leskovec Jure, and Jegelka Stefanie. 2019. How powerful are graph neural networks? In Proceedings of the International Conference on Learning Representations.Google Scholar
[36] Xu Mingxing, Dai Wenrui, Liu Chunmiao, Gao Xing, Lin Weiyao, Qi Guo-Jun, and Xiong Hongkai. 2020. Spatial-temporal transformer networks for traffic flow forecasting. arXiv:2001.02908. Retrieved from https://arxiv.org/abs/2001.02908.Google Scholar
[37] Yan Caixia, Chang Xiaojun, Li Zhihui, Ge Zongyuan, Guan Weili, Zhu Lei, and Zheng Qinghua. 2021. ZeroNAS: Differentiable generative adversarial networks search for zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 12 (112021), 9733–9740. DOI:Google ScholarCross Ref
[38] Yan Sijie, Xiong Yuanjun, and Lin Dahua. 2018. Spatial temporal graph convolutional networks for skeleton-based action recognition. In Proceedings of the AAAI Conference on Artificial Intelligence. 7444–7452.Google ScholarCross Ref
[39] Yang Zhaohui, Wang Yunhe, Chen Xinghao, Shi Boxin, Xu Chao, Xu Chunjing, Tian Qi, and Xu Chang. 2020. CARS: Continuous evolution for efficient neural architecture search. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE.Google ScholarCross Ref
[40] Yao Huaxiu, Wu Fei, Ke Jintao, Tang Xianfeng, Jia Yitian, Lu Siyu, Gong Pinghua, Ye Jieping, and Li Zhenhui. 2018. Deep multi-view spatial-temporal network for taxi demand prediction. In Proceedings of the AAAI Conference on Artificial Intelligence. 2588–2595.Google ScholarCross Ref
[41] Yu Bing, Yin Haoteng, and Zhu Zhanxing. 2018. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the International Joint Conference on Artificial Intelligence. 3634–3640.Google ScholarCross Ref
[42] Zang Tianzi, Zhu Yanmin, Xu Yanan, and Yu Jiadi. 2021. Jointly modeling spatio-temporal dependencies and daily flow correlations for crowd flow prediction. ACM Transactions on Knowledge Discovery from Data 15, 4 (2021), 58:1–58:20.Google ScholarDigital Library
[43] Zhang Dalin, Yao Lina, Chen Kaixuan, Wang Sen, Chang Xiaojun, and Liu Yunhao. 2020. Making sense of spatio-temporal preserving representations for EEG-based human intention recognition. IEEE Transactions on Cybernetics 50, 7 (2020), 3033–3044. DOI:Google ScholarCross Ref
[44] Zhang Yuyu, Chen Xinshi, Yang Yuan, Ramamurthy Arun, Li Bo, Qi Yuan, and Song Le. 2020. Efficient probabilistic logic reasoning with graph neural networks. In Proceedings of the International Conference on Learning Representations.Google Scholar
[45] Zhong Zhao, Yan Junjie, Wu Wei, Shao Jing, and Liu Cheng-Lin. 2018. Practical block-wise neural network architecture generation. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 2423–2432.Google ScholarCross Ref
[46] Zoph Barret, Vasudevan Vijay, Shlens Jonathon, and Le Quoc V.. 2018. Learning transferable architectures for scalable image recognition. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 8697–8710.Google ScholarCross Ref

Index Terms

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search
1. Computing methodologies
  1. Artificial intelligence
    1. Search methodologies
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks
2. Information systems
  1. Information systems applications
    1. Spatial-temporal systems

Recommendations

Traffic Spatial-Temporal Prediction Based on Neural Architecture Search
SSTD '23: Proceedings of the 18th International Symposium on Spatial and Temporal Data

Traffic spatial-temporal prediction is essential for intelligent transportation systems. However, the current approach relies heavily on expert knowledge and time-consuming manual modeling. Neural architecture search can build models adaptively, but it ...
Read More
Automated Search for Configurations of Convolutional Neural Network Architectures
SPLC '19: Proceedings of the 23rd International Systems and Software Product Line Conference - Volume A

Convolutional Neural Networks (CNNs) are intensively used to solve a wide variety of complex problems. Although powerful, such systems require manual configuration and tuning. To this end, we view CNNs as configurable systems and propose an end-to-end ...
Read More
Auto-Keras: An Efficient Neural Architecture Search System
KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Neural architecture search (NAS) has been proposed to automatically tune deep neural networks, but existing search algorithms, e.g., NASNet, PNAS, usually suffer from expensive computational cost. Network morphism, which keeps the functionality of a ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Knowledge Discovery from Data Volume 17, Issue 5
June 2023
386 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3583066
Editor:
Charu Aggarwal
IBM T. J. Watson Research, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 April 2023
- Online AM: 1 December 2022
- Accepted: 3 November 2022
- Revised: 17 September 2022
- Received: 21 February 2022
Published in tkdd Volume 17, Issue 5

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Automated machine learning
spatial-temporal graph convolutional network
neural architecture search
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 731
  Total Downloads
- Downloads (Last 12 months)518
- Downloads (Last 6 weeks)59
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

HTML Format

View this article in HTML Format .

View HTML Format

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search

ACM Transactions on Knowledge Discovery from Data

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Traffic Spatial-Temporal Prediction Based on Neural Architecture Search

Automated Search for Configurations of Convolutional Neural Network Architectures

Auto-Keras: An Efficient Neural Architecture Search System