skip to main content
research-article

Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search

Published:07 April 2023Publication History
Skip Abstract Section

Abstract

In recent years, many spatial-temporal graph convolutional network (STGCN) models are proposed to deal with the spatial-temporal network data forecasting problem. These STGCN models have their own advantages, i.e., each of them puts forward many effective operations and achieves good prediction results in the real applications. If users can effectively utilize and combine these excellent operations integrating the advantages of existing models, then they may obtain more effective STGCN models thus create greater value using existing work. However, they fail to do so due to the lack of domain knowledge, and there is lack of automated system to help users to achieve this goal. In this article, we fill this gap and propose Auto-STGCN algorithm, which makes use of existing models to automatically explore high-performance STGCN model for specific scenarios. Specifically, we design Unified-STGCN framework, which summarizes the operations of existing architectures, and use parameters to control the usage and characteristic attributes of each operation, so as to realize the parameterized representation of the STGCN architecture and the reorganization and fusion of advantages. Then, we present Auto-STGCN, an optimization method based on reinforcement learning, to quickly search the parameter search space provided by Unified-STGCN, and generate optimal STGCN models automatically. Extensive experiments on real-world benchmark datasets show that our Auto-STGCN can find STGCN models superior to existing STGCN models used for search space construction, which demonstrates the effectiveness of our proposed method.

REFERENCES

  1. [1] Bacciu Davide, Errica Federico, and Micheli Alessio. 2018. Contextual graph Markov model: A deep and generative approach to graph processing. In Proceedings of the 35th International Conference on Machine Learning. Vol. 80, 304313.Google ScholarGoogle Scholar
  2. [2] Bai Lei, Yao Lina, Kanhere Salil S., Wang Xianzhi, Liu Wei, and Yang Zheng. 2019. Spatio-temporal graph convolutional and recurrent networks for citywide passenger demand prediction. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. ACM, 22932296.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Bai Lei, Yao Lina, Kanhere Salil S., Yang Zheng, Chu Jing, and Wang Xianzhi. 2019. Passenger demand forecasting with multi-task convolutional recurrent neural networks. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining. Vol. 11440, 2942.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Bellman Richard and Kalaba Robert. 1957. On the role of dynamic programming in statistical communication theory. IRE Transactions on Information Theory 3, 3 (1957), 197203.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Bello Irwan, Zoph Barret, Vasudevan Vijay, and Le Quoc V.. 2017. Neural optimizer search with reinforcement learning. In Proceedings of the 34th International Conference on Machine Learning. Vol. 70, PMLR, 459468.Google ScholarGoogle Scholar
  6. [6] Chen Chao, Petty Karl, Skabardonis Alexander, Varaiya Pravin, and Jia Zhanfeng. 2001. Freeway performance measurement system: Mining loop detector data. Transportation Research Record 1748, 1 (2001), 96102.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Chen Tianqi, Li Mu, Li Yutian, Lin Min, Wang Naiyan, Wang Minjie, Xiao Tianjun, Xu Bing, Zhang Chiyuan, and Zhang Zheng. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv:1512.01274. Retrieved from https://arxiv.org/abs/1512.01274.Google ScholarGoogle Scholar
  8. [8] Chen Yukang, Meng Gaofeng, Zhang Qian, Xiang Shiming, Huang Chang, Mu Lisen, and Wang Xinggang. 2019. RENAS: Reinforced evolutionary neural architecture search. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 47874796.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] Chiang Wei-Lin, Liu Xuanqing, Si Si, Li Yang, Bengio Samy, and Hsieh Cho-Jui. 2019. Cluster-GCN: An efficient algorithm for training deep and large graph convolutional networks. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 257266.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Deng Jinliang, Chen Xiusi, Fan Zipei, Jiang Renhe, Song Xuan, and Tsang Ivor W.. 2021. The pulse of urban transport: Exploring the co-evolving pattern for spatio-temporal forecasting. ACM Transactions on Knowledge Discovery from Data 15, 6 (2021), 103:1–103:25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Drucker Harris, Burges Christopher J. C., Kaufman Linda, Smola Alexander J., and Vapnik Vladimir. 1996. Support vector regression machines. In Proceedings of the 9th International Conference on Neural Information Processing Systems. 155161.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Gao Yang, Yang Hong, Zhang Peng, Zhou Chuan, and Hu Yue. 2020. Graph neural architecture search. In Proceedings of the International Joint Conference on Artificial Intelligence. Bessiere Christian (Ed.), ijcai.org, 14031409.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Guo Shengnan, Lin Youfang, Feng Ning, Song Chao, and Wan Huaiyu. 2019. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 922929.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Guo Shengnan, Lin Youfang, Li Shijie, Chen Zhaoming, and Wan Huaiyu. 2019. Deep spatial-temporal 3D convolutional neural networks for traffic data forecasting. IEEE Transactions on Intelligent Transportation Systems. 20, 10 (2019), 39133926.Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Hamilton William L., Ying Zhitao, and Leskovec Jure. 2017. Inductive representation learning on large graphs. In Proceedings of the International Conference on Neural Information Processing Systems. 10241034.Google ScholarGoogle Scholar
  16. [16] Hosseini Ramtin, Yang Xingyi, and Xie Pengtao. 2021. DSRNA: Differentiable search of robust neural architectures. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Computer Vision Foundation/ IEEE, 61966205.Google ScholarGoogle ScholarCross RefCross Ref
  17. [17] Jaakkola Tommi S., Jordan Michael I., and Singh Satinder P.. 1994. On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation 6, 6 (1994), 11851201.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Jain Ashesh, Zamir Amir Roshan, Savarese Silvio, and Saxena Ashutosh. 2016. Structural-RNN: Deep learning on spatio-temporal graphs. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 53085317.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Kong Xiangyuan, Xing Weiwei, Wei Xiang, Bao Peng, Zhang Jian, and Lu Wei. 2020. STGAT: Spatial-temporal graph attention networks for traffic flow forecasting. IEEE Access 8 (2020), 134363134372.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Kröse Ben J. A.. 1995. Learning from delayed rewards. Robotics and Autonomous Systems. 15, 4 (1995), 233235.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Li Yaguang, Yu Rose, Shahabi Cyrus, and Liu Yan. 2018. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  22. [22] Liu Hanxiao, Simonyan Karen, and Yang Yiming. 2019. DARTS: Differentiable architecture search. In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  23. [23] Liu Ziqi, Chen Chaochao, Li Longfei, Zhou Jun, Li Xiaolong, Song Le, and Qi Yuan. 2019. GeniePath: Graph neural networks with adaptive receptive paths. In Proceedings of the AAAI Conference on Artificial Intelligence. 44244431.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Mnih Volodymyr, Kavukcuoglu Koray, Silver David, Rusu Andrei A., Veness Joel, Bellemare Marc G., Graves Alex, Riedmiller Martin A., Fidjeland Andreas, Ostrovski Georg, Petersen Stig, Beattie Charles, Sadik Amir, Antonoglou Ioannis, King Helen, Kumaran Dharshan, Wierstra Daan, Legg Shane, and Hassabis Demis. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529533.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Ng Andrew Y., Harada Daishi, and Russell Stuart J.. 1999. Policy invariance under reward transformations: Theory and application to reward shaping. In Proceedings of the 16th International Conference on Machine Learning. 278287.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Noy Asaf, Nayman Niv, Ridnik Tal, Zamir Nadav, Doveh Sivan, Friedman Itamar, Giryes Raja, and Zelnik Lihi. 2020. ASAP: Architecture search, anneal and prune. In Proceedings of the International Conference on Artificial Intelligence and Statistics. Vol. 108, PMLR, 493503.Google ScholarGoogle Scholar
  27. [27] Puterman Martin L.. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Real Esteban, Aggarwal Alok, Huang Yanping, and Le Quoc V.. 2019. Regularized evolution for image classifier architecture search. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 47804789.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Ren Pengzhen, Xiao Yun, Chang Xiaojun, Huang Poyao, Li Zhihui, Chen Xiaojiang, and Wang Xin. 2021. A comprehensive survey of neural architecture search: Challenges and solutions. ACM Computing Surveys 54, 4 (2021), 76:1–76:34.Google ScholarGoogle Scholar
  30. [30] Shi Xingjian, Chen Zhourong, Wang Hao, Yeung Dit-Yan, Wong Wai-Kin, and Woo Wang-chun. 2015. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Proceedings of the International Conference on Neural Information Processing Systems. 802810.Google ScholarGoogle Scholar
  31. [31] Song Chao, Lin Youfang, Guo Shengnan, and Wan Huaiyu. 2020. Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence. 914921.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Tan Mingxing, Chen Bo, Pang Ruoming, Vasudevan Vijay, Sandler Mark, Howard Andrew, and Le Quoc V.. 2019. MnasNet: Platform-aware neural architecture search for mobile. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 28202828.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Velickovic Petar, Cucurull Guillem, Casanova Arantxa, Romero Adriana, Liò Pietro, and Bengio Yoshua. 2018. Graph attention networks. In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  34. [34] Williams Billy M. and Hoel Lester A.. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of Transportation Engineering 129, 6 (2003), 664672.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Xu Keyulu, Hu Weihua, Leskovec Jure, and Jegelka Stefanie. 2019. How powerful are graph neural networks? In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  36. [36] Xu Mingxing, Dai Wenrui, Liu Chunmiao, Gao Xing, Lin Weiyao, Qi Guo-Jun, and Xiong Hongkai. 2020. Spatial-temporal transformer networks for traffic flow forecasting. arXiv:2001.02908. Retrieved from https://arxiv.org/abs/2001.02908.Google ScholarGoogle Scholar
  37. [37] Yan Caixia, Chang Xiaojun, Li Zhihui, Ge Zongyuan, Guan Weili, Zhu Lei, and Zheng Qinghua. 2021. ZeroNAS: Differentiable generative adversarial networks search for zero-shot learning. IEEE Transactions on Pattern Analysis and Machine Intelligence 44, 12 (112021), 97339740. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Yan Sijie, Xiong Yuanjun, and Lin Dahua. 2018. Spatial temporal graph convolutional networks for skeleton-based action recognition. In Proceedings of the AAAI Conference on Artificial Intelligence. 74447452.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Yang Zhaohui, Wang Yunhe, Chen Xinghao, Shi Boxin, Xu Chao, Xu Chunjing, Tian Qi, and Xu Chang. 2020. CARS: Continuous evolution for efficient neural architecture search. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Yao Huaxiu, Wu Fei, Ke Jintao, Tang Xianfeng, Jia Yitian, Lu Siyu, Gong Pinghua, Ye Jieping, and Li Zhenhui. 2018. Deep multi-view spatial-temporal network for taxi demand prediction. In Proceedings of the AAAI Conference on Artificial Intelligence. 25882595.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Yu Bing, Yin Haoteng, and Zhu Zhanxing. 2018. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the International Joint Conference on Artificial Intelligence. 36343640.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Zang Tianzi, Zhu Yanmin, Xu Yanan, and Yu Jiadi. 2021. Jointly modeling spatio-temporal dependencies and daily flow correlations for crowd flow prediction. ACM Transactions on Knowledge Discovery from Data 15, 4 (2021), 58:1–58:20.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Zhang Dalin, Yao Lina, Chen Kaixuan, Wang Sen, Chang Xiaojun, and Liu Yunhao. 2020. Making sense of spatio-temporal preserving representations for EEG-based human intention recognition. IEEE Transactions on Cybernetics 50, 7 (2020), 30333044. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Zhang Yuyu, Chen Xinshi, Yang Yuan, Ramamurthy Arun, Li Bo, Qi Yuan, and Song Le. 2020. Efficient probabilistic logic reasoning with graph neural networks. In Proceedings of the International Conference on Learning Representations.Google ScholarGoogle Scholar
  45. [45] Zhong Zhao, Yan Junjie, Wu Wei, Shao Jing, and Liu Cheng-Lin. 2018. Practical block-wise neural network architecture generation. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 24232432.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Zoph Barret, Vasudevan Vijay, Shlens Jonathon, and Le Quoc V.. 2018. Learning transferable architectures for scalable image recognition. In Proceedings of the Conference on Computer Vision and Pattern Recognition. 86978710.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Auto-STGCN: Autonomous Spatial-Temporal Graph Convolutional Network Search

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Knowledge Discovery from Data
          ACM Transactions on Knowledge Discovery from Data  Volume 17, Issue 5
          June 2023
          386 pages
          ISSN:1556-4681
          EISSN:1556-472X
          DOI:10.1145/3583066
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 7 April 2023
          • Online AM: 1 December 2022
          • Accepted: 3 November 2022
          • Revised: 17 September 2022
          • Received: 21 February 2022
          Published in tkdd Volume 17, Issue 5

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text

        HTML Format

        View this article in HTML Format .

        View HTML Format