Astral: An Autoencoder-Based Model for Pedestrian Trajectory Prediction of Variable-Length

Diao, Yupeng; Su, Yiteng; Zeng, Ximu; Chen, Xu; Liu, Shuncheng; Su, Han

doi:10.1007/978-3-031-11217-1_16

Yupeng Diao¹⁰,
Yiteng Su^10,11,
Ximu Zeng¹⁰,
Xu Chen¹⁰,
Shuncheng Liu¹⁰ &
…
Han Su^10,11

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13248))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

1050 Accesses
1 Citations

Abstract

Predicting pedestrian’s future behavior in a crowd plays an important role in many fields. Such as autonomous driving, machine navigation, video surveillance, and intelligent security systems. This is very challenging because pedestrian motion can be easily influenced by surrounding pedestrians’ interactions. In previous works, researchers use these interactions to make prediction more effective. However, the previous work set fixed-length input in their models. In this way, they ignore shorter pedestrian trajectories. This approach leads to insufficient feature information and inaccurately prediction in some scenarios. In this paper, we propose an Autoencoder-based model for pedestrian trajectory prediction of variable length (ASTRAL). At first, we use the autoencoder to process pedestrian data with variable-length trajectories. And then, we use the optimized multi-head attention mechanism to extract the interactions between neighbors. Finally, we use LSTM to decode vectors and make predictions. In particular, we fine-tune the model to make its performance better. We test our model and the state-of-the-art methods on the public benchmark datasets. Compared with others, our model improves ADE (average displacement error) and FDE (final displacement error) by \(9\%\) and \(33\%\) respectively. Therefore, our model is better than previous works, and we can predict the future trajectory of pedestrians more effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Fei-Fei, L., Savarese, S.: Social LSTM: human trajectory prediction in crowded spaces. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 961–971 (2016)
Google Scholar
Antonini, G., Bierlaire, M., Weber, M.: Discrete Choice Models of Pedestrian Walking Behavior. Transp. Res. Part B Methodol. 40, 667–687 (2006)
Google Scholar
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014)
Chen, X., et al.: SCSG attention: a self-centered star graph with attention for pedestrian trajectory prediction. In: Jensen, C.S., et al. (eds.) DASFAA 2021. LNCS, vol. 12681, pp. 422–438. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73194-6_29
Chapter Google Scholar
Ferrer, G., Garrell, A., Sanfeliu, A.: Robot companion: a social-force based approach with human awareness-navigation in crowded environments. In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1688–1694 (2013). https://doi.org/10.1109/IROS.2013.6696576
Gupta, A., Johnson, J., Fei-Fei, L., Savarese, S., Alahi, A.: Social GAN: socially acceptable trajectories with generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2255–2264 (2018)
Google Scholar
Haddad, S., Wu, M., Wei, H., Lam, S.K.: Situation-aware pedestrian trajectory prediction with spatio-temporal attention model. arXiv preprint arXiv:1902.05437 (2019)
Helbing, D., Buzna, L., Johansson, A., Werner, T.: Self-organized pedestrian crowd dynamics: experiments, simulations, and design solutions. Transpo. Sci. 39(1), 1–24 (2005)
Article Google Scholar
Helbing, D., Molnar, P.: Social force model for pedestrian dynamics. Phys. Rev. E 51(5), 4282 (1995)
Article Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural computation 9(8), 1735–1780 (1997)
Article Google Scholar
Leal-Taixé, L., Fenzi, M., Kuznetsova, A., Rosenhahn, B., Savarese, S.: Learning an image-based motion context for multiple people tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3542–3549 (2014)
Google Scholar
Miao, Y., Gowayyed, M., Metze, F.: EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp. 167–174 (2015). https://doi.org/10.1109/ASRU.2015.7404790
Pellegrini, S., Ess, A., Van Gool, L.: Improving data association by joint modeling of pedestrian trajectories and groupings. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6311, pp. 452–465. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15549-9_33
Chapter Google Scholar
Raksincharoensak, P., Hasegawa, T., Nagai, M.: Motion planning and control of autonomous driving intelligence system based on risk potential optimization framework. Int. J. Autom. Eng. 7(AVEC14), 53–60 (2016)
Article Google Scholar
Sadeghian, A., Kosaraju, V., Sadeghian, A., Hirose, N., Rezatofighi, H., Savarese, S.: Sophie: an attentive GAN for predicting paths compliant to social and physical constraints. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1349–1358 (2019)
Google Scholar
Shang, L., Lu, Z., Li, H.: Neural responding machine for short-text conversation. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 1577–1586. Association for Computational Linguistics, Beijing, China, July 2015. https://doi.org/10.3115/v1/P15-1152, https://aclanthology.org/P15-1152
Stahlberg, F.: Neural machine translation: a review. J. Artif. Intell. Res. 69, 343–418 (2020)
Article MathSciNet Google Scholar
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Advances in neural information processing systems, pp. 3104–3112 (2014)
Google Scholar
Treuille, A., Cooper, S., Popović, Z.: Continuum crowds. ACM Trans. Graph. 25(3), 1160–1168 (2006)
Article Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)
Google Scholar
Xiong, W., Wu, L., Alleva, F., Droppo, J., Huang, X., Stolcke, A.: The microsoft 2017 conversational speech recognition system. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5934–5938 (2018). https://doi.org/10.1109/ICASSP.2018.8461870

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
Yupeng Diao, Yiteng Su, Ximu Zeng, Xu Chen, Shuncheng Liu & Han Su
Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Chengdu, China
Yiteng Su & Han Su

Authors

Yupeng Diao
View author publications
You can also search for this author in PubMed Google Scholar
Yiteng Su
View author publications
You can also search for this author in PubMed Google Scholar
Ximu Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Xu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shuncheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Han Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Han Su .

Editor information

Editors and Affiliations

University of Aizu, Aizu, Japan
Uday Kiran Rage
Indraprastha Institute of Information Technology, Delhi, India
Vikram Goyal
Data Sciences and Analytics Center, International Institute of Information Technology, Hyderabad, Telangana, India
P. Krishna Reddy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Diao, Y., Su, Y., Zeng, X., Chen, X., Liu, S., Su, H. (2022). Astral: An Autoencoder-Based Model for Pedestrian Trajectory Prediction of Variable-Length. In: Rage, U.K., Goyal, V., Reddy, P.K. (eds) Database Systems for Advanced Applications. DASFAA 2022 International Workshops. DASFAA 2022. Lecture Notes in Computer Science, vol 13248. Springer, Cham. https://doi.org/10.1007/978-3-031-11217-1_16

Download citation

DOI: https://doi.org/10.1007/978-3-031-11217-1_16
Published: 16 July 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11216-4
Online ISBN: 978-3-031-11217-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Astral: An Autoencoder-Based Model for Pedestrian Trajectory Prediction of Variable-Length