Skip to main content

Pose Based Trajectory Forecast of Vulnerable Road Users Using Recurrent Neural Networks

  • Conference paper
  • First Online:
  • 2380 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12661))

Abstract

In this work, we use Recurrent Neural Networks (RNNs) in form of Gated Recurrent Unit (GRU) networks to forecast trajectories of vulnerable road users (VRUs), such as pedestrians and cyclists, in road traffic utilizing the past trajectory and 3D poses as input. The 3D poses represent the postures and movements of limbs and torso and contain early indicators for the transition between motion types, e.g. wait, start, move, and stop. VRUs often only become visible from the perspective of an approaching vehicle shortly before dangerous situations occur. Therefore, a network architecture is required which is able to forecast trajectories after short time periods and is able to improve the forecasts in case of longer observations. This motivates us to use GRU networks, which are able to use time series of varying duration as inputs, and to investigate the effects of different observation periods on the forecasting results. Our approach is able to make reasonable forecasts even for short observation periods. The use of poses improves the forecasting accuracy, especially for short observation periods compared to a solely head trajectory based approach. Different motion types benefit to different extent from the use of poses and longer observation periods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Alahi, A., Goel, K., Ramanathan, V., Robicquet, A., Fei-Fei, L., Savarese, S.: Social LSTM: human trajectory prediction in crowded spaces. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 961–971. Las Vegas (2016)

    Google Scholar 

  2. Cao, Z., Simon, T., Wei, S.E., Sheikh, Y.: Realtime multi-person 2D pose estimation using part affinity fields. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1302–1310. Honolulu (2017)

    Google Scholar 

  3. Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. In: Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–1734. ACL (2014)

    Google Scholar 

  4. Fang, Z., López, A.M.: Is the pedestrian going to cross? answering by 2D pose estimation. In: IEEE Intelligent Vehicles Symposium (IV), Changshu, pp. 1271–1276 (2018)

    Google Scholar 

  5. Fang, Z., López, A.M.: Intention recognition of pedestrians and cyclists by 2D pose estimation. IEEE Trans. Intell. Transp. Syst. 21, 1–11 (2019)

    Google Scholar 

  6. Fang, Z., Vázquez, D., López, A.M.: On-board detection of pedestrian intentions. Sensors 17(10), 2193 (2017)

    Article  Google Scholar 

  7. Goldhammer, M.: Selbstlernende Algorithmen zur videobasierten Absichtserkennung von Fußgängern. Ph.D. thesis, University of Kassel (2016)

    Google Scholar 

  8. Hubert, A., Zernetsch, S., Doll, K., Sick, B.: Cyclists’ starting behavior at intersections. In: IEEE Intelligent Vehicles Symposium (IV), pp. 1071–1077. Los Angeles (2017)

    Google Scholar 

  9. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR), San Diego (2015)

    Google Scholar 

  10. Kitani, K.M., Ziebart, B.D., Bagnell, J.A., Hebert, M.: Activity forecasting. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7575, pp. 201–214. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33765-9_15

    Chapter  Google Scholar 

  11. Kress, V., Jung, J., Zernetsch, S., Doll, K., Sick, B.: Human pose estimation in real traffic scenes. In: IEEE Symposium Series on Computational Intelligence (SSCI), Bangalore, pp. 518–523 (2018)

    Google Scholar 

  12. Kress, V., Jung, J., Zernetsch, S., Doll, K., Sick, B.: Pose based start intention detection of cyclists. In: IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, pp. 2381–2386 (2019)

    Google Scholar 

  13. Kress, V., Zernetsch, S., Doll, K., Sick, B.: Pose based trajectory forecast of vulnerable road users. In: IEEE Symposium Series on Computational Intelligence (SSCI), Xiamen, pp. 1200–1207 (2019)

    Google Scholar 

  14. Lee, N., Choi, W., Vernaza, P., Choy, C.B., Torr, P.H.S., Chandraker, M.: DESIRE: distant future prediction in dynamic scenes with interacting agents. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2165–2174 (2017)

    Google Scholar 

  15. Pool, E.A.I., Kooij, J.F.P., Gavrila, D.M.: Context-based cyclist path prediction using Recurrent Neural Networks. In: IEEE Intelligent Vehicles Symposium (IV), pp. 824–830 (2019)

    Google Scholar 

  16. Quintero, R., Parra, I., Fernández-Llorca, D., Sotelo, M.A.: Pedestrian path, pose, and intention prediction through gaussian process dynamical models and pedestrian activity recognition. IEEE Trans. Intell. Transp. Syst. 20(5), 1803–1814 (2019)

    Article  Google Scholar 

  17. Radwan, N., Valada, A., Burgard, W.: Multimodal interaction-aware motion prediction for autonomous street crossing. arXiv:1808.06887 (2019)

  18. Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation In: Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1, pp. 318–362. MIT Press, Cambridge (1986)

    Google Scholar 

  19. Schneemann, F., Heinemann, P.: Context-based detection of pedestrian crossing intention for autonomous driving in urban environments. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, pp. 2243–2248 (2016)

    Google Scholar 

  20. Tome, D., Russell, C., Agapito, L.: Lifting from the deep: Convolutional 3d pose estimation from a single image. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5689–5698. Honolulu (2017)

    Google Scholar 

  21. Wang, S., et al.: Leverage of limb detection in pose estimation for vulnerable road users. In: IEEE Intelligent Transportation Systems Conference (ITSC), pp. 528–534 (2019)

    Google Scholar 

  22. Werbos, P.J.: Backpropagation through time: what it does and how to do it. Proc. IEEE 78(10), 1550–1560 (1990)

    Article  Google Scholar 

Download references

Acknowledgment

This work was supported by “Zentrum Digitalisierung.Bayern”. In addition, the work is backed by the project DeCoInt\(^2\), supported by the German Research Foundation (DFG) within the priority program SPP 1835: “Kooperativ interagierende Automobile”, grant numbers DO 1186/1-2 and SI 674/11-2.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Viktor Kress .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kress, V., Zernetsch, S., Doll, K., Sick, B. (2021). Pose Based Trajectory Forecast of Vulnerable Road Users Using Recurrent Neural Networks. In: Del Bimbo, A., et al. Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science(), vol 12661. Springer, Cham. https://doi.org/10.1007/978-3-030-68763-2_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-68763-2_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-68762-5

  • Online ISBN: 978-3-030-68763-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics