Skip to main content

Position Tracking in 3D Space Based on a Data of a Single Camera

  • Conference paper
  • First Online:
Computational Science and Its Applications – ICCSA 2019 (ICCSA 2019)

Abstract

A high cost of equipment that solves the problem of tracking the position and direction of users in real time is one of factors that negatively affect the speed of development of the augmented reality industry. The urgency of this problem is a premise for the development of a financially available tracking system. In this research, we propose a software and hardware architecture of a system that solves three-dimensional tracking problems in a closed space and postures classification using neural network models. Distinctive feature of our system is the feasibility in borders of strictly limited computing power and the absence of any sensors placed on monitored objects. After setting the boundaries of the active area, all the necessary input data is provided by a static camera without an infrared filter. As an example of the implementation of a resource-limited solution, we present the assembly of this solution on a Raspberry Pi version 3 single board computer equipped with the Intel Neural Stick version 2 co-processor and a Raspberry version 2 NoIR camera. The first section of the article describes technical characteristics of the equipment used in the study. The second part is dedicated to the solution algorithm and its brief description. Further, in the third stage, the ways of data collection, necessary for a correct assessment of position, direction and posture are illustrated. The fourth, final section presents the results, discussion and possible directions for further work.

Supported by Russian Foundation for Basic Research, grant number 17-29-04288.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: British Machine Vision Conference Swansea, UK (2015)

    Google Scholar 

  2. Chan, T., Jia, K., Gao, S., Lu, J., Zeng, Z., Ma, Y.: A simple deep learning baseline for image classification? IEEE Trans. Image Process. 24(12), 5017–5032 (2015)

    Article  MathSciNet  Google Scholar 

  3. Seo, J., Han, S., Lee, S., Kim, H.: Computer vision techniques for construction safety and health monitoring. Adv. Eng. Inform. 29(2), 239–251 (2015). ISSN 1474–0346

    Article  Google Scholar 

  4. Seiferling, I., Naik, N., Ratti, C., Proulx, R.: Green streets - quantifying and mapping urban trees with street-level imagery and computer vision. Landscape Urban Plann. 165, 93–101 (2017). ISSN 0169–2046

    Article  Google Scholar 

  5. Qiu, W., Yuille, A.: UnrealCV: connecting computer vision to unreal engine. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9915, pp. 909–916. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_75

    Chapter  Google Scholar 

  6. Shafaei, A., Little, J.J., Schmidt, M.: Play and Learn: Using Video Games to Train Computer Vision Models. CoRR, vol. abs/1608.01745 (2016)

    Google Scholar 

  7. Kermadi, M., Berkouk, E.M.: Artificial intelligence-based maximum power point tracking controllers for photovoltaic systems: comparative study. Renew. Sustain. Energy Rev. 69, 369–386 (2017). ISSN 1364–0321

    Article  Google Scholar 

  8. Thies, J., Zollhöfer, M., Stamminger, M., Theobalt, C., Nießner, M.: FaceVR: Real-Time Facial Reenactment and Eye Gaze Control in Virtual Reality. CoRR, vol.abs/1610.03151

    Google Scholar 

  9. Zank, M., Nescher, T., Kunz, A.: Tracking human locomotion by relative positional feet tracking. In: IEEE Virtual Reality, pp. 317–318, Arles (2015). https://doi.org/10.1109/VR.2015.7223423

  10. Melekhov, I., Ylioinas, J., Kannala, J., Rahtu, E.: Relative camera pose estimation using convolutional neural networks. In: Blanc-Talon, J., Penne, R., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2017. LNCS, vol. 10617, pp. 675–687. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-70353-4_57

    Chapter  Google Scholar 

  11. Rambach, J.R., Tewari, A., Pagani, A., Stricker, D.: Learning to fuse: a deep learning approach to visual-inertial camera pose estimation. In: 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pp. 71–76 (2016)

    Google Scholar 

  12. Avila, L., Bailey, M.: Virtual reality for the masses. IEEE Comput. Graph. Appl. 34(05), 103–104 (2014). https://doi.org/10.1109/MCG.2014.103

    Article  Google Scholar 

  13. Li, Y., Dai, A., Guibas, L., Nießner, M.: Database-assisted object retrieval for real-time 3D reconstruction. Comput. Graph. Forum 34(2), 435–446 (2015). https://doi.org/10.1111/cgf.12573

    Article  Google Scholar 

  14. Sawyer, R., Smith, A., Rowe, J., Azevedo, R., Lester, J.: Models in game-based learning with facial expression recognition. In: Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, pp. 192–201, New York, NY, USA (2017)

    Google Scholar 

  15. Ruchti, P., Steder, B., Ruhnke, M, Burgard, W.: Localization on OpenStreetMap data using a 3D laser scanner. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 5260–5265, Seattle, WA (2015). https://doi.org/10.1109/ICRA.2015.7139932

  16. Sajjad, M., et al.: Raspberry Pi assisted face recognition framework for enhanced law-enforcement services in smart cities. Future Gener. Comput. Syst. (2017)

    Google Scholar 

  17. Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E., Sheikh, Y.: OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. CoRR, vol. abs/1812.08008 (2018)

    Google Scholar 

  18. State of streaming protocols (2018). https://blog.wmspanel.com/2018/12/state-of-streaming-protocols-2018-summary.html. Accessed 24 Dec 2018

  19. Pavllo, D., Feichtenhofer, C., Grangier, D., Auli, M.: 3D human pose estimation in video with temporal convolutions and semi-supervised training. CoRR, vol. abs/1811.11742 (2018)

    Google Scholar 

  20. Shi, W., et al.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1874–1883 (2016)

    Google Scholar 

Download references

Acknowledgments

This research was partially supported by the Russian Foundation for Basic Research grants (projects no. 17-29-04288). The authors would like to acknowledge the Reviewers for the valuable recommendations that helped in the improvement of this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Iakushkin Oleg .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Oleg, I. et al. (2019). Position Tracking in 3D Space Based on a Data of a Single Camera. In: Misra, S., et al. Computational Science and Its Applications – ICCSA 2019. ICCSA 2019. Lecture Notes in Computer Science(), vol 11622. Springer, Cham. https://doi.org/10.1007/978-3-030-24305-0_58

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-24305-0_58

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-24304-3

  • Online ISBN: 978-3-030-24305-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics