Abstract
A high cost of equipment that solves the problem of tracking the position and direction of users in real time is one of factors that negatively affect the speed of development of the augmented reality industry. The urgency of this problem is a premise for the development of a financially available tracking system. In this research, we propose a software and hardware architecture of a system that solves three-dimensional tracking problems in a closed space and postures classification using neural network models. Distinctive feature of our system is the feasibility in borders of strictly limited computing power and the absence of any sensors placed on monitored objects. After setting the boundaries of the active area, all the necessary input data is provided by a static camera without an infrared filter. As an example of the implementation of a resource-limited solution, we present the assembly of this solution on a Raspberry Pi version 3 single board computer equipped with the Intel Neural Stick version 2 co-processor and a Raspberry version 2 NoIR camera. The first section of the article describes technical characteristics of the equipment used in the study. The second part is dedicated to the solution algorithm and its brief description. Further, in the third stage, the ways of data collection, necessary for a correct assessment of position, direction and posture are illustrated. The fourth, final section presents the results, discussion and possible directions for further work.
Supported by Russian Foundation for Basic Research, grant number 17-29-04288.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: British Machine Vision Conference Swansea, UK (2015)
Chan, T., Jia, K., Gao, S., Lu, J., Zeng, Z., Ma, Y.: A simple deep learning baseline for image classification? IEEE Trans. Image Process. 24(12), 5017–5032 (2015)
Seo, J., Han, S., Lee, S., Kim, H.: Computer vision techniques for construction safety and health monitoring. Adv. Eng. Inform. 29(2), 239–251 (2015). ISSN 1474–0346
Seiferling, I., Naik, N., Ratti, C., Proulx, R.: Green streets - quantifying and mapping urban trees with street-level imagery and computer vision. Landscape Urban Plann. 165, 93–101 (2017). ISSN 0169–2046
Qiu, W., Yuille, A.: UnrealCV: connecting computer vision to unreal engine. In: Hua, G., Jégou, H. (eds.) ECCV 2016. LNCS, vol. 9915, pp. 909–916. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-49409-8_75
Shafaei, A., Little, J.J., Schmidt, M.: Play and Learn: Using Video Games to Train Computer Vision Models. CoRR, vol. abs/1608.01745 (2016)
Kermadi, M., Berkouk, E.M.: Artificial intelligence-based maximum power point tracking controllers for photovoltaic systems: comparative study. Renew. Sustain. Energy Rev. 69, 369–386 (2017). ISSN 1364–0321
Thies, J., Zollhöfer, M., Stamminger, M., Theobalt, C., Nießner, M.: FaceVR: Real-Time Facial Reenactment and Eye Gaze Control in Virtual Reality. CoRR, vol.abs/1610.03151
Zank, M., Nescher, T., Kunz, A.: Tracking human locomotion by relative positional feet tracking. In: IEEE Virtual Reality, pp. 317–318, Arles (2015). https://doi.org/10.1109/VR.2015.7223423
Melekhov, I., Ylioinas, J., Kannala, J., Rahtu, E.: Relative camera pose estimation using convolutional neural networks. In: Blanc-Talon, J., Penne, R., Philips, W., Popescu, D., Scheunders, P. (eds.) ACIVS 2017. LNCS, vol. 10617, pp. 675–687. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-70353-4_57
Rambach, J.R., Tewari, A., Pagani, A., Stricker, D.: Learning to fuse: a deep learning approach to visual-inertial camera pose estimation. In: 2016 IEEE International Symposium on Mixed and Augmented Reality (ISMAR), pp. 71–76 (2016)
Avila, L., Bailey, M.: Virtual reality for the masses. IEEE Comput. Graph. Appl. 34(05), 103–104 (2014). https://doi.org/10.1109/MCG.2014.103
Li, Y., Dai, A., Guibas, L., Nießner, M.: Database-assisted object retrieval for real-time 3D reconstruction. Comput. Graph. Forum 34(2), 435–446 (2015). https://doi.org/10.1111/cgf.12573
Sawyer, R., Smith, A., Rowe, J., Azevedo, R., Lester, J.: Models in game-based learning with facial expression recognition. In: Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, pp. 192–201, New York, NY, USA (2017)
Ruchti, P., Steder, B., Ruhnke, M, Burgard, W.: Localization on OpenStreetMap data using a 3D laser scanner. In: 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 5260–5265, Seattle, WA (2015). https://doi.org/10.1109/ICRA.2015.7139932
Sajjad, M., et al.: Raspberry Pi assisted face recognition framework for enhanced law-enforcement services in smart cities. Future Gener. Comput. Syst. (2017)
Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E., Sheikh, Y.: OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. CoRR, vol. abs/1812.08008 (2018)
State of streaming protocols (2018). https://blog.wmspanel.com/2018/12/state-of-streaming-protocols-2018-summary.html. Accessed 24 Dec 2018
Pavllo, D., Feichtenhofer, C., Grangier, D., Auli, M.: 3D human pose estimation in video with temporal convolutions and semi-supervised training. CoRR, vol. abs/1811.11742 (2018)
Shi, W., et al.: Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. In: the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1874–1883 (2016)
Acknowledgments
This research was partially supported by the Russian Foundation for Basic Research grants (projects no. 17-29-04288). The authors would like to acknowledge the Reviewers for the valuable recommendations that helped in the improvement of this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Oleg, I. et al. (2019). Position Tracking in 3D Space Based on a Data of a Single Camera. In: Misra, S., et al. Computational Science and Its Applications – ICCSA 2019. ICCSA 2019. Lecture Notes in Computer Science(), vol 11622. Springer, Cham. https://doi.org/10.1007/978-3-030-24305-0_58
Download citation
DOI: https://doi.org/10.1007/978-3-030-24305-0_58
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-24304-3
Online ISBN: 978-3-030-24305-0
eBook Packages: Computer ScienceComputer Science (R0)