Abstract
In the field of computer vision, research on human pose estimation aims to understand human movements by analyzing individual camera images. However, identifying individuals in multi-people pose estimation remains a challenging issue. This study explores ID assignment technology to address the technical challenge of identifying individuals and maintaining consistent identities over consecutive images. Previous studies often required additional training to perform optimally in the wild environments. Therefore, in order to address the limitation of the previous studies, we propose a novel methodology that performs real-time image analysis and evaluation without the need for any prior training or additional datasets. By identifying the number of people and analyzing individual characteristics in each frame, we compare the characteristics between the current and previous frames, assigning the same ID to the individuals showing the highest similarity. This methodology enhances the applicability of human pose estimation systems in the wild and in real-time without training. We conducted experiments across four distinct scenarios and observed that the average ID assignment accuracy across all data was approximately 86.01%. Consequently, our research is expected to make significant contributions to various applications, including human-computer interaction, sports analysis, healthcare, and surveillance systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ding, Z., Ji, Y., Gan, Y., Wang, Y., Xia, Y.: Current status and trends of technology, methods, and applications of human-computer intelligent interaction (HCII): a bibliometric research. Multimedia Tools Appl. (2024). https://doi.org/10.1007/s11042-023-18096-6
Samaila, Y.A., et al.: Video anomaly detection: a systematic review of issues and prospects. Neurocomputing 591, 127726 (2024). https://doi.org/10.1016/j.neucom.2024.127726
Andriluka, M., et al.: PoseTrack: a benchmark for human pose estimation and tracking. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018). https://doi.org/10.1109/cvpr.2018.00542
Cao, Z., Simon, T., Wei, S.-E., Sheikh, Y.: Realtime multi-person 2D pose estimation using part affinity fields. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017). https://doi.org/10.1109/cvpr.2017.143
Sun, K., Xiao, B., Liu, D., Wang, J.: Deep high-resolution representation learning for human pose estimation. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019). https://doi.org/10.1109/cvpr.2019.00584
Kocabas, M., Karagoz, S., Akbas, E.: MultiPoseNet: fast multi-person pose estimation using pose residual network. In: Computer Vision – ECCV 2018, LNCS, vol. 437, pp. 437–453. Springer, Heidelberg (2018). https://doi.org/10.1007/978-3-030-01252-626
Iqbal, U., Milan, A., Gall, J.: PoseTrack: joint multi-person pose estimation and tracking. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2017). https://doi.org/10.1109/cvpr.2017.495
Liu, Z., et al.: Deep dual consecutive network for human pose estimation. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2021). https://doi.org/10.1109/cvpr46437.2021.00059
Wu, S., Wang, B.: DRSI-Net: dual-residual spatial interaction network for multi-person pose estimation. Knowl.-Based Syst. 295, 111836 (2024). https://doi.org/10.1016/j.knosys.2024.111836
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002). https://doi.org/10.1109/tpami.2002.1017623
Ahonen, T., Hadid, A., Pietikäinen, M.: Face recognition with local binary patterns. In: Pajdla, T., Matas, J. (eds.) ECCV 2004. LNCS, vol. 3021, pp. 469–481. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24670-1_36
Ghosh, J., Strehl, A.: Similarity-based text clustering: a comparative study. In: Grouping Multidimensional Data, pp. 73–97. Springer, Heidelberg (2006). https://doi.org/10.1007/3-540-28349-83
Ivaturi, S.S., Thalatam, M.N., Bugatha, A.S., Ladi, S.S., Botta, D.T.: Hybrid movie recommendation system based on user preferences and item similarity. In: Springer Proceedings in Mathematics & Statistics, pp. 671–681 Springer, Heidelberg (2024). https://doi.org/10.1007/978-3-031-51167-764
Mehta, D., et al.: XNect: real-time multi-person 3D motion capture with a single RGB camera. ACM Trans. Graph. (TOG) 39(4), 82–1 (2020)
Varadarajan, S., Datta, P., Tickoo, O.: A greedy part assignment algorithm for real-time multi-person 2D pose estimation. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV) (2018). https://doi.org/10.1109/wacv.2018.00052
Cao, Z., Hidalgo, G., Simon, T., Wei, S.-E., Sheikh, Y.: OpenPose: realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans. Pattern Anal. Mach. Intell. 43(1), 172–186 (2021). https://doi.org/10.1109/tpami.2019.2929257
Bhattacharyya, A.: On a measure of divergence between two statistical populations defined by their probability distribution. Bull. Calcutta Math. Soc. 35, 99–110 (1943)
Novak, C.L., Shafer, S.A.: Method for estimating scene parameters from color histograms. JOSA A 11(11), 3020–3036 (1994)
Cha, S.H., Srihari, S.N.: On measuring the distance between histograms. Pattern Recogn. 35(6), 1355–1370 (2002)
Lin, T.-Y., Ma, H., Hays, J.: Microsoft COCO: common objects in context. In: *European Conference on Computer Vision (ECCV)*, pp. 740–755. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-319-10602-148
Andriluka, M., Pishchulin, L., Gehler, P.V., Schiele, B.: 2D human pose estimation: new benchmark and state of the art analysis. In: *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)*, pp. 3686–3693. IEEE (2014). https://doi.org/10.1109/CVPR.2014.471
Acknowledgements
This research was supported by Culture, Sports and Tourism R & D Program through Korea Creative Content Agency grant funded by the Ministry of Culture, Sports and Tourism in 2022. (Project Name: Reality and virtual creation, cooperation, and participation-type variable complex space Silk Road content platform for youth, Project Number: R2022020120, Contribution Rate: 100%)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Shin, P., Kwon, O. (2025). Human Pose Estimation-Based ID Assignment Method in the Wild: A Real-Time Approach. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2024. Lecture Notes in Computer Science, vol 15046. Springer, Cham. https://doi.org/10.1007/978-3-031-77392-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-031-77392-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-77391-4
Online ISBN: 978-3-031-77392-1
eBook Packages: Computer ScienceComputer Science (R0)