Abstract
Fully automatic algorithm for estimating the 3D human pose from multiple uncalibrated cameras is presented. Unlike the state-of-the-art methods which use the estimated pose of previous frames to restrict the candidates of current frame, the proposed method uses the viewpoint of previous frame in order to obtain an accurate pose. This paper also introduces a method to incorporate pose estimation results of several cameras without using the calibration information. The algorithm employs a rich descriptor for matching purposes. The performance of the proposed method is evaluated on a soccer database which is captured by multiple cameras. The dataset of silhouettes, in which the related 3D skeleton poses are known, is also constructed. Experimental results show that the proposed algorithm has a high accuracy rate in estimation of 3D pose of soccer players.
Access this article
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Similar content being viewed by others
Change history
19 November 2018
In the pdf version of this article, the captured corresponding author was incorrect when it is correctly presented in the online version. The correct corresponding author should be Hadi Seyedarabi. It is corrected in this correction article.
References
Agarwal A, Triggs B (2006) Recovering 3D human pose from monocular images. IEEE Trans Pattern Anal Mach Intell 28(1):44–58
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522
Burenius M, Sullivan J, Carlsson S, Halvorsen K (2011) Human 3D motion computation from a varying number of cameras. Scandinavian Conference on Image Analysis (SCIA):24–35
Burenius M, Sullivan J, Carlsson S (2011) Motion capture from dynamic orthographic cameras. IEEE International Conference on Computer Vision Workshops (ICCV Workshops): 1634-1641
Burenius M, Sullivan J, Carlsson S (2013) 3D pictorial structures for multiple view articulated pose estimation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR): 3618-3625
Eichner M et al (2012) 2D articulated human pose estimation and retrieval in (almost) unconstrained still images. Int J Comput Vis 99(2):190–214
Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79
Gall J, Rosenhahn B, Brox T, Seidel HP (2010) Optimization and filtering for human motion capture. Int J Comput Vis 87(1–2):75–92
Germann M, Hornung A, Keiser R, Ziegler R, Würmlin S, Gross M (2010) Articulated billboards for video-based rendering. Comput Graph Forum 29(2):585–594
Germann M, Popa T, Ziegler R, Keiser R, Gross M (2011) Space-time body pose estimation in uncontrolled environments. International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT):244-251
Hofmann M, Gavrila DM (2012) Multi-view 3D human pose estimation in complex environment. Int J Comput Vis 96(1):103–124
Holte MB, Tran C, Trivedi MM, Moeslund TB (2012) Human pose estimation and activity recognition from multi-view videos: comparative explorations of recent developments. J Sel Topics Signal Process 6(5):538–552
Howe NR (2007) Silhouette lookup for monocular 3D pose tracking. Image Vision Comput 25(3):331–341
Ji X, Liu H (2010) Advances in view-invariant human motion analysis: a review. IEEE Trans Syst Man Cybern Part C 40(1):13–24
Kazemi V, Sullivan J (2012) Using richer models for articulated pose estimation of footballers. IEEE British Machine Vision Conference (BMVC):1-10
Kazemi V, Burenius M, Azizpour H, Sullivan J (2013) Multi-view body part recognition with random forests. IEEE British machine vision conference (BMVC)
Moeslund TB, Granum E (2001) A survey of computer vision-based human motion capture. Comput Vis Image Underst 81(3):231–268
Moeslund TB, Hilton A, Krüger V (2006) A survey of advances in vision-based human motion capture and analysis. Comput Vis Image Underst 104(2–3):90–126
Moeslund TB, Hilton A, Krüger V, Sigal L (2011) Visual analysis of humans: looking at people. Springer
Mori G, Malik J (2006) Recovering 3D human body configurations using shape contexts. IEEE Trans Pattern Anal Mach Intell 28(7):1052–1062
Poppe R (2007) Vision-based human motion analysis: an overview. Comput Vis Image Underst 108(1–2):4–18
Shakhnarovich G, Viola P, Darrell T (2003) Fast pose estimation with parameter-sensitive hashing. Ninth IEEE International Conference on Computer Vision (ICCV):750-757
Sigal L, Black M (2010) Guest editorial: state of the art in image- and video-based human pose and motion estimation. Int J Comput Vis 87(1–2):1–3
Thomas GA (2006) Real-time camera pose estimation for augmenting sports scenes. IET Conf Publ 2006 (CP516): 10–19
Thomas G (2007) Real-time camera tracking using sports pitch markings. J Real-Time Image Proc 2(2):117–132
Yang Y and Ramanan D (2011) Articulated pose estimation with flexible mixtures-of-parts. IEEE conference on Computer Vision and Pattern Recognition (CVPR):1385–1392
Yi Y, Ramanan D (2013) Articulated human detection with flexible mixtures of parts. IEEE Trans Pattern Anal Mach Intell 35(12):2878–2890
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Afrouzian, R., Seyedarabi, H. & Kasaei, S. Pose estimation of soccer players using multiple uncalibrated cameras. Multimed Tools Appl 75, 6809–6827 (2016). https://doi.org/10.1007/s11042-015-2611-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2611-8