Abstract
This paper describes a method for estimating human distributions (quantities and locations) based on multiple-viewpoint image sequences. In the field of human image analysis, inter-human occlusion is a significant problem: when a scene includes a large number of occlusions, tracking of individual persons becomes difficult. Therefore, updating a tracking-based model is not enough to estimate the distribution in complex scenes. In our method, the number of persons and their locations are directly estimated from a set of input images based on the fitting of a projected shape model. The model’s complexity (number of persons) is determined based on the MDL (minimum description length) criterion. In addition, the image areas occluded by static objects are also detected and automatically excluded from the human distribution computations. We confirmed the feasibility of the proposed method through experiments using both synthesized and real images. Results show the effectiveness of our method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
O’Rourke, J., Badler, N.J.: Model-based image analysis of human motion using constraint propagation. IEEE Pattern Analysis and Machine Intelligence 2, 522–536 (1980)
Azarbayejani, A., Pentland, A.: Real-time self-calibrating stereo person tracking using 3-d shape estimation from blob features. In: Proceedings of 13th International Conference on Pattern Recognition, pp. 627–632 (1996)
Wren, C., Azarbayejani, A., Darrell, T., Pentland, A.: Pfinder: Real-time tracking of the human body. In: SPIE proceeding, vol. 2615, pp. 89–98 (1996)
Waggg, D.K., Nixon, M.S.: On automated model-based extraction and analysis of gait. In: Proc. of the 6th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 11–16 (2004)
Lim, J., Kriegman, D.: Tracking humans using prior and learned representations of shape and appearance. In: Proc. of the 6th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 869–874 (2004)
Leibe, B., Seemann, E., Schiele, B.: Pedestrian detection in crowed scenes. In: Proc. of Computer Vision and Pattern Recognition, vol. 1, pp. 20–25 (2005)
Utsumi, A., Tetsutani, N.: Human detection using geometrical pixel value structures. In: Proc. of the 5th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 372–377 (2002)
Segen, J., Pingali, S.: A camera-based system for tracking people in real time. In: Proceedings of 13th International Conference on Pattern Recognition, pp. 63–67 (1996)
Cai, Q., Mitiche, A., Aggarwal, J.K.: Tracking human motion in an indoor environment. In: Proceedings of 2nd International Conference on Image Processing, pp. 215–218 (1995)
Cai, Q., Aggarwal, J.K.: Tracking human motion using multiple cameras. In: Proceedings of 13th International Conference on Pattern Recognition, pp. 68–72 (1996)
Kettnaker, V., Zabih, R.: Counting people from multiple cameras. In: Proc. of the IEEE International Conference on Multimedia Computing and Systems, vol. 2, pp. 7–11 (1999)
Arita, D., ichiro, T.R., Yonemoto, S., Hamada, Y.: A real-time multi-view image processing system on pc cluster. In: Proceedings of Fourth Asian Conference on Computer Vision, pp. 270–275 (2000)
Papageorgiou, C., Evgeniou, T., Poggio, T.: A trainable pedestrian detection system. In: Proc. of Intelligent Vehicles, pp. 241–246 (1998)
Isard, M., Blake, A.: Condensation - conditional density propagation for visual tracking. International Journal of Computer Vision 29, 5–28 (1998)
Rissanen, J.: A universal prior for integers and estimation by minimum description length. The Annals of Stat. 11, 416–431 (1983)
Zhu, S.C., Yoille, A.: Region competition: Unifying snakes, region growing, and bayes/mdl for multiband image segmentation. IEEE Pattern Analysis and Machine Intelligence 18, 884–900 (1996)
Cham, T., Cipolla, R.: Automated b-spline curve representation incorporating mdl and error-minimizing control point insertion strategies. IEEE Pattern Analysis and Machine Intelligence 21, 49–53 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Utsumi, A., Yamazoe, H., Hosaka, Ki., Igi, S. (2006). Human Distribution Estimation Using Shape Projection Model Based on Multiple-Viewpoint Observations. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3851. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612032_80
Download citation
DOI: https://doi.org/10.1007/11612032_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31219-2
Online ISBN: 978-3-540-32433-1
eBook Packages: Computer ScienceComputer Science (R0)