Abstract
In this paper, the design and implementation of a hardware/software platform for parallel and distributed multiview vision processing is presented. The platform is focused at supporting the monitoring of human presence in indoor environments. Its architecture is focused at increased throughput through process pipelining as well as at reducing communication costs and hardware requirements. Using this platform, we present efficient implementations of basic visual processes such as person tracking, textured visual hull computation and head pose estimation. Using the proposed platform multiview visual operations can be combined and third-party ones integrated, to ultimately facilitate the development of interactive applications that employ visual input. Computational performance is benchmarked comparatively to state of the art and the efficacy of the approach is qualitatively assessed in the context of already developed applications related to interactive environments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ramachandran, U., Nikhil, R., Rehg, J., Angelov, Y., Paul, A., Adhikari, S., Mackenzie, K., Harel, N., Knobe, K.: Stampede: a cluster programming middleware for interactive stream-oriented applications. IEEE Trans. Parallel and Distributed Systems 14, 1140–1154 (2003)
Gualdi, G., Prati, A., Cucchiara, R., Ardizzone, E., Cascia, M.L., Presti, L.L., Morana, M.: Enabling technologies on hybrid camera networks for behavioral analysis of unattended indoor environments and their surroundings. In: ACM Multimedia Workshops, pp. 101–108 (2008)
Chen, P., Ahammad, P., Boyer, C., Huang, S., Lin, L., Lobaton, E., Meingast, M., Oh, S., Wang, S., Yan, P., Yang, A., Yeo, C., Chang, L., Tygar, J., Sastry, S.: CITRIC: A low-bandwidth wireless camera network platform. In: ACM/IEEE Int. Conference on Distributed Smart Cameras, pp. 1–10 (2008)
Jung, S.H., Bajcsy, R.: A framework for constructing real-time immersive environments for training physical activities. Journal of Multimedia 1, 9–17 (2006)
Allard, J., Franco, J., Menier, C., Boyer, E., Raffin, B.: The Grimage platform: A mixed reality environment for interactions. In: ICCVS (2006)
Ladikos, A., Benhimane, S., Navab, N.: Efficient visual hull computation for real-time 3d reconstruction using CUDA. In: CVPR Workshops, pp. 1–8 (2008)
Schick, A., Stiefelhagen, R.: Real-time GPU-based voxel carving with systematic occlusion handling. In: Denzler, J., Notni, G., Süße, H. (eds.) DAGM 2009. LNCS, vol. 5748, pp. 372–381. Springer, Heidelberg (2009)
Laurentini, A.: The visual hull concept for silhouette-based image understanding. PAMI 16, 150–162 (1994)
Sarmis, T., Zabulis, X., Argyros, A.A.: A checkerboard detection utility for intrinsic and extrinsic camera cluster calibration. Technical Report TR-397 (2009)
Bouguet, J.Y.: Camera calibration toolbox for Matlab, http://www.vision.caltech.edu/bouguetj/calib_doc
Lourakis, M., Argyros, A.: SBA: A software package for generic sparse bundle adjustment. ACM Transactions on Mathematical Software 36 (2009)
Zivkovic, Z.: Improved adaptive Gaussian mixture model for background subtraction. In: International Conference on Pattern Recognition, pp. 28–31 (2004)
INRIA Perception Group, http://4drepository.inrialpes.fr/
Tran, S., Lin, Z., Harwood, D., Davis, L.: UMD_VDT, an integration of detection and tracking methods for multiple human tracking. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 179–190. Springer, Heidelberg (2008)
Wu, B., Singh, V., Kuo, C., Zhang, L., Lee, S., Nevatia, R.: CLEAR 2007 evaluation of USC human tracking system for surveillance videos. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 191–196. Springer, Heidelberg (2008)
Khan, S.M., Shah, M.: A multiview approach to tracking people in crowded scenes using a planar homography constraint. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 133–146. Springer, Heidelberg (2006)
Mittal, A., Davis, L.: M2tracker: A multi-view approach to segmenting and tracking people in a cluttered scene. In: IJCV, pp. 189–203 (2003)
Reddy, D., Sankaranarayanan, A., Cevher, V., Chellappa, R.: Compressed sensing for multi-view tracking and 3-D voxel reconstruction. In: ICIP, pp. 221–224 (2008)
Fleuret, F., Berclaz, J., Lengagne, R., Fua, P.: Multicamera people tracking with a probabilistic occupancy map. PAMI 30, 267–282 (2008)
Liem, M., Gavrila, D.: Multi-person tracking with overlapping cameras in complex, dynamic environments. In: BMVC (2009)
Argyros, A.A., Lourakis, M.I.A.: Real time tracking of multiple skin-colored objects with a possibly moving camera. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 368–379. Springer, Heidelberg (2004)
Lorensen, W., Cline, H.: Marching cubes: A high resolution 3D surface construction algorithm. In: SIGGRAPH, pp. 163–169 (1987)
Tzevanidis, K., Zabulis, X., Sarmis, T., Koutlemanis, P., Kyriazis, N., Argyros, A.: From multiple views to textured 3d meshes: a gpu-powered approach. In: ECCV Workshops, pp. 5–11 (2010)
Zabulis, X., Sarmis, T., Argyros, A.A.: 3D head pose estimation from multiple distant views. In: BMVC (2009)
Voit, M., Nickel, K., Stiefelhagen, R.: Neural network-based head pose estimation and multi-view fusion. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 291–298. Springer, Heidelberg (2007)
Zhang, Z., Hu, Y., Liu, M., Huang, T.: Head pose estimation in seminar room using multi view face detectors. In: Stiefelhagen, R., Garofolo, J.S. (eds.) CLEAR 2006. LNCS, vol. 4122, pp. 299–304. Springer, Heidelberg (2007)
Tian, Y., Brown, L., Conell, J., Pankanti, S., Hapapur, A., Senior, A., Bolle, R.: Absolute head pose estimation from overhead wide-angle cameras. In: AMFG, pp. 92–99 (2003)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. PAMI 24, 603–619 (2002)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: CVPR, pp. 511–588 (2001)
Franco, J., Menier, C., Boyer, E., Raffin, B.: A distributed approach for real time 3D modeling. In: CVPR Workshops, p. 31 (2004)
Zabulis, X., Grammenos, D., Sarmis, T., Tzevanidis, K., Argyros, A.A.: Exploration of large-scale museum artifacts through non-instrumented, location-based, multi-user interaction. In: VAST (2010)
Naruniec, J.: Using GPU for face detection. In: SPIE, vol. 7502, pp. 204–206 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zabulis, X., Sarmis, T., Tzevanidis, K., Koutlemanis, P., Grammenos, D., Argyros, A.A. (2010). A Platform for Monitoring Aspects of Human Presence in Real-Time. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17274-8_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-17274-8_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17273-1
Online ISBN: 978-3-642-17274-8
eBook Packages: Computer ScienceComputer Science (R0)