ABSTRACT
Video conferencing is widely used with the growing popularity of Internet call systems such as FaceTime, Skype and broad band network. However, people still find it unnatural compared to face-to-face talk. One of the problems is that the display screen and web camera are fixed and do not track a person when he/she moves, resulting in an incomplete view and unnaturalness to the other party. In this paper, we propose a more engaging telepresence solution based on a tablet mounted on a pan-tilt robotic base. During video-conferencing, the human face is tracked by the built-in camera on the tablet. The tracking results will drive the robotic base in real-time such that the display and camera will follow the moving face. In this way, it not only sends the captured face images, but also gives natural feeling of head movement as in a face-to-face conversation. We conducted user studies and the findings showed preference of the proposed system.
- Double Robotics. iPad on Wheels. http://www.doublerobotics.com.Google Scholar
- Galileo by Motrr. Robotic iPhone Platform. http://www.motrr.com/galileo.html.Google Scholar
- Li, L., Yu, X., Li, J., Wang, G., Shi, J.-Y., Tan, Y. K., and Li, H. 2012. Vision-based attention estimation and selection for social robot to perform natural interaction in the open world. In Proceedings of the 7th ACM/IEEE Intl. Conf. Human-Robot Interaction, 183--184. Google ScholarDigital Library
- Murphy-Chutorian, E., and Trivedi, M. M. 2009. Head pose estimation in computer vision: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 31, 4 (Apr.), 607--626. Google ScholarDigital Library
- Nakanishi, H., Murakami, Y., and Kato, K. 2009. Movable cameras enhance social telepresence in media spaces. In CHI - Telepresence and Online Media, 433--442. Google ScholarDigital Library
- Yang, J., and Waibel, A. 1996. A real-time face tracker. In Proceedings of the 3rd IEEE Workshop on Applications of Computer Vision (WACV '96), IEEE Computer Society, Washington, DC, USA, WACV '96, 142--. Google ScholarDigital Library
- Yang, R., and Zhang, Z. 2002. Eye gaze correction with stereo-vision for video-teleconferencing. In Proceedings of the 7th European Conference on Computer Vision-Part II, Springer-Verlag, London, UK, UK, ECCV '02, 479--494. Google ScholarDigital Library
Index Terms
- Towards more engaging telepresence by face tracking
Recommendations
A realtime multimodal system for analyzing group meetings by combining face pose tracking and speaker diarization
ICMI '08: Proceedings of the 10th international conference on Multimodal interfacesThis paper presents a realtime system for analyzing group meetings that uses a novel omnidirectional camera-microphone system. The goal is to automatically discover the visual focus of attention (VFOA), i.e. "who is looking at whom", in addition to ...
Face tracking and recognition considering the camera's field of view
HBU'10: Proceedings of the First international conference on Human behavior understandingWe propose a method that tracks and recognizes faces simultaneously. In previous methods, features needed to be extracted twice for tracking and recognizing faces in image sequences because the features used for face recognition are different from those ...
Cluster-based distributed face tracking in camera networks
Special section on distributed camera networks: sensing, processing, communication, and implementationIn this paper, we present a distributed multicamera face tracking system suitable for large wired camera networks. Unlike previous multicamera face tracking systems, our system does not require a central server to coordinate the entire tracking effort. ...
Comments