ABSTRACT
A system for video summarization in a ubiquitous environment is presented. Data from pressure-based floor sensors are clustered to segment footsteps of different persons. Video handover has been implemented to retrieve a continuous video showing a person moving in the environment. Several methods for extracting key frames from the resulting video sequences have been implemented, and evaluated by experiments. It was found that most of the key frames the human subjects desire to see could be retrieved using an adaptive algorithm based on camera changes and the number of footsteps within the view of the same camera. The system consists of a graphical user interface that can be used to retrieve video summaries interactively using simple queries.
Supplemental Material
- Yamazaki, T. Ubiquitous Home: Real-life Testbed for Home Context-Aware Service. In Proceedings of Tridentcom2005, 2005, 54--59. Google ScholarDigital Library
- Sebe, N., Lew, M. S., Zhou, X., Huang, T. S., and Bakker, E. The State of the Art in Image and Video Retrieval. In Proceedings of the International Conf. on Image and Video Retrieval (CIVR'03), 2003, 1--8. Google ScholarDigital Library
- Wang, J. R., Prameswaran, N., Yu, X., Xu, C., and Tian, Q. Archiving Tennis Video Clips Based on Tactics Information. In Proceedings of PCM (2), 2004, 314--321. Google ScholarDigital Library
- Haubold, A. and Kender, J. R. Segmentation, Indexing, and Visualization of Extended Instructional Videos. CoRR cs.IR/0302023 (2003).Google Scholar
- Divakaran, A., Otsuka, I., Radhakrishnan, R., Nakane, K., and Ogawa, M. Audio-Assisted Video Browsing for DVD Recorders. In Proceedings of PCM (2), 2004, 27--33. Google ScholarDigital Library
- Morisawa, K., Nitta, N., and Babaguchi, N. Video Scene Retrieval with Sign Sequence Matching Based on Audio Features. In Proceedings of PCM (2), 2004, 121--129. Google ScholarDigital Library
- Davis, M., King, S., and Good, N. From Context to Content: Leveraging Context to Infer Media Metadata. In Proceedings of ACM Multimedia, 2004, 188--195. Google ScholarDigital Library
- Aizawa, K., Kawasaki, S., Ishikawa, T., and Yamasaki, T. Capture and retrieval of life log. In Proceedings of ICAT, 2004, 49--55.Google Scholar
- Department of Sensory Media - Ubiquitous Sensor Room: http://www.mis.atr.jp/~megumu/IM _Web/MisIM-E.html. ATR Media Information Science Laboratories, Kyoto, Japan.Google Scholar
- Abowd, G. A., Bobick, I., Essa, I., Mynatt, E., and Rogers, W. The Aware Home: Developing Technologies for Successful Aging. In Proceedings of AAAI, 2002.Google Scholar
- Orr, R. J., and Abowd, G. D. The Smart Floor: A Mechanism for Natural User Identification and Tracking. In Proceedings of the 2000 Conference on Human Factors in Computing Systems, 2000. Google ScholarDigital Library
- Jaimes, A., Omura, K., Nagamine, T., and Hirata, K. Memory Cues for Meeting Video Retrieval. In Proceedings of ACM CARPE Workshop, 2004, 74--85. Google ScholarDigital Library
- Mori, T., Noguchi, H., Takada, A., and Sato, T. Sensing Room: Distributed Sensor Environment for Measurement of Human Daily Behavior. In Proceedings of International Workshop on Networked Sensing Systems, 2004, 40--43.Google Scholar
- Matsuoka, K., and Fukushima, K. Understanding of Living Activity in a House for Real-time Life Support. In Proceedings of SCIS & ISIS, 2004, 1--6.Google Scholar
- Liu, L., and Fan, G. Combined Key-frame Extraction and Object-based Video Segmentation. IEEE Trans. Circuits and System for Video Technology, 15, 7 (2005), 869--884. Google ScholarDigital Library
- Naphade, M. R., and Smith, J. R. On the Detection of Semantic Concepts at TRECVID. In Proceedings of ACM Multimedia, 2004, 660--667. Google ScholarDigital Library
- TRECVID 2005 Guidelines, <http://www-nlpir.nist.gov/ projects/ tv2005/tv2005.html>, National Institute of Standards and Technology, USA, 2005.Google Scholar
- Kawasaki, S., Ishikawa, T., Yamasaki, T., Aizawa, K. Effective Life-Log Video Summarization Based on Sampling of Sensor Data. In Proceedings of IEICE MVE, 2005.Google Scholar
- Song, X., and Fan, G. Joint Key-Frame Extraction and Object-Based Video Segmentation. In Proceedings of Motion05 (II), 126--131. Google ScholarDigital Library
- De Silva, G. C., yamasaki, T., Ishikawa, T., and Aizawa, K. Video Handover for Retrieval in a Ubiquitous Environment Using Floor Sensor Data. In Proceedings of ICME, 2005.Google ScholarCross Ref
Index Terms
- Evaluation of video summarization for a large number of cameras in ubiquitous home
Recommendations
Experience retrieval in a ubiquitous home
CARPE '05: Proceedings of the 2nd ACM workshop on Continuous archival and retrieval of personal experiencesWe present a system for retrieval and summarization of continuously archived multimedia data from a home-like ubiquitous environment. Data from pressure-based floor sensors are analyzed to index video and audio from a large number of sources. Video and ...
Rushes video summarization using audio-visual information and sequence alignment
TVS '08: Proceedings of the 2nd ACM TRECVid Video Summarization WorkshopThis paper describes our system and methodologies for the BBC rushes video summarization task of TRECVID 2008. The procedure of the system is composed of three major steps: shot detection, irrelevant and repetitive subshot removal, and final summary ...
Multicamera Summarization of Rehabilitation Sessions in Home Environment
MM '17: Proceedings of the 25th ACM international conference on MultimediaIn this paper we present a cyber-physiotherapy system (CyPhy) that brings daily rehabilitation to patient's home with supervision from trained therapist. CyPhy is able to capture and record RGB-D, skeleton, and physiotherapy-related medical sensing data ...
Comments