Abstract
Rich, structured annotations of video recordings enable interesting uses, but existing techniques for manual, and even semi-automated, tagging can be too time-consuming. We present in this paper the ContextCam, a prototype of a consumer video camera that provides point of capture annotation of time, location, person presence and event information associated to recorded video. Both low- and high-level metadata are discovered via a variety of sensing and active tagging techniques, as well as through the application of machine learning techniques that use past annotations to suggest metadata for the current recordings. Furthermore, the ContextCam provides users with a minimally intrusive interface for correcting predicted high-level metadata during video recording.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abowd, G.D., Gauger, M., Lachenmann, A.: The Family Video Archive: An annotation and browsing environment for home movies. In: The Proceedings of MIR, Berkeley, CA (November 2003)
Adcock, J., Cooper, M.D., Doherty, J., Foote, J., Girgensohn, A., Wilcox, L.: Managing digital memories with the FXPAL photo application. ACM Multimedia, 598–599 (2003)
Bederson, B.B.,, P.: A Zoomable Image Browser using Quantum Treemaps and Bubblemaps. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2001), Orlando, FL, November 2001, pp. 71–80 (2001)
Casares, J., Myers, B.A., Long, C., Bhatnagar, R., Stevens, S.M., Dabbish, L., Yocum, D., Corbett, A.: Simplifying Video Editing Using Metadata. In: Proceedings of Designing Interactive Systems (DIS 2002), London, UK, June 2002, pp. 157–166 (2002)
Cox, D., Kindratenko, V., Pointer, D.: IntelliBadge: Towards Providing Location-Aware Value-Added Services at Academic Conferences. In: Dey, A.K., Schmidt, A., McCarthy, J.F. (eds.) UbiComp 2003. LNCS, vol. 2864, pp. 264–280. Springer, Heidelberg (2003)
Currie III, D.L., Irvine, C.E.: Surmounting the Effects of Lossy Compression on Steganography. In: Proceedings of the 19th National Information System Security Conference, October 1996, pp. 194–201 (1996)
Davis, M.: Media Streams: An Iconic Visual Language for Video Representation. In: Baecker, R.M., Grudin, J., Buxton, W.A.S., Greenberg, S. (eds.) Readings in Human-Computer Interaction: Toward the Year 2000, 2nd edn., pp. 854–866. Morgan Kaufmann Publishers, Inc., San Francisco (1995)
Girgensohn, A., Boreczky, J., Chiu, P., Doherty, J., Foote, J., Golovchinsky, G., Uchihashi, S., Wilcox, L.: A Semiautomatic Approach to Home Video Editing. In: Proceedings of the ACM Symposium on Use rIinterface Software and Technology (UIST 2000), San Diego, CA, November 5-8, 2000, pp. 81–89 (2000)
Hakansson, M., Ljungblad, S., Holmquist, L.E.: Capturing the Invisible: Designing Context Aware Photography. In: Proceedings of DUX 2003, Designing for User Experience, San Francisco, CA, June 5-7, 2003, ACM/AIGA (2003)
Hazas, M., Ward, A.: A Novel Broadband Ultrasonic Location System. In: Borriello, G., Holmquist, L.E. (eds.) UbiComp 2002. LNCS, vol. 2498, pp. 264–280. Springer, Heidelberg (2002)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval using Cross-Media Relevance Models. In: The Proceedings of SIGIR 2003 Conference, pp. 119–126 (2003)
Jiang, H., Helal, A., Elmagarmid, A., Joshi, A.: Scene Change Detection Techniques for Video Database Systems. ACM Multimedia Systems 6(3), 186–195 (1998)
Kang, H., Shneiderman, B.: Visualization Methods for Personal Photo Collections: Browsing and Searching in the PhotoFinder. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), New York City, New York, August 2000, pp. 1539–1542 (2000)
Kender, J.R., Yeo, B.L.: On the Structure and Analysis of Home Videos. In: Proceedings of the Asian Conference on Computer Vision (January 2000)
Kipp, M.: Anvil video annotation system. Page downloaded on August 1 (2003), http://www.dfki.de/~kipp/anvil/
Kuchinsky, A., Pering, C., Creech, M., Freeze, D., Serra, B., Gwizdka, J.: FotoFile: A Consumer Multimedia Organization and Retrieval System. In: Proceedings of the Conference on Human factors in computing systems (CHI 1999), Pittsburgh, Pennsylvania, USA, May 15-20, 1999, pp. 496–503 (1999)
Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: Proceedings of the Tenth National Conference on Artificial Intelligence, pp. 223–228. AAAI Press, San Jose (1992)
Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical Models for Automatic Video Annotation and Retrieval, submitted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Montreal, QC, Canada, May 17-21 (2004)
Luo, J., Savakis, A.: Indoor vs. Outdoor Classification of Consumer Photographs. In: Int. Conf. Image Proc. ICIP 2001, hessaloniki, Greece (October 2001)
McCarthy, J.F., Nguyen, D.H., Rashid, A.M., Soroczak, S.: Proactive Displays & The Experience UbiComp Project. In: UbiComp 2003, Adjunct Proceedings, Seattle, WA, October 12-15 (2003)
Platt, J.C., Czerwinskim, M., Field, B.: PhotoTOC: Automatic Clustering for Browsing Personal Photographs. Microsoft Research Technical Report MSR-TR-2002-17 (2002)
Priyantha, N., Chakraborty, A., Balakrishnan, H.: The Cricket location-support system. In: Proceedings of the Sixth Annual ACM International Conference on Mobile Computing and Networking, August 2000, ACM Press, Boston (2000)
Priyantha, N., Miu, B.H., Teller, S.: The Cricket Compass for Context-Aware Mobile Applications. In: Proceedings of the 7th Annual ACM/IEEE International Conference on Mobile Computing and Networking, MOBICOM 2000 (2000)
Ramos, G., Balakrishnan, R.: Fluid Interaction Techniques for the Control and Annotation of Digital Video. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2003), Vancouver, Canada, November 2-5 (2003)
Rekimoto, J., Katashi, N.: The World through the Computer: Computer Augmented Interaction with Real World Environments. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 1995), Pittsburgh, PA, pp. 29–36. ACM Press, New York (1995)
Rekimoto, J.: NaviCam: A Magnifying Glass Approach to Augmented Reality. Presence: Teleoperator and Virtual Environments 6(4), 399–412 (1997)
Sarvas, R., Herrarte, E., Wilhelm, A., Davis, M.: Metadata Creation System for Mobile Images. In: Proceedings of the Second International Conference on Mobile Systems, Applications, and Services (MobiSys2004), Boston, Massachusetts, June 2004, ACM Press, New York (2004)
Su, N.M., Park, H., Bostrom, E., Burke, J., Srivastava, M.B., Estrin, D.: Augmenting film and video footage with sensor data. In: Proceedings of the Second IEEE International Conference on Pervasive Computing and Communications (PerCom 2004), Orlando, FL, March 2004, pp. 3–12 (2004)
Virage, Inc. VideoLogger product. Page downloaded on February 21 (2004), http://www.virage.com/solutions/details.cfm?solutionID=5&categoryID=1&products=0
Wactlar, H.D., Christel, M., Gong, Y., Hauptmann, A.: Lessons Learned from the Creation and Deployment of a Terabyte Digital Video Library. IEEE Computer 32(2), 66–73 (1999)
Want, R., Hopper, A., Falcao, V., Gibbons, J.: The active badge location system. ACM Transactions on Information Systems 10(1), 91–102 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Patel, S.N., Abowd, G.D. (2004). The ContextCam: Automated Point of Capture Video Annotation. In: Davies, N., Mynatt, E.D., Siio, I. (eds) UbiComp 2004: Ubiquitous Computing. UbiComp 2004. Lecture Notes in Computer Science, vol 3205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30119-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-30119-6_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22955-1
Online ISBN: 978-3-540-30119-6
eBook Packages: Springer Book Archive