The ContextCam: Automated Point of Capture Video Annotation

Patel, Shwetak N.; Abowd, Gregory D.

doi:10.1007/978-3-540-30119-6_18

Shwetak N. Patel¹⁹ &
Gregory D. Abowd¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3205))

Included in the following conference series:

International Conference on Ubiquitous Computing

2567 Accesses
16 Citations
3 Altmetric

Abstract

Rich, structured annotations of video recordings enable interesting uses, but existing techniques for manual, and even semi-automated, tagging can be too time-consuming. We present in this paper the ContextCam, a prototype of a consumer video camera that provides point of capture annotation of time, location, person presence and event information associated to recorded video. Both low- and high-level metadata are discovered via a variety of sensing and active tagging techniques, as well as through the application of machine learning techniques that use past annotations to suggest metadata for the current recordings. Furthermore, the ContextCam provides users with a minimally intrusive interface for correcting predicted high-level metadata during video recording.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abowd, G.D., Gauger, M., Lachenmann, A.: The Family Video Archive: An annotation and browsing environment for home movies. In: The Proceedings of MIR, Berkeley, CA (November 2003)
Google Scholar
Adcock, J., Cooper, M.D., Doherty, J., Foote, J., Girgensohn, A., Wilcox, L.: Managing digital memories with the FXPAL photo application. ACM Multimedia, 598–599 (2003)
Google Scholar
Bederson, B.B.,, P.: A Zoomable Image Browser using Quantum Treemaps and Bubblemaps. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2001), Orlando, FL, November 2001, pp. 71–80 (2001)
Google Scholar
Casares, J., Myers, B.A., Long, C., Bhatnagar, R., Stevens, S.M., Dabbish, L., Yocum, D., Corbett, A.: Simplifying Video Editing Using Metadata. In: Proceedings of Designing Interactive Systems (DIS 2002), London, UK, June 2002, pp. 157–166 (2002)
Google Scholar
Cox, D., Kindratenko, V., Pointer, D.: IntelliBadge: Towards Providing Location-Aware Value-Added Services at Academic Conferences. In: Dey, A.K., Schmidt, A., McCarthy, J.F. (eds.) UbiComp 2003. LNCS, vol. 2864, pp. 264–280. Springer, Heidelberg (2003)
Chapter Google Scholar
Currie III, D.L., Irvine, C.E.: Surmounting the Effects of Lossy Compression on Steganography. In: Proceedings of the 19th National Information System Security Conference, October 1996, pp. 194–201 (1996)
Google Scholar
Davis, M.: Media Streams: An Iconic Visual Language for Video Representation. In: Baecker, R.M., Grudin, J., Buxton, W.A.S., Greenberg, S. (eds.) Readings in Human-Computer Interaction: Toward the Year 2000, 2nd edn., pp. 854–866. Morgan Kaufmann Publishers, Inc., San Francisco (1995)
Google Scholar
Girgensohn, A., Boreczky, J., Chiu, P., Doherty, J., Foote, J., Golovchinsky, G., Uchihashi, S., Wilcox, L.: A Semiautomatic Approach to Home Video Editing. In: Proceedings of the ACM Symposium on Use rIinterface Software and Technology (UIST 2000), San Diego, CA, November 5-8, 2000, pp. 81–89 (2000)
Google Scholar
Hakansson, M., Ljungblad, S., Holmquist, L.E.: Capturing the Invisible: Designing Context Aware Photography. In: Proceedings of DUX 2003, Designing for User Experience, San Francisco, CA, June 5-7, 2003, ACM/AIGA (2003)
Google Scholar
Hazas, M., Ward, A.: A Novel Broadband Ultrasonic Location System. In: Borriello, G., Holmquist, L.E. (eds.) UbiComp 2002. LNCS, vol. 2498, pp. 264–280. Springer, Heidelberg (2002)
Chapter Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval using Cross-Media Relevance Models. In: The Proceedings of SIGIR 2003 Conference, pp. 119–126 (2003)
Google Scholar
Jiang, H., Helal, A., Elmagarmid, A., Joshi, A.: Scene Change Detection Techniques for Video Database Systems. ACM Multimedia Systems 6(3), 186–195 (1998)
Article Google Scholar
Kang, H., Shneiderman, B.: Visualization Methods for Personal Photo Collections: Browsing and Searching in the PhotoFinder. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), New York City, New York, August 2000, pp. 1539–1542 (2000)
Google Scholar
Kender, J.R., Yeo, B.L.: On the Structure and Analysis of Home Videos. In: Proceedings of the Asian Conference on Computer Vision (January 2000)
Google Scholar
Kipp, M.: Anvil video annotation system. Page downloaded on August 1 (2003), http://www.dfki.de/~kipp/anvil/
Kuchinsky, A., Pering, C., Creech, M., Freeze, D., Serra, B., Gwizdka, J.: FotoFile: A Consumer Multimedia Organization and Retrieval System. In: Proceedings of the Conference on Human factors in computing systems (CHI 1999), Pittsburgh, Pennsylvania, USA, May 15-20, 1999, pp. 496–503 (1999)
Google Scholar
Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: Proceedings of the Tenth National Conference on Artificial Intelligence, pp. 223–228. AAAI Press, San Jose (1992)
Google Scholar
Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical Models for Automatic Video Annotation and Retrieval, submitted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Montreal, QC, Canada, May 17-21 (2004)
Google Scholar
Luo, J., Savakis, A.: Indoor vs. Outdoor Classification of Consumer Photographs. In: Int. Conf. Image Proc. ICIP 2001, hessaloniki, Greece (October 2001)
Google Scholar
McCarthy, J.F., Nguyen, D.H., Rashid, A.M., Soroczak, S.: Proactive Displays & The Experience UbiComp Project. In: UbiComp 2003, Adjunct Proceedings, Seattle, WA, October 12-15 (2003)
Google Scholar
Platt, J.C., Czerwinskim, M., Field, B.: PhotoTOC: Automatic Clustering for Browsing Personal Photographs. Microsoft Research Technical Report MSR-TR-2002-17 (2002)
Google Scholar
Priyantha, N., Chakraborty, A., Balakrishnan, H.: The Cricket location-support system. In: Proceedings of the Sixth Annual ACM International Conference on Mobile Computing and Networking, August 2000, ACM Press, Boston (2000)
Google Scholar
Priyantha, N., Miu, B.H., Teller, S.: The Cricket Compass for Context-Aware Mobile Applications. In: Proceedings of the 7th Annual ACM/IEEE International Conference on Mobile Computing and Networking, MOBICOM 2000 (2000)
Google Scholar
Ramos, G., Balakrishnan, R.: Fluid Interaction Techniques for the Control and Annotation of Digital Video. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2003), Vancouver, Canada, November 2-5 (2003)
Google Scholar
Rekimoto, J., Katashi, N.: The World through the Computer: Computer Augmented Interaction with Real World Environments. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 1995), Pittsburgh, PA, pp. 29–36. ACM Press, New York (1995)
Chapter Google Scholar
Rekimoto, J.: NaviCam: A Magnifying Glass Approach to Augmented Reality. Presence: Teleoperator and Virtual Environments 6(4), 399–412 (1997)
Google Scholar
Sarvas, R., Herrarte, E., Wilhelm, A., Davis, M.: Metadata Creation System for Mobile Images. In: Proceedings of the Second International Conference on Mobile Systems, Applications, and Services (MobiSys2004), Boston, Massachusetts, June 2004, ACM Press, New York (2004)
Google Scholar
Su, N.M., Park, H., Bostrom, E., Burke, J., Srivastava, M.B., Estrin, D.: Augmenting film and video footage with sensor data. In: Proceedings of the Second IEEE International Conference on Pervasive Computing and Communications (PerCom 2004), Orlando, FL, March 2004, pp. 3–12 (2004)
Google Scholar
Virage, Inc. VideoLogger product. Page downloaded on February 21 (2004), http://www.virage.com/solutions/details.cfm?solutionID=5&categoryID=1&products=0
Wactlar, H.D., Christel, M., Gong, Y., Hauptmann, A.: Lessons Learned from the Creation and Deployment of a Terabyte Digital Video Library. IEEE Computer 32(2), 66–73 (1999)
Google Scholar
Want, R., Hopper, A., Falcao, V., Gibbons, J.: The active badge location system. ACM Transactions on Information Systems 10(1), 91–102 (1992)
Article Google Scholar

Download references

Author information

Authors and Affiliations

College of Computing & GVU Center, Georgia Institute of Technology, 801 Atlantic Drive, Atlanta, GA, 30332-0280, USA
Shwetak N. Patel & Gregory D. Abowd

Authors

Shwetak N. Patel
View author publications
You can also search for this author in PubMed Google Scholar
Gregory D. Abowd
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computing Department, InfoLab21, Lancaster University, LA1 4WA, Lancaster, UK
Nigel Davies
GVU Center & School of Interactive Computing, Georgia Institute of Technology, Atlanta, GA, USA
Elizabeth D. Mynatt
Department of Information Sciences, Ochanomizu University, 2-1-1 Otsuka, Bunkyo-ku, 112-8610, Tokyo, Japan
Itiro Siio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Patel, S.N., Abowd, G.D. (2004). The ContextCam: Automated Point of Capture Video Annotation. In: Davies, N., Mynatt, E.D., Siio, I. (eds) UbiComp 2004: Ubiquitous Computing. UbiComp 2004. Lecture Notes in Computer Science, vol 3205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30119-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-30119-6_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22955-1
Online ISBN: 978-3-540-30119-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics