Skip to main content

The ContextCam: Automated Point of Capture Video Annotation

  • Conference paper
UbiComp 2004: Ubiquitous Computing (UbiComp 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3205))

Included in the following conference series:

Abstract

Rich, structured annotations of video recordings enable interesting uses, but existing techniques for manual, and even semi-automated, tagging can be too time-consuming. We present in this paper the ContextCam, a prototype of a consumer video camera that provides point of capture annotation of time, location, person presence and event information associated to recorded video. Both low- and high-level metadata are discovered via a variety of sensing and active tagging techniques, as well as through the application of machine learning techniques that use past annotations to suggest metadata for the current recordings. Furthermore, the ContextCam provides users with a minimally intrusive interface for correcting predicted high-level metadata during video recording.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abowd, G.D., Gauger, M., Lachenmann, A.: The Family Video Archive: An annotation and browsing environment for home movies. In: The Proceedings of MIR, Berkeley, CA (November 2003)

    Google Scholar 

  2. Adcock, J., Cooper, M.D., Doherty, J., Foote, J., Girgensohn, A., Wilcox, L.: Managing digital memories with the FXPAL photo application. ACM Multimedia, 598–599 (2003)

    Google Scholar 

  3. Bederson, B.B.,, P.: A Zoomable Image Browser using Quantum Treemaps and Bubblemaps. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2001), Orlando, FL, November 2001, pp. 71–80 (2001)

    Google Scholar 

  4. Casares, J., Myers, B.A., Long, C., Bhatnagar, R., Stevens, S.M., Dabbish, L., Yocum, D., Corbett, A.: Simplifying Video Editing Using Metadata. In: Proceedings of Designing Interactive Systems (DIS 2002), London, UK, June 2002, pp. 157–166 (2002)

    Google Scholar 

  5. Cox, D., Kindratenko, V., Pointer, D.: IntelliBadge: Towards Providing Location-Aware Value-Added Services at Academic Conferences. In: Dey, A.K., Schmidt, A., McCarthy, J.F. (eds.) UbiComp 2003. LNCS, vol. 2864, pp. 264–280. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  6. Currie III, D.L., Irvine, C.E.: Surmounting the Effects of Lossy Compression on Steganography. In: Proceedings of the 19th National Information System Security Conference, October 1996, pp. 194–201 (1996)

    Google Scholar 

  7. Davis, M.: Media Streams: An Iconic Visual Language for Video Representation. In: Baecker, R.M., Grudin, J., Buxton, W.A.S., Greenberg, S. (eds.) Readings in Human-Computer Interaction: Toward the Year 2000, 2nd edn., pp. 854–866. Morgan Kaufmann Publishers, Inc., San Francisco (1995)

    Google Scholar 

  8. Girgensohn, A., Boreczky, J., Chiu, P., Doherty, J., Foote, J., Golovchinsky, G., Uchihashi, S., Wilcox, L.: A Semiautomatic Approach to Home Video Editing. In: Proceedings of the ACM Symposium on Use rIinterface Software and Technology (UIST 2000), San Diego, CA, November 5-8, 2000, pp. 81–89 (2000)

    Google Scholar 

  9. Hakansson, M., Ljungblad, S., Holmquist, L.E.: Capturing the Invisible: Designing Context Aware Photography. In: Proceedings of DUX 2003, Designing for User Experience, San Francisco, CA, June 5-7, 2003, ACM/AIGA (2003)

    Google Scholar 

  10. Hazas, M., Ward, A.: A Novel Broadband Ultrasonic Location System. In: Borriello, G., Holmquist, L.E. (eds.) UbiComp 2002. LNCS, vol. 2498, pp. 264–280. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  11. Jeon, J., Lavrenko, V., Manmatha, R.: Automatic Image Annotation and Retrieval using Cross-Media Relevance Models. In: The Proceedings of SIGIR 2003 Conference, pp. 119–126 (2003)

    Google Scholar 

  12. Jiang, H., Helal, A., Elmagarmid, A., Joshi, A.: Scene Change Detection Techniques for Video Database Systems. ACM Multimedia Systems 6(3), 186–195 (1998)

    Article  Google Scholar 

  13. Kang, H., Shneiderman, B.: Visualization Methods for Personal Photo Collections: Browsing and Searching in the PhotoFinder. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), New York City, New York, August 2000, pp. 1539–1542 (2000)

    Google Scholar 

  14. Kender, J.R., Yeo, B.L.: On the Structure and Analysis of Home Videos. In: Proceedings of the Asian Conference on Computer Vision (January 2000)

    Google Scholar 

  15. Kipp, M.: Anvil video annotation system. Page downloaded on August 1 (2003), http://www.dfki.de/~kipp/anvil/

  16. Kuchinsky, A., Pering, C., Creech, M., Freeze, D., Serra, B., Gwizdka, J.: FotoFile: A Consumer Multimedia Organization and Retrieval System. In: Proceedings of the Conference on Human factors in computing systems (CHI 1999), Pittsburgh, Pennsylvania, USA, May 15-20, 1999, pp. 496–503 (1999)

    Google Scholar 

  17. Langley, P., Iba, W., Thompson, K.: An analysis of Bayesian classifiers. In: Proceedings of the Tenth National Conference on Artificial Intelligence, pp. 223–228. AAAI Press, San Jose (1992)

    Google Scholar 

  18. Lavrenko, V., Feng, S.L., Manmatha, R.: Statistical Models for Automatic Video Annotation and Retrieval, submitted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP), Montreal, QC, Canada, May 17-21 (2004)

    Google Scholar 

  19. Luo, J., Savakis, A.: Indoor vs. Outdoor Classification of Consumer Photographs. In: Int. Conf. Image Proc. ICIP 2001, hessaloniki, Greece (October 2001)

    Google Scholar 

  20. McCarthy, J.F., Nguyen, D.H., Rashid, A.M., Soroczak, S.: Proactive Displays & The Experience UbiComp Project. In: UbiComp 2003, Adjunct Proceedings, Seattle, WA, October 12-15 (2003)

    Google Scholar 

  21. Platt, J.C., Czerwinskim, M., Field, B.: PhotoTOC: Automatic Clustering for Browsing Personal Photographs. Microsoft Research Technical Report MSR-TR-2002-17 (2002)

    Google Scholar 

  22. Priyantha, N., Chakraborty, A., Balakrishnan, H.: The Cricket location-support system. In: Proceedings of the Sixth Annual ACM International Conference on Mobile Computing and Networking, August 2000, ACM Press, Boston (2000)

    Google Scholar 

  23. Priyantha, N., Miu, B.H., Teller, S.: The Cricket Compass for Context-Aware Mobile Applications. In: Proceedings of the 7th Annual ACM/IEEE International Conference on Mobile Computing and Networking, MOBICOM 2000 (2000)

    Google Scholar 

  24. Ramos, G., Balakrishnan, R.: Fluid Interaction Techniques for the Control and Annotation of Digital Video. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 2003), Vancouver, Canada, November 2-5 (2003)

    Google Scholar 

  25. Rekimoto, J., Katashi, N.: The World through the Computer: Computer Augmented Interaction with Real World Environments. In: Proceedings of the ACM Symposium on User Interface Software and Technology (UIST 1995), Pittsburgh, PA, pp. 29–36. ACM Press, New York (1995)

    Chapter  Google Scholar 

  26. Rekimoto, J.: NaviCam: A Magnifying Glass Approach to Augmented Reality. Presence: Teleoperator and Virtual Environments 6(4), 399–412 (1997)

    Google Scholar 

  27. Sarvas, R., Herrarte, E., Wilhelm, A., Davis, M.: Metadata Creation System for Mobile Images. In: Proceedings of the Second International Conference on Mobile Systems, Applications, and Services (MobiSys2004), Boston, Massachusetts, June 2004, ACM Press, New York (2004)

    Google Scholar 

  28. Su, N.M., Park, H., Bostrom, E., Burke, J., Srivastava, M.B., Estrin, D.: Augmenting film and video footage with sensor data. In: Proceedings of the Second IEEE International Conference on Pervasive Computing and Communications (PerCom 2004), Orlando, FL, March 2004, pp. 3–12 (2004)

    Google Scholar 

  29. Virage, Inc. VideoLogger product. Page downloaded on February 21 (2004), http://www.virage.com/solutions/details.cfm?solutionID=5&categoryID=1&products=0

  30. Wactlar, H.D., Christel, M., Gong, Y., Hauptmann, A.: Lessons Learned from the Creation and Deployment of a Terabyte Digital Video Library. IEEE Computer 32(2), 66–73 (1999)

    Google Scholar 

  31. Want, R., Hopper, A., Falcao, V., Gibbons, J.: The active badge location system. ACM Transactions on Information Systems 10(1), 91–102 (1992)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Patel, S.N., Abowd, G.D. (2004). The ContextCam: Automated Point of Capture Video Annotation. In: Davies, N., Mynatt, E.D., Siio, I. (eds) UbiComp 2004: Ubiquitous Computing. UbiComp 2004. Lecture Notes in Computer Science, vol 3205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30119-6_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30119-6_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22955-1

  • Online ISBN: 978-3-540-30119-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics