skip to main content
10.1145/1322192.1322215acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
poster

Toward content-aware multimodal tagging of personal photo collections

Published: 12 November 2007 Publication History

Abstract

A growing number of tools is becoming available, that make use ofexisting tags to help organize and retrieve photos, facilitating the management and use of photo sets. The tagging on which these techniques rely remains a time consuming, labor intensive task that discourages many users. To address this problem, we aim to leverage the multimodal content of naturally occurring photo discussions among friends and families to automatically extract tags from a combination of conversational speech, handwriting, and photo content analysis. While naturally occurring discussions are rich sources of informationabout photos, methods need to be developed to reliably extract a set of discriminative tags from this noisy, unconstrained group discourse. To this end, this paper contributes ananalysis of pilot data identifying robust multimodal features examining the interplay between photo content and other modalities such as speech and handwriting. Our analysis is motivated by a search for design implications leading to the effective incorporation of automated location and person identification(e.g. based on GPS and facial recognition technologies) into a system able to extract tags from natural multimodal conversations.

References

[1]
M. T. and M. Mischa S. Harris, D. Duplaw, A. Chakravarthy, C. Brewster, N. Gibbins, K. O'Hara, F. Ciravegna, D. Sleeman, N. Shadbolt, and Y. Wilks. Image annotation with photocopain. In Proceedings First International Workshop on Semantic Web Annotations for Multimedia (SWAMM), 2006.
[2]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, 1999.
[3]
P. Barthelmess, E. Kaiser, X. Huang, D. McGee, and P. Cohen. Collaborative multimodal photo annotation over digital paper. In Proceedings of the International Conference on Multimodal Interfaces (ICMI). ACM Press, 2006.
[4]
P. Barthelmess, D. McGee, and P. Cohen. The emergence of representations in collaborative space planning over digital paper: Preliminary observations. In CSCW 2006 Workshop on Collaborating over Paper and Digital Documents (CoPADD), 2006. Available at http://www.copadd.ethz.ch/abstracts/11.pdf.
[5]
M. Davis, M. Smith, F. Stentiford, A. Bambidele, J. Canny, N. Good, S. King, and R. Janakiraman. Using context and similarity for face and location identification. In Proceedings of the IS&T/SPIE 18th Annual Symposium on Electronic Imaging Science and Technology Internet Imaging VII. IS&T/SPIE Press, 2006.
[6]
N. Diakopoulos and I. Essa. Mediating photo collage authoring. In UIST '05: Proceedings of the 18th annual ACM symposium on User interface software and technology, pages 183--186, New York, NY, USA, 2005. ACM Press.
[7]
M. Fleck. Eavesdropping on storytelling. Technical Report HPL-2004-44, HP Laboratories Palo Alto, 2004.
[8]
E. Kaiser, P. Barthelmess, C. Erdmann, and P. Cohen. Multimodal redundancy across handwriting and speech during computer mediated human-human interactions. In Computer Human Interaction (CHI), 2007.
[9]
J. Kustanowitz and B. Shneiderman. Annotation for personal digital photo libraries: Lowering barriers while raising incentives. Technical Report HCIL--2004--18, Univ. of Maryland, January 2005.
[10]
N. Kuwahara, K. Kuwabara, N. Tetsutani, and K. Yasuda. Using photo annotations to produce a reminiscence video for dementia patients. In 3rd International Semantic Web Conference (ISWC2004), 2004. Demo Papers.
[11]
C. Marlow, M. Naaman, D. Boyd, and M. Davis. Ht06, tagging paper, taxonomy, flickr, academic article, to read. In HYPERTEXT '06: Proceedings of the seventeenth conference on Hypertext and hypermedia, pages 31--40, New York, NY, USA, 2006. ACM Press.
[12]
Y. Qian and L. M. G. Feijs. Exploring the potentials of combining photo annotating tasks with instant messaging fun. In MUM '04: Proceedings of the 3rd international conference on Mobile and ubiquitous multimedia, pages 11--17, New York, NY, USA, 2004. ACM Press.
[13]
P. Schmitz. Inducing ontology from flickr tags. In Proc. of the Collaborative Web Tagging Workshop (WWW'06), May 2006.

Cited By

View all
  • (2013)ACRONYMSemantic Web10.4018/978-1-4666-3610-1.ch009(201-234)Online publication date: 2013
  • (2013)CueNetProceedings of the 3rd ACM conference on International conference on multimedia retrieval10.1145/2461466.2461533(341-344)Online publication date: 16-Apr-2013
  • (2012)Event-based content management by spontaneous metadata generation and diffusion2012 IEEE 13th International Symposium on Computational Intelligence and Informatics (CINTI)10.1109/CINTI.2012.6496740(97-102)Online publication date: Nov-2012
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMI '07: Proceedings of the 9th international conference on Multimodal interfaces
November 2007
402 pages
ISBN:9781595938176
DOI:10.1145/1322192
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 November 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic label extraction
  2. collaborative interaction
  3. intelligent interfaces
  4. multimodal processing
  5. photo annotation
  6. tagging

Qualifiers

  • Poster

Conference

ICMI07
Sponsor:
ICMI07: International Conference on Multimodal Interface
November 12 - 15, 2007
Aichi, Nagoya, Japan

Acceptance Rates

Overall Acceptance Rate 453 of 1,080 submissions, 42%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2013)ACRONYMSemantic Web10.4018/978-1-4666-3610-1.ch009(201-234)Online publication date: 2013
  • (2013)CueNetProceedings of the 3rd ACM conference on International conference on multimedia retrieval10.1145/2461466.2461533(341-344)Online publication date: 16-Apr-2013
  • (2012)Event-based content management by spontaneous metadata generation and diffusion2012 IEEE 13th International Symposium on Computational Intelligence and Informatics (CINTI)10.1109/CINTI.2012.6496740(97-102)Online publication date: Nov-2012
  • (2011)ACRONYMInternational Journal on Semantic Web & Information Systems10.4018/jswis.20111001017:4(1-35)Online publication date: 1-Oct-2011
  • (2011)The picture says it all!Proceedings of the 13th international conference on multimodal interfaces10.1145/2070481.2070499(89-96)Online publication date: 14-Nov-2011
  • (2011)QooqleProceedings of the 13th international conference on Ubiquitous computing10.1145/2030112.2030203(541-542)Online publication date: 17-Sep-2011
  • (2009)Online Handwriting Recognition for Indic ScriptsGuide to OCR for Indic Scripts10.1007/978-1-84800-330-9_11(209-234)Online publication date: 28-Aug-2009
  • (2007)Cross-domain matching for automatic tag extraction across redundant handwriting and speech eventsProceedings of the 2007 workshop on Tagging, mining and retrieval of human related activity information10.1145/1330588.1330597(55-62)Online publication date: 15-Nov-2007

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media