Skip to main content

Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links

  • Conference paper
Machine Learning for Multimodal Interaction (MLMI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3869))

Included in the following conference series:

Abstract

This article proposes to consider all the links existing between documents, as a new artifact for browsing through multimedia archives. In particular, links between static documents and other media are presented in this article through Inquisitor, FriDoc and FaericWorld, i.e. three distinct document-centric systems, which allow (a) browsing (b) validation of annotations, and (c) edition of annotations or documents. Inquisitor illustrates the intra-document links between a raw document and its abstract representations. It is the base level, i.e. the closest to the raw media. FriDoc illustrates the cross-documents links, in particular temporal ones, between documents at the event level, which strictly connect documents captured at the same occasion (e.g. a meeting, a conference, etc.). Finally, FaericWorld proposes cross-documents linking as a novel artifact for browsing and searching through a cross-event multimedia library. This article describes those three systemvs and the various types of links that can be built between documents. Finally, the paper presents the result of a user evaluation of FriDoc and briefly discusses the usefulness of cross-documents linking, and in particular document alignments, for browsing through multimedia archives.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bollacker, K.D., Lawrence, S., Lee Giles, C.: CiteSeer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: 2nd International Conference on Autonomous Agents, pp. 116–123. ACM Press, New York, USA (1998)

    Google Scholar 

  2. Hu, N., Dannenberg, R.B.: A comparison of Melodic Database Retrieval Techniques Using Sung Queries. In: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries. International Conference on Digital Libraries, Portland, USA, pp. 301–307 (2002)

    Google Scholar 

  3. Janecek, P., Pu, P.: An Evaluation of Semantic Fisheye Views for Opportunistic Search in an Annotated Image Collection. Journal of Digital Libraries 5(1); Special Issue on Information Visualization Interfaces for Retrieval and Analysis, 42–56 (2005)

    Google Scholar 

  4. Kartoo, http://www.kartoo.com

  5. Lalanne, D., Sire, I.R., Behera, A., Mekhaldi, D., von Rotz, D.: A research agenda for assessing the utility of document annotations in multimedia databases of meeting recordings. In: 3rd International Workshop on Multimedia Data and Document Engineering, in conjunction with VLDB-2003, Berlin, Germany, pp. 47–55 (2003)

    Google Scholar 

  6. Lalanne, D., Ingold, R., von Rotz, D., Behera, A., Mekhaldi, D., Popescu-Belis, A.: Using static documents as structured and thematic interfaces to multimedia meeting archives. In: Bourlard, H., Bengio, S. (eds.) Multimodal Interaction and Related Machine Learning Algorithms. LNCS, pp. 87–100. Springer-Verlag, Berlin, Germany (2004)

    Google Scholar 

  7. Lalanne, D., Lisowska, A., Bruno, E., Flynn, M., Georgescul, M., Guillemot, M., Janvier, B., Marchand-Maillet, S., Melichar, M., Moenne-Loccoz, N., Popescu-Belis, A., Rajman, M., Rigamonti, M., von Rotz, D., Wellner, P.: The IM2 Multimodal Meeting Browser Family, IM2 technical report (2005)

    Google Scholar 

  8. LinkedIn, https://www.linkedin.com

  9. Lisowska, A., Rajman, M., Bui, T.H.: ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings. In: Proceedings of the Joint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Martigny, Switzerland, pp. 291–304 (2004)

    Google Scholar 

  10. Marchand-Maillet, S., Bruno, E.: Collection Guiding: A new framework for handling large multimedia collections. In: First Workshop on Audio-visual Content and Information Visualization In Digital Libraries, AVIVDiLib 2005, Cortona, Italy (2005)

    Google Scholar 

  11. Rigamonti, M., Bloechle, J.-L., Hadjar, K., Lalanne, D., Ingold, R.: Towards a Canonical and Structured Representation of PDF Documents through Reverse Engineering. In: ICDAR 2005, Seoul, Korea, pp. 1050–1054 (2005)

    Google Scholar 

  12. Rigamonti, M., Hitz, O., Ingold, R.: A Framework for Cooperative and Interactive Analysis of Technical Documents. In: Fifth IAPR International Workshop on Graphics Recognition, Barcelona, Spain, pp. 407–414 (2003)

    Google Scholar 

  13. Shneiderman, B., Plaisant, C.: Designing the User Interface: Strategies for Effective Human-Computer Interaction, 4th edn. Addison-Wesley, Hardcover; 4th edition, 652 pages (Published, March 2004)

    Google Scholar 

  14. Alice in Wonderland, TextArc, http://www.textarc.org/

  15. Tucker, S., Whittaker, S.: Accessing multimodal meeting data: Systems, problems and possibilities. In: Bengio, S., Bourlard, H. (eds.) MLMI 2004. LNCS, vol. 3361, pp. 1–11. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  16. Yee, K.-P., Swearingen, K., Li, K., Hearst, M.: Faceted Metadata for Image Search and Browsing. In: Proceedings of the SIGCHI conference on Human factors in computing systems, Ft. Lauderdale, USA, pp. 401–408 (2003)

    Google Scholar 

  17. Wellner, P., Flynn, M., Tucker, S., Whittaker, S.: A Meeting Browser Evaluation Test, Presented at the Conference on Human Factors in Computing Systems, Portand, Oregon, USA, pp. 2021–2024 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Rigamonti, M., Lalanne, D., Evéquoz, F., Ingold, R. (2006). Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links. In: Renals, S., Bengio, S. (eds) Machine Learning for Multimodal Interaction. MLMI 2005. Lecture Notes in Computer Science, vol 3869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677482_10

Download citation

  • DOI: https://doi.org/10.1007/11677482_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32549-9

  • Online ISBN: 978-3-540-32550-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics