Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links

Rigamonti, Maurizio; Lalanne, Denis; Evéquoz, Florian; Ingold, Rolf

doi:10.1007/11677482_10

Maurizio Rigamonti¹⁸,
Denis Lalanne¹⁸,
Florian Evéquoz¹⁸ &
…
Rolf Ingold¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3869))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

1981 Accesses
2 Citations

Abstract

This article proposes to consider all the links existing between documents, as a new artifact for browsing through multimedia archives. In particular, links between static documents and other media are presented in this article through Inquisitor, FriDoc and FaericWorld, i.e. three distinct document-centric systems, which allow (a) browsing (b) validation of annotations, and (c) edition of annotations or documents. Inquisitor illustrates the intra-document links between a raw document and its abstract representations. It is the base level, i.e. the closest to the raw media. FriDoc illustrates the cross-documents links, in particular temporal ones, between documents at the event level, which strictly connect documents captured at the same occasion (e.g. a meeting, a conference, etc.). Finally, FaericWorld proposes cross-documents linking as a novel artifact for browsing and searching through a cross-event multimedia library. This article describes those three systemvs and the various types of links that can be built between documents. Finally, the paper presents the result of a user evaluation of FriDoc and briefly discusses the usefulness of cross-documents linking, and in particular document alignments, for browsing through multimedia archives.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bollacker, K.D., Lawrence, S., Lee Giles, C.: CiteSeer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: 2nd International Conference on Autonomous Agents, pp. 116–123. ACM Press, New York, USA (1998)
Google Scholar
Hu, N., Dannenberg, R.B.: A comparison of Melodic Database Retrieval Techniques Using Sung Queries. In: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries. International Conference on Digital Libraries, Portland, USA, pp. 301–307 (2002)
Google Scholar
Janecek, P., Pu, P.: An Evaluation of Semantic Fisheye Views for Opportunistic Search in an Annotated Image Collection. Journal of Digital Libraries 5(1); Special Issue on Information Visualization Interfaces for Retrieval and Analysis, 42–56 (2005)
Google Scholar
Kartoo, http://www.kartoo.com
Lalanne, D., Sire, I.R., Behera, A., Mekhaldi, D., von Rotz, D.: A research agenda for assessing the utility of document annotations in multimedia databases of meeting recordings. In: 3rd International Workshop on Multimedia Data and Document Engineering, in conjunction with VLDB-2003, Berlin, Germany, pp. 47–55 (2003)
Google Scholar
Lalanne, D., Ingold, R., von Rotz, D., Behera, A., Mekhaldi, D., Popescu-Belis, A.: Using static documents as structured and thematic interfaces to multimedia meeting archives. In: Bourlard, H., Bengio, S. (eds.) Multimodal Interaction and Related Machine Learning Algorithms. LNCS, pp. 87–100. Springer-Verlag, Berlin, Germany (2004)
Google Scholar
Lalanne, D., Lisowska, A., Bruno, E., Flynn, M., Georgescul, M., Guillemot, M., Janvier, B., Marchand-Maillet, S., Melichar, M., Moenne-Loccoz, N., Popescu-Belis, A., Rajman, M., Rigamonti, M., von Rotz, D., Wellner, P.: The IM2 Multimodal Meeting Browser Family, IM2 technical report (2005)
Google Scholar
LinkedIn, https://www.linkedin.com
Lisowska, A., Rajman, M., Bui, T.H.: ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings. In: Proceedings of the Joint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Martigny, Switzerland, pp. 291–304 (2004)
Google Scholar
Marchand-Maillet, S., Bruno, E.: Collection Guiding: A new framework for handling large multimedia collections. In: First Workshop on Audio-visual Content and Information Visualization In Digital Libraries, AVIVDiLib 2005, Cortona, Italy (2005)
Google Scholar
Rigamonti, M., Bloechle, J.-L., Hadjar, K., Lalanne, D., Ingold, R.: Towards a Canonical and Structured Representation of PDF Documents through Reverse Engineering. In: ICDAR 2005, Seoul, Korea, pp. 1050–1054 (2005)
Google Scholar
Rigamonti, M., Hitz, O., Ingold, R.: A Framework for Cooperative and Interactive Analysis of Technical Documents. In: Fifth IAPR International Workshop on Graphics Recognition, Barcelona, Spain, pp. 407–414 (2003)
Google Scholar
Shneiderman, B., Plaisant, C.: Designing the User Interface: Strategies for Effective Human-Computer Interaction, 4th edn. Addison-Wesley, Hardcover; 4th edition, 652 pages (Published, March 2004)
Google Scholar
Alice in Wonderland, TextArc, http://www.textarc.org/
Tucker, S., Whittaker, S.: Accessing multimodal meeting data: Systems, problems and possibilities. In: Bengio, S., Bourlard, H. (eds.) MLMI 2004. LNCS, vol. 3361, pp. 1–11. Springer, Heidelberg (2005)
Chapter Google Scholar
Yee, K.-P., Swearingen, K., Li, K., Hearst, M.: Faceted Metadata for Image Search and Browsing. In: Proceedings of the SIGCHI conference on Human factors in computing systems, Ft. Lauderdale, USA, pp. 401–408 (2003)
Google Scholar
Wellner, P., Flynn, M., Tucker, S., Whittaker, S.: A Meeting Browser Evaluation Test, Presented at the Conference on Human Factors in Computing Systems, Portand, Oregon, USA, pp. 2021–2024 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

DIVA Group, Department of Informatics, University of Fribourg, Bd. de Pérolles 90, CH-1700, Fribourg, Switzerland
Maurizio Rigamonti, Denis Lalanne, Florian Evéquoz & Rolf Ingold

Authors

Maurizio Rigamonti
View author publications
You can also search for this author in PubMed Google Scholar
Denis Lalanne
View author publications
You can also search for this author in PubMed Google Scholar
Florian Evéquoz
View author publications
You can also search for this author in PubMed Google Scholar
Rolf Ingold
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Edinburgh, Edinburgh, Scotland
Steve Renals
IDIAP Research Institute, Martigny, Switzerland
Samy Bengio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rigamonti, M., Lalanne, D., Evéquoz, F., Ingold, R. (2006). Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links. In: Renals, S., Bengio, S. (eds) Machine Learning for Multimodal Interaction. MLMI 2005. Lecture Notes in Computer Science, vol 3869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677482_10

Download citation

DOI: https://doi.org/10.1007/11677482_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32549-9
Online ISBN: 978-3-540-32550-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics