Skip to main content

Promoting free Dialog Video Corpora: The IFADV Corpus Example

  • Chapter
Multimodal Corpora (MMCorp 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5509))

Included in the following conference series:

  • 1152 Accesses

Abstract

Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright restrictions. In contrast, Free-and-Libre corpora would allow anyone to add incremental annotations and improvement, distributing the cost of construction and maintenance. A freely available annotated corpus is presented with high quality video recordings of face-to-face conversational speech. An effort has been made to remove copyright and use restrictions. Annotations have been processed to RDBMS tables that allow SQL queries and direct connections to statistical software. A few simple examples are presented to illustrate the use of a databases of annotated speech. From our experiences we would like to advocate the formulation of “best practises” for both legal handling and database storage of recordings and annotations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: Genbank. Nucleic Acids Research 35(Database-Issue), 21–25 (2007)

    Article  Google Scholar 

  2. Kolbitsch, J., Maurer, H.: The transformation of the web: How emerging communities shape the information we consume. Journal of Universal Computer Science 12(2), 187–213 (2006)

    Google Scholar 

  3. Lerner, J., Tirole, J.: Some simple economics of open source. Journal of Industrial Economics 50, 197–234 (2002)

    Article  Google Scholar 

  4. Ciffolilli, A.: The economics of open source hijacking and declining quality of digital information resources: A case for copyleft. Development and Comp. Systems 0404008, EconWPA (April 2004)

    Google Scholar 

  5. Rullani, F.: Dragging developers towards the core. how the free/libre/open source software community enhances developers’ contribution. LEM Papers Series 2006/22, Laboratory of Economics and Management (LEM), Sant’Anna School of Advanced Studies, Pisa, Italy (September 2006)

    Google Scholar 

  6. ELRA: European Language Resources Association: Catalogue of Language Resources (2004–2007), http://catalog.elra.info/

  7. LDC: The Language Data Consortium Corpus Catalog (1992–2007), http://www.ldc.upenn.edu/Catalog/

  8. HLT-Agency: Centrale voor Taal- en Spraaktechnologie, TST-centrale (2007), http://www.tst.inl.nl/producten/

  9. MAPtask: HCRC Map Task Corpus (1992–2007), http://www.hcrc.ed.ac.uk/maptask/

  10. Blache, P., Rauzy, S., Ferré, G.: An XML Coding Scheme for Multimodal Corpus Annotation. In: Proceedings of Corpus Linguistics (2007)

    Google Scholar 

  11. Bertrand, R.: Corpus d’interactions dilogales, CID (2007), http://crdo.fr/voir_depot.php?langue=en&id=27

  12. CRDO: Licences (2008), http://crdo.up.univ-aix.fr/phpwiki/index.php?pagename=Licences

  13. CGN: The Spoken Dutch Corpus project (2006), http://www.tst.inl.nl/cgndocs/doc_English/topics/index.htm

  14. SMIL: W3C Synchronized Multimedia Integration Language (2008), http://www.w3.org/AudioVideo/

  15. Ide, N., Romary, L.: Outline of the international standard linguistic annotation framework. In: Proceedings of the ACL 2003 workshop on Linguistic annotation, Morristown, NJ, USA, Association for Computational Linguistics, pp. 1–5 (2003)

    Google Scholar 

  16. Schmidt, T., Chiarcos, C., Lehmberg, T., Rehm, G., Witt, A., Hinrichs, E.: Avoiding data graveyards: From heterogeneous data collected in multiple research projects to sustainable linguistic resources. In: Proceedings of the E-MELD 2006 Workshop on Digital Language Documentation: Tools and Standards: The State of the Art, Lansing, Michigan (2006)

    Google Scholar 

  17. Rehm, G., Witt, A., Hinrichs, E., Reis, M.: Sustainability of annotated resources in linguistics. In: Proceedings of Digital Humanities 2008, Oulu, Finland, pp. 27–29 (2008)

    Google Scholar 

  18. IDABC: European Interoperability Framework for Pan-European eGovernmentservices (2004), http://europa.eu.int/idabc/en/document/3761

  19. Ken Coar: The Open Source Definition, Annotated (2006), http://www.opensource.org/docs/definition.php

  20. Schmidt, T., Duncan, S., Ehmer, O., Hoyt, J., Kipp, M., Loehr, D., Magnusson, M., Rose, T., Sloetjes, H.: An exchange format for multimodal annotations. In: (ELRA), E.L.R.A. (ed.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (May 2008)

    Google Scholar 

  21. Carletta, J., Isard, A., Isard, S., Kowtko, J., Doherty-Sneddon, G., Anderson, A.: The reliability of a dialogue structure coding scheme. Computational Linguistics 23, 13–31 (1997)

    Google Scholar 

  22. Core, M., Allen, J.: Coding dialogs with the damsl annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, pp. 28–35 (1997)

    Google Scholar 

  23. ELAN: ELAN is a professional tool for the creation of complex annotations on video and audio resources (2002–2007), http://www.lat-mpi.eu/tools/elan/

  24. Caspers, J.: Local speech melody as a limiting factor in the turn-taking system in dutch. Journal of Phonetics 31(2), 251–276 (2003)

    Article  Google Scholar 

  25. Wesseling, W., van Son, R.J.J.H.: Early Preparation of Experimentally Elicited Minimal Responses. In: Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, pp. 11–18 (2005)

    Google Scholar 

  26. Mengel, A., Heid, U.: Enhancing reusability of speech corpora by hyperlinked query output. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2703–2706 (1999)

    Google Scholar 

  27. Cassidy, S.: Compiling multi-tiered speech databases into the relational model: Experiments with the EMU system. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2239–2242 (1999)

    Google Scholar 

  28. Van Son, R., Binnenpoorte, D., van den Heuvel, H., Pols, L.: The IFA corpus: a phonemically segmented Dutch Open Source speech database. In: Proceedings of EUROSPEECH 2001, Aalborg, pp. 2051–2054 (2001)

    Google Scholar 

  29. Van Son, R., Pols, L.: Structure and access of the open source IFA Corpus. In: Proceedings of the IRCS workshop on Linguistic Databases, Philadelphia, pp. 245–253 (2001)

    Google Scholar 

  30. R Core Team: The R Project for Statistical Computing (1998–2008), http://www.r-project.org/

  31. IMDI: ISLE Meta Data Initiative (1999–2007), http://www.mpi.nl/IMDI/

  32. WIPO: Berne Convention for the Protection of Literary and Artistic Works (1979), http://www.wipo.int/treaties/en/ip/berne/index.html

  33. WIPO: 5: International Treaties and Conventions on Intellectual Property. In: WIPO Handbook on Intellectual Property: Policy, Law and Use, 2nd edn., pp. 237–364. WIPO (2004), http://www.wipo.int/about-ip/en/iprm/ (Date of access: March 2008)

  34. Maurer, S.M., Hugenholtz, P.B., Onsrud, H.J.: Europe’s database experiment. Science 294, 789–790 (2001)

    Google Scholar 

  35. Kienle, H.M., German, D., Tilley, S., Müller, H.A.: Intellectual property aspects of web publishing. In: SIGDOC 2004: Proceedings of the 22nd annual international conference on Design of communication, pp. 136–144. ACM, New York (2004)

    Google Scholar 

  36. EC: First evaluation of Directive 96/9/EC on the legal protection of databases, DG Internal Market and Services Working Paper (2005), http://europa.eu.int/comm/internal_market/copyright/docs/databases/evaluation_report_en.pdf

  37. FSF: GNU General Public License, version 2 (1991), http://www.gnu.org/licenses/old-licenses/gpl-2.0.html

  38. IDABC: European Union Public Licence, EUPL v.1.0 (2008), http://ec.europa.eu/idabc/eupl

  39. ten Bosch, L., Oostdijk, N., Boves, L.: On temporal aspects of turn taking in conversational dialogues. Speech Communication 47(1-2), 80–86 (2005)

    Article  Google Scholar 

  40. : Community cleverness required. Nature 455(7209), 1 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

van Son, R.J.J.H., Wesseling, W., Sanders, E., van den Heuvel, H. (2009). Promoting free Dialog Video Corpora: The IFADV Corpus Example. In: Kipp, M., Martin, JC., Paggio, P., Heylen, D. (eds) Multimodal Corpora. MMCorp 2008. Lecture Notes in Computer Science(), vol 5509. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04793-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04793-0_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04792-3

  • Online ISBN: 978-3-642-04793-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics