Promoting free Dialog Video Corpora: The IFADV Corpus Example

van Son, R. J. J. H.; Wesseling, Wieneke; Sanders, Eric; van den Heuvel, Henk

doi:10.1007/978-3-642-04793-0_2

R. J. J. H. van Son²³,
Wieneke Wesseling²³,
Eric Sanders²⁴ &
…
Henk van den Heuvel²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5509))

Included in the following conference series:

International LREC Workshop on Multimodal Corpora

1222 Accesses

Abstract

Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright restrictions. In contrast, Free-and-Libre corpora would allow anyone to add incremental annotations and improvement, distributing the cost of construction and maintenance. A freely available annotated corpus is presented with high quality video recordings of face-to-face conversational speech. An effort has been made to remove copyright and use restrictions. Annotations have been processed to RDBMS tables that allow SQL queries and direct connections to statistical software. A few simple examples are presented to illustrate the use of a databases of annotated speech. From our experiences we would like to advocate the formulation of “best practises” for both legal handling and database storage of recordings and annotations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

The Corpus of Interactional Data: A Large Multimodal Annotated Resource

Case Study: The AusTalk Corpus

The Danish NOMCO corpus: multimodal interaction in first acquaintance conversations

Article 19 October 2016

References

Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: Genbank. Nucleic Acids Research 35(Database-Issue), 21–25 (2007)
Article Google Scholar
Kolbitsch, J., Maurer, H.: The transformation of the web: How emerging communities shape the information we consume. Journal of Universal Computer Science 12(2), 187–213 (2006)
Google Scholar
Lerner, J., Tirole, J.: Some simple economics of open source. Journal of Industrial Economics 50, 197–234 (2002)
Article Google Scholar
Ciffolilli, A.: The economics of open source hijacking and declining quality of digital information resources: A case for copyleft. Development and Comp. Systems 0404008, EconWPA (April 2004)
Google Scholar
Rullani, F.: Dragging developers towards the core. how the free/libre/open source software community enhances developers’ contribution. LEM Papers Series 2006/22, Laboratory of Economics and Management (LEM), Sant’Anna School of Advanced Studies, Pisa, Italy (September 2006)
Google Scholar
ELRA: European Language Resources Association: Catalogue of Language Resources (2004–2007), http://catalog.elra.info/
LDC: The Language Data Consortium Corpus Catalog (1992–2007), http://www.ldc.upenn.edu/Catalog/
HLT-Agency: Centrale voor Taal- en Spraaktechnologie, TST-centrale (2007), http://www.tst.inl.nl/producten/
MAPtask: HCRC Map Task Corpus (1992–2007), http://www.hcrc.ed.ac.uk/maptask/
Blache, P., Rauzy, S., Ferré, G.: An XML Coding Scheme for Multimodal Corpus Annotation. In: Proceedings of Corpus Linguistics (2007)
Google Scholar
Bertrand, R.: Corpus d’interactions dilogales, CID (2007), http://crdo.fr/voir_depot.php?langue=en&id=27
CRDO: Licences (2008), http://crdo.up.univ-aix.fr/phpwiki/index.php?pagename=Licences
CGN: The Spoken Dutch Corpus project (2006), http://www.tst.inl.nl/cgndocs/doc_English/topics/index.htm
SMIL: W3C Synchronized Multimedia Integration Language (2008), http://www.w3.org/AudioVideo/
Ide, N., Romary, L.: Outline of the international standard linguistic annotation framework. In: Proceedings of the ACL 2003 workshop on Linguistic annotation, Morristown, NJ, USA, Association for Computational Linguistics, pp. 1–5 (2003)
Google Scholar
Schmidt, T., Chiarcos, C., Lehmberg, T., Rehm, G., Witt, A., Hinrichs, E.: Avoiding data graveyards: From heterogeneous data collected in multiple research projects to sustainable linguistic resources. In: Proceedings of the E-MELD 2006 Workshop on Digital Language Documentation: Tools and Standards: The State of the Art, Lansing, Michigan (2006)
Google Scholar
Rehm, G., Witt, A., Hinrichs, E., Reis, M.: Sustainability of annotated resources in linguistics. In: Proceedings of Digital Humanities 2008, Oulu, Finland, pp. 27–29 (2008)
Google Scholar
IDABC: European Interoperability Framework for Pan-European eGovernmentservices (2004), http://europa.eu.int/idabc/en/document/3761
Ken Coar: The Open Source Definition, Annotated (2006), http://www.opensource.org/docs/definition.php
Schmidt, T., Duncan, S., Ehmer, O., Hoyt, J., Kipp, M., Loehr, D., Magnusson, M., Rose, T., Sloetjes, H.: An exchange format for multimodal annotations. In: (ELRA), E.L.R.A. (ed.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (May 2008)
Google Scholar
Carletta, J., Isard, A., Isard, S., Kowtko, J., Doherty-Sneddon, G., Anderson, A.: The reliability of a dialogue structure coding scheme. Computational Linguistics 23, 13–31 (1997)
Google Scholar
Core, M., Allen, J.: Coding dialogs with the damsl annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, pp. 28–35 (1997)
Google Scholar
ELAN: ELAN is a professional tool for the creation of complex annotations on video and audio resources (2002–2007), http://www.lat-mpi.eu/tools/elan/
Caspers, J.: Local speech melody as a limiting factor in the turn-taking system in dutch. Journal of Phonetics 31(2), 251–276 (2003)
Article Google Scholar
Wesseling, W., van Son, R.J.J.H.: Early Preparation of Experimentally Elicited Minimal Responses. In: Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, pp. 11–18 (2005)
Google Scholar
Mengel, A., Heid, U.: Enhancing reusability of speech corpora by hyperlinked query output. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2703–2706 (1999)
Google Scholar
Cassidy, S.: Compiling multi-tiered speech databases into the relational model: Experiments with the EMU system. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2239–2242 (1999)
Google Scholar
Van Son, R., Binnenpoorte, D., van den Heuvel, H., Pols, L.: The IFA corpus: a phonemically segmented Dutch Open Source speech database. In: Proceedings of EUROSPEECH 2001, Aalborg, pp. 2051–2054 (2001)
Google Scholar
Van Son, R., Pols, L.: Structure and access of the open source IFA Corpus. In: Proceedings of the IRCS workshop on Linguistic Databases, Philadelphia, pp. 245–253 (2001)
Google Scholar
R Core Team: The R Project for Statistical Computing (1998–2008), http://www.r-project.org/
IMDI: ISLE Meta Data Initiative (1999–2007), http://www.mpi.nl/IMDI/
WIPO: Berne Convention for the Protection of Literary and Artistic Works (1979), http://www.wipo.int/treaties/en/ip/berne/index.html
WIPO: 5: International Treaties and Conventions on Intellectual Property. In: WIPO Handbook on Intellectual Property: Policy, Law and Use, 2nd edn., pp. 237–364. WIPO (2004), http://www.wipo.int/about-ip/en/iprm/ (Date of access: March 2008)
Maurer, S.M., Hugenholtz, P.B., Onsrud, H.J.: Europe’s database experiment. Science 294, 789–790 (2001)
Google Scholar
Kienle, H.M., German, D., Tilley, S., Müller, H.A.: Intellectual property aspects of web publishing. In: SIGDOC 2004: Proceedings of the 22nd annual international conference on Design of communication, pp. 136–144. ACM, New York (2004)
Google Scholar
EC: First evaluation of Directive 96/9/EC on the legal protection of databases, DG Internal Market and Services Working Paper (2005), http://europa.eu.int/comm/internal_market/copyright/docs/databases/evaluation_report_en.pdf
FSF: GNU General Public License, version 2 (1991), http://www.gnu.org/licenses/old-licenses/gpl-2.0.html
IDABC: European Union Public Licence, EUPL v.1.0 (2008), http://ec.europa.eu/idabc/eupl
ten Bosch, L., Oostdijk, N., Boves, L.: On temporal aspects of turn taking in conversational dialogues. Speech Communication 47(1-2), 80–86 (2005)
Article Google Scholar
: Community cleverness required. Nature 455(7209), 1 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

ACLC/IFA, University of Amsterdam, The Netherlands
R. J. J. H. van Son & Wieneke Wesseling
SPEX/CLST, Radboud University Nijmegen, The Netherlands
Eric Sanders & Henk van den Heuvel

Authors

R. J. J. H. van Son
View author publications
You can also search for this author in PubMed Google Scholar
Wieneke Wesseling
View author publications
You can also search for this author in PubMed Google Scholar
Eric Sanders
View author publications
You can also search for this author in PubMed Google Scholar
Henk van den Heuvel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Deutsches Forschungszentrum für künstliche Intelligenz (DFKI), Campus D3.2, 66123, Saarbrücken, Germany
Michael Kipp
Laboratoire d’Informatique pour la Mécanique et les Sciences de l’Ingénieur (LIMSI-CNRS), BP 133, 91403, Orsay Cedex, France
Jean-Claude Martin
Faculty of Humanities, Centre for Language Technology, University of Copenhagen, Njalsgade 140-142, 2300, Copenhagen, Denmark
Patrizia Paggio
Computer Science, Human Media Interaction, University of Twente, PO Box 217, 7500, Enschede, AE, The Netherlands
Dirk Heylen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

van Son, R.J.J.H., Wesseling, W., Sanders, E., van den Heuvel, H. (2009). Promoting free Dialog Video Corpora: The IFADV Corpus Example. In: Kipp, M., Martin, JC., Paggio, P., Heylen, D. (eds) Multimodal Corpora. MMCorp 2008. Lecture Notes in Computer Science(), vol 5509. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04793-0_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-04793-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04792-3
Online ISBN: 978-3-642-04793-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics