Abstract
Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright restrictions. In contrast, Free-and-Libre corpora would allow anyone to add incremental annotations and improvement, distributing the cost of construction and maintenance. A freely available annotated corpus is presented with high quality video recordings of face-to-face conversational speech. An effort has been made to remove copyright and use restrictions. Annotations have been processed to RDBMS tables that allow SQL queries and direct connections to statistical software. A few simple examples are presented to illustrate the use of a databases of annotated speech. From our experiences we would like to advocate the formulation of “best practises” for both legal handling and database storage of recordings and annotations.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., Wheeler, D.L.: Genbank. Nucleic Acids Research 35(Database-Issue), 21–25 (2007)
Kolbitsch, J., Maurer, H.: The transformation of the web: How emerging communities shape the information we consume. Journal of Universal Computer Science 12(2), 187–213 (2006)
Lerner, J., Tirole, J.: Some simple economics of open source. Journal of Industrial Economics 50, 197–234 (2002)
Ciffolilli, A.: The economics of open source hijacking and declining quality of digital information resources: A case for copyleft. Development and Comp. Systems 0404008, EconWPA (April 2004)
Rullani, F.: Dragging developers towards the core. how the free/libre/open source software community enhances developers’ contribution. LEM Papers Series 2006/22, Laboratory of Economics and Management (LEM), Sant’Anna School of Advanced Studies, Pisa, Italy (September 2006)
ELRA: European Language Resources Association: Catalogue of Language Resources (2004–2007), http://catalog.elra.info/
LDC: The Language Data Consortium Corpus Catalog (1992–2007), http://www.ldc.upenn.edu/Catalog/
HLT-Agency: Centrale voor Taal- en Spraaktechnologie, TST-centrale (2007), http://www.tst.inl.nl/producten/
MAPtask: HCRC Map Task Corpus (1992–2007), http://www.hcrc.ed.ac.uk/maptask/
Blache, P., Rauzy, S., Ferré, G.: An XML Coding Scheme for Multimodal Corpus Annotation. In: Proceedings of Corpus Linguistics (2007)
Bertrand, R.: Corpus d’interactions dilogales, CID (2007), http://crdo.fr/voir_depot.php?langue=en&id=27
CRDO: Licences (2008), http://crdo.up.univ-aix.fr/phpwiki/index.php?pagename=Licences
CGN: The Spoken Dutch Corpus project (2006), http://www.tst.inl.nl/cgndocs/doc_English/topics/index.htm
SMIL: W3C Synchronized Multimedia Integration Language (2008), http://www.w3.org/AudioVideo/
Ide, N., Romary, L.: Outline of the international standard linguistic annotation framework. In: Proceedings of the ACL 2003 workshop on Linguistic annotation, Morristown, NJ, USA, Association for Computational Linguistics, pp. 1–5 (2003)
Schmidt, T., Chiarcos, C., Lehmberg, T., Rehm, G., Witt, A., Hinrichs, E.: Avoiding data graveyards: From heterogeneous data collected in multiple research projects to sustainable linguistic resources. In: Proceedings of the E-MELD 2006 Workshop on Digital Language Documentation: Tools and Standards: The State of the Art, Lansing, Michigan (2006)
Rehm, G., Witt, A., Hinrichs, E., Reis, M.: Sustainability of annotated resources in linguistics. In: Proceedings of Digital Humanities 2008, Oulu, Finland, pp. 27–29 (2008)
IDABC: European Interoperability Framework for Pan-European eGovernmentservices (2004), http://europa.eu.int/idabc/en/document/3761
Ken Coar: The Open Source Definition, Annotated (2006), http://www.opensource.org/docs/definition.php
Schmidt, T., Duncan, S., Ehmer, O., Hoyt, J., Kipp, M., Loehr, D., Magnusson, M., Rose, T., Sloetjes, H.: An exchange format for multimodal annotations. In: (ELRA), E.L.R.A. (ed.) Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), Marrakech, Morocco (May 2008)
Carletta, J., Isard, A., Isard, S., Kowtko, J., Doherty-Sneddon, G., Anderson, A.: The reliability of a dialogue structure coding scheme. Computational Linguistics 23, 13–31 (1997)
Core, M., Allen, J.: Coding dialogs with the damsl annotation scheme. In: AAAI Fall Symposium on Communicative Action in Humans and Machines, pp. 28–35 (1997)
ELAN: ELAN is a professional tool for the creation of complex annotations on video and audio resources (2002–2007), http://www.lat-mpi.eu/tools/elan/
Caspers, J.: Local speech melody as a limiting factor in the turn-taking system in dutch. Journal of Phonetics 31(2), 251–276 (2003)
Wesseling, W., van Son, R.J.J.H.: Early Preparation of Experimentally Elicited Minimal Responses. In: Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, pp. 11–18 (2005)
Mengel, A., Heid, U.: Enhancing reusability of speech corpora by hyperlinked query output. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2703–2706 (1999)
Cassidy, S.: Compiling multi-tiered speech databases into the relational model: Experiments with the EMU system. In: Proceedings of EUROSPEECH 1999, Budapest, pp. 2239–2242 (1999)
Van Son, R., Binnenpoorte, D., van den Heuvel, H., Pols, L.: The IFA corpus: a phonemically segmented Dutch Open Source speech database. In: Proceedings of EUROSPEECH 2001, Aalborg, pp. 2051–2054 (2001)
Van Son, R., Pols, L.: Structure and access of the open source IFA Corpus. In: Proceedings of the IRCS workshop on Linguistic Databases, Philadelphia, pp. 245–253 (2001)
R Core Team: The R Project for Statistical Computing (1998–2008), http://www.r-project.org/
IMDI: ISLE Meta Data Initiative (1999–2007), http://www.mpi.nl/IMDI/
WIPO: Berne Convention for the Protection of Literary and Artistic Works (1979), http://www.wipo.int/treaties/en/ip/berne/index.html
WIPO: 5: International Treaties and Conventions on Intellectual Property. In: WIPO Handbook on Intellectual Property: Policy, Law and Use, 2nd edn., pp. 237–364. WIPO (2004), http://www.wipo.int/about-ip/en/iprm/ (Date of access: March 2008)
Maurer, S.M., Hugenholtz, P.B., Onsrud, H.J.: Europe’s database experiment. Science 294, 789–790 (2001)
Kienle, H.M., German, D., Tilley, S., Müller, H.A.: Intellectual property aspects of web publishing. In: SIGDOC 2004: Proceedings of the 22nd annual international conference on Design of communication, pp. 136–144. ACM, New York (2004)
EC: First evaluation of Directive 96/9/EC on the legal protection of databases, DG Internal Market and Services Working Paper (2005), http://europa.eu.int/comm/internal_market/copyright/docs/databases/evaluation_report_en.pdf
FSF: GNU General Public License, version 2 (1991), http://www.gnu.org/licenses/old-licenses/gpl-2.0.html
IDABC: European Union Public Licence, EUPL v.1.0 (2008), http://ec.europa.eu/idabc/eupl
ten Bosch, L., Oostdijk, N., Boves, L.: On temporal aspects of turn taking in conversational dialogues. Speech Communication 47(1-2), 80–86 (2005)
: Community cleverness required. Nature 455(7209), 1 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
van Son, R.J.J.H., Wesseling, W., Sanders, E., van den Heuvel, H. (2009). Promoting free Dialog Video Corpora: The IFADV Corpus Example. In: Kipp, M., Martin, JC., Paggio, P., Heylen, D. (eds) Multimodal Corpora. MMCorp 2008. Lecture Notes in Computer Science(), vol 5509. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04793-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-04793-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04792-3
Online ISBN: 978-3-642-04793-0
eBook Packages: Computer ScienceComputer Science (R0)