Abstract
During the design of multimodal interaction environments, the use of a corpus of multimodal sentences is very important in order to achieve various tasks of multimodal interaction. In last decade, several researchers addressed the creation of multimodal corpora for English, French, and various other languages. However, from the analysis of these multimodal corpora, there clearly is a lack of multimodal corpora for Italian. This paper describes the building process of an Italian multimodal corpus. Starting from the manual analysis of multimedia dialogues, this process extracts different multimodal data, i.e. speech and gestures, which are used to generate grammar rules and to train the multimodal interpreter in order to set the framework for the multimodal corpus building. Following that, the set framework is used to annotate semi-automatically multimodal information, such as syntactic roles and semantics, on new dialogues to be included in the corpus.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ringeval, F., Sonderegger, A., Sauer, J., Lalanne, D.: Introducing the RECOLA Multimodal Corpus of Remote Collaborative and Affective Interactions. In: 2nd International Workshop EmoSPACE (2013)
Inoue, M., Hanada, R., Furuyama, N., Irino, T., Ichinomiya, T., Massaki, H.: Multimodal Corpus for Psychotherapeutic Situations. In: International Workshop Series on Multimodal Corpora, Tools and Resources, pp. 18–21 (2012)
Melvin, R.S., May, W., Narayanan, S., Georgiou, P.G., Ganjavi, S.: Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients. In: LREC (2004)
Fleury, A., Vacher, M., Portet, F., Chahuara, P., Noury, N.: A multimodal corpus recorded in a health smart home. In: Proceedings of the LREC 2010, pp. 99–105 (2010)
Vacher, M., Lecouteux, B., Chahuara, P., Portet, F., Meillon, B., Bonnefond, N.: The Sweet-Home speech and multimodal corpus for home automation interaction. In: LREC 2014, pp. 1–8 (2014)
Costantini, E., Burger, S., Pianesi, F.: NESPOLE!’s Multilingual and Multimodal Corpus. In: LREC (2002)
D’Ulizia, A., Ferri, F., Grifoni, P.: Toward the Development of an Integrative Framework for Multimodal Dialogue Processing. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008 Workshops. LNCS, vol. 5333, pp. 509–518. Springer, Heidelberg (2008)
Caschera, M.C., Ferri, F., Grifoni, P.: Multimodal interaction systems: information and time features. International Journal of Web and Grid Services 3(1), 82–99 (2007)
Caschera, M.C., Ferri, F., Grifoni, P.: An Approach for Managing Ambiguities in Multimodal Interaction. In: Meersman, R., Tari, Z. (eds.) OTM-WS 2007, Part I. LNCS, vol. 4805, pp. 387–397. Springer, Heidelberg (2007)
Avola, D., Caschera, M.C., Ferri, F., Grifoni, P.: Ambiguities in Sketch-Based Interfaces. In: HICSS 2007, p. 290. IEEE Computer Society (2007)
Caschera, M.C., Ferri, F., Grifoni, P.: Ambiguity detection in multimodal systems. In: Advanced Visual Interfaces, AVI 2008, pp. 331–334. ACM Press (2008)
Avola, D., Caschera, M.C., Grifoni, P.: Solving ambiguities for Sketch-Based interaction in mobile environments. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2006 Workshops. LNCS, vol. 4277, pp. 904–915. Springer, Heidelberg (2006)
D’Ulizia, A., Ferri, F.: Formalization of multimodal languages in pervasive computing paradigm. In: Damiani, E., Yetongnon, K., Chbeir, R., Dipanda, A. (eds.) SITIS 2006. LNCS, vol. 4879, pp. 126–136. Springer, Heidelberg (2009)
Ferri, F., D’Ulizia, A., Grifoni, P.: Multimodal Language Specification for Human Adaptive Mechatronics. JNIT 3(1), 47–57 (2012)
D’Ulizia, A.: Exploring Multimodal Input Fusion Strategies. In: Handbook of Research on Multimodal Human Computer Interaction and Pervasive Services: Evolutionary Techniques for Improving Accessibility, pp. 34–57. IGI Publishing (2009)
D’Ulizia, A., Ferri, F., Grifoni, P.: A Hybrid Grammar-Based Approach to Multimodal Languages Specification. In: Meersman, R., Tari, Z. (eds.) OTM-WS 2007, Part I. LNCS, vol. 4805, pp. 367–376. Springer, Heidelberg (2007)
D’Ulizia, A., Ferri, F., Grifoni, P.: A Survey of Grammatical Inference Methods for Natural Language Learning. Artificial Intelligence Review 36(1), 1–27 (2011)
Manchón, P., Pérez, G., Amores, G.: Multimodal Fusion: A New Hybrid Strategy for Dialogue Systems. In: Proceedings of ICMI 2006, pp. 357–363. ACM (2006)
D’Ulizia, A., Ferri, F., Grifoni, P.: Generating Multimodal Grammars for Multimodal Dialogue Processing. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans 40(6), 1130–1145 (2010)
Shimazu, H., Takashima, Y.: Multimodal Definite Clause Grammar. Systems and Computers in Japan 26(3), 93–102 (1995)
Johnston, M., Bangalore, S.: Finite-state multimodal integration and understanding. Nat. Lang. Eng. 11(2), 159–187 (2005)
Reitter, D., Panttaja, E.M., Cummins, F.: UI on the fly: Generating a multimodal user interface. In: Proceedings of Human Language Technology Conference (2004)
Pereira, F., Warren, D.H.D.: Definite Clause Grammars for Language Analysis - A survey of the Formalism and a Comparison with Augmented Transition Networks. Artificial Intelligence 13(3) (1980)
Baldridge, J., Kruijff, G.J.M.: Multimodal combinatory categorial grammar. In: Proceedings of the 10th Conference of the European Chapter of the ACL, pp. 211–218 (2003)
D’Andrea, A., D’Ulizia, A., Ferri, F., Grifoni, P.: A Multimodal Pervasive Framework for Ambient Assisted Living. In: Proceedings of the PETRA. ACM Digital Library (2009)
Caschera, M.C.: Interpretation methods and ambiguity management in multimodal systems. In: Handbook of Research on Multimodal Human Computer Interaction and Pervasive Services: Evolutionary Techniques for Improving Accessibility, pp. 87–102. IGI Publishing (2009)
Caschera, M.C., Ferri, F., Grifoni, P.: InteSe: An Integrated Model for Resolving Ambiguities in Multimodal Sentences. IEEE Transactions on Systems, Man, and Cybernetics: Systems 43(4), 911–931 (2013)
Avola, D., Caschera, M.C., Ferri, F., Grifoni, P.: Classifying and resolving ambiguities in sketch-based interaction. Int. J. Virt. Technol. Multimedia 1(2), 104–139 (2010)
Caschera, M.C., Ferri, F., Grifoni, P.: Personal sphere information, histories and social interaction between people on the Internet. In: Meersman, R., Tari, Z., Herrero, P. (eds.) OTM 2008 Workshops. LNCS, vol. 5333, pp. 480–488. Springer, Heidelberg (2008)
Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–285 (1989)
Makhoul, J., Starner, T., Schwartz, R., Chou, G.: On-line cursive handwriting recognition using hidden Markov models and statistical grammars. In: Proc. Workshop Hum. Lang. Technol., pp. 432–436 (1994)
Jelinek, F.: Robust part-of-speech tagging using a hidden Markov model. Comput. Speech Lang. 6(3), 225–242 (1992)
Li, N., Busso, C.: Evaluating the robustness of an appearance-based gaze estimation method for multimodal interfaces. In: ICMI 2013, pp. 91–98 (2013)
Caschera, M.C., Ferri, F., Grifoni, P.: From Modal to Multimodal Ambiguities: a Classification Approach. JNIT 4(5), 87–109 (2013)
Allwood, J.: Multimodal Corpora. In: Lüdeling, A., Kytö, M. (eds.) Corpus Linguistics. An International Handbook, pp. 207–225. Mouton de Gruyter, Berlin (2008)
Caschera, M.C., D’Ulizia, A., Ferri, F., Grifoni, P.: Multiculturality and Multimodal Languages. In: Multiple Sensorial Media Advances and Applications: New Developments in MulSeMedia, pp. 99–114. IGI Global Publishing (2012)
Caschera, M.C., D’Ulizia, A., Ferri, F., Grifoni, P.: Methods for dynamic building of multimodal corpora. In: LTC 2013, pp. 499–503 (2013)
Bosco, C., Montemagni, S., Simi, M.: Converting Italian Treebanks: Towards an Italian Stanford Dependency Treebank. In: ACL Workshop (2013)
D’Ulizia, A., Ferri, F., Grifoni, P.: A Learning Algorithm for Multimodal Grammar Inference. IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics 41(6), 1495–1510 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Caschera, M.C., D’Ulizia, A., Ferri, F., Grifoni, P. (2014). An Italian Multimodal Corpus: The Building Process. In: Meersman, R., et al. On the Move to Meaningful Internet Systems: OTM 2014 Workshops. OTM 2014. Lecture Notes in Computer Science, vol 8842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45550-0_57
Download citation
DOI: https://doi.org/10.1007/978-3-662-45550-0_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45549-4
Online ISBN: 978-3-662-45550-0
eBook Packages: Computer ScienceComputer Science (R0)