Engineering Variance: Software Techniques for Scalable, Customizable, and Reusable Multimodal Processing

Latoschik, Marc Erich; Fischbach, Martin

doi:10.1007/978-3-319-07233-3_29

Marc Erich Latoschik¹⁶ &
Martin Fischbach¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8510))

Included in the following conference series:

International Conference on Human-Computer Interaction

3551 Accesses

Abstract

This article describes four software techniques to enhance the overall quality of multimodal processing software and to include concurrency and variance due to individual characteristics and cultural context. First, the processing steps are decentralized and distributed using the actor model. Second, functor objects decouple domain- and application-specific operations from universal processing methods. Third, domain specific languages are provided inside of specialized feature processing units to define necessary algorithms in a human-readable and comprehensible format. Fourth, constituents of the DSLs (including the functors) are semantically grounded into a common ontology supporting syntactic and semantic correctness checks as well as code-generation capabilities. These techniques provide scalable, customizable, and reusable technical solutions for reoccurring multimodal processing tasks.

Download to read the full chapter text

Chapter PDF

Introduction to the Multimodal Architecture Specification

Design and Development of Multimodal Applications: A Vision on Key Issues and Methods

Implementation Goals for Multimodal Interfaces in Human-Computer Interaction

Keywords

References

Böhm, K., Broll, W., Sokolewicz, M.: Dynamic gesture recognition using neural networks; a fundament for advanced interaction construction. In: Fisher, S., Merrit, J., Bolan, M. (eds.) Stereoscopic Displays and Virtual Reality Systems, SPIE Conference Electronic Imaging Science & Technology, San Jose, USA, vol. 2177 (1994)
Google Scholar
Bouchet, J., Nigay, L., Ganille, T.: ICARE software components for rapidly developing multimodal interfaces. In: ICMI 2004: Proceedings of the 6th International Conference on Multimodal Interfaces, pp. 251–258. ACM, New York (2004)
Google Scholar
Fischbach, M., Wiebusch, D., Giebler-Schubert, A., Latoschik, M.E., Rehfeld, S., Tramberend, H.: SiXton’s curse - Simulator X demonstration. In: 2011 IEEE Virtual Reality Conference, VR, pp. 255–256 (2011)
Google Scholar
Fitzgerald, W., Firby, R.J., Hannemann, M.: Multimodal event parsing for intelligent user interfaces. In: Proceedings of the 2003 International Conference on Intelligent User Interfaces, pp. 53–60. ACM Press (2003)
Google Scholar
Hewitt, C., Bishop, P., Steiger, R.: A universal modular ACTOR formalism for artificial intelligence. In: IJCAI 1973: Proceedings of the 3rd International Joint Conference on Artificial Intelligence, pp. 235–245. Morgan Kaufmann Publishers Inc., San Francisco (1973)
Google Scholar
Hoste, L., Dumas, B., Signer, B.: Mudra: A unified multimodal interaction framework. In: Proceedings of the 13th International Conference on Multimodal Interfaces, ICMI 2011, pp. 97–104. ACM, New York (2011)
Google Scholar
Johnston, M.: Unification-based multimodal parsing. In: Proceedings of the 17th International Conference on Computational Linguistics and the 36th Annual Meeting of the Association for Computational Linguistics, COLING-ACL, pp. 624–630 (1998)
Google Scholar
Johnston, M., Bangalore, S.: Finite-state methods for multimodal parsing and integration. In: Finite-state Methods Workshop, ESSLLI Summer School on Logic Language and Information, Helsinki, Finland, pp. 74–80 (August 2001)
Google Scholar
Johnston, M., Cohen, P.R., McGee, D., Oviatt, S.L., Pittman, J.A., Smith, I.: Unification-based multimodal integration. In: 35th Annual Meeting of the Association for Computational Linguistics, Madrid, pp. 281–288 (1997)
Google Scholar
Kendon, A.: Gesticulation and speech: Two aspects of the process of utterance. In: Key, M.R. (ed.) The Relation between Verbal and Non-verbal Communication (1980)
Google Scholar
Koons, D.B., Sparrel, C.J., Thorisson, K.R.: Intergrating simultaneous input from speech, gaze and hand gestures. In: Intelligent Multimedia Interfaces. American Association for Artificial Intelligence (1993)
Google Scholar
Lalanne, D., Nigay, L., Palanque, P., Robinson, P., Vanderdonckt, J., Ladry, J.F.: Fusion engines for multimodal input: A survey. In: ICMI-MLMI 2009: Proceedings of the 2009 International Conference on Multimodal Interfaces, pp. 153–160. ACM, New York (2009)
Chapter Google Scholar
Latoschik, M.E.: Designing Transition Networks for Multimodal VR-Interactions Using a Markup Language. In: Proceedings of the Fourth IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, pp. 411–416. IEEE (2002)
Google Scholar
Latoschik, M.E.: A user interface framework for multimodal VR interactions. In: Proceedings of the IEEE Seventh International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, pp. 76–83 (October 2005)
Google Scholar
Latoschik, M., Tramberend, H.: Simulator X: A scalable and concurrent architecture for intelligent realtime interactive systems. In: 2011 IEEE Virtual Reality Conference (VR), pp. 171–174 (March 2011)
Google Scholar
Nigay, L., Bouchet, J., Juras, D., Mansoux, B., Ortega, M., Serrano, M., Lawson, J.-Y.L.: Software engineering for multimodal interactive systems. In: Tzovaras, D. (ed.) Multimodal User Interfaces. Signals and Commmunication Technologies, pp. 201–218. Springer (2008)
Google Scholar
Väänänen, K., Böhm, K.: Gesture-driven interaction as a human factor in virtual environments – an approach with neural networks. In: Gigante, M.A., Jones, H. (eds.) Virtual Reality Systems. Academic Press (1993)
Google Scholar
Wagner, J., Lingenfelser, F., Baur, T., Damian, I., Kistler, F., André, E.: The social signal interpretation (SSI) framework: Multimodal signal processing and recognition in real-time. In: Proceedings of the 21st ACM International Conference on Multimedia, MM 2013, pp. 831–834. ACM, New York (2013)
Google Scholar
Wiebusch, D., Latoschik, M.E.: Enhanced decoupling of components in intelligent realtime interactive systems using ontologies. In: Proceedings of the IEEE Virtual Reality 2012 Workshop on Software Engineering and Architectures for Realtime Interactive Systems, SEARIS (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

HCI group, University of Würzburg, Germany
Marc Erich Latoschik & Martin Fischbach

Authors

Marc Erich Latoschik
View author publications
You can also search for this author in PubMed Google Scholar
Martin Fischbach
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Open University of Japan, 2-11 Wakaba, Mihama-ku, 261-8586, Chiba-shi, Japan
Masaaki Kurosu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Latoschik, M.E., Fischbach, M. (2014). Engineering Variance: Software Techniques for Scalable, Customizable, and Reusable Multimodal Processing. In: Kurosu, M. (eds) Human-Computer Interaction. Theories, Methods, and Tools. HCI 2014. Lecture Notes in Computer Science, vol 8510. Springer, Cham. https://doi.org/10.1007/978-3-319-07233-3_29

Download citation

DOI: https://doi.org/10.1007/978-3-319-07233-3_29
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07232-6
Online ISBN: 978-3-319-07233-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Engineering Variance: Software Techniques for Scalable, Customizable, and Reusable Multimodal Processing

Abstract

Chapter PDF

Similar content being viewed by others

Introduction to the Multimodal Architecture Specification

Design and Development of Multimodal Applications: A Vision on Key Issues and Methods

Implementation Goals for Multimodal Interfaces in Human-Computer Interaction

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Engineering Variance: Software Techniques for Scalable, Customizable, and Reusable Multimodal Processing

Abstract

Chapter PDF

Similar content being viewed by others

Introduction to the Multimodal Architecture Specification

Design and Development of Multimodal Applications: A Vision on Key Issues and Methods

Implementation Goals for Multimodal Interfaces in Human-Computer Interaction

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation