Proposal of MMI-API and Library for JavaScript

Katsurada, Kouichi; Kikuchi, Taiki; Iribe, Yurie; Nitta, Tsuneo

doi:10.1007/978-3-642-29934-6_49

Kouichi Katsurada⁶,
Taiki Kikuchi⁶,
Yurie Iribe⁶ &
…
Tsuneo Nitta⁶

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 14))

1157 Accesses

Abstract

This paper proposes a multimodal interaction API (MMI-API) and a library for the development of web-based multimodal applications. The API and library enable us to embed synchronized multiple inputs/outputs into an application, as well as to specify concrete speech inputs/outputs and actions of dialogue agents. Because the API and the library are provided for JavaScript, which is a commonly used web-development language, they can be executed on general web browsers without having to install special add-ons. The users can therefore experience multimodal interaction simply by accessing a web site from their web browsers. In addition to presenting an outline of the API and the library, we offer a practical example of the use of the multimodal interaction system, as applied to an English pronunciation training application for Japanese students.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

XHTML+Voice, http://www.w3.org/TR/xhtml+voice/
Katsurada, K., Nakamura, Y., Yamada, H., Nitta, T.: XISL: A Language for Describing Multimodal Interaction Scenarios. In: Proc. of ICMI 2003, pp. 281–284 (2003)
Google Scholar
Wang, K.: SALT: A spoken language interface for web-based multimodal dialog systems. In: Proc. of InterSpeec 2002, pp. 2241–2244 (2002)
Google Scholar
SMIL, http://www.w3.org/AudioVideo/
Tsutsui, T., Saeyor, S., Ishizuka, M.: MPML: A Multimodal Presentation Markup Language with Character Agent Control Functions. In: Proc. WebNet 2000 World Conf. on the WWW and Internet (2000)
Google Scholar
Hayashi, Ueda, Kurihara: TVML (TV program Making Language) - Automatic TV Program Generation from Text-based Script. In: ACM Multimedia 1997 State of the Art Demos (1997)
Google Scholar
http://www.microsoft.com/products/msagent/main.aspx
Nishimura, Y., Minotsu, S., Dohi, H., Ishizuka, M., Nakano, M., Funakoshi, K., Takeuchi, J., Hasegawa, Y., Tsujino, H.: A markup language for describing interactive humanoid robot presentations. In: Proc. of IUI 2007, pp. 333–336 (2007)
Google Scholar
JSON, http://www.json.org/index.html
Kawahara, T., Kobayashi, T., Takeda, K., Minematsu, N., Itou, K., Yamamoto, M., Yamada, A., Utsuro, T., Shikano, K.: Sharable software repository for Japanese large vocabulary continuous speech recognition. In: Proc. ICSLP 1998, pp. 3257–3260 (1998)
Google Scholar
Aques Talk, http://www.a-quest.com/aquestalk/
Mori, T., Iribe, Y., Katsurada, K., Nitta, T.: Real-time Visualization of English Pronunciation on an IPA Vowel-Chart Based on Articulatory Feature Extraction. IPSJ SIG Technical Report 89-15 (2011) (in Japanese)
Google Scholar

Download references

Author information

Authors and Affiliations

Toyohashi University of Technology, Toyohashi, Aichi, 441-8580, Japan
Kouichi Katsurada, Taiki Kikuchi, Yurie Iribe & Tsuneo Nitta

Authors

Kouichi Katsurada
View author publications
You can also search for this author in PubMed Google Scholar
Taiki Kikuchi
View author publications
You can also search for this author in PubMed Google Scholar
Yurie Iribe
View author publications
You can also search for this author in PubMed Google Scholar
Tsuneo Nitta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kouichi Katsurada .

Editor information

Editors and Affiliations

Graduate School of Information Science, Nagoya University, Furo-cho, Nagoya, 464-8601, Japan
Toyohide Watanabe
Graduate School of Information,, Production & Systems (IPS), Waseda University, Hibikino 2-7, Kitakyushu, 808-0135, Japan
Junzo Watada
Nagoya Institute of Technology, Gokiso-cho, Showa-ku, Nagoya, 466-8555, Japan
Naohisa Takahashi
KES International, PO Box 2115, Shoreham-by-Sea, BN43 9AF, United Kingdom
Robert J. Howlett
, School of Electrical and Information, University of South Australia, Mawson Lakes Campus, Adelaide, SA 5095, South Australia, Australia
Lakhmi C. Jain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Katsurada, K., Kikuchi, T., Iribe, Y., Nitta, T. (2012). Proposal of MMI-API and Library for JavaScript. In: Watanabe, T., Watada, J., Takahashi, N., Howlett, R., Jain, L. (eds) Intelligent Interactive Multimedia: Systems and Services. Smart Innovation, Systems and Technologies, vol 14. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29934-6_49

Download citation

DOI: https://doi.org/10.1007/978-3-642-29934-6_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29933-9
Online ISBN: 978-3-642-29934-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics