Abstract
Embodied conversational agents (ECAs) are computer-generated, human-like characters that interact with human users in face-to-face conversations. ECA is a powerful tool for representing cultural differences and is suitable for interactive training or edutainment systems. This article presents preliminary results from the development of a culture-adaptive virtual tour guide agent for serving Japanese, Croatian, and general Western users by displaying appropriate verbal and non-verbal behaviors. It is being implemented in Generic ECA Framework, a modular framework for developing ECAs. Dividing the ECA functions into reusable and loosely coupled modules minimizes the effort required to implement additional behavior and facilitates incremental scale up of the system.






Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
A.L.I.C.E. AI Foundation (2005) Artificial Intelligence Markup Language (AIML). http://www.alicebot.org
Baylor AL, Rosenberg-Kima RB, Plant EA (2006) Interface agents as social models: The impact of appearance on females’ attitude toward engineering. In: Conference on human factors in computing systems (CHI’06), Montreal
Cerekovic A, Huang HH, Furukawa T, Yamaoka Y, Pandzic IS, Nishida T, Nakano Y (2008) Implementing a multiparty support in a tour guide system with an embodied conversational agent (ECA). In: The eNTERFACE’08 international workshop on multimodal interfaces. Orsay, France
Costa A, Pickering MJ, Sorace A (2008) Alignment in second language dialogue. Lang Cogn Process 23(4):528–556
de Rosis F, Pelachaud C, Poggi I (2004) Transcultural believability in embodied agents. A matter of consistent adaption. In: Agent culture: human–agent interaction in a multicultural world. Lawrence Erlbaum Associates, London, pp 75–105
Gratch J, Marsella S (2004) A domain-independent framework for modeling emotion. J Cogn Syst Res 5:269–306
Hall ET (1992) Beyond culture. Peter Smith Publisher, Gloucester
Hamiru.aqui (2004) 70 Japanese gestures—no language communication. IBC Publishing, Westminster
Hoya Corp (2008) Pentax VoiceText text-to-speech engine. http://voice.pentax.jp/
Huang HH, Cerekovic A, Tarasenko K, Levacic V, Zoric G, Treumuth M, Pandzic IS, Nakano Y, Nishida T (2006) An agent based multicultural user interface in a customer service application. In: The eNTERFACE’06 international workshop on multimodal interfaces. Dubrovnik, Croatia
Huang HH, Inoue T, Cerekovic A, Nakano Y, Pandzic IS, Nishida T (2007) A quiz game console based on a generic embodied conversational agent framework. In: Seventh international conference on intelligent virtual agents (IVA’07). Paris, France, pp 383–384
Huang HH, Cerekovic A, Nakano Y, Pandzic IS, Nishida T (2008a) The design of a generic framework for integrating ECA components. In: Padgham L, Parkes D, Muller JP (eds) The 7th international conference of autonomous agents and multiagent systems (AAMAS’08), Inesc-Id, Estoril, Portugal, pp 128–135
Huang HH, Cerekovic A, Tarasenko K, Levacic V, Zoric G, Pandzic IS, Nakano Y, Nishida T (2008b) An agent based multicultural tour guide system with nonverbal user interface. Int J Multimodal User Interfaces 1(1):41–48
Huang HH, Furukawa T, Ohashi H, Ohmoto Y, Nishida T (2008c) Toward a virtual quiz agent who interacts with user groups. In: The 7th international workshop on social intelligence design (SID’08). Puerto Rico
Iacobelli F, Cassell J (2007) Ethnic identity and engagement in embodied conversational agents. In: Proceedings of the 7th international conference on intelligent virtual agents (IVA’07). Springer, Paris, pp 57–63
Intel Corp (2006) Open computer vision library (OpenCV) 1.0. http://sourceforge.net/projects/opencvlibrary/
Ipsic S, Zanert J, Ipsic I (2003) Speech recognition of Croatian and Slovenian weather forecast. In: Proceedings of 4th EURASIP conference, France, pp 637–642
Isbister K (2004) Building bridges through the unspoken: embodied agents to facilitate intercultural communication. In: Agent culture: human–agent interaction in a multicultural world. Lawrence Erlbaum Associates, London, pp 233–244
Johnson WL, Vilhjalmsson H, Marsella S (2005) Serious games for language learning: How much game, how much AI? In: Proceedings of the 12th international conference on artificial intelligence in education. Amsterdam, The Netherlands
Johnston M, Bangalore S (2000) Finite-state multimodal parsing and understanding. In: Proceedings of the 18th conference on computational linguistics. Saarbrucken, Germany
Kato H (2006) Artoolkit. http://artoolkit.sourceforge.net/
Larsson S, Traum DR (2000) Information state and dialogue management in the trindi dialogue move engine toolkit. Natural Language Engineering, Cambridge University Press 6(3–4):323–340
mind.makersorg (2005) OpenAIR protocol specification 1.0. http://www.mindmakers.org/openair/airPage.jsp
Nakano Y, Okamoto M, Kawahara D, Li Q, Nishida T (2004) Converting text into agent animations: assigning gestures to text. In: Proceedings of the human language technology conference (HLT-NAACL’04). ACL Press, Prague
Nass C, Isbister K, Lee EJ (2000) Truth is beauty, researching embodied conversational agents. In: Embodied conversational agents. The MIT Press, Cambridge, pp 374–402
Omron Corp (2008) OKAO vision. http://www.omron.com/rd/coretech/vision/okao.html
Peic R (2003) A speech recognition algorithm based on the features of Croatian language. In: Proceedings of the 4th EURASIP conference. Dubrovnik, Croatia, pp 613–618
Pickering MJ, Garrod S (2004) Toward a mechanistic psychology of dialogue. Behav Brain Sci 27:169–226
Pickering MJ, Garrod S (2006) Alignment as the basis for successful communication. Res Lang Comput 4:203–228
Rehm M, Andre E, Bee N, Endrass B, Wissner M, Nakano Y, Nishida T, Huang HH (2007a) The CUBE-G approach—coaching culture-specific nonverbal behavior by virtual agents. In: The 38th conference of the international simulation and gaming association (ISAGA). Nijmegen, New Zealand
Rehm M, Bee N, Endrass B, Wissner M, Andre E (2007b) Too close for comfort? In: Proceedings of the international workshop on human-centered multimedia, ACM Multimedia
Rehm M, Nakano Y, Andre E, Nishida T (2008a) Culture-specific first meeting encounters between virtual agents. In: Prendinger H, Lester J, Ishizuka M (eds) Proceedings of the 8th international conference on intelligent virtual agents (IVA’08). Tokyo, Japan, pp 223–236
Rehm M, Gruneberg F, Nakano Y, Lipi AA, Yamaoka Y, Huang HH (2008b) Creating a standardized corpus of multimodal interactions for enculturating conversational interfaces. In: Workshop on enculturating conversational interfaces by socio-cultural as-pects of communication, 2008 international conference on intelligent user interfaces (IUI2008). Canary Islands, Spain
Solomon S, van Lent M, Core M, Carpenter P, Rosenberg M (2008) A language for modeling cultural norms, biases and stereotypes for human behavior models. In: Proceedings of the 17th conference on behavior representation in modeling and simulation (BRIMS’08)
Thiebaux M, Marshall AN, Marsella S, Kallmann M (2008) Smartbody: Behavior realization for embodied conversational agents. In: The 7th international conference of autonomous agents and multiagent systems (AAMAS’08), Estoril, Portugal
Traum D, Larsson S (2003) The information state approach to dialogue management. In: Smith R, van Kuppevelt J (eds) Current and new directions in discourse and dialogue. Kluwer, Dordrecht, pp 325–353
Traum D, Roque A, Georgiou ALP, Gerten J, Martinovski B, Narayanan S, Robinson S, Vaswani A (2007) Hassan: a virtual human for tactical questioning. In: The 8th SIGdial workshop on discourse and dialogue, Antwerp, Belgium
Visage Technologies AB (2008) Visage|SDK. http://www.visagetechnologies.com
W3C (2004) Emma: extensible multimodal annotation markup language. http://www.w3.org/TR/emma/
Young PA (2008) Integrating culture in the design of ICTS. Br J Educational Technol 39(1):6–17
Zoric G, Pandzic IS (2005) A real-time language independent lip synchronization method using a genetic algorithm. In: the Proceedings of ICME’05, Amsterdam, The Netherlands
Acknowledgments
We thank Kateryna Tarasenko, Vjekoslav Levacic, Goranka Zoric, and Margus Treumuth for their contributions to this project during the eNTERFACE’06 summer workshop, and Takuya Furukawa and Yuji Yamaoka for their contributions to this project during the eNTERFACE’08 summer workshop. We also thank Tsuyoshi Masuda for his contribution in the application for experiencing the cross-cultural differences in gestures.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Huang, HH., Cerekovic, A., Pandzic, I.S. et al. Toward a multi-culture adaptive virtual tour guide agent with a modular approach. AI & Soc 24, 225–235 (2009). https://doi.org/10.1007/s00146-009-0213-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00146-009-0213-6