Methods of Efficiently Constructing Text-Dialogue-Agent System Using Existing Anime Character

Ishii, Ryo; Higashinaka, Ryuichiro; Mitsuda, Koh; Katayama, Taichi; Mizukami, Masahiro; Tomita, Junji; Kawabata, Hidetoshi; Yamaguchi, Emi; Adachi, Noritake; Aono, Yushi

doi:10.1007/978-3-030-60152-2_25

Ryo Ishii¹⁶,
Ryuichiro Higashinaka¹⁶,
Koh Mitsuda¹⁶,
Taichi Katayama¹⁶,
Masahiro Mizukami¹⁷,
Junji Tomita¹⁶,
Hidetoshi Kawabata¹⁸,
Emi Yamaguchi¹⁸,
Noritake Adachi¹⁸ &
…
Yushi Aono¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12427))

Included in the following conference series:

International Conference on Human-Computer Interaction

3312 Accesses
1 Altmetric

Abstract

Many surely dream of being able to chat with his/her favorite anime characters from an early age. To make such a dream possible, we propose an efficient method for constructing a system that enables users to text chat with existing anime characters. We tackled two research problems to generate verbal and nonverbal behaviors for a text-chat agent system of an existing character. In the generation of verbal behavior, it is a major issue to be able to generate utterance text that reflects the personality of existing characters in response to any user questions. For this problem, we propose the use role play-based question-answering to efficiently collect high-quality paired data of user’s questions and system’s answers reflecting the personality of an anime character. We also propose a new utterance generation method that uses a neural translation model with the collected data. Rich and natural expressions of nonverbal behavior greatly enhance the appeal of agent systems. However, not all existing anime characters move as naturally and as diversely as humans. Therefore, we propose a method that can automatically generate whole-body motion from spoken text in order to make it so that anime characters have human-like and natural movements. In addition to these movements, we try to add a small amount of characteristic movement on a rule basis to reflect personality. We created a text-dialogue agent system of a popular existing anime character using our proposed generation methods. As a result of a subjective evaluation of the implemented system, our models for generating verbal and nonverbal behavior improved the impression of the agent’s responsiveness and reflected the personality of the character. In addition, generating characteristic motions with a small amount of on the basis of heuristic rules was not effective, but rather the character generated by our generation model that reflects the average motion of persons had more personality. Therefore, our proposed methods for generating verbal and nonverbal behaviors and the construction method will greatly contribute to the realization of text-dialogue-agent systems of existing characters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://www.nicovideo.jp/.
2.
https://lucene.apache.org/.

References

Fuchi, T., Takagi, S.: Japanese morphological analyzer using word cooccurrence -JTAG. In: International Conference on Computational Linguistics, pp. 409–413 (1998)
Google Scholar
Higashinaka, R., et al.: Towards an open-domain conversational system fully based on natural language processing. In: International Conference on Computational Linguistics, pp. 928–939 (2014)
Google Scholar
Higashinaka, R., Sadamitsu, K., Saito, K., Kobayashi, N.: Question answering technology for pinpointing answers to a wide range of questions. NTT Tech. Rev. 11(7) (2013)
Google Scholar
Imamura, K.: Analysis of Japanese dependency analysis of semi-spoken words by series labeling. In: Proceedings of the Annual Meeting of the Association for Natural Language Processing, pp. 518–521 (2007)
Google Scholar
Ishi, C.T., Haas, J., Wilbers, F.P., Ishiguro, H., Hagita, N.: Analysis of head motions and speech, and head motion control in an android. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 548–553 (2007)
Google Scholar
Ishi, C.T., Ishiguro, H., Hagita, N.: Head motion during dialogue speech and nod timing control in humanoid robots. In: ACM/IEEE International Conference on Human-Robot Interaction, pp. 293–300 (2010)
Google Scholar
Kadono, Y., Takase, Y., Nakano, Y.I.: Generating iconic gestures based on graphic data analysis and clustering. In: The Eleventh ACM/IEEE International Conference on Human Robot Interaction, HRI 2016, Piscataway, NJ, USA, pp. 447–448. IEEE Press (2016)
Google Scholar
Leuski, A., Patel, R., Traum, D., Kennedy, B.: Building effective question answering characters. In: Proceedings of the SIGDIAL, pp. 18–27 (2009)
Google Scholar
Lohse, M., Rothuis, R., Gallego-Pérez, J., Karreman, D.E., Evers, V.: Robot gestures make difficult tasks easier: the impact of gestures on perceived workload and task performance. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 2014, pp. 1459–1466. ACM, New York (2014)
Google Scholar
McNeill, D.: Hand and Mind: What Gestures Reveal About Thought. University of Chicago, Chicago Press (1996)
Google Scholar
Meguro, T., Higashinaka, R., Minami, Y., Dohsaka, K.: Controlling listening-oriented dialogue using partially observable Markov decision processes. In: International Conference on Computational Linguistics, pp. 761–769 (2010)
Google Scholar
Van Ments, M.: The Effective Use of Role Play: Practical Techniques for Improving Learning. Kogan Page Publishers, London (1999)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Proceedings of the NIPS, pp. 3111–3119 (2013)
Google Scholar
Miyazaki, C., Hirano, T., Higashinaka, R., Matsuo, Y.: Towards an entertaining natural language generation system: linguistic peculiarities of Japanese fictional characters. In: Proceedings of the SIGDIAL, pp. 319–328 (2016)
Google Scholar
Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. CoRR, abs/1503.03832 (2015)
Google Scholar
Sekine, S., Sudo, K., Nobata, C.: Extended named entity hierarchy. In: Proceedings of the LREC (2002)
Google Scholar
Vinyals, O., Le, Q.: A neural conversational model. arXiv preprint arXiv:1506.05869 (2015)
Wittenburg, P., Brugman, H., Russel, A., Klassmann, A., Sloetjes, H : Elan a professional framework for multimodality research. In: International Conference on Language Resources and Evaluation (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

NTT Media Intelligence Laboratories, NTT Corporation, 1-1, Hikari-no-oka, Yokosuka-shi, Kanagawa, Japan
Ryo Ishii, Ryuichiro Higashinaka, Koh Mitsuda, Taichi Katayama, Junji Tomita & Yushi Aono
NTT Communication Science Laboratories, NTT Corporation, 2-4, Hikaridai, Seika-cho, “Keihanna Science City”, Kyoto, Japan
Masahiro Mizukami
DWANGO Co., Ltd., Kabukiza Tower, 4-12-15 Ginza, Chuo-ku, Tokyo, Japan
Hidetoshi Kawabata, Emi Yamaguchi & Noritake Adachi

Authors

Ryo Ishii
View author publications
You can also search for this author in PubMed Google Scholar
Ryuichiro Higashinaka
View author publications
You can also search for this author in PubMed Google Scholar
Koh Mitsuda
View author publications
You can also search for this author in PubMed Google Scholar
Taichi Katayama
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Mizukami
View author publications
You can also search for this author in PubMed Google Scholar
Junji Tomita
View author publications
You can also search for this author in PubMed Google Scholar
Hidetoshi Kawabata
View author publications
You can also search for this author in PubMed Google Scholar
Emi Yamaguchi
View author publications
You can also search for this author in PubMed Google Scholar
Noritake Adachi
View author publications
You can also search for this author in PubMed Google Scholar
Yushi Aono
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ryo Ishii .

Editor information

Editors and Affiliations

University of Crete and Foundation for Research and Technology – Hellas (FORTH), Heraklion, Crete, Greece
Constantine Stephanidis
University of Central Florida, Orlando, FL, USA
Gavriel Salvendy
University of West Florida, Pensacola, FL, USA
June Wei
Tokyo University of Science, Tokyo, Japan
Sakae Yamamoto
Tokyo City University, Tokyo, Japan
Hirohiko Mori
Towson University, Towson, MD, USA
Gabriele Meiselwitz
Missouri University of Science and Technology, Rolla, MO, USA
Fiona Fui-Hoon Nah
Missouri University of Science and Technology, Rolla, MO, USA
Keng Siau

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ishii, R. et al. (2020). Methods of Efficiently Constructing Text-Dialogue-Agent System Using Existing Anime Character. In: Stephanidis, C., et al. HCI International 2020 – Late Breaking Papers: Interaction, Knowledge and Social Media. HCII 2020. Lecture Notes in Computer Science(), vol 12427. Springer, Cham. https://doi.org/10.1007/978-3-030-60152-2_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-60152-2_25
Published: 27 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60151-5
Online ISBN: 978-3-030-60152-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics