Skip to main content

A Granular Architecture for Dynamic Realtime Dialogue

  • Conference paper
Intelligent Virtual Agents (IVA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5208))

Included in the following conference series:

Abstract

We present a dialogue architecture that addresses perception, planning and execution of multimodal dialogue behavior. Motivated by realtime human performance and modular architectural principles, the architecture is full-duplex (“open-mic”); prosody is continuously analyzed and used for mixed-control turntaking behaviors (reactive and deliberative) and incremental utterance production. The architecture is fine-grain and highly expandable; we are currently applying it in more complex multimodal interaction and dynamic task environments. We describe here the theoretical underpinnings behind the architecture, compare it to prior efforts, discuss the methodology and give a brief overview of its current runtime characteristics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Moore, R.K.: PRESENCE: A Human-Inspired Architecture for Speech-Based Human-Machine Interaction. IEEE Transactions on Computers 56(9), 1176–1188 (2007)

    Article  Google Scholar 

  2. Raux, A., Eskenazi, M.: A Multi-Layer Architecture for Semi-Synchronous Event-Driven Dialogue Management, ASRU, Japan, 514–519 (2007)

    Google Scholar 

  3. Allen, J., Ferguson, G., Stent, A.: An Architecture for More Realistic Conversational Systems. In: IUI, pp. 14–17. ACM Press, Santa Fe (2001)

    Google Scholar 

  4. O’Connell, D.C., Kowal, S., Kaltenbacher, E.: Turn-Taking: A Critical Analysis of the Research Tradition. Journal of Psycholinguistic Research 19(6), 345–373 (1990)

    Article  Google Scholar 

  5. Thórisson, K.R., Benko, H., Arnold, A., Abramov, D., Maskey, S., Vaseekaran, A.: Constructionist Design Methodology for Interactive Intelligences. A.I. Magazine. American Association for Artificial Intelligence 25(4), 77–90 (2004)

    Google Scholar 

  6. Thórisson, K.R.: A Mind Model for Multimodal Communicative Creatures and Humanoids.   International J. Appl. Artif. Intell. 13(4-5), 449–486 (1999)

    Article  Google Scholar 

  7. Thórisson, K.R.: Natural Turn-Taking Needs No Manual: Computational Theory and Model, from Perception to Action. In: Granström, B., House, D., Karlsson, I. (eds.) Multimodality in Language and Speech Systems, pp. 173–207. Kluwer Academic Publishers, Dordrecht (2002)

    Google Scholar 

  8. Jonsdottir, G.R., Thórisson, K.R., Nivel, E.: Learning Smooth, Human-Like Turntaking in Realtime Dialogue. In: Prendinger, H., Lester, J., Ishizuka, M. (eds.) IVA 2008. LNCS (LNAI), vol. 5208, Springer, Heidelberg (2008)

    Google Scholar 

  9. Pan, S., McKeown, K.R.: Integrating Language Generation with Speech Synthesis in a Concept to Speech System. In: Proceedings of the ACL Workshop on Concept to Speech Generation Systems. ACL/EACL (1997)

    Google Scholar 

  10. Grote, B., Hagen, E., Teich, E.: Matchmaking: Dialogue Modeling and Speech Generation Meet. In: Proceedings of the 1996 International Workshop on Natural Language Generation, Herstmonceux, England, pp. 171–180 (1996)

    Google Scholar 

  11. Wilson, M., Wilson, T.P.: An oscillator model of the timing of turn-taking. Psychonomic Bulletin and Review 12(6), 957–968 (2005)

    Google Scholar 

  12. Sacks, H., Schegloff, E.A., Jefferson, G.A.: A Simplest Systematics for the Organization of Turn-Taking in Conversation. Language 50, 696–735 (1974)

    Article  Google Scholar 

  13. Thórisson, K.R.: Modeling Multimodal Communication as a Complex System. In: Wachsmuth, I., Lenzen, M., Knoblich, G. (eds.) Modeling Communication with Robots and Virtual Humans, pp. 143–168. Springer, New York (2008)

    Chapter  Google Scholar 

  14. Simon, H.A.: Can there be a science of complex systems? In: Bar-Yam, Y. (ed.) Unifying themes in complex systems: Proceedings from the International Conference on Complex Systems, pp. 4–14. Perseus Press, Cambridge (1999)

    Google Scholar 

  15. Thórisson, K.R., List, T., Pennock, C., DiPirro, J.: Whiteboards: Scheduling Blackboards for Semantic Routing of Messages & Streams. Proceedings of AAAI 2005, AAAI Technical Report WS-05-08, 8-15 (2005)

    Google Scholar 

  16. Thórisson, K.R.: Integrated A.I. Systems. Minds & Machines 17, 11–25 (2007)

    Article  Google Scholar 

  17. Scwabacher, M., Gelsey, A.: Multi-Level Simulation and Numerical Optimization of Complex Engineering Designs. In: 6th AIAA/NASA/USAF Multidisciplinary Analysis & Optmization Symposium, Bellevue, WA, AIAA-96-4021 (1996)

    Google Scholar 

  18. Schaffner, K.F.: Reduction: the Cheshire cat problem and a return to roots. Synthese 151(3), 377–402 (2006)

    Article  Google Scholar 

  19. Ng-Thow-Hing, V., List, T., Thórisson, K.R., Lim, J., Wormer, J.: Design and Evaluation of Communication Middleware in a Distributed Humanoid Robot Architecture. In: IROS 2007 Workshop Measures and Procedures for the Evaluation of Robot Architectures and Middleware, San Diego, California, 29 October - 2 November (2007)

    Google Scholar 

  20. Gaud, N., Gechter, F., Galland, S., Koukam, A.: Holonic Multiagent Multilevel Simulation Application to Real-time Pedestrians Simulation in Urban Environment. In: Proceedings of IJCAI 2007, pp. 1275–1280 (2007)

    Google Scholar 

  21. Arbib, M.A.: Levels of Modeling of Visually Guided Behavior (with peer commentary and author’s response). Behavioral and Brain Sciences 10, 407–465 (1987)

    Article  Google Scholar 

  22. Bonaiuto, J., Thórisson, K.R.: Towards a Neurocognitive Model of Realtime Turntaking in Face-to-Face Dialogue. In: Wachsmuth, I., Lenzen, M., Knoblich, G. (eds.) Embodied Communication in Humans and Machines. Oxford University Press, U.K (2008)

    Google Scholar 

  23. Thórisson, K.R., Jonsdottir, G.R., Nivel, E.: Methods for Complex Single-Mind Architecture Designs. In: Proc. AAMAS, Portugal (June 2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Helmut Prendinger James Lester Mitsuru Ishizuka

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Thórisson, K.R., Jonsdottir, G.R. (2008). A Granular Architecture for Dynamic Realtime Dialogue. In: Prendinger, H., Lester, J., Ishizuka, M. (eds) Intelligent Virtual Agents. IVA 2008. Lecture Notes in Computer Science(), vol 5208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85483-8_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85483-8_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85482-1

  • Online ISBN: 978-3-540-85483-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics