skip to main content
10.1145/2933242.2933262acmconferencesArticle/Chapter ViewAbstractPublication PageseicsConference Proceedingsconference-collections
research-article

A language-based model for specifying and staging mixed-initiative dialogs

Published:21 June 2016Publication History

ABSTRACT

Specifying and implementing flexible human-computer dialogs, such as those used in kiosks, is complex because of the numerous and varied directions in which each user might steer a dialog. The objective of this research is to improve dialog specification and implementation. To do so we developed a model for specifying and staging mixed-initiative dialogs. The model involves a dialog authoring notation, based on concepts from programming languages, for specifying a variety of unsolicited reporting, mixed-initiative dialogs in a concise representation that serves as a design for dialog implementation. Guided by this foundation, we built a dialog staging engine which operationalizes dialogs specified in this notation. The model, notation, and engine help automate the engineering of mixed-initiative dialog systems. These results also provide a proof-of-concept for dialog specification and implementation from the perspective of theoretical programming languages. The ubiquity of dialogs in domains such as travel, education, and health care with the increased use of interactive voice-response systems and virtual environments provide a fertile landscape for further investigation of these results.

References

  1. J. F. Allen. 1999. Mixed-Initiative Interaction. IEEE Intelligent Systems 14, 5 (1999), 14--16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. F. Baader and T. Nipkow. 1999. Term Rewriting and All That. Cambridge University Press, Cambridge, UK. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. T. Ball, C. Colby, P. Danielsen, L. J. Jagadeesan, R. Jagadeesan, K. Läufer, P. Mataga, and K. Rehor. 2000. Sisl: Several Interfaces, Single Logic. International Journal of Speech Technology 3, 2 (2000), 93--108.Google ScholarGoogle ScholarCross RefCross Ref
  4. F. Bechet, G. Riccardi, and D. Hakkani-Tür. 2004. Mining Spoken Dialogue Corpora for System Evaluation and Modeling. In Proceedings of the Association for Computational Linguistics (ACL) Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Stroudsburg, PA, 134--141.Google ScholarGoogle Scholar
  5. D. Bohus and A. I. Rudnicky. 2003. RavenClaw: Dialog management using hierarchical task decomposition and an expectation agenda. In Proceedings of the Sixth Annual INTERSPEECH Conference. International Speech Communication Association.Google ScholarGoogle Scholar
  6. D. Bohus and A. I. Rudnicky. 2009. The RavenClaw Dialog Management Framework: Architecture and Systems. Computer Speech and Language 23, 3 (2009), 332--361. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Capra, M. Narayan, S. Perugini, N. Ramakrishnan, and M. A. Pérez-Quiñones. 2003. The Staging Transformation Approach to Mixing Initiative. In Working Notes of the IJCAI 2003 Workshop on Mixed-Initiative Intelligent Systems, G. Tecuci (Ed.). AAAI/MIT Press, Menlo Park, CA, 23--29.Google ScholarGoogle Scholar
  8. J. Chu-Carroll. 2000. MIMIC: An adaptive mixed initiative spoken dialogue system for information queries. In Proceedings of the Sixth Conference on Applied Natural Language Processing (ANLC). Association for Computational Linguistics, Stroudsburg, PA, 97--104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Dix, J. Finlay, G. D. Abowd, and R. Beale. 2010. Human-Computer Interaction (third ed.). Prentice Hall, Harlow, England, Chapter 16: Dialog Notations and Design. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Dubrow. Seven Cyberlearning Technologies Transforming Education. Huffington Post, 6 April 2015. Available: http://www.huffingtonpost.com/aaron-dubrow/7-cyberlearning-technolog_b_6988976.html {Last accessed: 6 May 2016}.Google ScholarGoogle Scholar
  11. J. Feng, D. Hakkani-Tür, G. Di Fabbrizio, M. Gilbert, and M. Beutnagel. 2006. Webtalk: Towards Automatically Building Spoken Dialog Systems Through Mining Websites. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE Computer Society Press, Los Alamitos, CA, 573--576.Google ScholarGoogle Scholar
  12. R. Freedman. 2000. Using a Reactive Planner as the Basis for a Dialogue Agent. In Proceedings of the Thirteenth International Florida Artificial Intelligence Research Society Conference. 203--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. D. P. Friedman and M. Wand. 2008. Essentials of Programming Languages (third ed.). MIT Press, Cambridge, MA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Glass and S. Seneff. 2003. Flexible and Personalizable Mixed-initiative Dialogue Systems. In Proceedings of the North American Chapter of the Association for Computational Linguistics (ACL): Human Language Technologies (NAACL-HLT) Workshop on Research Directions in Dialogue Processing. Association for Computational Linguistics, Stroudsburg, PA, 19--21. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. Graunke, R. Findler, S. Krishnamurthi, and M. Felleisen. 2001. Automatically Restructuring Programs for the Web. In Proceedings of the Sixteenth IEEE International Conference on Automated Software Engineering (ASE). 211--222. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Green. 1986. A Survey of Three Dialogue Models. ACM Transactions on Graphics 5, 3 (1986), 244--275. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. I. Guinn. 1999. Evaluating Mixed-initiative Dialog. IEEE Intelligent Systems 14, 5 (1999), 21--23.Google ScholarGoogle Scholar
  18. E. Hagen and B. Grote. 1997. Planning Efficient Mixed-initiative Dialogue. In Proceedings of the Association for Computational Linguistics (ACL) Conference on Interactive Spoken Dialog Systems on Bringing Speech and NLP Together in Real Applications (ISDS). Association for Computational Linguistics, Stroudsburg, PA, 53--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. Haller and S. McRoy (Eds.). 1997. Proceedings of the AAAI Spring Symposium on Computational Models for Mixed Initiative Interaction. Number SS-97-04. AAAI Press, Menlo Park, CA.Google ScholarGoogle Scholar
  20. J. Hochberg, N. Kambhatla, and S. Roukos. 2002. A Flexible Framework for Developing Mixed-initiative Dialog Systems. In Proceedings of the Third Association for Computational Linguistics (ACL) SIGDIAL Workshop on Discourse and Dialogue. Association for Computational Linguistics, Stroudsburg, PA, 60--63. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. B. Inouye. 2004. Minimizing the Length of Non-mixed Initiative Dialogs. In Proceedings of the Association for Computational Linguistics (ACL) Workshop on Student Research. Association for Computational Linguistics, Stroudsburg, PA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. N. D. Jones. 1996. An Introduction to Partial Evaluation. Comput. Surveys 28, 3 (1996), 480--503. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. P. Jordan, M. Ringenberg, and B. Hall. 2006. Rapidly Developing Dialogue Systems that Support Learning Studies. In Proceedings of Intelligent Tutoring Systems (ITS) Workshop on Teaching with Robots, Agents, and NLP. 1--8.Google ScholarGoogle Scholar
  24. A. Kobsa. 1988. User Models and Dialog Models: United They Stand. Computational Linguistics 14, 3 (1988), 91--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. D. L. Kreher and D. R. Stinson. 1999. Combinatorial Algorithms: Generation, Enumeration, and Search. CRC Press, Boca Raton, FL.Google ScholarGoogle Scholar
  26. C. Lee, S. Jung, K. Kim, D. Lee, and G. G. Lee. 2010. Recent approaches to dialog management for spoken dialog systems. Journal of Computing Science and Engineering 4, 1 (2010), 1--22.Google ScholarGoogle ScholarCross RefCross Ref
  27. A. Leuski and D. Traum. 2011. NPCEditor: Creating Virtual Human Dialogue Using Information Retrieval Techniques. AI Magazine 32, 2 (2011), 42--56.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. S. N. Malkov. 2010. Customizing a Functional Programming Language for Web Development. Computer Languages, Systems and Structures 36, 4 (2010), 345--351. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. T. Misu, K. Georgila, A. Leuski, and D. Traum. 2012. Reinforcement Learning of Question-answering Dialogue Policies for Virtual Museum Guides. In Proceedings of the Thirteenth Annual Meeting of the Special Interest Group on Discourse and Dialogue. Association for Computational Linguistics, Stroudsburg, PA, 84--93. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Narayan, C. Williams, S. Perugini, and N. Ramakrishnan. 2004. Staging Transformations for Multimodal Web Interaction Management. In Proceedings of the Thirteenth International ACM World Wide Web Conference (WWW), M. Najork and C. E. Wills (Eds.). ACM Press, New York, NY, 212--223. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. H. Partsch and R. Steinbrüggen. 1983. Program Transformation Systems. Comput. Surveys 15, 3 (1983), 199--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. M. A. Pérez-Quiñones. 1996. Conversational Collaboration in User-initiated Interruption and Cancellation Requests. Ph.D. dissertation. The George Washington University.Google ScholarGoogle Scholar
  33. J. Polifroni, G. Chung, and S. Seneff. 2003. Towards the Automatic Generation of Mixed-Initiative Dialogue Systems from Web Content. In Proceedings of the Eighth European Conference on Speech Communication and Technology (EUROSPEECH). International Speech Communication Association, 193--196.Google ScholarGoogle Scholar
  34. D. Quan, D. Huynh, D. R. Karger, and R. Miller. 2003. User Interface Continuations. In Proceedings of the Sixteenth Annual ACM Symposium on User Interface Software and Technology (UIST). ACM Press, New York, NY, 145--148. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. C. Queinnec. 2000. The Influence of Browsers on Evaluators or, Continuations to Program Web Servers. In Proceedings of the Fifth ACM SIGPLAN International Conference on Functional Programming (ICFP). ACM Press, New York, NY, 23--33. Also appears in ACM SIGPLAN Notices, 35(9), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. N. Ramakrishnan, R. Capra, and M. A. Pérez-Quiñones. 2002. Mixed-Initiative Interaction = Mixed Computation. In Proceedings of the ACM SIGPLAN Workshop on Partial Evaluation and Semantics-Based Program Manipulation (PEPM), P. Thiemann (Ed.). ACM Press, New York, NY, 119--130. Also appears in ACM SIGPLAN Notices, 37(3), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. F. Ricca and P. Tonella. 2001. Web Application Slicing. In Proceedings of the International Conference on Software Maintenance (ICSM). IEEE Computer Society Press, Los Alamitos, CA, 148--157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. F. Ricca, P. Tonella, and I. D. Baxter. 2001. Restructuring Web Applications via Transformation Rules. In Proceedings of the First International Workshop on Source Code Analysis and Manipulation (SCAM). IEEE Computer Society Press, Los Alamitos, CA, 150--160.Google ScholarGoogle Scholar
  39. A. Rudnicky, E. Thayer, P. Constantinides, C. Tchou, R. Stern, K. Lenzo, W. Xu, and A. Oh. 1999. Creating natural dialogs in the Carnegie Mellon communicator system. In Proceedings of the Sixth European Conference on Speech Communication and Technology (EUROSPEECH). International Speech Communication Association.Google ScholarGoogle Scholar
  40. A. Rudnicky and W. Xu. 1999. An agenda-based dialog management architecture for spoken language systems. IEEE Automatic Speech Recognition and Understanding Workshop 13, 4 (1999).Google ScholarGoogle Scholar
  41. R. C. Schank and R. P. Abelson. 1977. Scripts, Plans, Goals and Understanding: an Inquiry into Human Knowledge Structures. L. Erlbaum, Hillsdale, NJ.Google ScholarGoogle Scholar
  42. D. Stallard. 2001. Dialogue management in the Talk'n'Travel system. In Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE Computer Society Press, Los Alamitos, CA, 235--239.Google ScholarGoogle ScholarCross RefCross Ref
  43. M. Walker, L. Hirschman, and J. Aberdeen. 2000. Evaluation for DARPA communicator spoken dialogue systems. In Proceedings Second International Conference on Language Resources and Evaluation. European Language Resources Association.Google ScholarGoogle Scholar
  44. M. Walker and S. Whittaker. 1990. Mixed-initiative in dialogue: An investigation into discourse segmentation. In Proceedings of the Twenty-eighth Annual Meeting on Association for Computational Linguistics (ACL). Association for Computational Linguistics, Stroudsburg, PA, 70--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. W. Wong, L. Cavedon, J. Thangarajah, and L. Padgham. 2012. Mixed-initiative Conversational System Using Question-answer Pairs Mined from the Web. In Proceedings of the Twenty-first ACM International Conference on Information and Knowledge Management (CIKM). ACM Press, New York, NY, 2707--2709. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. X. Yao, E. Tosch, G. Chen, E. Nouri, R. Artstein, A. Leuski, K. Sagae, and D. Traum. 2012. Creating conversational characters using question generation tools. Dialogue and Discourse 3, 2 (2012), 125--146.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A language-based model for specifying and staging mixed-initiative dialogs

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader