Abstract
We present an approach to mixed initiative dialogue in acoustic user interfaces to databases. First, we discuss how we distinguish between initiative and control in mixed initiative information retrieval dialogue and how the notions of taking, keeping, and relinquishing initiative and control are reflected in our approach. Based on this discussion, we develop a dialogue planning algorithm. This algorithm distinguished between resources and routines and between the type and the content of an utterance; type and content are calculated separately by routines that reason on the resources – a dialogue model, a dialogue history, and an application description. Through this division we achieve a dialogue where the system adapts to the user's attempts at changing the direction of a dialogue. Finally, we argue that automatic segmentation of the dialogue and automatic tracking of initiative and control is inherent to our approach.
Similar content being viewed by others
References
Allen, J. and C. Perrault: 1980, Analyzing intention in utterances. Artificial Intelligence 15(3), 143–178.
Austin, J.: 1962, How to Do Things with Words. London: Oxford University Press.
Baekgaard, P., O. Bernsen, T. Brøndsted, P. Dalsgaard, H. Dybkjær, L. Dybkjær, J. Kristiansen, L. Larsen, B. Lindberg, B. Maegaard, B. Music, L. Offersgaard, and C. Povlsen: 1994, The danish spoken language dialogue project - A general overview. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 89–92.
Ball, J. E. and D. T. Ling: 1994, Spoken language processing in the persona conversational assistant. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 109–112.
Bennacef, S., F. Néel, and H. Maynard: 1995, An oral dialogue model based on speech acts categorization. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 237–240.
Bilange, E.: 1991, A task independent oral dialogue model. In: Proc. of 5th Euro. Conf. of the ACL. pp. 83–87.
Blomberg, M., R. Carlson, K. Elenius, B. Granström, J. Gustafson, S. Hunnicutt, R. Lindell, and L. Neovius: 1993, An experimental dialogue system: WAXHOLM. In: Proc. European Conf. on Speech Communication and Technology (Eurospeech'93). pp. 1867–1870.
Caminero-Gil, J., J. Alvarez-Cercadillo, C. Crespo-Casas, and D. Tapias-Merino: 1996, Data-driven discourse modeling for semantic interpretation. In: Proc. of 1996 Intl. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'96). pp. 401–404.
Carberry, S.: 1990, Plan Recognition in Natural Language Dialogue. Cambridge, MA. London, England: MIT Press.
Chu-Carroll, J. and M. Brown: 1998, An evidential model for tracking initiative in collaborative dialogue interactions. User Modeling and User-Adapted Interaction. 8(3–4) pp. 215–254.
Chu-Carroll, J. and S. Carberry: 1995, Response generation in collaborative negotiation. In: Proc. of the 33th Annual Meeting of the ACL. Also available as http://xxx.lanl.gov/cmp-lg/9505001.
Cohen, P.: 1998, Dialogue modeling. In: R. Cole, J. Mariani, H. Uszkoreit, G. Varile, A. Zaenen, and A. Zampolli (eds.): Survey of the State of the Art in Human Language Technology. Cambridge University Press, Cambridge, Chapt. 6.3. Also available at http://www.cse.ogi.edu/CSLU/ HLTsurvey/.
Cohen, R., C. Allaby, C. Cumbaa, M. Fitzgerald, K. Ho, B. Hui, C. Latulipe, F. Lu, N. Moussa, D. Pooley, A. Qian, and S. Siddiqi: 1998, What is initiative?. User Modeling and User-Adapted Interaction. 8(3–4) pp. 171–214.
Cristea, D. and B.Webber: 1997, Expectations in incremental discourse. In: Proc. of the 35th Annual Meeting of the ACL and the 8th Conf. of the European ACL. pp. 88–95.
Dahlbäck, N. and A. Jönsson: 1992, An empirically based computationally tractable dialogue model. In: Proc. of the 14th Annual Conference of the Cognitive Science Society (COGSCI-92). Bloomington, Indiana.
EAGLES: 1997, Handbook of Standards and Resources for Spoken Language Systems. http://coral.lili.uni-bielefeld.de/EAGLES/eagbook/eagbook.html.
Eckert, W., G. Fink, A. Kießling, R. Kompe, T. Kuhn, F. Kummert, M. Mast, H. Niemann, E. Nöth, R. Prechtel, S. Rieck, G. Sagerer, A. Scheuer, G. Schukat-Talamazzini, and B. Seestaedt: 1992, EVAR: Ein sprachverstehendes Dialogsystem. In: KONVENS 92. pp. 49–58.
Feldes, S., G. Fries, E. Hagen, and A.Wirth: 1998, A novel service creation environment for speechenabled database access. In: Proc. of 4th IEEE Workshop on Interactive Voice Technology for Telecommunications Applications (IVT-TA'98).
Glass, J., G. Flammia, D. Goodine, M. Phillips, J. Polifroni, S. Sakai, S. Seneff, and V. Zue: 1995, Multilingual spoken-language understanding in theMIT voyager system. Speech Communication 17(1–2), 1–18.
Goddeau, D., H. Meng, J. Polifroni, S. Seneff, and S. Busayapongchai: 1996, A form-based dialogue manager for spoken language applications. In: Proc. of the 1996 Intl. Conf. on Spoken Language Processing (ICSLP'96).
Green, N. and S. Carberry: 1999, A computational mechanism for initiative in answer generation'. User Modeling and User-Adapted Interaction.This issue.
Grice, H.: 1975, Logic and conversation. In: P. Cole and J. Morgan (eds.): Syntax and Semantics, Vol. 3: Speech Acts. Academic Press, pp. 41–58.
Grosz, B. and C. Sidner: 1986, Attention, intentions, and the structure of discourse. Computational Linguistics 12(3), 175–204.
Hagen, E. and B. Grote: 1997, Generating efficient mixed initiative dialogue. In: Proc. ACL Workshop Interactive Spoken Dialog Systems: Bringing Speech and NLP Together in Real Applications. pp. 53–56.
Hagen, E. and A. Stein: 1996, Automatic generation of a complex dialogue history. In: Proc. 11th Canadian Conference on Artificial Intelligence (AI96). pp. 84–96.
Jönsson, A.: 1993, A dialogue manager using initiative-response units and distributed control. In: Proc. of 6th Euro. Conf. of the ACL.
Jordan, P. and B. Di Eugenio: 1997, Control and initiative in collaborative problem solving dialogues. In: Working Notes of the AAAI-97 Spring Symposium on Computational Models for Mixed Initiative Interaction. pp. 81–84.
Kamp, H. and U. Reyle: 1993, From Discourse to Logic, vol. 42 of Studies in Linguistics and Philosophy. Kluwer Academic Publisher, Dordrecht.
Kaspar, B., G. Fries, K. Schuhmacher, and A. Wirth: 1995, Faust - A directory assistance demonstrator. In: Proc. European Conf. on Speech Communication and Technology (Eurospeech'93). pp. 1161–1164.
Kaspar, B., K. Schuhmacher, and S. Feldes: 1997, Barge-in revised. In: Proc. European Conf. on Speech Communication and Technology (Eurospeech'97).
Lambert, L. and S. Carberry: 1991, A tripartite plan-based model of dialogue. In: Proc. of the 29th Annual Meeting of the ACL. pp. 47–54.
Lester, J., B. Stone, and G. Stelling: 1999, Lifelike pedagogical agents for mixed-initiative problem solving in constructivist learning environments. User Modeling and User-Adapted Interaction.This issue.
Litman, D. and J. Allen: 1987, A plan recognition model for subdialogues in conversations. Cognitive Science 11, 163–200.
Mann, W. and S. Thompson: 1988, Rhetorical structure theory: Toward a functional theory of text organization. Text 8(3), 243–281.
Naito, M., S. Kuroiwa, K. Takeda, and S. Y. F. Yato: 1994, A real-time speech dialogue system for a voice activated telephone extension service. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 129–132.
O'Donnell, M.: 1990, A dynamic model of exchange. Word 41(3), 293–327.
Oerder, M. and H. Aust: 1993, A realtime prototype of an automatic inquiry system. In: Proc. of the 1994 Intl. Conf. on Spoken Language Processing (ICSLP'94). pp. 703–706.
Peckham, J.: 1993, A new generation of spoken dialogue systems: Results and lessons from the SUNDIAL project. In: Proc. European Conf. on Speech Communication and Technology (Eurospeech'93).
Polanyi, R. and R. Scha: 1984, A syntactic approach to discourse semantics. In: Proc. of the 10th Intl. Conf. on Computational Linguistics (COLING'84). pp. 413–419.
Reichman, R.: 1985, Getting Computers to Talk Like You and Me. Cambridge, MA: MIT Press.
Sadek, M.: 1990, Logical task modelling for man-machine dialogue. In: Proc. of the Natl. Conf. on Artificial Intelligence (AAAI'90). pp. 970–975.
Sadek, M., Bretier, V. Cadoret, A. Cozannet, P. Dupont, A. Ferrieux, and F. Panaget: 1994, A cooperative spoken dialogue system based on a rational agent model: A first implementation on the AGS application. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 145–148.
Sinclair, J. and R. Coulthard: 1975, Towards an Analysis of Discourse: The English Used by Teachers and Pupils. London: Oxford University Press.
Sitter, S. and A. Stein: 1992, Modelling the illocutionary aspects of information-seeking dialogues. Information Processing and Management 8(2), 165–180.
Smith, R. and D. Hipp: 1994, Spoken Natural Language Dialog Systems; A Practical Approach. New York, Oxford: Oxford University Press.
Spiegel, M. and C. Kamm: 1997, Special issue on interactive voice technology for telecommunication applications (IVTTA'96). Speech Communication 23(1–2).
Stein, A., J. A. Gulla, and U. Thiel: 1999, User tailored planning of mixed inititative information seeking dialogues. User Modeling and User-Adapted Interaction.This issue.
Traum, D. and E. Hinkelman: 1992, Conversation acts in task-oriented spoken dialogue. Computational Intelligence 8(3), 575–599.
Walker, M., D. Litman, C. Kamm, and A. Abella: 1997, PARADISE: A framework for evaluating spoken dialogue agents. In: Proc. of the 35th Annual Meeting of the ACL and the 8th Conf. of the European ACL. pp. 271–280.
Walker, M. and S. Whittaker: 1990, Mixed initiative in dialogue: An investigation into discourse segmentation. In: Proc. of the 28th Annual Meeting of the ACL. pp. 70–78.
Whittaker, S. andD. Attwater: 1994, Advanced speech applications - The integration of speech technology into complex services. In: Proc. ESCA Wshp. on Spoken Dialogue Systems; Theories and Applications. pp. 113–116.
Whittaker, S. and P. Stenton: 1988, Cues and control in expert-client dialogues. In: Proc. of the 26th Annual Meeting of the ACL. pp. 123–130.
Young, S., A. Hauptmann, W. Ward, E. Smith, and P. Werner: 1990, High level knowledge sources in usable speech recognition systems. In: A. Waibel and K. Lee (eds): Readings in Speech Recognition. San Mateo, CA: Morgan Kaufman, pp. 538–549.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Hagen, E. An Approach to Mixed Initiative Spoken Information Retrieval Dialogue. User Modeling and User-Adapted Interaction 9, 167–213 (1999). https://doi.org/10.1023/A:1008300826159
Issue Date:
DOI: https://doi.org/10.1023/A:1008300826159