Skip to main content

A Wizard-of-Oz System Evaluation Study

  • Conference paper
Text, Speech and Dialogue (TSD 2007)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4629))

Included in the following conference series:

  • 1754 Accesses

Abstract

In order to evaluate the performance of the dialogue-manager component of a developing, Slovenian and Croatian spoken dialogue system, two Wizard-of-Oz experiments were performed. The only difference between the two experiment settings was in the dialogue-management manner, i.e., while in the first experiment dialogue management was performed by a human, the wizard, in the second experiment it was performed by the newly-implemented dialogue-manager component. The data from both Wizard-of-Oz experiments was evaluated with the PARADISE evaluation framework, a potential general methodology for evaluating and comparing different versions of spoken-language dialogue systems. The study ascertains a remarkable difference in the performance functions when taking different satisfaction-measure sums or even individual scores as the target to be predicted, it proves the indispensableness of the recently introduced database parameters when evaluating information-providing dialogue systems, and it confirms the dialogue manager’s cooperativity subject to the incorporated knowledge representation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Walker, M.A., Litman, D., Kamm, C.A., Abella, A.: PARADISE: A General Framework for Evaluating Spoken Dialogue Agents. In: Proc. 35th Annual Meeting of the Association of Computational Linguistics, Madrid, Spain, pp. 271–280 (1997)

    Google Scholar 

  2. Hajdinjak, M., Mihelič, F.: The PARADISE Evaluation Framework: Issues and Findings. Computational Linguistics 32(2), 263–272 (2006)

    Article  Google Scholar 

  3. Žibert, J., Martinčić-Ipšić, S., Hajdinjak, M., Ipšić, I., Mihelič, F.: Development of a Bilingual Spoken Dialog System for Weather Information Retrieval. In: Proc. 8th European Conference on Speech Communication and Technology, Geneva, Switzerland, pp. 1917–1920 (2003)

    Google Scholar 

  4. Hajdinjak, M., Mihelič, F.: Conducting the Wizard-of-Oz Experiment. Informatica 28(4), 425–430 (2004)

    Google Scholar 

  5. Hajdinjak, M., Mihelič, F.: Information-providing dialogue management. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 595–602. Springer, Heidelberg (2004)

    Google Scholar 

  6. Hajdinjak, M., Mihelič, F.: A Dialogue-Management Evaluation Study. Journal of Computing and Information Technology (to appear, 2007)

    Google Scholar 

  7. Hajdinjak, M.: Knowledge Representation and Performance Evaluation of Cooperative Automatic Dialogue Systems, Ph. D. Thesis, Faculty of Electrical Engineering, University of Ljubljana, Slovenia (2006)

    Google Scholar 

  8. Walker, M.A., Litman, D.J., Kamm, C.A., Abella, A.: Evaluating spoken dialogue agents with paradise: Two case studies. Computer Speech and Language 12(3), 317–347 (1998)

    Article  Google Scholar 

  9. Di Eugenio, B., Glass, M.: The Kappa statistic: a second look. Computational Linguistics 30(1), 95–101 (2004)

    Article  Google Scholar 

  10. Hone, K.S., Graham, R.: Towards a tool for the Subjective Assesment of Speech System Interfaces (SASSI). Natural Language Engineering: Special Issue on Best Practice in Spoken Dialogue Systems 6(3-4), 287–303 (2000)

    Google Scholar 

  11. Walker, M.A., Borland, J., Kamm, C.A.: The utility of elapsed time as a usability metric for spoken dialogue systems. In: Proc. ASRU, Keystone, USA, pp. 317–320 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Václav Matoušek Pavel Mautner

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hajdinjak, M., Mihelič, F. (2007). A Wizard-of-Oz System Evaluation Study. In: Matoušek, V., Mautner, P. (eds) Text, Speech and Dialogue. TSD 2007. Lecture Notes in Computer Science(), vol 4629. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74628-7_69

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74628-7_69

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74627-0

  • Online ISBN: 978-3-540-74628-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics