Skip to main content

Performance of an Ad-hoc User Simulation in a Formative Evaluation of a Spoken Dialog System

  • Conference paper
  • First Online:
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop

Abstract

This paper addresses the performance of a user simulator in producing interactions with a spoken dialog system which include usability problems. Simple and general models of user behavior and speech understanding performance, as they are known in early design stages, are shown to detect up to 85% of the usability problems found in a real user test. Data can be reordered to allow a quick manual inspection of the test result. Thus, in an iterative design process small iterations, each incorporating knowledge gained from simulations, are recommended to efficiently improve the system design.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ai, H. and Weng, F. (2008) User Simulation as Testing for Spoken Dialog Systems, in: Proc. of SIGdial 2008, Columbus, Ohio, pp. 164–171.

    Google Scholar 

  2. Araki, M. and Doshita, S. (1997) Automatic Evaluation Environment for Spoken Dialogue Systems, in: Proc. of ECAI ’96, Springer, London, UK, pp. 183–194.

    Google Scholar 

  3. Chung, G. (2004) Developing a flexible spoken dialog system using simulation, in: Proc. of ACL ’04, Barcelona, Spain.

    Google Scholar 

  4. Cohen, M. H., Giangola, J. P., Balogh, J. (2004) Voice User Interface Design, Addison-Wesley, Boston, USA.

    Google Scholar 

  5. Desurvire, H. W. (1994) Faster, Cheaper!! Are Usability Inspection Methods as Effective as Empirical Testing? In: J. Nielsen and R. L. Mack (eds), Usability Inspection Methods, John Wiley and Sons, New York, NY, USA.

    Google Scholar 

  6. Eckert, W., Levin, E., Pieraccini, R. (1997) User Modeling for Spoken Dialogue System Evaluation, in: Proc. of ASRU 1997, Santa Barbara, CA, USA.

    Google Scholar 

  7. Engelbrecht, K.-P., Quade, M., Möller, S. (2009) Analysis of a New Simulation Approach to Dialogue System Evaluation. Speech Communication, 51, pp. 1234–1252.

    Article  Google Scholar 

  8. Ito, A., Shimada, K., Suzuki, M., Makino, S. (2006) A User Simulator Based on VoiceXML for Evaluation of Spoken Dialog Systems, in: Proc. of Interspeech 2006, Pittsburgh, PA, USA.

    Google Scholar 

  9. ITU-T Recommendation P.851 (2003) Subjective Quality Evaluation of Telephone Services Based on Spoken Dialogue Systems. International Telecommunication Union, Geneva, Switzerland.

    Google Scholar 

  10. ITU-T Supplement 24 to P-Series Recommendations (2005) Parameters Describing the Interaction with Spoken Dialogue Systems. International Telecommunication Union, Geneva, Switzerland.

    Google Scholar 

  11. Kieras, D. E. (2003) Model-based Evaluation. In: Jacko, J. and Sears, A. (eds), The Human-Computer Interaction Handbook, Erlbaum, Mahwah, NJ, USA, pp. 1191–1208.

    Google Scholar 

  12. López-Cózar, R., de la Torre, A., Segura, J. C., Rubio, A. J. (2003) Assessment of dialogue systems by means of a new simulation technique. Speech Communication, 40(3), pp. 387–407.

    Article  Google Scholar 

  13. López-Cózar, R., Callejas, Z., McTear, M. (2006) Testing the Performance of Spoken Dialogue Systems by Means of an Artificially Simulated User. Artificial Intelligence Review 26, pp. 291–323.

    Article  Google Scholar 

  14. López-Cózar, R., Espejo, G., Callejas, Z., Gutiérrez, A., Griol, D. (2009) Assessment of Spoken Dialogue Systems by Simulating Different Levels of User Cooperativeness, in: Proc. of the IWSDS 2009, Kloster Irsee, Germany.

    Google Scholar 

  15. Möller, S. (2005) Quality of Telephone-based Spoken Dialog Systems. Springer, New York, USA.

    Google Scholar 

  16. Nielsen, J. (1993) Usability Engineering, Academic Press, San Diego, CA.

    MATH  Google Scholar 

  17. Schatzmann, J., Thomson, B., Weilhammer, K., Ye, H., Young, S. (2007) Agenda-based User Simulation for Boot-strapping a POMDP Dialogue System, in: Proc. of HLT/NAACL, Rochester, NY, USA.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Klaus-Peter Engelbrecht .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media, LLC

About this paper

Cite this paper

Engelbrecht, KP., Schmidt, S., Möller, S. (2011). Performance of an Ad-hoc User Simulation in a Formative Evaluation of a Spoken Dialog System. In: Delgado, RC., Kobayashi, T. (eds) Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems Workshop. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1335-6_27

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-1335-6_27

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-1334-9

  • Online ISBN: 978-1-4614-1335-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics