An Assessment Framework for DialPort

Lee, Kyusong; Zhao, Tiancheng; Ultes, Stefan; Rojas-Barahona, Lina; Pincus, Eli; Traum, David; Eskenazi, Maxine

doi:10.1007/978-3-319-92108-2_10

Kyusong Lee³⁵,
Tiancheng Zhao³⁵,
Stefan Ultes³⁶,
Lina Rojas-Barahona³⁶,
Eli Pincus³⁷,
David Traum³⁷ &
…
Maxine Eskenazi³⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 510))

948 Accesses

Abstract

Collecting a large amount of real human-computer interaction data in various domains is a cornerstone in the development of better data-driven spoken dialog systems. The DialPort project is creating a portal to collect a constant stream of real user conversational data on a variety of topics. In order to keep real users attracted to DialPort, it is crucial to develop a robust evaluation framework to monitor and maintain high performance. Different from earlier spoken dialog systems, DialPort has a heterogeneous set of spoken dialog systems gathered under one outward-looking agent. In order to access this new structure, we have identified some unique challenges that DialPort will encounter so that it can appeal to real users and have created a novel evaluation scheme that quantitatively assesses their performance in these situations. We look at assessment from the point of view of the system developer as well as that of the end user.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Lee K, Zhao T, Du Y, Cai E, Lu A, Pincus E, Traum D, Ultes S, Barahona LMR, Gasic M et al (2017) Dialport, gone live: an update after a year of development. In: Proceedings of the 18th annual SIGdial meeting on discourse and dialogue, pp 170–173
Google Scholar
Liu CW, Lowe R, Serban IV, Noseworthy M, Charlin L, Pineau J (2016) How not to evaluate your dialogue system: an empirical study of unsupervised evaluation metrics for dialogue response generation. arXiv preprint arXiv:1603.08023
Mrkšić N, Séaghdha DO, Thomson B, Gašić M, Su PH, Vandyke D, Wen TH, Young S (2015) Multi-domain dialog state tracking using recurrent neural networks. arXiv preprint arXiv:1506.07190
Pincus E, Traum D (2016) Towards automatic identification of effective clues for team word-guessing games. In: Proceedings of the language resources and evaluation conference (LREC). European Language Resources Association, Portoro, Slovenia, pp 2741–2747
Google Scholar
Raux A, Eskenazi M (2009) A finite-state turn-taking model for spoken dialog systems. In: Proceedings of human language technologies: the 2009 annual conference of the north american chapter of the association for computational linguistics. Association for Computational Linguistics, pp 629–637
Google Scholar
Vinyals O, Le Q (2015) A neural conversational model. arXiv preprint arXiv:1506.05869
Walker MA, Litman DJ, Kamm CA, Abella A (1997) Paradise: a framework for evaluating spoken dialogue agents. In: Proceedings of the eighth conference on European chapter of the association for computational linguistics. Association for Computational Linguistics, pp 271–280
Google Scholar
Williams JD, Young S (2007) Partially observable markov decision processes for spoken dialog systems. Comput Speech Lang 21(2):393–422
Article Google Scholar
Zhao T, Eskenazi M, Lee K (2016) Dialport: a general framework for aggregating dialog systems. EMNLP 2016:32
Google Scholar
Zhao T, Lee K, Eskenazi M (2016) Dialport: connecting the spoken dialog research community to real user data. In: 2016 IEEE workshop on spoken language technology
Google Scholar
Zhao T, Lee K, Eskenazi M (2016) The dialport portal: grouping diverse types of spoken dialog systems. In: Workshop on Chatbots and conversational agents
Google Scholar

Download references

Author information

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, PA, USA
Kyusong Lee, Tiancheng Zhao & Maxine Eskenazi
Cambridge University, Cambridge, UK
Stefan Ultes & Lina Rojas-Barahona
University of Southern California, Los Angeles, CA, USA
Eli Pincus & David Traum

Authors

Kyusong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Tiancheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Ultes
View author publications
You can also search for this author in PubMed Google Scholar
Lina Rojas-Barahona
View author publications
You can also search for this author in PubMed Google Scholar
Eli Pincus
View author publications
You can also search for this author in PubMed Google Scholar
David Traum
View author publications
You can also search for this author in PubMed Google Scholar
Maxine Eskenazi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kyusong Lee .

Editor information

Editors and Affiliations

Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
Maxine Eskenazi
LIMSI-CNRS, Sorbonne University, Paris, France
Laurence Devillers
LIMSI-CNRS, Paris-Saclay University, Orsay, France
Joseph Mariani

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lee, K. et al. (2019). An Assessment Framework for DialPort. In: Eskenazi, M., Devillers, L., Mariani, J. (eds) Advanced Social Interaction with Agents . Lecture Notes in Electrical Engineering, vol 510. Springer, Cham. https://doi.org/10.1007/978-3-319-92108-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-92108-2_10
Published: 02 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92107-5
Online ISBN: 978-3-319-92108-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics