A robust system for human-machine dialogue in telephony-based applications

Albesano, D.; Baggia, P.; Danieli, M.; Gemello, R.; Gerbino, E.; Rullent, C.

doi:10.1007/BF02208822

A robust system for human-machine dialogue in telephony-based applications

Published: December 1997

Volume 2, pages 101–111, (1997)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

D. Albesano¹,
P. Baggia¹,
M. Danieli¹,
R. Gemello¹,
E. Gerbino¹ &
…
C. Rullent¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

This paper presents a real-time system for human-machine spoken dialogue on the telephone in task-oriented domains. The system has been tested in a large trial with inexperienced users and it has proved robust enough to allow spontaneous interactions even for people with poor recognition performance. The robust behaviour of the system has been achieved by combining the use of specific language models during the recognition phase of analysis, the tolerance toward spontaneous speech phenomena, the activity of a robust parser, and the use of pragmatic-based dialogue knowledge. This integration of the different modules allows the system to deal with partial or total breakdowns at other levels of analysis. We report the field trial data of the system with respect to speech recognition metrics of word accuracy and sentence understanding rate, time-to-completion, time-to-acquisition of crucial parameters, and degree of success of the interactions in providing the speakers with the information they required. The evaluation data show that most of the subjects were able to interact fruitfully with the system. These results suggest that the design choices made to achieve robust behaviour are a promising way to create usable spoken language telephone systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speech and Dialogue Technologies, Assets for the Multilingual Digital Single Market

The Harmonia Corpus – A Dialogue Corpus for Automatic Analysis of Phonetic Convergence

Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

Allen, J.F., Miller, B.W., Ringger, E.K., and Sikorski, T. (1996). A robust system for natural spoken dialogue.Proceedings of the 14th Meeting of the ACL. Santa Cruz, CA.
Andry, F. (1992). Static and dynamic predictions: A method to improve speech understanding in cooperative dialogues.Proceedings of ICSPL. Banff, pp. 639–642.
Bourlard, H. and Morgan, N. (1993).Connectionist Speech Recognition: A Hybrid Approach. Norwell, MA: Kluwer Academic Publishers.
Google Scholar
Chow, Y. and Schwartz, R. (1989). TheN-best algorithm: An efficient procedure for finding topn sentence hypotheses.Proceedings of the 2nd DARPA Workshop on Speech and Natural Language. San Mateo, CA, pp. 199–202.
Cravero, M., Fissore, L., Pieraccini, R., and Scagliola, C. (1984). Syntax-driven recognition of connected words by Markov Models.Proceedings of ICASSP-84, San Diego, CA, pp. 35.5.1–35.5.4.
Danieli, M. (1996). On the use of expectations for detecting and repairing human-machine miscommunications.Proceedings of AAAI-96 Conference Workshop on Detecting, Preventing, and Repairing Human-Machine Miscommunications. Portland, OR, pp. 87–93.
Danieli, M. and Gerbino, E. (1995). Metrics for evaluating dialogue strategies in a spoken language system.Working Notes of the AAAI Spring Symposium on Empirical Methods in Discourse Interpretation and Generation. Stanford, CA, pp. 34–39.
Eckert, W., Gallwitz, F., and Niemann, H. (1996). Combining stochastic and linguistic language models for recognition of spontaneous speech.Proceedings of ICASSP-96. Atlanta, GE, Vol. 1, pp. 423–427.
Google Scholar
Fissore, L., Ravera, F., and Laface, P. (1995). Acoustic-phonetic modeling for flexible vocabulary speech recognition.Proceedings of EUROSPEECH 95. Madrid, Spain, Vol. 1, pp. 799–802.
Google Scholar
Gemello, R., Albesano, D., Mana, F., and Cancelliere, R. (1994). Recurrent network automata for speech recognition: A summary of recent work.Proceedings of IEEE Neural Networks for Signal Processing Workshop. Ermioni, Greece.
Gerbino, E. and Danieli, M. (1993). Managing dialogue in a continuous speech understanding system.Proceedings of the Third European Conference on Speech Communication and Technology. Berlin, Germany, pp. 1661–1164.
Ney, H., Essen, U., and Kneser, R. (1994). On structuring probabilistic dependencies in stochastic language modeling.Computer Speech and Language, 8: 1–38.
Google Scholar
Robinson, A.J. (1994). An application of recurrent nets to phone probability estimation.IEEE Transactions on Neural Networks, 5(2):298–305.
Google Scholar
Smith, R. and Hipp, R.D. (1994).Spoken Natural Language Dialog Systems: A Practical Approach. Oxford and New York: Oxford University Press.
Google Scholar

Download references

Author information

Authors and Affiliations

CSELT-Centro Studi E Laboratori Telecomunicazioni, via G. Reiss Romoli, 274, I-10148, Torino, Italy
D. Albesano, P. Baggia, M. Danieli, R. Gemello, E. Gerbino & C. Rullent

Authors

D. Albesano
View author publications
You can also search for this author inPubMed Google Scholar
P. Baggia
View author publications
You can also search for this author inPubMed Google Scholar
M. Danieli
View author publications
You can also search for this author inPubMed Google Scholar
R. Gemello
View author publications
You can also search for this author inPubMed Google Scholar
E. Gerbino
View author publications
You can also search for this author inPubMed Google Scholar
C. Rullent
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Albesano, D., Baggia, P., Danieli, M. et al. A robust system for human-machine dialogue in telephony-based applications. Int J Speech Technol 2, 101–111 (1997). https://doi.org/10.1007/BF02208822

Download citation

Received: 07 August 1996
Accepted: 27 May 1997
Issue Date: December 1997
DOI: https://doi.org/10.1007/BF02208822

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A robust system for human-machine dialogue in telephony-based applications

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Speech and Dialogue Technologies, Assets for the Multilingual Digital Single Market

The Harmonia Corpus – A Dialogue Corpus for Automatic Analysis of Phonetic Convergence

Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now