Call classification using recurrent neural networks, support vector machines and finite state automata

Garfield, Sheila; Wermter, Stefan

doi:10.1007/s10115-005-0198-5

Call classification using recurrent neural networks, support vector machines and finite state automata

Regular Paper
Published: 18 July 2005

Volume 9, pages 131–156, (2006)
Cite this article

Knowledge and Information Systems Aims and scope Submit manuscript

Sheila Garfield¹ &
Stefan Wermter¹

125 Accesses
3 Altmetric
Explore all metrics

Abstract

Our objective is spoken-language classification for helpdesk call routing using a scanning understanding and intelligent-system techniques. In particular, we examine simple recurrent networks, support-vector machines and finite-state transducers for their potential in this spoken-language-classification task and we describe an approach to classification of recorded operator-assistance telephone utterances. The main contribution of the paper is a comparison of a variety of techniques in the domain of call routing. Support-vector machines and transducers are shown to have some potential for spoken-language classification, but the performance of the neural networks indicates that a simple recurrent network performs best for helpdesk call routing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated Voice Recognition of Emotions Through the Use of Neural Networks

Language in Artificial Neural Network Training

Building an Intelligent Call Distributor

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Artificial Intelligence

References

Allen J, Ferguson G, Ringger EK et al (2001) Dialogue Systems: From Theory to Practice in TRAINS-96. In: Dale R, Moisl H, Somers H (eds) Handbook of natural language processing. Marcel Dekker, New York, pp 347–376
Allen J, Ferguson G, Stent A (2001) An architecture for more realistic conversational systems. In: Proceedings of intelligent user interfaces (IUI-01), Santa Fe, NM
Arai K, Wright JH, Riccardi G et al (1999) Grammar fragment acquisition using syntactic and semantic clustering. Speech Commun 27:43–62
Google Scholar
Attwater D, Edgington M, Durston P et al (2000) Practical issues in the application of speech technology to network and customer service applications. Speech Commun 31:279–291
Google Scholar
Brill E, Florian R, Henderson JC et al (1998) Beyond N-Grams: Can linguistic sophistication improve language modeling? In: Boitet C, Whitelock P (eds) Proceedings of the thirty-sixth annual meeting of the association for computational linguistics and seventeenth international conference on computational linguistics, Morgan Kaufmann Publishers, San Francisco, CA, pp 186–190
Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2:121–167
Google Scholar
Burton-Roberts N (1986) Analysing sentences. An introduction to English syntax. Longman Group UK Ltd, England
Carpenter B, Chu-Carroll J (1998) Natural language call routing: a robust self-organizing approach. ICSLP 98, Sydney, pp 2059–2062
Google Scholar
Chapelle O, Vapnik V (2000) Model selection for support vector machines. In: Solla S, Leen T, Muller K-R (eds) Advances in neural information processing systems, vol 12. MIT Press, Cambridge, MA
Charniak E (1993) Statistical language learning. MIT Press, Cambridge, MA
Chou W, Zhou Q, Kuo H-K J et al (2000) Natural language call steering for service applications. In: Proceedings of the international conference on spoken language processing, Beijing, China
Chu-Carroll J, Carpenter B (1998) Dialogue management in vector-based call routing. COLING-ACL98, pp 256–262
Chu-Carroll J, Carpenter B (1999) Vector-based natural language call routing. J Comput Ling 25(3):361–388
Google Scholar
Durston PJ, Farrell M, Attwater D et al (2001) OASIS natural language call steering trial. In: Proceedings of Eurospeech, vol 2, pp 1323–1326
Edgington M, Attwater D, Durston P (1999) OASIS—a framework for spoken language call steering. In: Proceedings of Eurospeech '99, Budapest Hungary, pp 923–926
Elman JL, Bates EA, Johnson MH et al (1996) Rethinking innateness. MIT Press, Cambridge, MA
Elman JL (1991) Distributed representations simple recurrent networks and grammatical structure. Mach Learn 7:195–225
Google Scholar
Elman JL (1990) Finding structure in time. Cognitive Sci 14:179–211
Article Google Scholar
Feng J, Williams P (2001) The generalization error of the symmetric and scaled support vector machines. IEEE Trans on Neural Net 12(5):1255–1260
Google Scholar
Ferguson G, Allen JF (1998) TRIPS: an integrated intelligent problem-solving assistant. In: Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), Maddison, WI, pp 567–573
Forman G (2002) Choose your words carefully: an empirical study of feature selection metrics or text classification. In: Proceedings of the 13th European conference on machine learning ECML '02 and 6th European conference on principles and practice of knowledge discovery in databases PKDD, Helsinki, Finland
Garner PN (1997) On topic identification and dialogue move recognition. Comput Speech Lang 11:275–306
Article Google Scholar
Glass1999 Glass JR (1999) Challenges for spoken dialogue systems. In: Proceedings of IEEE ASRU Workshop, Key
Gorin AL, Riccardi G, Wright JH (1997) How may I help you? Speech Commun 23:113–127
Google Scholar
Gorin AL, Wright JH, Riccardi G et al (2000) Semantic information processing of spoken language. In: Proceedings of 2000 International ATR Workshop on Multilingual Speech Communication, Kyoto Japan, October 2000, pp 13–16
Gunn S (1998) Support vector machines for classification and regression. ISIS Technical Report
Harman D (1995) Overview of the fourth text retrieval conference. In: Proceedings of TREC
Joachims T (1999) Making large-scale SVM learning practical. In: Schölkopf B, Burges C, Smola A (eds) Advances in kernel methods: support vector learning. MIT Press, Cambridge, MA
Joachims T (2000) Estimating the generalization performance of an SVM efficiently. In: Proceedings of International Conference on Machine Learning
Joachims T (2002) Learning to classify text using support vector machines. Kluwer Academic Publishers, Boston, MA
Jordan M (1986) Attractor dynamics and parallelism in a connectionist sequential machine. In: Proceedings of the Eighth Annual Conference of the Cognitive Science Society, Amherst, pp 531–546
Jurafsky D, Martin JH (2000) Speech and language processing. Prentice Hall, Upper Saddle River, NJ
Landauer TK, Foltz PW, Laham D (1998) An introduction to latent semantic analysis. Discourse Process 25:259–284
Google Scholar
LeCun Y, Bottou L, Orr G et al (1998) Efficient backprop. In: Orr G, Muller K (eds) Neural networks: tricks of the trade, Springer, Berlin Heidelberg New York
McDonough J, Ng K, Jeanrenaud P et al (1994) Approaches to topic identification on the switchboard corpus. In: Proceedings of IEEE international conference on acoustics speech and signal processing, Adelaide, Australia, pp 385–388
McTear MF (2000) Intelligent interface technology: from theory to reality? Interact Comput 12:323–336
Google Scholar
McTear MF (2002) Spoken dialogue technology: enabling the conversational user interface. ACM Comput Surv 34(1):90–169
Article Google Scholar
Moghaddam B, Yang M-H (2001) Sex with support vector machines. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems, vol 13. MIT Press, Cambridge, MA, pp 960–966
Opper M, Urbanczik R (2001) Universal learning curves of support vector machines. Phys Rev Lett 86(19):4410–4413
Article Google Scholar
Platt JC (1998) Sequential minimal optimization: a fast algorithm for training support vector machines. Technical Report MSR-TR-98-14
Google Scholar
Pyka C (1992) Management of hypotheses in an integrated speech-language architecture. In: Proceedings of 10th European conference on artificial intelligence
Roche E, Schabes Y (1997) Finite-state language processing. MIT, London
Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw Hill, New York
Schölkopf B (1998) SVMs-a practical consequence of learning theory. IEEE Intell Syst pp: 18–21
Google Scholar
Schölkopf B, Burges C, Vapnik V (1995) Extracting support data for a given task. In: Fayyad UM, Uthurusamy R (eds) Proceedings first international conference on knowledge discovery and data mining AAAI, Press, Menlo Park, CA, pp 252–257
Stitson MO, Weston JAE, Gammerman A et al (1996) Theory of support vector machines. Technical Report CSD-TR-96-17
Stolcke A, Shriberg E, Bates R et al (1998) Dialog act modeling for conversational speech. In: Proceedings of AAAI-98 spring symposium on applying machine learning to discourse processing
CJ (1979) Information retrieval, 2nd edn. Butterworths, London
Vapnik VN (1995) The nature of statistical learning theory. Springer Verlag, Berlin Heidelberg New York
Wermter S (1995) Hybrid connectionist natural language processing. Chapman and Hall Thomson International, London, UK
Wermter S, Panchev C, Arevian G (1999) Hybrid neural plausibility networks for news agents. In: Proceedings of the National Conference on Artificial Intelligence, Orlando, FL
Wermter S, Weber V (1997) SCREEN: learning a flat syntactic and semantic spoken language analysis. J Artif Intell Res 6(1):35–85
Google Scholar
Wermter S (2000) Neural fuzzy preference integration using neural preference Moore machines. Int J Neural Syst 10(4):287–309
MATH Google Scholar
Wermter S (1999) Preference Moore machines for neural fuzzy integration. In: Proceedings of the international joint conference on artificial intelligence, Stockholm, pp 840–845
Wermter S, Panchev C, Houlsby J (1999) Language disorders in the brain: distinguishing aphasia forms with recurrent networks. In: Proceedings of AAAI 99 conference workshop on neuroscience and neural computation, Orlando, FL, pp 93–98
Yang Y-J, Chien L-F, Lee L-S (2002) Speaker intention modeling for large vocabulary Mandarin spoken dialogues. ICSLP '96, vol 2, pp 713–716
Young SR, Hauptmann AG, Ward WH et al (1989) High level knowledge sources in usable speech recognition systems. Commun ACM 32:183–194
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Sunderland, School of Computing and Technology, St. Peter's Way, Sunderland, SR6 0DD, United Kingdom
Sheila Garfield & Stefan Wermter

Authors

Sheila Garfield
View author publications
You can also search for this author inPubMed Google Scholar
Stefan Wermter
View author publications
You can also search for this author inPubMed Google Scholar

Additional information

Sheila Garfield received a BSc (Hons) in computing from the University of Sunderland in 2000 where, as part of her programme of study, she completed a project associated with aphasic language processing. She received her PhD from the same university, in 2004, for a programme of work connected with hybrid intelligent systems and spoken-language processing. In her PhD thesis, she collaborated with British Telecom and suggested a novel hybrid system for call routing. Her research interests are natural language processing, hybrid systems, intelligent systems.

Stefan Wermter holds the Chair in Intelligent Systems and is leading the Intelligent Systems Division at the University of Sunderland, UK. His research interests are intelligent systems, neural networks, cognitive neuroscience, hybrid systems, language processing and learning robots. He has a diploma from the University of Dortmund, Germany, an MSc from the University of Massachusetts, USA, and a PhD in habilitation from the University of Hamburg, Germany, all in Computer Science. He was a Research Scientist at Berkeley, CA, before joining the University of Sunderland. Professor Wermter has written edited, or contributed to 8 books and published about 80 articles on this research area.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Garfield, S., Wermter, S. Call classification using recurrent neural networks, support vector machines and finite state automata. Knowl Inf Syst 9, 131–156 (2006). https://doi.org/10.1007/s10115-005-0198-5

Download citation

Received: 29 January 2004
Revised: 05 January 2005
Accepted: 09 January 2005
Published: 18 July 2005
Issue Date: February 2006
DOI: https://doi.org/10.1007/s10115-005-0198-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Call classification using recurrent neural networks, support vector machines and finite state automata

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Automated Voice Recognition of Emotions Through the Use of Neural Networks

Language in Artificial Neural Network Training

Building an Intelligent Call Distributor

Explore related subjects

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now