Abstract
Our objective is spoken-language classification for helpdesk call routing using a scanning understanding and intelligent-system techniques. In particular, we examine simple recurrent networks, support-vector machines and finite-state transducers for their potential in this spoken-language-classification task and we describe an approach to classification of recorded operator-assistance telephone utterances. The main contribution of the paper is a comparison of a variety of techniques in the domain of call routing. Support-vector machines and transducers are shown to have some potential for spoken-language classification, but the performance of the neural networks indicates that a simple recurrent network performs best for helpdesk call routing.
Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Allen J, Ferguson G, Ringger EK et al (2001) Dialogue Systems: From Theory to Practice in TRAINS-96. In: Dale R, Moisl H, Somers H (eds) Handbook of natural language processing. Marcel Dekker, New York, pp 347–376
Allen J, Ferguson G, Stent A (2001) An architecture for more realistic conversational systems. In: Proceedings of intelligent user interfaces (IUI-01), Santa Fe, NM
Arai K, Wright JH, Riccardi G et al (1999) Grammar fragment acquisition using syntactic and semantic clustering. Speech Commun 27:43–62
Attwater D, Edgington M, Durston P et al (2000) Practical issues in the application of speech technology to network and customer service applications. Speech Commun 31:279–291
Brill E, Florian R, Henderson JC et al (1998) Beyond N-Grams: Can linguistic sophistication improve language modeling? In: Boitet C, Whitelock P (eds) Proceedings of the thirty-sixth annual meeting of the association for computational linguistics and seventeenth international conference on computational linguistics, Morgan Kaufmann Publishers, San Francisco, CA, pp 186–190
Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Mining Knowl Discov 2:121–167
Burton-Roberts N (1986) Analysing sentences. An introduction to English syntax. Longman Group UK Ltd, England
Carpenter B, Chu-Carroll J (1998) Natural language call routing: a robust self-organizing approach. ICSLP 98, Sydney, pp 2059–2062
Chapelle O, Vapnik V (2000) Model selection for support vector machines. In: Solla S, Leen T, Muller K-R (eds) Advances in neural information processing systems, vol 12. MIT Press, Cambridge, MA
Charniak E (1993) Statistical language learning. MIT Press, Cambridge, MA
Chou W, Zhou Q, Kuo H-K J et al (2000) Natural language call steering for service applications. In: Proceedings of the international conference on spoken language processing, Beijing, China
Chu-Carroll J, Carpenter B (1998) Dialogue management in vector-based call routing. COLING-ACL98, pp 256–262
Chu-Carroll J, Carpenter B (1999) Vector-based natural language call routing. J Comput Ling 25(3):361–388
Durston PJ, Farrell M, Attwater D et al (2001) OASIS natural language call steering trial. In: Proceedings of Eurospeech, vol 2, pp 1323–1326
Edgington M, Attwater D, Durston P (1999) OASIS—a framework for spoken language call steering. In: Proceedings of Eurospeech '99, Budapest Hungary, pp 923–926
Elman JL, Bates EA, Johnson MH et al (1996) Rethinking innateness. MIT Press, Cambridge, MA
Elman JL (1991) Distributed representations simple recurrent networks and grammatical structure. Mach Learn 7:195–225
Elman JL (1990) Finding structure in time. Cognitive Sci 14:179–211
Feng J, Williams P (2001) The generalization error of the symmetric and scaled support vector machines. IEEE Trans on Neural Net 12(5):1255–1260
Ferguson G, Allen JF (1998) TRIPS: an integrated intelligent problem-solving assistant. In: Proceedings of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), Maddison, WI, pp 567–573
Forman G (2002) Choose your words carefully: an empirical study of feature selection metrics or text classification. In: Proceedings of the 13th European conference on machine learning ECML '02 and 6th European conference on principles and practice of knowledge discovery in databases PKDD, Helsinki, Finland
Garner PN (1997) On topic identification and dialogue move recognition. Comput Speech Lang 11:275–306
Glass1999 Glass JR (1999) Challenges for spoken dialogue systems. In: Proceedings of IEEE ASRU Workshop, Key
Gorin AL, Riccardi G, Wright JH (1997) How may I help you? Speech Commun 23:113–127
Gorin AL, Wright JH, Riccardi G et al (2000) Semantic information processing of spoken language. In: Proceedings of 2000 International ATR Workshop on Multilingual Speech Communication, Kyoto Japan, October 2000, pp 13–16
Gunn S (1998) Support vector machines for classification and regression. ISIS Technical Report
Harman D (1995) Overview of the fourth text retrieval conference. In: Proceedings of TREC
Joachims T (1999) Making large-scale SVM learning practical. In: Schölkopf B, Burges C, Smola A (eds) Advances in kernel methods: support vector learning. MIT Press, Cambridge, MA
Joachims T (2000) Estimating the generalization performance of an SVM efficiently. In: Proceedings of International Conference on Machine Learning
Joachims T (2002) Learning to classify text using support vector machines. Kluwer Academic Publishers, Boston, MA
Jordan M (1986) Attractor dynamics and parallelism in a connectionist sequential machine. In: Proceedings of the Eighth Annual Conference of the Cognitive Science Society, Amherst, pp 531–546
Jurafsky D, Martin JH (2000) Speech and language processing. Prentice Hall, Upper Saddle River, NJ
Landauer TK, Foltz PW, Laham D (1998) An introduction to latent semantic analysis. Discourse Process 25:259–284
LeCun Y, Bottou L, Orr G et al (1998) Efficient backprop. In: Orr G, Muller K (eds) Neural networks: tricks of the trade, Springer, Berlin Heidelberg New York
McDonough J, Ng K, Jeanrenaud P et al (1994) Approaches to topic identification on the switchboard corpus. In: Proceedings of IEEE international conference on acoustics speech and signal processing, Adelaide, Australia, pp 385–388
McTear MF (2000) Intelligent interface technology: from theory to reality? Interact Comput 12:323–336
McTear MF (2002) Spoken dialogue technology: enabling the conversational user interface. ACM Comput Surv 34(1):90–169
Moghaddam B, Yang M-H (2001) Sex with support vector machines. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems, vol 13. MIT Press, Cambridge, MA, pp 960–966
Opper M, Urbanczik R (2001) Universal learning curves of support vector machines. Phys Rev Lett 86(19):4410–4413
Platt JC (1998) Sequential minimal optimization: a fast algorithm for training support vector machines. Technical Report MSR-TR-98-14
Pyka C (1992) Management of hypotheses in an integrated speech-language architecture. In: Proceedings of 10th European conference on artificial intelligence
Roche E, Schabes Y (1997) Finite-state language processing. MIT, London
Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw Hill, New York
Schölkopf B (1998) SVMs-a practical consequence of learning theory. IEEE Intell Syst pp: 18–21
Schölkopf B, Burges C, Vapnik V (1995) Extracting support data for a given task. In: Fayyad UM, Uthurusamy R (eds) Proceedings first international conference on knowledge discovery and data mining AAAI, Press, Menlo Park, CA, pp 252–257
Stitson MO, Weston JAE, Gammerman A et al (1996) Theory of support vector machines. Technical Report CSD-TR-96-17
Stolcke A, Shriberg E, Bates R et al (1998) Dialog act modeling for conversational speech. In: Proceedings of AAAI-98 spring symposium on applying machine learning to discourse processing
CJ (1979) Information retrieval, 2nd edn. Butterworths, London
Vapnik VN (1995) The nature of statistical learning theory. Springer Verlag, Berlin Heidelberg New York
Wermter S (1995) Hybrid connectionist natural language processing. Chapman and Hall Thomson International, London, UK
Wermter S, Panchev C, Arevian G (1999) Hybrid neural plausibility networks for news agents. In: Proceedings of the National Conference on Artificial Intelligence, Orlando, FL
Wermter S, Weber V (1997) SCREEN: learning a flat syntactic and semantic spoken language analysis. J Artif Intell Res 6(1):35–85
Wermter S (2000) Neural fuzzy preference integration using neural preference Moore machines. Int J Neural Syst 10(4):287–309
Wermter S (1999) Preference Moore machines for neural fuzzy integration. In: Proceedings of the international joint conference on artificial intelligence, Stockholm, pp 840–845
Wermter S, Panchev C, Houlsby J (1999) Language disorders in the brain: distinguishing aphasia forms with recurrent networks. In: Proceedings of AAAI 99 conference workshop on neuroscience and neural computation, Orlando, FL, pp 93–98
Yang Y-J, Chien L-F, Lee L-S (2002) Speaker intention modeling for large vocabulary Mandarin spoken dialogues. ICSLP '96, vol 2, pp 713–716
Young SR, Hauptmann AG, Ward WH et al (1989) High level knowledge sources in usable speech recognition systems. Commun ACM 32:183–194
Author information
Authors and Affiliations
Additional information
Sheila Garfield received a BSc (Hons) in computing from the University of Sunderland in 2000 where, as part of her programme of study, she completed a project associated with aphasic language processing. She received her PhD from the same university, in 2004, for a programme of work connected with hybrid intelligent systems and spoken-language processing. In her PhD thesis, she collaborated with British Telecom and suggested a novel hybrid system for call routing. Her research interests are natural language processing, hybrid systems, intelligent systems.
Stefan Wermter holds the Chair in Intelligent Systems and is leading the Intelligent Systems Division at the University of Sunderland, UK. His research interests are intelligent systems, neural networks, cognitive neuroscience, hybrid systems, language processing and learning robots. He has a diploma from the University of Dortmund, Germany, an MSc from the University of Massachusetts, USA, and a PhD in habilitation from the University of Hamburg, Germany, all in Computer Science. He was a Research Scientist at Berkeley, CA, before joining the University of Sunderland. Professor Wermter has written edited, or contributed to 8 books and published about 80 articles on this research area.
Rights and permissions
About this article
Cite this article
Garfield, S., Wermter, S. Call classification using recurrent neural networks, support vector machines and finite state automata. Knowl Inf Syst 9, 131–156 (2006). https://doi.org/10.1007/s10115-005-0198-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-005-0198-5