Collectives of Term Weighting Methods for Natural Language Call Routing

Sergienko, Roman; Gasanova, Tatiana; Semenkin, Eugene; Minker, Wolfgang

doi:10.1007/978-3-319-26453-0_6

Roman Sergienko⁵,
Tatiana Gasanova⁵,
Eugene Semenkin⁶ &
…
Wolfgang Minker⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 370))

871 Accesses
3 Citations

Abstract

The paper presents the investigation of collectives of term weighting methods for natural language call routing. The database consists of user utterances recorded in English language from caller interactions with commercial automated agents. Utterances from this database are labelled by experts and divided into 20 classes. Seven different unsupervised and supervised term weighting methods were tested and compared with each other for classification with k-NN. Also a novel feature extraction method based on terms belonging to classes was applied. After that different combinations of term weighting methods were formed as collectives and used for meta-classification with rule induction. The numerical experiments have shown that the combination of two best term weighting methods (Term Relevance Ratio and Confident Weights) increases classification effectiveness in comparison with the best individual term weighting method significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bengio, Y., Schwenk, H., Senecal, J.-S., Morin, F., and Gauvain, J.-L.: Neural probabilistic language models. In: Innovations in Machine Learning, 137–186 (2006)
Google Scholar
Cohen, W.: Fast effective rule induction. In: Proceedings of the Twelfth International Conference on Machine Learning, Lake Tahoe, California (1995)
Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: Deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning, 160–167 (2008)
Google Scholar
Debole, F., Sebastiani, F.: Supervised term weighting for automated text categorization. Text mining and its applications, Springer, Berlin Heidelberg, 81–97 (2004)
Google Scholar
Gasanova, T., Sergienko, R., Minker, W., Semenkin, E., Zhukov, E.: A semi-supervised approach for natural language call routing. In: Proceedings of the SIGDIAL 2013 Conference, 344–348 (2013)
Google Scholar
Gasanova, T., Sergienko, R., Akhmedova, S., Semenkin, E., Minker, W.: Opinion mining and topic categorization with novel term weighting. In: Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, Association for Computational Linguistics, Baltimore, Maryland, USA, 84–89 (2014)
Google Scholar
Gasanova, T., Sergienko, R., Semenkin, E., Minker, W.: Dimension Reduction with Coevolutionary Genetic Algorithm for Text Classification. In: Proceedings of the 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO), Vienna University of Technology, Austria, Vol. 1, 215–222 (2014)
Google Scholar
Huang, F., Yates, A.: Distributional representations for handling sparsity in supervised sequencelabeling. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: ACL, Vol. 1, 495–503 (2009)
Google Scholar
Ishibuchi, H., Nakashima, T., Murata, T.: Performance evaluation of fuzzy classifier systems for multidimensional pattern classification problems. IEEE Trans. Syst. Man Cybern. Part B: Cybern. 29(5), 601–618 (1999)
Article Google Scholar
Joachims, T.: Learning to classify text using support vector machines: methods, theory and algorithms. Kluwer Academic Publishers, Berlin (2002)
Google Scholar
Ko, Y.: A study of term weighting schemes using class information for text classification. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1029–1030 (2012)
Google Scholar
Koo, T., Carreras, X., Collins, M.: Simple semisupervised dependency parsing. ACL, 595–603 (2008)
Google Scholar
Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)
Article Google Scholar
Miller, S., Guinness, J., Zamanian, A.: Name tagging with word clusters and discriminative training. HLT-NAACL 4, 337–342 (2004)
Google Scholar
Mnih, A. Hinton, G.: Three new graphical models for statistical language modelling. In: Proceedings of the 24th International Conference on Machine Learning, 641–648 (2007)
Google Scholar
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning, ACL, 147–155 (2009)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inform. Process. manage. 24(5), 513–523 (1988)
Article Google Scholar
Sebastiani, F.: Machine learning in automated text categorization. ACM Comput. Surv. 34(1), 1–47 (2002)
Article Google Scholar
Shafait, F., Reif, M., Kofler, C., Breuel, T.M.: Pattern recognition engineering. RapidMiner Community Meeting and Conference, 9 (2010)
Google Scholar
Schwenk, H. Gauvain, J.-L.: Connectionist language modeling for large vocabulary continuous speech recognition. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vol. 1 (2002)
Google Scholar
Soucy, P., Mineau, G.W.: Beyond TFIDF weighting for text categorization in the Vector space model. In: Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI 2005), 1130–1135 (2005)
Google Scholar
Wilson, E.B.: Probable inference, the law of succession, and statistical inference. J. Am. Stat. Assoc. 22(158), 209–212 (1927)
Article Google Scholar
Xu, H., Li, C.: A Novel term weighting scheme for automated text Categorization. Intelligent Systems Design and Applications (2007)
Google Scholar
Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. ICML 9, 412–420 (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Ulm University, Albert-Einstein-Allee 43, 89081, Ulm, Germany
Roman Sergienko, Tatiana Gasanova & Wolfgang Minker
Siberian State Aerospace University, Krasnoyarskiy Rabochiy Avenue 31, 660014, Krasnoyarsk, Russian Federation
Eugene Semenkin

Authors

Roman Sergienko
View author publications
You can also search for this author in PubMed Google Scholar
Tatiana Gasanova
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Semenkin
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Minker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roman Sergienko .

Editor information

Editors and Affiliations

Polytechnic Institute of Setúbal INSTICC, Setúbal, Portugal
Joaquim Filipe
Ford Research & Adv. Engineering, Dearborn, Michigan, USA
Oleg Gusikhin
Univ Paris-Est Creteil (UPEC), Créteil, France
Kurosh Madani
Carleton University, Mechanical and Aerospace Engineering, Ottawa, Ontario, Canada
Jurek Sasiadek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sergienko, R., Gasanova, T., Semenkin, E., Minker, W. (2016). Collectives of Term Weighting Methods for Natural Language Call Routing. In: Filipe, J., Gusikhin, O., Madani, K., Sasiadek, J. (eds) Informatics in Control, Automation and Robotics. Lecture Notes in Electrical Engineering, vol 370. Springer, Cham. https://doi.org/10.1007/978-3-319-26453-0_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-26453-0_6
Published: 27 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26451-6
Online ISBN: 978-3-319-26453-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics