Abstract
Although there are many text classification techniques depending on vector spaces, it is difficult to detect the meaning relating to the user’s intention (complaint, encouragement, request, invitation, etc.). The intention to be discussed in this study is very useful for understanding focus points in conversation. This paper presents a technique of determining the speaker’s intention for sentences in conversation. The intention association expressions are introduced and the formal rule descriptions with weight using these expressions are defined to build intention classification knowledge. A deterministic multi-attribute pattern-matching algorithm is used to determine the intention class efficiently. From simulation results for 681 E-mail messages of 5,859 sentences, the multi-attribute pattern matching algorithm is about 44.5 times faster than Aho and Corasick method. The precision and recall of intention classification of sentences are 91%, 95%. Precision and recall of the classification of each mail are 88%, 89%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aho, A.V., Corasick, M.J.: Efficient string matching: An aid to bibliographic search. Communications of the ACM 18(6), 333–340 (1975)
Aoe, J.: An Efficient Implementation of Static String Pattern Matching Machines. IEEE Trans. Softw. Engr. SE-15(8), 1010–1016 (1989)
Aoe, J.: An efficient digital search algorithm by using a double-array structure. IEEE Trans. Softw. Engr. SE-15(9), 1066–1077 (1989)
Atlam, E.-S., Fuketa, M., Morita, K., Aoe, J.: Documents similarity measurement using field association terms. An International Journal of Information Processing and Management 39(6), 809–824 (2003)
Atlam, E.-S., Morita, K., Fuketa, M., Aoe, J.: A new method for selecting English field association terms of compound words and its knowledge representation. An International Journal of Information Processing and Management 38(6), 807–821 (2002)
Fuketa, M., Lee, S., Tsuji, T., Okada, M., Aoe, J.: A document classification method by using field association words. An International Journal of Information Sciences 126(1), 57–70 (2000)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Kwon, O., Lee, J.: Text categorization based on k-nearest neighbor approach for Web site classification. An International Journal of Information Processing and Management 39(1), 25–44 (2003)
Kwon, O., Jung, S., Lee, J., Lee, G.: Evaluation of Category Features and Text Structural Information on a Text Categorization Using Memory Based Reasoning. Paper presented at the Proceedings of the 18th international conference on computer processing of oriental languages (ICCPOL 1999), vol. 1, pp. 153–158 (1999)
Mani, I.: Automatic Summarization. John Benjamins Publishing Company, Amsterdam (2001)
Salton, G., McGill, M.J.: Introduction of modern information retrieval. McGraw-Hill, New York (1983)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fuketa, M. et al. (2005). A New Technique of Determining Speaker’s Intention for Sentences in Conversation. In: Khosla, R., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2005. Lecture Notes in Computer Science(), vol 3684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11554028_85
Download citation
DOI: https://doi.org/10.1007/11554028_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28897-8
Online ISBN: 978-3-540-31997-9
eBook Packages: Computer ScienceComputer Science (R0)