Abstract
This paper proposes a new method for analyzing textual data. The method deals with items of textual data, where each item includes various viewpoints and each viewpoint is regarded as a class. The method inductively acquires classification models for 2-class classification tasks from items labeled by multiple classes. The method infers classes of new items by using these models. Lastly, the method extracts important expressions from new items in each class and extracts characteristic expressions by comparing the frequency of expressions. This paper applies the method to questionnaire data described by guests at a hotel and verifies its effect through numerical experiments.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cardoso-Cachopo, A., Oliveira, A.L.: An Empirical Comparison of Text Categorization Methods. In: Proceedings of the 10th International Symposium on String Processing and Information Retrieval, pp. 183–196 (2003)
Feldman, R., Hirsh, H.: Mining Text using Keyword Distributions. Journal of Intelligent Information Systems 10, 281–300 (1998)
Hearst, M.A.: Untangling Text Data Mining. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (1999)
Hsu, C.-W., Chang, C.-C., Lin, C.-J.: A Practical Guide to Support Vector Classification, http://www.csie.ntu.edu.tw/~cjlin/libsvm/
Ichimura, Y., Nakayama, Y., Miyoshi, M., Akahane, T., Sekiguchi, T., Fujiwara, Y.: Text Mining System for Analysis of a Salesperson’s Daily Reports. In: Proceedings of Pacific Association for Computational Linguistics 2001, pp. 127–135 (2001)
Ittycheriah, A., Franz, M., Zhu, W.-J., Ratnaparkhi, A.: IBM’s Statistical Question Answering System. In: Proceedings of the 8th Text Retrieval Conference (2000)
Manevitz, L.M., Yousef, M.: One-Class SVMs for Document Classification. Journal of Machine Learning Research 2, 139–154 (2001)
Sakurai, S., Ichimura, Y., Suyama, A., Orihara, R.: Acquisition of a Knowledge Dictionary for a Text Mining System using an Inductive Learning Method. In: IJCAI 2001 Workshop on Text Learning: Beyond Supervision, pp. 45–52 (2001)
Tan, P.-N., Blau, H., Harp, S., Goldman, R.: Textual Data Mining of Service Center Call Records. In: Proceedings of the 6th International Conference on Knowledge Discovery and Data Mining, pp. 417–423 (2000)
Tellex, S., Katz, B., Lin, J., Fernandes, A.: Quantitative Evaluation of Passage Retrieval Algorithms for Question Answering. In: Proceedings of the 26th Annual Conference ACM SIGIR Conference on Research and Development in Information Retrieval (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sakurai, S., Goh, C., Orihara, R. (2005). Analysis of Textual Data with Multiple Classes. In: Hacid, MS., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds) Foundations of Intelligent Systems. ISMIS 2005. Lecture Notes in Computer Science(), vol 3488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11425274_12
Download citation
DOI: https://doi.org/10.1007/11425274_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25878-0
Online ISBN: 978-3-540-31949-8
eBook Packages: Computer ScienceComputer Science (R0)