Abstract
Opinions always carry important information of texts, but comparative sentence is a common way to express opinions. We describe how to recognize comparative sentences from Chinese text documents by combining rule-based methods and statistical methods as well as analyze the performance of these methods. The method firstly normalizes the corpus and Chinese word segmentation, and then gets the broad extraction results by using comparative words, sentence structure templates and dependency relation analysis. Finally we take CSR, comparative words and statistical feature words as classification features of SVM to accurately identify comparative sentences in the broad extraction results. The experiments with COAE 2013’s test data show that our approach provides better performance than the baselines and most systems reported at CCIR 2013.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kawahara, D., Inui, K., Kurohashi, S.: Identifying contradictory and contrastive relations between statements to outline web information on a given topic. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 534–542. Association for Computational Linguistics (2010)
Jindal, N., Liu, B.: Identifying comparative sentences in text documents. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 244–251. ACM (2006)
Jindal, N., Liu, B.: Mining comparative sentences and relations. In: Proceedings of the National Conference on Artificial Intelligence. AAAI Press, MIT Press, Menlo Park, Cambridge (1999); 21(2), 1331 (2006)
Sun, J.T., Wang, X., Shen, D., et al.: CWS: A comparative web search system. In: Proceedings of the 15th International Conference on World Wide Web, pp. 467–476. ACM (2006)
Luo, G., Tang, C., Tian, Y.: Answering relationship queries on the web. In: Proceedings of the 16th International Conference on World Wide Web, pp. 561–570. ACM (2007)
Feldman, R., Fresko, M., Goldenberg, J., et al.: Extracting product comparisons from discussion boards. In: Seventh IEEE International Conference on Data Mining, ICDM 2007, pp. 469–474. IEEE (2007)
Li, S., Lin, C.Y., Song, Y.I., et al.: Comparable entity mining from comparative questions. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 650–658 (2010)
Huang, X.J., Wan, X.J., Yang, J.W., et al.: Learning to Identify Chinese Comparative Sentences. Journal of Chinese Information Processing 22(5), 30–38 (2008)
Huang, G.H., Yao, T.F., Liu, Q.S.: Mining Chinese Comparative Sentences and Relations Based on CRF algorithm. Application Research of Computers 27(6) (2010)
Zhang, R., Jin, Y.: Identification and Transformation of Comparative Sentences in Patent Chinese-English Machine Translation. In: 2012 International Conference on Asian Language Processing (IALP), pp. 217–220. IEEE (2012)
Che, J.: A brief analysis of comparative sentences in modern Chinese. Journal of Hubei Normal University (Philosophy and Social Science) 3 (2005)
Li, J.: The study of comparative sentence and comparison identification. Chongqing University (2011)
Song, R., Lin, H.-F., Chang, F.-Y.: Chinese Comparative Sentences Identification and Comparative Relations Extraction. Journal of Chinese Information Processing 23(2), 102–107 (2009)
Hu, B.-S., Wang, D.-L., Yu, G., et al.: An Answer Extraction Algorithm Based on Syntax Structure Feature Parsing and Classification. Chinese Journal of Computers 31(4), 662–676 (2008)
Liu, W., Yan, H.-L., Xiao, J.-G., et al.: Solution for Automatic Web Review Extraction. Journal of Software 21(12), 3220–3236 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, Q., Huang, H., Zhang, C., Chen, Z., Chen, J. (2013). Chinese Comparative Sentence Identification Based on the Combination of Rules and Statistics. In: Motoda, H., Wu, Z., Cao, L., Zaiane, O., Yao, M., Wang, W. (eds) Advanced Data Mining and Applications. ADMA 2013. Lecture Notes in Computer Science(), vol 8347. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-53917-6_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-53917-6_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53916-9
Online ISBN: 978-3-642-53917-6
eBook Packages: Computer ScienceComputer Science (R0)