Abstract
In this paper, we present a novel approach of constructing an opinion-element collocation dataset for Chinese language. The opinion-element collocation is a collocation whose composition words contain opinion/sentiment element. The dataset is useful for opinion mining task in many aspects. A search engine is used as a fundamental tool mainly because it could help us to seek both domain-specific and domain-independent collocation pairs, and at the same time, an ontology is used as a resource because it can offer rich semantic information to help us to classify collocations into domain-specific or domain-independent type. The tool and resource are combined to build a smart system that can automatically crawl data from the Internet and analyze extracted collocations. In order to ensure the quality of extracted collocations, we evaluate it manually. The experimental results on the COAE2008’s public corpus have proved the success of this approach on the four domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kim, S.M., Hovy, E.: Determining the Sentiment of Opinions. In: 20th International Conference on Computational Linguistics (COLING 2004), pp. 1367–1373. ACL, Stroudsburg (2004)
Matsumoto, S., Takamura, H., Okumura, M.: Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees. In: Ho, T.-B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 301–311. Springer, Heidelberg (2005)
Popescu, A.M., Etzioni, O.: Extracting Product Features and Opinions from Reviews. In: Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT/EMNLP 2005), pp. 339–346. ACL, Stroudsburg (2005)
Kim, S.M., Hovy, E.: Identifying and Analyzing Judgment Opinions. In: Main Conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL 2006), pp. 200–207. ACL, Stroudsburg (2006)
Liu, B., Hu, M.Q., Cheng, J.S.: Opinion Observer: Analyzing and Comparing Opinions on the Web. In: 14th International Conference on World Wide Web (WWW 2005), pp. 342–351. ACM, New York (2005)
Pang, B., Lee, L.L.: Opinion Mining and Sentiment Analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Kobayashi, N., Inui, K., Matsumoto, Y.: Extracting Aspect-Evaluation and Aspect-of Relations in Opinion Mining. In: 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL 2007), pp. 1065–1074. ACL, Stroudsburg (2007)
Lou, D.C., Yao, T.F.: Semantic Polarity Analysis and Opinion Mining on Chinese Review Sentences. Journal of Computer Application 26, 2622–2625 (2006) (in Chinese)
Zhang, J.F., Zhang, Q., Wu, L.D., Huang, X.J.: Subjective Relation Extraction in Chinese Opinion Mining. Journal of Chinese Information Processing 22, 55–59 (2008) (in Chinese)
Chen, Q.Z., Liu, Q.S., Yao, T.F.: Topic and Sentiment Relation Extraction on Chinese Opinioned Texts. In: 5th China National Conference on Information Retrieval (CCIR 2009), pp. 505–512. CIPSC, Beijing (2009) (in Chinese)
Chen, M.S., Yao, T.F.: Combining Dependency Parsing with Shallow Semantic Analysis for Chinese Opinion-element Relation Extraction. In: 4th International Universal Communication Symposium, pp. 299–305. IEEE Press, New York (2010)
Zengin, B.: Benefit of Google Search Engine in Learning and Teaching Collocations. Journal of Educational Research 34, 151–166 (2009)
Etzioni, O., Banko, M., Soderland, S., Weld, D.S.: Open Information Extraction from the Web. Communication of the ACM 51, 68–74 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yao, T., Chen, M. (2013). Constructing Chinese Opinion-Element Collocation Dataset Using Search Engine and Ontology. In: Ji, D., Xiao, G. (eds) Chinese Lexical Semantics. CLSW 2012. Lecture Notes in Computer Science(), vol 7717. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36337-5_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-36337-5_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36336-8
Online ISBN: 978-3-642-36337-5
eBook Packages: Computer ScienceComputer Science (R0)