Skip to main content

The Constitution of a Fine-Grained Opinion Annotated Corpus on Weibo

  • Conference paper
  • First Online:
Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data (NLP-NABD 2016, CCL 2016)

Abstract

Sentiment analysis on social media represented by Weibo is one of the hotspot research problems in NLP. A comprehensive and systematic fine-grained annotated corpus plays a significance role. In this paper, considering the characteristics of Weibo, we focus on the constitution of a fine-grained, hierarchical opinion annotated corpus and design a set of labelling specification. We manually annotate the opinion sentences with a part of ones containing hidden opinion which can be useful for implicit sentiment analysis. Then a fine-grained aspect extraction, namely opinion triples like <object, attribute, polarity> is finished for aspect-level sentiment research. Moreover, we establish an evaluation method for the task of fine-grained aspect extraction which has been applied in evaluation for years. The corpus was used in the task of COAE2015, and it will be a useful resource for the related research on social media sentiment analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.autohome.com.cn

  2. 2.

    The coverage match is equivalent to fully match when the coverage is 1.

References

  1. CNNIC: Statistical report on internet development in China (2015)

    Google Scholar 

  2. Dai, M., Zhu, Z., Li, S., Zhou, G.: Corpus construction on opinion information extraction in Chinese. J. Chin. Inf. Process. 29(4), 67 (2015)

    Google Scholar 

  3. Deng, L., Wiebe, J.: MPQA 3.0: an entity/event-level sentiment corpus. In: Proceedings of Conference of the North American Chapter of the Association of Computational Linguistics: Human Language Technologies (2015)

    Google Scholar 

  4. Jakob, N., Gurevych, I.: Extracting opinion targets in a single-and cross-domain setting with conditional random fields. In: Proceedings of Empirical Methods in Natural Language Processing, pp. 1035–1045. Association for Computational Linguistics (2010)

    Google Scholar 

  5. Liao, J., Wang, S., Li, D., Zhang, P.: The bag-of-opinions method for car review sentiment polarity classification. J. Chin. Inf. Process. 29(3), 113 (2015)

    Google Scholar 

  6. Liao, X., Wang, S., Huang, M.: Overview of Chinese opinion analysis evaluation 2015. In: Proceedings of Chinese Opinion Analysis Evaluation 2015, Luoyang, China, pp. 5–26 (2015)

    Google Scholar 

  7. Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, vol. 2, pp. 627–666 (2010)

    Google Scholar 

  8. Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of International Conference on Language Resources and Evaluation, vol. 10, pp. 1320–1326 (2010)

    Google Scholar 

  9. Ptaszynski, M., Rzepka, R., Araki, K., Momouchi, Y.: Automatically annotating a five-billion-word corpus of Japanese blogs for sentiment and affect analysis. Comput. Speech Lang. 28(1), 38–55 (2014)

    Article  Google Scholar 

  10. Song, Y.: Design and implementation of a rule-based comparative opinion mining system. Master’s thesis, Shanxi University, Taiyuan, China (2014)

    Google Scholar 

  11. Tan, S., Wang, S., Liao, X., Li, W.: Overview of Chinese opinion analysis evaluation 2013. In: Proceedings of Chinese Opinion Analysis Evaluation 2013, Taiyuan, China, pp. 5–33 (2013)

    Google Scholar 

  12. Tan, S., Wang, S., Xu, W., Yan, X., Liao, X.: Overview of Chinese opinion analysis evaluation 2014. In: Proceedings of Chinese Opinion Analysis Evaluation 2014, Kunming, China, pp. 4–25 (2014)

    Google Scholar 

  13. Xu, L., Lin, H., Zhao, J.: Construction and analysis of emotional corpus. J. Chin. Inf. Process. 22(1), 116–122 (2008)

    Google Scholar 

  14. Xu, R., Xia, Y., Wong, K.F., Li, W.: Opinion annotation in on-line Chinese product reviews. In: Proceedings of International Conference on Language Resources and Evaluation, vol. 8, pp. 26–30 (2008)

    Google Scholar 

  15. Yao, Y., Wang, S., Xu, R., Liu, B., Gui, L., Lu, Q., Wang, X.: The construction of an emotion annotated corpus on microblog text. J. Chin. Inf. Process. 28(5), 83 (2014)

    Google Scholar 

Download references

Acknowledgement

The authors would like to thank all the students’ hard work who participate the corpus’s labelling including Zhao Celi, Zhang Jin, Xu Chaoyi, Guo Xiaomin, Zhang Jun, Li Min, Qiao Pei, Mu Wanqing, Wang Jia, Wang Jie and Lv Ying. Also thank all anonymous reviewers for their valuable comments and suggestions which have significantly improved the quality and presentation of this paper. This work was supported by the National High-Tech Research and Development Program (863 Program) (2015AA011808); the National Natural Science Foundation of China (61432011, 61573231, 61175067, 61272095, U1435212); the Shanxi Province Returned Overseas Research Project (2013-014); the Shanxi Province Science and Technology Basic Condition Platform Construction (2015091001-0102).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wang Suge .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Jian, L., Yang, L., Suge, W. (2016). The Constitution of a Fine-Grained Opinion Annotated Corpus on Weibo. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-47674-2_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-47673-5

  • Online ISBN: 978-3-319-47674-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics