Skip to main content

Lexicon-Based Sentiment Analysis on Topical Chinese Microblog Messages

  • Conference paper
  • First Online:
Book cover Semantic Web and Web Science

Part of the book series: Springer Proceedings in Complexity ((SPCOM))

Abstract

Microblogging is a popular social media where people express their opinions and sentiment on social topics. The Chinese microblogging service, called Weibo, has become a remarkable media in the Chinese society. People are eager to know others’ attitudes towards social events; thus sentiment analysis on those topical microblog messages is important. In this paper we introduce a lexicon-based sentiment analysis method. We construct a Weibo Lexicon with representative topical words and out-of-vocabulary (OOV) words, which are usually informal and are not existing in formal dictionaries. In addition, we use a propagation algorithm to automatically assign sentiment polarity scores to the discovered words. These scores are more closely reflecting the Weibo context since words may have new or opposite polarities instead of their formal meanings. Evaluations on the classification tasks show that our method is effective on recognizing the subjectivity and sentiment of Weibo sentences. The Weibo lexicon increases the performance of the classifications.

This work was supported by Natural Science Foundation (60903107, 61073071), National High Technology Research and Development (863) Program (2011AA01A207) and the Research Fund for the Doctoral Program of Higher Education of China (20090002120005). This work has been done at the NUS-Tsinghua EXtreme search centre (NExT).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://twitter.com/.

  2. 2.

    http://www.mediabistro.com/alltwitter/500-million-registered-users_b18842.

  3. 3.

    http://weibo.com/.

  4. 4.

    http://t.qq.com/.

  5. 5.

    http://www.keenage.com/html/c_bulletin_2007.htm.

References

  1. Chang, C.C., Lin, C.J.: Libsvm: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27:1–27:27 (2011)

    Google Scholar 

  2. CNNIC: The 30th china internet development report. Tech. rep., China Internet Information Center (2012)

    Google Scholar 

  3. Cui, A., Zhang, M., Liu, Y., Ma, S.: Emotion tokens: bridging the gap among multilingual twitter sentiment analysis. In: Proceedings of the 7th Asia conference on Information Retrieval Technology, pp. 238–249. AIRS’11, Springer, Berlin, Heidelberg (2011)

    Google Scholar 

  4. Ku, L.W., Chen, H.H.: Mining opinions from the web: Beyond relevance retrieval. J. Am. Soc. Inf. Sci. Technol. 58(12), 1838–1850 (2007)

    Article  Google Scholar 

  5. Li, Z., Zhang, M., Ma, S., Zhou, B., Sun, Y.: Automatic extraction for product feature words from comments on the web. In: Proceedings of the 5th Asia Information Retrieval Symposium on Information Retrieval Technology, pp. 112–123. AIRS ’09, Springer, Berlin, Heidelberg (2009)

    Google Scholar 

  6. Liu, B.: Sentiment analysis and subjectivity. Handbook of Natural Language Processing, 2nd edn. In: Indurkhya, N., Damerau, FJ. (eds.) pp. 627–666 (2010)

    Google Scholar 

  7. Neviarouskaya, A., Prendinger, H., Ishizuka, M.: SentiFul: A lexicon for sentiment analysis. IEEE Trans. Affect. Comput. 2(1), 22–36 (2011)

    Article  Google Scholar 

  8. Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of LREC, vol. 2010 (2010)

    Google Scholar 

  9. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations Trends Inform. Retrieval 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  10. Zhang, H.P., Yu, H.K., Xiong, D.Y., Liu, Q.: Hhmm-based chinese lexical analyzer ictclas. In: Proceedings of the second SIGHAN workshop on Chinese language processing - vol. 17, pp. 184–187. SIGHAN ’03, Association for Computational Linguistics, Stroudsburg, PA (2003)

    Google Scholar 

  11. Zhang, W., Liu, J., Guo, X.: Positive and Negative Words Dictionary for Students (First Edition). Beijing, China: Encyclopedia of China Publishing House, 75–77 (2004)

    Google Scholar 

  12. Zhao, J., Dong, L., Wu, J., Xu, K.: Moodlens: an emoticon-based sentiment analysis system for chinese tweets. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1528–1531. ACM, New York (2012)

    Google Scholar 

Download references

Acknowledgements

The NExT search center is supported by the Singapore National Research Foundation and Interactive Digital Media R&D Program Office, MDA, under research grant (WBS: R-252-300-001-490).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anqi Cui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this paper

Cite this paper

Cui, A., Zhang, H., Liu, Y., Zhang, M., Ma, S. (2013). Lexicon-Based Sentiment Analysis on Topical Chinese Microblog Messages. In: Li, J., Qi, G., Zhao, D., Nejdl, W., Zheng, HT. (eds) Semantic Web and Web Science. Springer Proceedings in Complexity. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6880-6_29

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-6880-6_29

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-6879-0

  • Online ISBN: 978-1-4614-6880-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics