Skip to main content

Lifelong Learning for Cross-Domain Vietnamese Sentiment Classification

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9795))

Abstract

This paper proposes an improvement to lifelong learning for cross-domain sentiment classification. Lifelong learning is to retain knowledge from past learning tasks to improve the learning task on a new domain. In this paper, we will discuss how bigram and bag-of-bigram features integrated into a lifelong learning system can help improve the performance of sentiment classification on both Vietnamese and English. Also, pre-processing techniques specifically for our cross-domain, Vietnamese dataset will be discussed. Experimental results show that our method achieves improvements over prior systems and its potential for cross-domain sentiment classification.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://tiki.vn/.

References

  1. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2, 1–135 (2008)

    Article  Google Scholar 

  2. Liu, B.: Sentiment analysis and opinion mining. Synth. Lect. Hum. Lang. Technol. 5, 1–167 (2012)

    Article  Google Scholar 

  3. Chen, Z., Ma, N., Liu, B.: Lifelong learning for sentiment classification. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing: Short Papers, Beijing, China, vol. 2, pp. 750–756. Association for Computational Linguistics (2015)

    Google Scholar 

  4. Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22, 1345–1359 (2010)

    Article  Google Scholar 

  5. Raina, R., Battle, A., Lee, H., Packer, B., Ng, A.Y.: Self-taught learning: transfer learning from unlabeled data. In: Proceedings of the 24th International Conference on Machine Learning, ICML 2007, pp. 759–766. ACM, New York (2007)

    Google Scholar 

  6. Wang, S., Manning, C.: Baselines and bigrams: simple, good sentiment and topic classification. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers, Jeju Island, Korea, vol. 2, pp. 90–94. Association for Computational Linguistics (2012)

    Google Scholar 

  7. Chen, Z., Liu, B.: Mining topics in documents: standing on the shoulders of big data. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1116–1125. ACM (2014)

    Google Scholar 

  8. Chen, Z., Liu, B.: Topic modeling using topics from many domains, lifelong learning and big data. In: Jebara, T., Xing, E.P. (eds.) Proceedings of the 31st International Conference on Machine Learning (ICML 2014), pp. 703–711. JMLR Workshop and Conference Proceedings (2014)

    Google Scholar 

  9. Ruvolo, P., Eaton, E.: Scalable lifelong learning with active task selection. In: AAAI Spring Symposium: Lifelong Machine Learning (2013)

    Google Scholar 

  10. Finkel, J.R., Manning, C.D.: Hierarchical Bayesian domain adaptation. In: Proceedings of the Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2009, Stroudsburg, PA, USA, pp. 602–610. Association for Computational Linguistics (2009)

    Google Scholar 

  11. Chen, M., Weinberger, K.Q., Blitzer, J.: Co-training for domain adaptation. In: Advances in Neural Information Processing Systems, pp. 2456–2464 (2011)

    Google Scholar 

  12. Kumar, A., Saha, A., Daume, H.: Co-regularization based semi-supervised domain adaptation. In: Advances in Neural Information Processing Systems, pp. 478–486 (2010)

    Google Scholar 

  13. Huang, F., Yates, A.: Exploring representation-learning approaches to domain adaptation. In: Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing, pp. 23–30. Association for Computational Linguistics (2010)

    Google Scholar 

  14. Yang, H., Callan, J., Si, L.: Knowledge transfer and opinion detection in the TREC 2006 blog track. In: TREC (2006)

    Google Scholar 

  15. Blitzer, J., Dredze, M., Pereira, F., et al.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: ACL, vol. 7, pp. 440–447 (2007)

    Google Scholar 

  16. Pan, S.J., Ni, X., Sun, J.T., Yang, Q., Chen, Z.: Cross-domain sentiment classification via spectral feature alignment. In: Proceedings of the 19th International Conference on World Wide Web, pp. 751–760. ACM (2010)

    Google Scholar 

  17. Li, S., Zong, C.: Multi-domain sentiment classification. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 257–260. Association for Computational Linguistics (2008)

    Google Scholar 

  18. Duyen, N.T., Bach, N.X., Phuong, T.M.: An empirical study on sentiment analysis for Vietnamese. In: Advanced Technologies for Communications (ATC), pp. 309–314 (2014)

    Google Scholar 

  19. Bach, N.X., Phuong, T.M.: Leveraging user ratings for resource-poor sentiment classification. Procedia Comput. Sci. 60, 322–331 (2015). Proceedings of the 19th Annual Conference on Knowledge-Based and Intelligent Information Engineering Systems, KES 2015, Singapore, September 2015

    Article  Google Scholar 

  20. Zhang, R., Tran, T.: An information gain-based approach for recommending useful product reviews. Knowl. Inf. Syst. 26, 419–434 (2011)

    Article  Google Scholar 

  21. Kieu, B.T., Pham, S.B.: Sentiment analysis for Vietnamese. In: 2010 Second International Conference on Knowledge and Systems Engineering (KSE), pp. 152–157. IEEE (2010)

    Google Scholar 

  22. Dien, D., Thuy, V.: A maximum entropy approach for Vietnamese word segmentation. In: 2006 International Conference on Research, Innovation and Vision for the Future, pp. 248–253 (2006)

    Google Scholar 

  23. Blitzer, J., Dredze, M., Pereira, F.: Biographies, bollywood, boom-boxes and blenders: domain adaptation for sentiment classification. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, Prague, Czech Republic, pp. 440–447. Association for Computational Linguistics (2007)

    Google Scholar 

  24. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  25. Vu, X.-S., Song, H.-J., Park, S.-B.: Building a Vietnamese SentiWordNet using Vietnamese electronic dictionary and string kernel. In: Kim, Y.S., Kang, B.H., Richards, D. (eds.) PKAW 2014. LNCS, vol. 8863, pp. 223–235. Springer, Heidelberg (2014)

    Google Scholar 

Download references

Acknowledgments

This research is supported by research funding from Honors Program, University of Science, Vietnam National University - Ho Chi Minh City.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Quang-Vinh Ha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Ha, QV., Nguyen-Hoang, BD., Nghiem, MQ. (2016). Lifelong Learning for Cross-Domain Vietnamese Sentiment Classification. In: Nguyen, H., Snasel, V. (eds) Computational Social Networks. CSoNet 2016. Lecture Notes in Computer Science(), vol 9795. Springer, Cham. https://doi.org/10.1007/978-3-319-42345-6_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-42345-6_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-42344-9

  • Online ISBN: 978-3-319-42345-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics