Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis

Tan, Songbo; Cheng, Xueqi; Wang, Yuefen; Xu, Hongbo

doi:10.1007/978-3-642-00958-7_31

Songbo Tan¹⁹,
Xueqi Cheng¹⁹,
Yuefen Wang²⁰ &
…
Hongbo Xu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5478))

Included in the following conference series:

European Conference on Information Retrieval

3824 Accesses
71 Citations

Abstract

In the community of sentiment analysis, supervised learning techniques have been shown to perform very well. When transferred to another domain, however, a supervised sentiment classifier often performs extremely bad. This is so-called domain-transfer problem. In this work, we attempt to attack this problem by making the maximum use of both the old-domain data and the unlabeled new-domain data. To leverage knowledge from the old-domain data, we proposed an effective measure, i.e., Frequently Co-occurring Entropy (FCE), to pick out generalizable features that occur frequently in both domains and have similar occurring probability. To gain knowledge from the new-domain data, we proposed Adapted Naïve Bayes (ANB), a weighted transfer version of Naive Bayes Classifier. The experimental results indicate that proposed approach could improve the performance of base classifier dramatically, and even provide much better performance than the transfer-learning baseline, i.e. the Naïve Bayes Transfer Classifier (NTBC).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Whitelaw, C., Garg, N., Argamon, S.: Using appraisal groups for sentiment analysis. In: CIKM (2005)
Google Scholar
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up? Sentiment classification using machine learning techniques. In: EMNLP 2002 (2002)
Google Scholar
Aue, A., Gamon, M.: Customizing Sentiment Classifiers to New Domains: a Case Study. In: RANLP 2005 (2005)
Google Scholar
Tan, S., Wu, G., Tang, H., Cheng, X.: A novel scheme for domain-transfer problem in the context of sentiment analysis. In: CIKM 2007 (2007)
Google Scholar
Nigam, K., McCallum, A., Thrun, S., Mitchell, T.: Learning to classify text from labeled and unlabeled documents. In: AAAI 1998 (1998)
Google Scholar
Joachims, T.: Transductive inference for text classification using support vector machines. In: ICML (1999)
Google Scholar
Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Article MathSciNet Google Scholar
Zhang, H.: Chinese Lexical Analysis Using Hierarchical Hidden Markov Model. In: The Second SIGHAN workshop affiliated with 41st ACL (2003)
Google Scholar
DaumeIII, H., Marcu, D.: Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research 26, 101–126 (2006)
MathSciNet MATH Google Scholar
Jiang, J., Zhai, C.: A Two-Stage Approach to Domain Adaptation for Statistical Classifiers. In: CIKM 2007 (2007)
Google Scholar
Dai, W., Xue, G., Yang, Q., Yu, Y.: Transferring Naive Bayes Classifiers for Text Classification. In: AAAI 2007 (2007)
Google Scholar
McCallum, A., Nigam, K.: A Comparison of Event Models for Naive Bayes Text Classification. In: AAAI/ICML Workshop on Learning for Text Categorization (1998)
Google Scholar
Wilson, T., Wiebe, J., Hwa, R.: Recognizing Strong and Weak Opinion Clauses. Computational Intelligence 22(2), 73–99 (2006)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Key Laboratory of Network, Institute of Computing Technology, China
Songbo Tan, Xueqi Cheng & Hongbo Xu
Information Center, Chinese Academy of Geological Sciences, China
Yuefen Wang

Authors

Songbo Tan
View author publications
You can also search for this author in PubMed Google Scholar
Xueqi Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Yuefen Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hongbo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Université de Toulouse - IRIT,, 118 Route de Narbonne,, 31062,, Toulouse Cedex 4,, France
Mohand Boughanem
Laboratoire d’Informatique de Grenoble, BP 53,, Université Joseph Fourier,, 38041, Grenoble Cedex 9,, France
Catherine Berrut
Université de Toulouse - IRIT,, 118 Route de Narbonne,, 31062, Toulouse Cedex 4,, France
Josiane Mothe & Chantal Soule-Dupuy &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, S., Cheng, X., Wang, Y., Xu, H. (2009). Adapting Naive Bayes to Domain Adaptation for Sentiment Analysis. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds) Advances in Information Retrieval. ECIR 2009. Lecture Notes in Computer Science, vol 5478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00958-7_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-00958-7_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00957-0
Online ISBN: 978-3-642-00958-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics