Skip to main content

Clustering Sentiment Phrases in Product Reviews by Constrained Co-clustering

  • Conference paper
  • First Online:
  • 2305 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9362))

Abstract

Clustering sentiment phrases in product reviews is convenient for us to get the most important information about one product directly through thousands of reviews. There are mainly two components in a sentiment phrase, the aspect word and the opinion word. We need to cluster these two parts simultaneously. Although several methods have been proposed to cluster words or phrases, limited work has been done on clustering two-dimensional sentiment phrases. In this paper, we apply a two-sided hidden Markov random field (HMRF) model on this task. We use the approach of constrained co-clustering with some priori knowledge, in a semi-supervised setting. Experimental results on sentiment phrases extracted from about 0.7 million mobile phone reviews show that this method is promising for this task and our method outperforms baselines remarkably.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Yutaka, M., Takeshi, S., Koki, U., and Mitsuru, I.: Graph-based word clustering using a web search engine. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 542–550 (2006)

    Google Scholar 

  2. Lin, D., Wu, X.: Phrase clustering for discriminative learning. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP (ACL-IJCNLP), pp. 1030–1038 (2009)

    Google Scholar 

  3. Zhai, Z., Liu, B., Xu, H., Jia, P.: Clustering product features for opinion mining. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining, pp. 347–354 (2011)

    Google Scholar 

  4. Zhao, L., Huang, M., Chen, H., Cheng, J., Zhu, X.: Clustering aspect-related phrases by leveraging sentiment distribution consistency. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1614–1623 (2014)

    Google Scholar 

  5. Li, H., Abe, N.: Word clustering and disambiguation based on co-occurrence data. In: Proceedings of the 17th International Conference on Computational Linguistics, pp. 749–755 (1998)

    Google Scholar 

  6. Song, Y., Pan, S., Liu, S.: Constrained co-clustering for textual documents. In: Proceedings of the 24th AAAI conference on Artificial Intelligence, pp. 581–586 (2010)

    Google Scholar 

  7. Song, Y., Pan, S., Liu, S., Wei, F., Zhou, M.X., Qian, W.: Constrained text coclustering with supervised and unsupervised constraints. IEEE Trans. Knowl. Data Eng. 25(6), 1227–1239 (2013)

    Article  Google Scholar 

  8. Dhillon, I.S., Mallela, S., Modha, D. S.: Information-Theoretical Coclustering. In: Proceedings of the Ninth ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining (KDD 2003), pp. 89–98 (2003)

    Google Scholar 

  9. Wagner, S., Wagner, D.: Comparing clusterings - an overview. Technical report 2006-04, Faculty of Informatics, Universitat Karlsruhe (TH) (2006)

    Google Scholar 

  10. Basu, S., Bilenko, M., Mooney, R. J.: A probabilistic framework for semi-supervised clustering. In: Proceedings of the 10th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 59–68 (2004)

    Google Scholar 

  11. Wang, F., Li, T., Zhang, C.: Semi-supervised clustering via matrix factorization. In: Proceedings of SIAM Int’l Conf. Data. Mining (SDM), pp. 1–12 (2008)

    Google Scholar 

  12. Ding, C., Li, T., Peng, W., Park, H.: Orthogonal nonnegative matrix trifactorizations for clustering. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 126–135 (2006)

    Google Scholar 

  13. Dhillio, I.S.: Co-clustering documents and words using bipartite spectral graph partitioning. In: Proceedings of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2001), pp. 269–274 (2001)

    Google Scholar 

  14. Shi, X., Fan, W., Yu, P.S.: Efficient semi supervised spectral co-clustering with constraints. In: Proceedings of IEEE 10th International Conf. Data Mining (ICMD), pp. 1043–1048 (2010)

    Google Scholar 

  15. Matsuo, Y., Sakaki, T., Uchiyama, k., Ishizuka, M.: Graph based word clustering using web search engine. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 542–550 (2006)

    Google Scholar 

  16. SanJuan, E., Fidelia I.: Phrase clustering without document context. In: Proceedings of the 28th European Conference on Information Retrieval, pp. 494–497 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yujie Cao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Cao, Y., Huang, M., Zhu, X. (2015). Clustering Sentiment Phrases in Product Reviews by Constrained Co-clustering. In: Li, J., Ji, H., Zhao, D., Feng, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2015. Lecture Notes in Computer Science(), vol 9362. Springer, Cham. https://doi.org/10.1007/978-3-319-25207-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25207-0_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25206-3

  • Online ISBN: 978-3-319-25207-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics