Skip to main content

Grouping Product Aspects from Short Texts Using Multiple Classifiers

  • Conference paper
  • First Online:
  • 1506 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9418))

Abstract

In this paper we present and evaluate a classification model to group product aspects from short user comments, found as pros and cons in consumer review websites. Because of the distinct vocabulary used by consumers to describe the same aspects of a product, it is necessary to group pros and cons to support consumers’ decision making. For this purpose we propose a supervised classification model, consisting of an ensemble classifier that combines a main text classifier (e.g. Naive Bayes) and several string-based classifiers. Furthermore we make use of WordNet as a domain independent ontology to detect semantically related words. Experimental results using pros and cons from five heterogeneous product groups show, that the proposed method outperforms existing approaches to group pros and cons from short texts. We also found that the reusable short comments from our sample follow a power law distribution, that is usually present in social tagging systems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://tartarus.org/martin/PorterStemmer, last access: 03/15/2015.

  2. 2.

    http://www.buzzillions.com/about, last access: 03/16/2015.

  3. 3.

    http://www.cs.waikato.ac.nz/ml/weka/ (last access: 03/16/2015).

  4. 4.

    http://weka.sourceforge.net/doc.dev/weka/classifiers/Classifier.html (last access: 03/16/2015).

References

  1. Carenini, G., Ng, R.T., Zwart, E.: Extracting knowledge from evaluative text. In: Proceedings of the 3rd International Conference on Knowledge Capture. pp. 11–18. ACM (2005)

    Google Scholar 

  2. Chen, X., Li, L., Xu, G., Yang, Z., Kitsuregawa, M.: Recommending related microblogs: a comparison between topic and WordNet based approaches. In: Proceedings of the 26th AAAI Conference on Artificial Intelligence, pp. 2417–2418 (2012)

    Google Scholar 

  3. Dellschaft, K., Staab, S.: An epistemic dynamic model for tagging systems. In: Proceedings of the 19th Conference on Hypertext and Hypermedia, pp. 71–80. ACM (2008)

    Google Scholar 

  4. Guo, H., Zhu, H., Guo, Z., Zhang, X., Su, Z.: Product feature categorization with multilevel latent semantic association. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management, pp. 1087–1096. ACM (2009)

    Google Scholar 

  5. Gupta, M., Li, R., Yin, Z., Han, J.: Survey on social tagging techniques. ACM SIGKDD Explor. Newslett. 12(1), 58–72 (2010)

    Article  Google Scholar 

  6. Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the 10th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 168–177 (2004)

    Google Scholar 

  7. Islam, A., Inkpen, D.: Semantic text similarity using corpus-based word similarity and string similarity. ACM Trans. Knowl. Disc. Data (TKDD) 2(2), 10:1–10:25 (2008)

    Google Scholar 

  8. Kailer, D., Mandl, P., Schill, A.: Supporting customers’ decision making with rated tags. In: Proceedings of the 16th International Conference on Electronic Commerce, pp. 33–40. ACM (2014)

    Google Scholar 

  9. Kittler, J., Hatef, M., Duin, R.P., Matas, J.: On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3), 226–239 (1998)

    Article  Google Scholar 

  10. Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM (2005)

    Google Scholar 

  11. Lu, Y., Zhai, C., Sundaresan, N.: Rated aspect summarization of short comments. In: Proceedings of the 18th International Conference on World Wide Web, pp. 131–140. ACM (2009)

    Google Scholar 

  12. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to information retrieval. Cambridge University Press (2008)

    Google Scholar 

  13. Miller, G.A.: WordNet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  14. Popescu, A.M., Etzioni, O.: Extracting product features and opinions from reviews. In: Kao, A., Poteet, S.R. (eds.) Natural Language Processing and Text Mining, pp. 9–28. Springer, London (2007)

    Chapter  Google Scholar 

  15. Xu, L., Krzyzak, A., Suen, C.Y.: Methods of combining multiple classifiers and their applications to handwriting recognition. IEEE Trans. Syst. Man Cybern. 22(3), 418–435 (1992)

    Article  Google Scholar 

  16. Zhai, Z., Liu, B., Xu, H., Jia, P.: Grouping product features using semi-supervised learning with soft-constraints. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 1272–1280. ACL, August 2010

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel Kailer .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Kailer, D., Mandl, P., Schill, A. (2015). Grouping Product Aspects from Short Texts Using Multiple Classifiers. In: Wang, J., et al. Web Information Systems Engineering – WISE 2015. WISE 2015. Lecture Notes in Computer Science(), vol 9418. Springer, Cham. https://doi.org/10.1007/978-3-319-26190-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26190-4_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26189-8

  • Online ISBN: 978-3-319-26190-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics