Skip to main content

Sentiment Classification of Reviews on Automobile Websites by Combining Word2Vec and Dependency Parsing

  • Conference paper
  • First Online:
Smart Computing and Communication (SmartCom 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10699))

Included in the following conference series:

Abstract

The online product reviews become one of the most useful and vast information sources for guiding customers’ decisions and helping the companies improve the quality of the products and services. Therefore, It is valuable to automatically identify sentiments from comment texts, which is concerned with the Sentiment Classification. In this paper, we propose a novel machine learning-based method called ADSSR to classify the sentiments of reviews on popular automobile websites in China. We extract the features based on dependency parsing which can reveal the syntactic structure of the sentence, to avoid obtaining the same vectors for sentences that contain the same words but a different grammatical structure. In order to reduce the dimensionality of the feature vectors and keep the contributions of the low-frequency words, we obtain the distributed vectors learned by Word2Vec and group the semantic similar words in a cluster through the K-means to obtain the pairs of each word and its corresponding cluster, and then replace every word with its corresponding cluster label. Experiments show the efficiency of the proposed sentiment classification method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.autohome.com.cn.

  2. 2.

    http://www.bitauto.com.

  3. 3.

    http://www.pcauto.com.cn/.

  4. 4.

    http://www.xcar.com.cn/.

  5. 5.

    https://www.python.org/.

  6. 6.

    https://github.com/HIT-SCIR/ltp.

  7. 7.

    http://www.ltp-cloud.com/intro/.

  8. 8.

    http://radimrehurek.com/gensim/.

  9. 9.

    http://scikit-learn.org/stable/.

References

  1. Zong, C.: Statistics Natural Language Processing. Tsinghua University Press, Beijing (2008)

    Google Scholar 

  2. Tesnière, L.: Eléments de syntaxe structurale. Librairie C. Klincksieck (1959)

    Google Scholar 

  3. Mikolov, T., et al.: Efficient estimation of word representations in vector space. Computer Science (2013)

    Google Scholar 

  4. Bo, P., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Acl-02 Conference on Empirical Methods in Natural Language Processing, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  5. Li, S., et al.: Active learning for imbalanced sentiment classification. In: Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 139–148. Association for Computational Linguistics (2012)

    Google Scholar 

  6. Turney, P.D.: Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In: Meeting on Association for Computational Linguistics, pp. 417–424. Association for Computational Linguistics (2002)

    Google Scholar 

  7. Agarwal, A., et al.: Sentiment analysis of Twitter data. In: The Workshop on Languages in Social Media, pp. 30–38. Association for Computational Linguistics (2011)

    Google Scholar 

  8. Dai, X., Prout, B.: Unlock big data emotions: weighted word embeddings for sentiment classification. In: IEEE International Conference on Big Data, pp. 3833–3838. IEEE (2017)

    Google Scholar 

  9. Zhai, Z., et al.: Exploiting effective features for Chinese sentiment classification. Expert Syst. Appl. 38(8), 9139–9146 (2011)

    Article  Google Scholar 

  10. Abbasi, A., et al.: Selecting attributes for sentiment classification using feature relation networks. IEEE Trans. Knowl. Data Eng. 23(3), 447–462 (2011)

    Article  Google Scholar 

  11. Quan, C., Wei, X., Ren, F.: Combine sentiment lexicon and dependency parsing for sentiment classification. In: IEEE/SICE International Symposium on System Integration, pp. 100–104. IEEE (2013)

    Google Scholar 

  12. Guo, F.L., Zhou, G.: Research on micro-blog sentiment orientation analysis based on improved dependency parsing. In: International Conference on Consumer Electronics, Communications and Networks, pp. 546–550. IEEE (2014)

    Google Scholar 

  13. Li, L.: Identification of informative reviews enhanced by dependency parsing and sentiment analysis. In: IEEE International Conference on Computer Communication and the Internet. IEEE (2016)

    Google Scholar 

  14. Xue, B., Fu, C., Zhan, S.: A study on sentiment computing and classification of Sina Weibo with Word2vec. In: IEEE International Congress on Big Data, pp. 358–363. IEEE (2014)

    Google Scholar 

  15. Su, Z., et al.: Chinese sentiment classification using a neural network tool Word2vec. In: International Conference on Multisensor Fusion and Information Integration for Intelligent Systems, pp. 1–6. IEEE (2014)

    Google Scholar 

  16. Lu, X., et al.: An approach to sentiment analysis of short Chinese texts based on SVMs. In: Control Conference, pp. 9115–9120. IEEE (2015)

    Google Scholar 

  17. Zharmagambetov, A.S., Pak, A.A.: Sentiment analysis of a document using deep learning approach and decision trees. In: Twelve International Conference on Electronics Computer and Computation, pp. 1–4. IEEE (2016)

    Google Scholar 

  18. Wang, C., Huo, L.: Research on sentence sensitivity analysis based on dependency parsing. World Academic Publishing

    Google Scholar 

  19. Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: International Conference on Neural Information Processing Systems, pp. 3111–3119. Curran Associates Inc. (2013)

    Google Scholar 

  20. Takamura, H., Inui, T., Okumura, M.: Extracting semantic orientations of words using spin model.In: Meeting on Association for Computational Linguistics, pp. 133–140. Association for Computational Linguistics (2005)

    Google Scholar 

  21. Taboada, M., et al.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China under Grant No. 61601046 and No. 61171098, and is partially supported by the 111 Project of China under Grant No. B08004, and EU FP7 IRSES Mobile Cloud Project under Grant No. 612212.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Feifei Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, F., Wei, F., Yu, K., Wu, X. (2018). Sentiment Classification of Reviews on Automobile Websites by Combining Word2Vec and Dependency Parsing. In: Qiu, M. (eds) Smart Computing and Communication. SmartCom 2017. Lecture Notes in Computer Science(), vol 10699. Springer, Cham. https://doi.org/10.1007/978-3-319-73830-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73830-7_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73829-1

  • Online ISBN: 978-3-319-73830-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics