Skip to main content

Software Sentiment Analysis Using Machine Learning with Different Word-Embedding

  • Conference paper
  • First Online:
Book cover Computational Science and Its Applications – ICCSA 2022 Workshops (ICCSA 2022)

Abstract

Software sentiment analysis has applications in numerous software engineering tasks ranging from code suggestions to evaluating app reviews which help to save the development team valuable time and increase productivity. In recent years, sentiment analysis has been used to study the emotional state of developers through sources like commit messages. State-of-the-art sentiment analysis techniques have been employed to accomplish these tasks with varying results. The goal of this paper is to provide a comparison between the performance of various models for possible applications of sentiment analysis in software engineering. We have used three different datasets to account for the possible applications: JIRA, AppReviews, and StackOverflow. In this work, six word embedding techniques have been applied on above datasets to represent the text as n-dimensional vectors. To handle the skewed distribution of classes present in the data, we have employed two class balancing techniques in the form of SMOTE and Borderline-SMOTE. The resulting data is subjected to six feature selection techniques, and finally, the sentiment of the text is classified using 14 different classifiers. The experimental results suggest that some models are very successful in accurately classifying the sentiment of the text, whereas choosing the wrong combination of ML techniques can lead to disappointing performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Biswas, E., Vijay-Shanker, K., Pollock, L.: Exploring word embedding techniques to improve sentiment analysis of software engineering texts. In: 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), pp. 68–78 (2019)

    Google Scholar 

  2. Islam, M.R., Zibran, M.F.: Leveraging automated sentiment analysis in software engineering. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 203–214 (2017)

    Google Scholar 

  3. Jindal, R., Malhotra, R., Jain, A.: Software defect prediction using neural networks. In: Proceedings of 3rd International Conference on Reliability, Infocom Technologies and Optimization, pp. 1–6. IEEE (2014)

    Google Scholar 

  4. Jurado, F., Rodriguez, P.: Sentiment analysis in monitoring software development processes: an exploratory case study on github’s project issues. J. Syst. Softw. 104, 82–89 (2015)

    Article  Google Scholar 

  5. Kumar, L., Kumar, M., Murthy, L.B., Misra, S., Kocher, V., Padmanabhuni, S.: An empirical study on application of word embedding techniques for prediction of software defect severity level. In: 2021 16th Conference on Computer Science and Intelligence Systems (FedCSIS), pp. 477–484. IEEE (2021)

    Google Scholar 

  6. Lin, B., Zampetti, F., Bavota, G., Di Penta, M., Lanza, M., Oliveto, R.: Sentiment analysis for software engineering: how far can we go? In: Proceedings of the 40th International Conference on Software Engineering, pp. 94–104. ICSE 2018, Association for Computing Machinery, New York, NY, USA (2018)

    Google Scholar 

  7. Sari, G.I.P., Siahaan, D.O.: An attribute selection for severity level determination according to the support vector machine classification result. In: Proceedings of the 1st International Conference on Information Systems for Business Competitiveness (ICISBC) (2011)

    Google Scholar 

Download references

Acknowledgements

This research is funded by TestAIng Solutions Pvt. Ltd.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sanidhya Vijayvargiya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Mula, V.K.C., Vijayvargiya, S., Kumar, L., Samant, S.S., Murthy, L.B. (2022). Software Sentiment Analysis Using Machine Learning with Different Word-Embedding. In: Gervasi, O., Murgante, B., Misra, S., Rocha, A.M.A.C., Garau, C. (eds) Computational Science and Its Applications – ICCSA 2022 Workshops. ICCSA 2022. Lecture Notes in Computer Science, vol 13381. Springer, Cham. https://doi.org/10.1007/978-3-031-10548-7_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-10548-7_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-10547-0

  • Online ISBN: 978-3-031-10548-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics