Skip to main content

Advertisement

Log in

Sentiment analysis algorithm based on word embedding in text mining

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

This study proposes a sentiment analysis algorithm model that combines word embedding and machine learning algorithms to address issues such as data shortage, low accuracy, and limited coverage. The proposed model uses the continuous bag-of-words word embedding method for word analysis and incorporates convolutional neural network, long and short-term memory neural network model, and attention mechanism to enhance the traditional word embedding method.The research results obtained using the English review dataset and the hotel review dataset, which contain an average of 10,000 entries, show that in the performance comparison of hotel review sentiment analysis tests, the highest value of the model loss function used in this study is 0.92. The accuracy, recall, and F1 value of the model are 90.56%, 91.65%, and 92.36%, respectively, which are 5.89%, 7.00%, and 6.74% higher than the lowest values in the convolutional neural network, long short term memory neural network, and convolutional long short term memory neural network. In the performance comparison for different emotional tendencies, the algorithm model used in this study achieved various indicator values of 94.68%, 93.86%, and 94.68% in sentiment analysis, respectively. The model with attention mechanism has a higher accuracy, with an accuracy of 96.4%. This shows that the algorithmic model used in the study is able to improve the accuracy and effectiveness of sentiment analysis to some extent. The research model is particularly suitable for sentiment analysis tasks in scenarios such as product reviews and hotel evaluations on e-commerce platforms, and can extract key emotional information from massive user reviews. This provides strong support for user sentiment analysis and data-driven decision-making in industries such as e-commerce and tourism.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Data availability

Data will be available on a reasonable request.

References

  • Attieh, J., & Tekli, J. (2023). Supervised term-category feature weighting for improved text classification. Knowledge-Based Systems, 261(6), 1–17.

    Google Scholar 

  • Cardone, B., Martino, F. D., & Senatore, S. (2021). Improving the emotion-based classification by exploiting the fuzzy entropy in FCM clustering. International Journal of Intelligent Systems, 36(11), 6944–6967.

    Article  Google Scholar 

  • Chen, X., Zhang, W., Xu, X., & Cao, W. (2022). A public and large-scale expert information fusion method and its application: Mining public opinion via sentiment analysis and measuring public dynamic reliability. Information Fusion, 78(8), 71–85.

    Article  Google Scholar 

  • Chen, Z., Li, X., Wang, M., & Yang, S. (2020). Domain sentiment dictionary construction and optimization based on multi-source information fusion. Intelligent Data Analysis, 24(2), 229–251.

    Article  Google Scholar 

  • Choudhuri, S., Adeniye, S., & Sen, A. (2023). Distribution alignment using complement entropy objective and adaptive consensus-based label refinement for partial domain adaptation. Artificial Intelligence and Applications, 1(1), 43–51.

    Article  Google Scholar 

  • Gao, Z., Jiang, L., Xia, X., Lo, D., & Grundy, J. (2020). Checking smart contracts with structural code embedding. IEEE Transactions on Software Engineering, 47(12), 2874–2891.

    Article  Google Scholar 

  • Jain, A., Nandi, B. P., Gupta, C., & Tayal, D. K. (2020). Senti-NSetPSO: Large-sized document-level sentiment analysis using Neutrosophic Set and particle swarm optimization. Soft Computing, 24(1), 3–15.

    Article  Google Scholar 

  • Ji, Z., Yang, Y., Wang, F., Xu, L., & Hu, X. (2020). Feature encoding with hybrid heterogeneous structure model for image classification. IET Image Processing, 14(10), 2166–2174.

    Article  Google Scholar 

  • Li, M. Y., Zhao, X. J., Zhang, L., & Ye, X. (2020b). Method for product selection considering consumer’s expectations and online reviews. Kybernetes, 50(9), 2488–2520.

    Article  Google Scholar 

  • Li, Y., Zhang, K., Wang, J., & Gao, X. (2020a). A cognitive brain model for multimodal sentiment analysis based on attention neural networks. Neurocomputing, 430(2), 159–173.

    Google Scholar 

  • Lim, S., Prade, H., & Richard, G. (2021). Classifying and completing word analogies by machine learning. International Journal of Approximate Reasoning, 132(1), 1–25.

    Article  MathSciNet  Google Scholar 

  • Mohamed, F. S. (2021). Detecting cyberbullying across social media platforms in Saudi Arabia using sentiment analysis: A case study. The Computer Journal, 7, 1787–1794.

    Google Scholar 

  • Musa, I. H., Xu, K., Liu, F., & Zamit, I. (2020). A cross-lingual sentiment topic model evolution over time. Intelligent Data Analysis, 24(2), 253–266.

    Article  Google Scholar 

  • Pandiaraj, A., Sundar, C., & Pavalarajan, S. (2021). Sentiment analysis on newspaper article reviews: Contribution towards improved rider optimization-based hybrid classifier. Kybernetes, 51(1), 348–382.

    Google Scholar 

  • Pindado, E., & Barrena, R. (2020). Using Twitter to explore consumers’ sentiments and their social representations towards new food trends. British Food Journal, 123(3), 1060–1082.

    Article  Google Scholar 

  • Rahmani, S., Hosseini, S., Zall, R., & Kangavari, M. R. (2023). Transfer-based adaptive tree for multimodal sentiment analysis based on user latent aspects. Knowledge-Based Systems, 261(5), 1–16.

    Google Scholar 

  • Rehioui, H., & Idrissi, A. (2020). New clustering algorithms for Twitter sentiment analysis. IEEE Systems Journal, 14(1), 530–537.

    Article  Google Scholar 

  • Sharan, B., & Jain, T. (2020). Spectral analysis-based fault diagnosis algorithm for 3-phase passive rectifiers in renewable energy systems. IET Power Electronics, 13(16), 3818–3829.

    Article  Google Scholar 

  • Trillo, J. R., Herrera-Viedma, E., Morente-Molinera, J. A., & Cabrerizo, F. J. (2023). A large scale group decision making system based on sentiment analysis cluster. Information Fusion, 91(5), 633–643.

    Article  Google Scholar 

  • Vo, A. D., Nguyen, Q. P., & Ock, C. Y. (2020). Semantic and syntactic analysis in learning representation based on a sentiment analysis model. Applied Intelligence, 50(3), 663–680.

    Article  Google Scholar 

  • Yang, Z., Ouyang, T., Fu, X., & Peng, X. (2020). A decision-making algorithm for online shopping using deep-learning–based opinion pairs mining and q-rung orthopair fuzzy interaction Heronian mean operators. International Journal of Intelligent Systems, 35(5), 783–825.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qiong Hu.

Ethics declarations

Competing interests

The author reports there are no competing interests to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Hu, Q. Sentiment analysis algorithm based on word embedding in text mining. Int J Speech Technol 28, 141–151 (2025). https://doi.org/10.1007/s10772-025-10170-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-025-10170-8

Keywords