Abstract
This study proposes a sentiment analysis algorithm model that combines word embedding and machine learning algorithms to address issues such as data shortage, low accuracy, and limited coverage. The proposed model uses the continuous bag-of-words word embedding method for word analysis and incorporates convolutional neural network, long and short-term memory neural network model, and attention mechanism to enhance the traditional word embedding method.The research results obtained using the English review dataset and the hotel review dataset, which contain an average of 10,000 entries, show that in the performance comparison of hotel review sentiment analysis tests, the highest value of the model loss function used in this study is 0.92. The accuracy, recall, and F1 value of the model are 90.56%, 91.65%, and 92.36%, respectively, which are 5.89%, 7.00%, and 6.74% higher than the lowest values in the convolutional neural network, long short term memory neural network, and convolutional long short term memory neural network. In the performance comparison for different emotional tendencies, the algorithm model used in this study achieved various indicator values of 94.68%, 93.86%, and 94.68% in sentiment analysis, respectively. The model with attention mechanism has a higher accuracy, with an accuracy of 96.4%. This shows that the algorithmic model used in the study is able to improve the accuracy and effectiveness of sentiment analysis to some extent. The research model is particularly suitable for sentiment analysis tasks in scenarios such as product reviews and hotel evaluations on e-commerce platforms, and can extract key emotional information from massive user reviews. This provides strong support for user sentiment analysis and data-driven decision-making in industries such as e-commerce and tourism.










Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Data availability
Data will be available on a reasonable request.
References
Attieh, J., & Tekli, J. (2023). Supervised term-category feature weighting for improved text classification. Knowledge-Based Systems, 261(6), 1–17.
Cardone, B., Martino, F. D., & Senatore, S. (2021). Improving the emotion-based classification by exploiting the fuzzy entropy in FCM clustering. International Journal of Intelligent Systems, 36(11), 6944–6967.
Chen, X., Zhang, W., Xu, X., & Cao, W. (2022). A public and large-scale expert information fusion method and its application: Mining public opinion via sentiment analysis and measuring public dynamic reliability. Information Fusion, 78(8), 71–85.
Chen, Z., Li, X., Wang, M., & Yang, S. (2020). Domain sentiment dictionary construction and optimization based on multi-source information fusion. Intelligent Data Analysis, 24(2), 229–251.
Choudhuri, S., Adeniye, S., & Sen, A. (2023). Distribution alignment using complement entropy objective and adaptive consensus-based label refinement for partial domain adaptation. Artificial Intelligence and Applications, 1(1), 43–51.
Gao, Z., Jiang, L., Xia, X., Lo, D., & Grundy, J. (2020). Checking smart contracts with structural code embedding. IEEE Transactions on Software Engineering, 47(12), 2874–2891.
Jain, A., Nandi, B. P., Gupta, C., & Tayal, D. K. (2020). Senti-NSetPSO: Large-sized document-level sentiment analysis using Neutrosophic Set and particle swarm optimization. Soft Computing, 24(1), 3–15.
Ji, Z., Yang, Y., Wang, F., Xu, L., & Hu, X. (2020). Feature encoding with hybrid heterogeneous structure model for image classification. IET Image Processing, 14(10), 2166–2174.
Li, M. Y., Zhao, X. J., Zhang, L., & Ye, X. (2020b). Method for product selection considering consumer’s expectations and online reviews. Kybernetes, 50(9), 2488–2520.
Li, Y., Zhang, K., Wang, J., & Gao, X. (2020a). A cognitive brain model for multimodal sentiment analysis based on attention neural networks. Neurocomputing, 430(2), 159–173.
Lim, S., Prade, H., & Richard, G. (2021). Classifying and completing word analogies by machine learning. International Journal of Approximate Reasoning, 132(1), 1–25.
Mohamed, F. S. (2021). Detecting cyberbullying across social media platforms in Saudi Arabia using sentiment analysis: A case study. The Computer Journal, 7, 1787–1794.
Musa, I. H., Xu, K., Liu, F., & Zamit, I. (2020). A cross-lingual sentiment topic model evolution over time. Intelligent Data Analysis, 24(2), 253–266.
Pandiaraj, A., Sundar, C., & Pavalarajan, S. (2021). Sentiment analysis on newspaper article reviews: Contribution towards improved rider optimization-based hybrid classifier. Kybernetes, 51(1), 348–382.
Pindado, E., & Barrena, R. (2020). Using Twitter to explore consumers’ sentiments and their social representations towards new food trends. British Food Journal, 123(3), 1060–1082.
Rahmani, S., Hosseini, S., Zall, R., & Kangavari, M. R. (2023). Transfer-based adaptive tree for multimodal sentiment analysis based on user latent aspects. Knowledge-Based Systems, 261(5), 1–16.
Rehioui, H., & Idrissi, A. (2020). New clustering algorithms for Twitter sentiment analysis. IEEE Systems Journal, 14(1), 530–537.
Sharan, B., & Jain, T. (2020). Spectral analysis-based fault diagnosis algorithm for 3-phase passive rectifiers in renewable energy systems. IET Power Electronics, 13(16), 3818–3829.
Trillo, J. R., Herrera-Viedma, E., Morente-Molinera, J. A., & Cabrerizo, F. J. (2023). A large scale group decision making system based on sentiment analysis cluster. Information Fusion, 91(5), 633–643.
Vo, A. D., Nguyen, Q. P., & Ock, C. Y. (2020). Semantic and syntactic analysis in learning representation based on a sentiment analysis model. Applied Intelligence, 50(3), 663–680.
Yang, Z., Ouyang, T., Fu, X., & Peng, X. (2020). A decision-making algorithm for online shopping using deep-learning–based opinion pairs mining and q-rung orthopair fuzzy interaction Heronian mean operators. International Journal of Intelligent Systems, 35(5), 783–825.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The author reports there are no competing interests to declare.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Hu, Q. Sentiment analysis algorithm based on word embedding in text mining. Int J Speech Technol 28, 141–151 (2025). https://doi.org/10.1007/s10772-025-10170-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10772-025-10170-8