Abstract
Binary sentiment analysis uses sentiment dictionaries, TF-IDF, word2vec, and BERT to convert text documents such as product and movie reviews into vectors. Dimensionality reduction by feature selection can effectively reduce the complexity of sentiment analysis. Existing feature selection methods put all samples together and ignore the difference in the feature representation between different categories. For binary sentiment analysis, there are some reviews with uncertain sentiment polarity, three-way decision divides samples into positive (POS) region, negative (NEG) region, and uncertain region (UNC). The model based on the three-way decision is beneficial to process the UNC and improve the effect of binary sentiment analysis. However, how to obtain the optimal feature representation in certain regions respectively to process the uncertain samples is a challenge. In this paper, a classified feature representation three-way decision model is proposed to obtain the optimal feature representation of the positive and negative domains for sentiment analysis. In the positive domain and the negative domain, m- and n-layer feature representations are obtained. The optimal layer with the best performance is selected as the optimal feature representation. The POS region and the NEG region in the testing set are processed by the optimal feature representation, the UNC region is processed by the original feature representation. Experiments on IMDB and Amazon show that the performance of our proposed method in terms of classification accuracy in sentiment analysis is significantly higher than that of the chi-square, principal component analysis, and mutual information methods.






Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ahmad S R, Bakar A A, Yaakub M R (2019) A review of feature selection techniques in sentiment analysis. In: Intelligent data analysis, vol 1, pp 159–189
Mehta P, Chandra S (2019) NICFS: A novel feature selection method applied to lexicon based sentiment analysis. Intell Decis Technol 13(1):41–48
Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: north american chapter of the association for computational linguistics. pp 4171–4186
Tommasel A, Godoy D (2018) A Social-aware online short-text feature selection technique for social media. Information Fusion
Wang Z, Lin Z (2020) Optimal feature selection for Learning-Based algorithms for sentiment classification. Cogn Comput 12(1):238–248
Madasu A, Sivasankar E (2020) Efficient Feature Selection techniques for Sentiment Analysis. Multimed Tools Appl 79(9):6313–6335
Kumar H M K, Harish B S (2019) A new feature selection method for sentiment analysis in short text. J Intell Syst 29(1):1122–1134
Tripathy A, Agrawal A, Rath S K (2016) Classification of sentiment reviews using n-gram machine learning approach. Expert Syst Appl 57:117–126
Barkha B, Sangeet S (2019) Hybrid attribute based sentiment classification of online reviews for consumer intelligence. Appl Intell 49(1):137–149
Al-Sharuee M T, Liu F, Pratama M (2020) Sentiment analysis: dynamic and temporal clustering of product reviews. Appl Intell:1–20
Huiping C, Lidan W, Shukai D (2016) Sentiment classification model based on word embedding and CNN. Application Research of Computers
Dey R, Hong Y (2018) CompNet: Complementary Segmentation Network for Brain MRI Extraction. In: medical image computing and computer assisted intervention, pp 628– 636
Sabour S, Frosst N, Hinton GE (2017) Dynamic Routing Between Capsules. arXiv Computer Vision and Pattern Recognition
Hochreiter S, Schmidhuber J (1997) Long Short-Term memory. Neural Comput 9(8):1735–1780
Tang D, Qin B, Feng X, Liu T Effective LSTMs for Target-Dependent Sentiment Classification. arXiv Computation and Language
Chen L -C, Lee C -M, Chen M -Y (2020) Exploration of social media for sentiment analysis using deep learning. Soft Comput 24(11):8187–8197. https://doi.org/10.1007/s00500-019-04402-8
Yao Y (2009) Three-Way Decision: An Interpretation of Rules in Rough Set Theory. Rough Sets and Knowledge Technology, 4th International Conference, RSKT 2009, Gold Coast. Proceedings. Springer, Berlin
Fujita H, Gaeta A, Loia V, Orciuoli F (2020) Hypotheses analysis and assessment in counterterrorism activities: a method based on OWA and fuzzy probabilistic rough sets. IEEE Trans Fuzzy Syst 28 (5):831–845. https://doi.org/10.1109/TFUZZ.2019.2955047
Yiyu Y (2018) Three-way decision and granular computing. Int J Approx Reason 103:107–123
Yao Y (2019) Tri-level thinking: models of three-way decision. Int J Mach Learn Cybern 1(5)
Yao Y, Wang S, Deng X (2017) Constructing shadowed sets and three-way approximations of fuzzy sets. Inf Sci:132–153
Zhao X, Miao D, Fujita H (2020) Variable-precision three-way concepts in L-contexts. Int J Approx Reason 130:107– 125
Yang X, Zhang Y, Fujita H, Liu D, Li T (2020) Local temporal-spatial multi-granularity learning for sequential three-way granular computing. Inf Sci 541:75–97. https://doi.org/10.1016/j.ins.2020.06.020
Luo J, Fujita H, Yao Y, Qin K (2020) On modeling similarity and three-way decision under incomplete information in rough set theory. Knowl Based Syst 191:105251. https://doi.org/10.1016/j.knosys.2019.105251
Yang D, Deng T, Fujita H (2020) Partial-overall dominance three-way decision models in interval-valued decision systems. Int J Approx Reason 126:308–325. https://doi.org/10.1016/j.ijar.2020.08.014
Yue X, Chen Y, Miao D, Fujita H (2020) Fuzzy neighborhood covering for three-way classification. Inf Sci 507:795–808. https://doi.org/10.1016/j.ins.2018.07.065
Li Y, Zhang L, Xu Y, Yao Y, Lau R, Wu Y (2017) Enhancing binary classification by modeling uncertain boundary in Three-Way decisions. IEEE Trans Knowl Data Eng:1–1
Qiao J, Qing Hu B (2018) On transformations from semi-three-way decision spaces to three-way decision spaces based on triangular norms and triangular conorms. Information Ences S0020025517305911
Li H, Zhang L, Zhou X, Huang B (2017) Cost-sensitive sequential three-way decision modeling using a deep neural network. Int J Approx Reason 85:68–78
Abdel-Basset M, Manogaran G, Mohamed M, Chilamkurti N (2018) Three-way decisions based on neutrosophic sets and AHP-QFD framework for supplier selection problem. Futur Gener Comput Syst 89(DEC.):19–30
Hu F, Wang L, Zhou Y (2018) An oversampling method for imbalance data based on Three-Way decision model. Acta Electron Sin 46(1):135–144
Afridi M K, Azam N, Yao J T, Alanazi E (2018) A three-way clustering approach for handling missing data using GTRS. Int J Approx Reason:11–24
Jiang C, Wu J, Li Z (2018) Adaptive thresholds determination for saving cloud energy using three-way decisions. Cluster Computing
Fujita H, Gaeta A, Loia V, Orciuoli F (2019) Resilience analysis of critical infrastructures: a cognitive approach based on granular computing. IEEE Trans Cybern 49(5):1835–1848. https://doi.org/10.1109/TCYB.2018.2815178
Yao Y (2008) A unified framework of granular computing. Wiley
Yang X, Li T, Fujita H, Liu D, Yao Y (2017) A unified model of sequential three-way decisions and multilevel incremental processing. Knowl-Based Syst 134:172–188
Chen J, Zhang Y P, Zhao S (2016)
Wang T, Li H, Zhang L, Zhou X, Huang B (2020) A three-way decision model based on cumulative prospect theory. Inf Sci 519:74–92. https://doi.org/10.1016/j.ins.2020.01.030
Li H, Zhang L, Huang B, Zhou X (2019) Cost-Sensitive Dual-Bidirectional Linear discriminant analysis. Inf Sci 510:283–303
Chung J, Gulcehre C, Cho K, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv Neural and Evolutionary Computing
Harish B S, Revanasiddappa M B (2017) A Comprehensive Survey on various Feature Selection Methods to Categorize Text Documents. Int J Comput Appl 164(8):1–7
Yang Y (1997) A Comparative Study on Feature Selection in Text Categorization. In: Proceedings of Int Conference on Machine Learning
Abbasi A, Chen H, Salem A (2008) Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Trans Inf Syst 26(3):12
Rasool A, Tao R, Kamyab A (2020) GAWA–A feature selection method for hybrid sentiment classification. IEEE Access:8
Madasu A, Elango S (2020) Efficient feature selection techniques for sentiment analysis. Multimed Tools Appl 79(9-10):6313–6335
Wang Z, Lin Z (2020) Optimal feature selection for Learning-Based algorithms for sentiment classification. Cogn Comput 12(1):238–248
Gokalp O, Tasci E, Ugur A (2020) A novel wrapper feature selection algorithm based on iterated greedy metaheuristic for sentiment classification. Expert Syst Appl 146:113176-
Yao Y, Zhang X (2017) Class-specific attribute reducts in rough set theory. Inf Sci:601–618
Zhang L, Zhang B (2014) Quotient Space Based Problem Solving.
Ling Z, Bo Z (2003) Theory of Fuzzy Quotient Space (Methods of Fuzzy Granular Computing). Journal of Software
Chen J, Xu Y (2020) AH3: An Adaptive Hierarchical Feature Representation Model for Three-Way Decision Boundary Processing[J]. International Journal of Advanced Research
Acknowledgements
This work was supported by the Major Program of the National Social Science Foundation of China (Grant No. 18ZDA032), the National Natural Science Foundation of China (Grant No. 61876001), and the China Scholarship Council.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, J., Chen, Y., He, Y. et al. A classified feature representation three-way decision model for sentiment analysis. Appl Intell 52, 7995–8007 (2022). https://doi.org/10.1007/s10489-021-02809-1
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-021-02809-1