Abstract
In recent years, since technology has revolutionized human life, plenty of data and information are published on the Internet each day. These shared data contain sentiments (i.e., positive, neutral or negative) toward various topics. Hence, there is a growing need for some techniques and tools which make anticipations about the sentiment of documents. Some sentiment analysis tools extract sentiments for words or sentences separately; moreover, these tools might generate noisy results. Moreover, some documents may contain noisy sentences(not relevant sentences to the context of the document). Therefore, we present some fusion methods to fuse all separated sentiments to have a noise robust value as the whole document sentiment. The proposed robust sentiment analysis tool is called RSF. Initially, the sentiment of sentences are extracted by CoreNLP; afterward, due to the histogram of extracted sentence sentiments, documents are divided into two separated groups. The first group consists of documents which have one main concept and some noisy signals. On the other hand, the second group consists of documents with two main concepts. Afterward, the separated sentiments are fused by using non-linear and linear operator with different loss functions. Moreover, an approach is proposed to evaluate the different loss functions utilized in the fusion step. This approach is based on spaCy and neural network which calculate the train and test errors of different fusion methods to determine the most efficient loss function. Eventually, a news corpus of English news of ISNA is introduced in this paper.
Similar content being viewed by others
References
Aggarwal CC, Zhai C (2012) Mining text data. Springer, New York. https://doi.org/10.1007/978-3-319-14142-8_13
Ashkezari-Toussi S, Sadoghi-Yazdi H (2019) Robust diffusion LMS over adaptive networks. Signal Process 158:201–209. https://doi.org/10.1016/j.sigpro.2019.01.004
Boiy E, Moens M-F (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retr 12(5):526–558. https://doi.org/10.1007/s10791-008-9070-z
Braunstein SL (1992) How large a sample is needed for the maximum likelihood estimator to be approximately gaussian?. J Phys A Math Gen 25(13):3813. http://stacks.iop.org/0305-4470/25/i=13/a=027
Cai X, Hu S, Lin X (2012) Feature extraction using restricted boltzmann machine for stock price prediction. In: 2012 IEEE international conference on computer science and automation engineering (CSAE), IEEE. https://doi.org/10.1109/csae.2012.6272913
Čapek D, Metscher BD, Müller GB (2002) Thumbs up or thumbs down?: semantic orientation applied to unsupervised classication of reviews. In: Proceedings of the 40th annual meeting on association for computational linguistics 322, 1–12. https://doi.org/10.1002/jez.b.22545
Chatzakou D, Vakali A (2015) Harvesting opinions and emotions from social media textual resources. IEEE Internet Comput 19:46–50. https://doi.org/10.1109/mic.2015.28
Choi JD, Tetreault JR, Stent A (2015) It depends: Dependency parser comparison using a web-based evaluation tool. In: ACL
Dasarathy B (1997) Sensor fusion potential exploitation-innovative architectures and illustrative applications. Proc IEEE 85(1):24–38. https://doi.org/10.1109/5.554206
Deonna J, Teroni F (2012) The emotions: a philosophical introduction, Vol. 69 Routledge
Durrant-Whyte H (1988) Sensor models and multisensor integration. Int J Robot Res 7(6):97–113. https://doi.org/10.1177/027836498800700608
Durrant-whyte H, Stevens M Data fusion in decentralized sensing networks, Information Fusion - INFFUS
Duan X, Banchs RE, Zhang M, Li H, Kumara A (2016) Proceedings of news 2016 the sixth named entities workshop
Gan Q, Harris C (2001) Comparison of two measurement fusion methods for kalman-filter-based multisensor data fusion. IEEE Trans Aerosp Electron Syst 37 (1):273–279. https://doi.org/10.1109/7.913685
Gao L, Guo Z, Zhang H, Xu X, Shen HT (2017) Video captioning with attention-based LSTM and semantic consistency. IEEE Trans Multimed 19(9):2045–2055. https://doi.org/10.1109/tmm.2017.2729019
Greene D, Cunningham P (2006) Practical solutions to the problem of diagonal dominance in kernel document clustering. In: Proceedings of the 23rd international conference on machine learning, ICML ’06. https://doi.org/10.1145/1143844.1143892. ACM, New York, pp 377–384
Honnibal M, Johnson M (2015) An improved non-monotonic transition system for dependency parsing. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, pp 1373–1378. https://aclweb.org/anthology/D/D15/D15-1162
Hussein DME-DM A survey on sentiment analysis challenges., King Suad University. https://doi.org/10.1016/j.jksues.2016.04.002
Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’04. https://doi.org/10.1145/1014052.1014073. ACM, New York, pp 168–177
Intarapaiboon P (2015) A framework for text classification using intuitionistic fuzzy sets. https://doi.org/10.1007/978-3-662-47200-2_78
Kadhim AI, Cheah Y-N, Ahamed NH Text document preprocessing and dimension reduction techniques for text document clustering. https://doi.org/10.1109/icaiet.2014.21
Liang H, Sun X, Sun Y, Gao Y (2017) Text feature extraction based on deep learning: a review. EURASIP Journal on Wireless Communications and Networking, 1. https://doi.org/10.1186/s13638-017-0993-1
Liu B (2012) Sentiment analysis and opinion mining, Morgan & Claypool Publishers
Luo R, Yih C-C, Su KL (2002) Multisensor fusion and integration: approaches, applications, and future research directions. IEEE Sensors J 2(2):107–119. https://doi.org/10.1109/jsen.2002.1000251
Manyika J, Durrant-Whyte H (1994) Data Fusion and Sensor Management: A Decentralized Information-Theoretic Approach (Ellis Horwood Series in Electrical and Electronic Engineering), Prentice Hall. https://www.amazon.com/Data-Fusion-Sensor-Management-Information-Theoretic/dp/0133031322?SubscriptionId=AKIAIOBINVZYXZQZ2U3A&tag=chimbori05-20&linkCode=xm2&camp=2025&creative=165953&creativeASIN=0133031322
Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. A in Shams Eng J 5:1093–1113. https://doi.org/10.1016/j.asej.2014.04.011
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - Volume 10, EMNLP ’02. https://doi.org/10.3115/1118693.1118704. Association for Computational Linguistics, Stroudsburg, pp 79–86
Pradhan L, Taneja NA, Dixit C, Suhag M (2017) Comparison of text classifiers on news articles. Procedia Technology 66:S104. https://doi.org/10.1016/j.jasi.2017.08.330
Qin ZL (2013) Sparse automatic encoder application in text categorization research. In: Science technology and engineering
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing. reprinted in (26). MIT Press, Cambridge, pp 318–362
Serrano-Guerrero J, Olivas JA, Romero FP, Herrera-Viedma E (2015) Sentiment analysis: a review and comparative analysis of web services. Inf Sci 311:18–38. https://doi.org/10.1016/j.ins.2015.03.040
Shaikh MA, Prendinger H, Mitsuru I (2007) Assessing sentiment of text by semantic dependency and contextual valence analysis. In: Proceedings of the 2nd international conference on affective computing and intelligent interaction, ACII ’07. Springer, Berlin, pp 191–202. https://doi.org/10.1007/978-3-540-74889-2_18
Shing J, Jang R (1993) Anfis: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23:665–685
Shopon M, Mohammed N, Abedin MA (2016) Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: 2016 international workshop on computational intelligence (IWCI), IEEE. https://doi.org/10.1109/iwci.2016.7860340
Snyder B, Barzilay R (2007) Multiple aspect ranking using the good grief algorithm, In: Proceedings of the human language technology conference of the north american chapter of the association of computational linguistics (HLT-NAACL, pp. 300–307
Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C Recursive deep models for semantic compositionality over a sentiment treebank
Song J, Yang Y, Huang Z, Shen HT, Luo J (2013) Effective multiple feature hashing for large-scale near-duplicate video retrieval. IEEE Trans Multimed 15(8):1997–2008. https://doi.org/10.1109/tmm.2013.2271746
Song J, Guo Y, Gao L, Li X, Hanjalic A, Shen HT (2018) From deterministic to generative: Multimodal stochastic RNNs for video captioning. IEEE Trans Neural Netw Learn Syst 2018:1–12. https://doi.org/10.1109/tnnls.2018.2851077
Sun X, Ni Y (2006) Recurrent neural network with kernel feature extraction for stock prices forecasting. In: 2006 international conference on computational intelligence and security, IEEE. https://doi.org/10.1109/iccias.2006.294269
Sun X, Li C, Xu W, Ren F (2014) Chinese microblog sentiment classification based on deep belief nets with extended multi-modality features. In: 2014 IEEE international conference on data mining workshop, IEEE. https://doi.org/10.1109/icdmw.2014.101
Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Conference on language resources and evaluation (LREC), pp 427–432
Takamura H, Inui T, Okumura M (2005) Extracting semantic orientations of words using spin model. In: Proceedings of the 43rd annual meeting on association for computational linguistics, ACL ’05. https://doi.org/10.3115/1219840.1219857. Association for Computational Linguistics, Stroudsburg, pp 133–140
Tellez ES, Miranda-Jiménez S, Graff M, Moctezuma D, Siordia OS, Villaseñor EA (2017) A case study of spanish text transformations for twitter sentiment analysis. Expert System with Application 81:457–471. https://doi.org/10.1016/j.eswa.2017.03.071
Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of ACL-08: HLT. http://www.aclweb.org/anthology/P08-1036. Association for Computational Linguistics, Columbus, pp 308–316
Vaseghi SV (2006) Advanced digital signal processing and noise reduction. Wiley, New York
Vilares D, Alonso MA, Gómez-Rodríguez C (2017) Supervised sentiment analysis in multilingual environments. Inf Process Manag 53:595–607. https://doi.org/10.1016/j.ipm.2017.01.004
Wang Z, Ziou D, Armenakis C, Li D, Li Q (2005) A comparative analysis of image fusion methods. IEEE Trans Geosci Remote Sens 43(6):1391–1402. https://doi.org/10.1109/tgrs.2005.846874
Wei W, Gulla JA (2010) Sentiment learning on product reviews via sentiment ontology tree. In: Proceedings of the 48th annual meeting of the association for computational linguistics, ACL ’10. Association for Computational Linguistics, Stroudsburg, pp 404–413. http://dl.acm.org/citation.cfm?id=1858681.1858723
Wiebe J, Mihalcea R (2006) Word sense and subjectivity. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the association for computational linguistics, ACL-44. https://doi.org/10.3115/1220175.1220309. Association for Computational Linguistics, Stroudsburg, pp 1065–1072
Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, HLT ’05. Association for Computational Linguistics, Stroudsburg, pp 347–354. https://doi.org/10.3115/1220575.1220619
Xiong Y, Wang D, Zhang Y, Feng S, Wang G (2014) Multimodal data fusion in text-image heterogeneous graph for social media recommendation. In: Web-Age information management, Springer International Publishing, pp 96–99. https://doi.org/10.1007/978-3-319-08010-9_12
Zhang S, He F, Ren W, Yao J Joint learning of image detail and transmission map for single image dehazing, The Visual Computer. https://doi.org/10.1007/s00371-018-1612-9
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kamel, M., Siuky, F.N. & Yazdi, H.S. Robust sentiment fusion on distribution of news. Multimed Tools Appl 78, 21917–21942 (2019). https://doi.org/10.1007/s11042-019-7505-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-7505-8