Skip to main content
Log in

Robust sentiment fusion on distribution of news

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In recent years, since technology has revolutionized human life, plenty of data and information are published on the Internet each day. These shared data contain sentiments (i.e., positive, neutral or negative) toward various topics. Hence, there is a growing need for some techniques and tools which make anticipations about the sentiment of documents. Some sentiment analysis tools extract sentiments for words or sentences separately; moreover, these tools might generate noisy results. Moreover, some documents may contain noisy sentences(not relevant sentences to the context of the document). Therefore, we present some fusion methods to fuse all separated sentiments to have a noise robust value as the whole document sentiment. The proposed robust sentiment analysis tool is called RSF. Initially, the sentiment of sentences are extracted by CoreNLP; afterward, due to the histogram of extracted sentence sentiments, documents are divided into two separated groups. The first group consists of documents which have one main concept and some noisy signals. On the other hand, the second group consists of documents with two main concepts. Afterward, the separated sentiments are fused by using non-linear and linear operator with different loss functions. Moreover, an approach is proposed to evaluate the different loss functions utilized in the fusion step. This approach is based on spaCy and neural network which calculate the train and test errors of different fusion methods to determine the most efficient loss function. Eventually, a news corpus of English news of ISNA is introduced in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Notes

  1. http://nlp.stanford.edu/sentiment/treebank.html

  2. https://spacy.io

  3. http://www.isna.ir

  4. http://www.nltk.org

  5. http://www.cs.cornell.edu/people/pabo/movie-review-da

  6. https://textblob.readthedocs.io/en/dev/

  7. https://www.crummy.com/software/BeautifulSoup

References

  1. Aggarwal CC, Zhai C (2012) Mining text data. Springer, New York. https://doi.org/10.1007/978-3-319-14142-8_13

    Book  Google Scholar 

  2. Ashkezari-Toussi S, Sadoghi-Yazdi H (2019) Robust diffusion LMS over adaptive networks. Signal Process 158:201–209. https://doi.org/10.1016/j.sigpro.2019.01.004

    Article  Google Scholar 

  3. Boiy E, Moens M-F (2009) A machine learning approach to sentiment analysis in multilingual web texts. Inf Retr 12(5):526–558. https://doi.org/10.1007/s10791-008-9070-z

    Article  Google Scholar 

  4. Braunstein SL (1992) How large a sample is needed for the maximum likelihood estimator to be approximately gaussian?. J Phys A Math Gen 25(13):3813. http://stacks.iop.org/0305-4470/25/i=13/a=027

    Article  MathSciNet  MATH  Google Scholar 

  5. Cai X, Hu S, Lin X (2012) Feature extraction using restricted boltzmann machine for stock price prediction. In: 2012 IEEE international conference on computer science and automation engineering (CSAE), IEEE. https://doi.org/10.1109/csae.2012.6272913

  6. Čapek D, Metscher BD, Müller GB (2002) Thumbs up or thumbs down?: semantic orientation applied to unsupervised classication of reviews. In: Proceedings of the 40th annual meeting on association for computational linguistics 322, 1–12. https://doi.org/10.1002/jez.b.22545

  7. Chatzakou D, Vakali A (2015) Harvesting opinions and emotions from social media textual resources. IEEE Internet Comput 19:46–50. https://doi.org/10.1109/mic.2015.28

    Article  Google Scholar 

  8. Choi JD, Tetreault JR, Stent A (2015) It depends: Dependency parser comparison using a web-based evaluation tool. In: ACL

  9. Dasarathy B (1997) Sensor fusion potential exploitation-innovative architectures and illustrative applications. Proc IEEE 85(1):24–38. https://doi.org/10.1109/5.554206

    Article  Google Scholar 

  10. Deonna J, Teroni F (2012) The emotions: a philosophical introduction, Vol. 69 Routledge

  11. Durrant-Whyte H (1988) Sensor models and multisensor integration. Int J Robot Res 7(6):97–113. https://doi.org/10.1177/027836498800700608

    Article  Google Scholar 

  12. Durrant-whyte H, Stevens M Data fusion in decentralized sensing networks, Information Fusion - INFFUS

  13. Duan X, Banchs RE, Zhang M, Li H, Kumara A (2016) Proceedings of news 2016 the sixth named entities workshop

  14. Gan Q, Harris C (2001) Comparison of two measurement fusion methods for kalman-filter-based multisensor data fusion. IEEE Trans Aerosp Electron Syst 37 (1):273–279. https://doi.org/10.1109/7.913685

    Article  Google Scholar 

  15. Gao L, Guo Z, Zhang H, Xu X, Shen HT (2017) Video captioning with attention-based LSTM and semantic consistency. IEEE Trans Multimed 19(9):2045–2055. https://doi.org/10.1109/tmm.2017.2729019

    Article  Google Scholar 

  16. Greene D, Cunningham P (2006) Practical solutions to the problem of diagonal dominance in kernel document clustering. In: Proceedings of the 23rd international conference on machine learning, ICML ’06. https://doi.org/10.1145/1143844.1143892. ACM, New York, pp 377–384

  17. Honnibal M, Johnson M (2015) An improved non-monotonic transition system for dependency parsing. In: Proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, pp 1373–1378. https://aclweb.org/anthology/D/D15/D15-1162

  18. Hussein DME-DM A survey on sentiment analysis challenges., King Suad University. https://doi.org/10.1016/j.jksues.2016.04.002

  19. Hu M, Liu B (2004) Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’04. https://doi.org/10.1145/1014052.1014073. ACM, New York, pp 168–177

  20. Intarapaiboon P (2015) A framework for text classification using intuitionistic fuzzy sets. https://doi.org/10.1007/978-3-662-47200-2_78

  21. Kadhim AI, Cheah Y-N, Ahamed NH Text document preprocessing and dimension reduction techniques for text document clustering. https://doi.org/10.1109/icaiet.2014.21

  22. Liang H, Sun X, Sun Y, Gao Y (2017) Text feature extraction based on deep learning: a review. EURASIP Journal on Wireless Communications and Networking, 1. https://doi.org/10.1186/s13638-017-0993-1

  23. Liu B (2012) Sentiment analysis and opinion mining, Morgan & Claypool Publishers

  24. Luo R, Yih C-C, Su KL (2002) Multisensor fusion and integration: approaches, applications, and future research directions. IEEE Sensors J 2(2):107–119. https://doi.org/10.1109/jsen.2002.1000251

    Article  Google Scholar 

  25. Manyika J, Durrant-Whyte H (1994) Data Fusion and Sensor Management: A Decentralized Information-Theoretic Approach (Ellis Horwood Series in Electrical and Electronic Engineering), Prentice Hall. https://www.amazon.com/Data-Fusion-Sensor-Management-Information-Theoretic/dp/0133031322?SubscriptionId=AKIAIOBINVZYXZQZ2U3A&tag=chimbori05-20&linkCode=xm2&camp=2025&creative=165953&creativeASIN=0133031322

  26. Medhat W, Hassan A, Korashy H (2014) Sentiment analysis algorithms and applications: a survey. A in Shams Eng J 5:1093–1113. https://doi.org/10.1016/j.asej.2014.04.011

    Article  Google Scholar 

  27. Pang B, Lee L, Vaithyanathan S (2002) Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing - Volume 10, EMNLP ’02. https://doi.org/10.3115/1118693.1118704. Association for Computational Linguistics, Stroudsburg, pp 79–86

  28. Pradhan L, Taneja NA, Dixit C, Suhag M (2017) Comparison of text classifiers on news articles. Procedia Technology 66:S104. https://doi.org/10.1016/j.jasi.2017.08.330

    Google Scholar 

  29. Qin ZL (2013) Sparse automatic encoder application in text categorization research. In: Science technology and engineering

  30. Rumelhart DE, Hinton GE, Williams RJ (1986) Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (eds) Parallel distributed processing. reprinted in (26). MIT Press, Cambridge, pp 318–362

  31. Serrano-Guerrero J, Olivas JA, Romero FP, Herrera-Viedma E (2015) Sentiment analysis: a review and comparative analysis of web services. Inf Sci 311:18–38. https://doi.org/10.1016/j.ins.2015.03.040

    Article  Google Scholar 

  32. Shaikh MA, Prendinger H, Mitsuru I (2007) Assessing sentiment of text by semantic dependency and contextual valence analysis. In: Proceedings of the 2nd international conference on affective computing and intelligent interaction, ACII ’07. Springer, Berlin, pp 191–202. https://doi.org/10.1007/978-3-540-74889-2_18

  33. Shing J, Jang R (1993) Anfis: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23:665–685

    Article  Google Scholar 

  34. Shopon M, Mohammed N, Abedin MA (2016) Bangla handwritten digit recognition using autoencoder and deep convolutional neural network. In: 2016 international workshop on computational intelligence (IWCI), IEEE. https://doi.org/10.1109/iwci.2016.7860340

  35. Snyder B, Barzilay R (2007) Multiple aspect ranking using the good grief algorithm, In: Proceedings of the human language technology conference of the north american chapter of the association of computational linguistics (HLT-NAACL, pp. 300–307

  36. Socher R, Perelygin A, Wu JY, Chuang J, Manning CD, Ng AY, Potts C Recursive deep models for semantic compositionality over a sentiment treebank

  37. Song J, Yang Y, Huang Z, Shen HT, Luo J (2013) Effective multiple feature hashing for large-scale near-duplicate video retrieval. IEEE Trans Multimed 15(8):1997–2008. https://doi.org/10.1109/tmm.2013.2271746

    Article  Google Scholar 

  38. Song J, Guo Y, Gao L, Li X, Hanjalic A, Shen HT (2018) From deterministic to generative: Multimodal stochastic RNNs for video captioning. IEEE Trans Neural Netw Learn Syst 2018:1–12. https://doi.org/10.1109/tnnls.2018.2851077

    Google Scholar 

  39. Sun X, Ni Y (2006) Recurrent neural network with kernel feature extraction for stock prices forecasting. In: 2006 international conference on computational intelligence and security, IEEE. https://doi.org/10.1109/iccias.2006.294269

  40. Sun X, Li C, Xu W, Ren F (2014) Chinese microblog sentiment classification based on deep belief nets with extended multi-modality features. In: 2014 IEEE international conference on data mining workshop, IEEE. https://doi.org/10.1109/icdmw.2014.101

  41. Taboada M, Anthony C, Voll K (2006) Methods for creating semantic orientation dictionaries. In: Conference on language resources and evaluation (LREC), pp 427–432

  42. Takamura H, Inui T, Okumura M (2005) Extracting semantic orientations of words using spin model. In: Proceedings of the 43rd annual meeting on association for computational linguistics, ACL ’05. https://doi.org/10.3115/1219840.1219857. Association for Computational Linguistics, Stroudsburg, pp 133–140

  43. Tellez ES, Miranda-Jiménez S, Graff M, Moctezuma D, Siordia OS, Villaseñor EA (2017) A case study of spanish text transformations for twitter sentiment analysis. Expert System with Application 81:457–471. https://doi.org/10.1016/j.eswa.2017.03.071

    Article  Google Scholar 

  44. Titov I, McDonald R (2008) A joint model of text and aspect ratings for sentiment summarization. In: Proceedings of ACL-08: HLT. http://www.aclweb.org/anthology/P08-1036. Association for Computational Linguistics, Columbus, pp 308–316

  45. Vaseghi SV (2006) Advanced digital signal processing and noise reduction. Wiley, New York

    Google Scholar 

  46. Vilares D, Alonso MA, Gómez-Rodríguez C (2017) Supervised sentiment analysis in multilingual environments. Inf Process Manag 53:595–607. https://doi.org/10.1016/j.ipm.2017.01.004

    Article  Google Scholar 

  47. Wang Z, Ziou D, Armenakis C, Li D, Li Q (2005) A comparative analysis of image fusion methods. IEEE Trans Geosci Remote Sens 43(6):1391–1402. https://doi.org/10.1109/tgrs.2005.846874

    Article  Google Scholar 

  48. Wei W, Gulla JA (2010) Sentiment learning on product reviews via sentiment ontology tree. In: Proceedings of the 48th annual meeting of the association for computational linguistics, ACL ’10. Association for Computational Linguistics, Stroudsburg, pp 404–413. http://dl.acm.org/citation.cfm?id=1858681.1858723

  49. Wiebe J, Mihalcea R (2006) Word sense and subjectivity. In: Proceedings of the 21st international conference on computational linguistics and the 44th annual meeting of the association for computational linguistics, ACL-44. https://doi.org/10.3115/1220175.1220309. Association for Computational Linguistics, Stroudsburg, pp 1065–1072

  50. Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of the conference on human language technology and empirical methods in natural language processing, HLT ’05. Association for Computational Linguistics, Stroudsburg, pp 347–354. https://doi.org/10.3115/1220575.1220619

  51. Xiong Y, Wang D, Zhang Y, Feng S, Wang G (2014) Multimodal data fusion in text-image heterogeneous graph for social media recommendation. In: Web-Age information management, Springer International Publishing, pp 96–99. https://doi.org/10.1007/978-3-319-08010-9_12

  52. Zhang S, He F, Ren W, Yao J Joint learning of image detail and transmission map for single image dehazing, The Visual Computer. https://doi.org/10.1007/s00371-018-1612-9

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hadi Sadoghi Yazdi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kamel, M., Siuky, F.N. & Yazdi, H.S. Robust sentiment fusion on distribution of news. Multimed Tools Appl 78, 21917–21942 (2019). https://doi.org/10.1007/s11042-019-7505-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-7505-8

Keywords

Navigation