Abstract
In the digital era, the importance of extracting the hidden sentiments from user reviews plays a prominent role, to increase the profitability of an organization. The interest among, the research community in Sentiment Analysis (SA) has grown exponentially. But there are enormous challenges still being faced in the field of SA namely: Identification of sarcasm/Irony/Conditional/Modifier statements present in the review, Identification of Aspects and sentiment word as a pair (Data Transformation), Rating the recognized Aspects towards predicting the overall aggregated sentiment, Analyzing and designing issues towards implementing the parallel Aspect Level sentiment. In the present research work, We have addressed each of this challenges using a serial hybridization model, Where, the output of each step, is input to the following stage. First, towards identification sarcasm. In which, the dictionary is updated with the set of sentiment words by manually crafted rules. Next, to mitigate the discovery of sentiment and aspect word pair. In which, Latent Dirichlet Allocation (LDA), Gibbs sampling techniques are used. Next, to present the result of sentiment analysis as the overall rating of data considered, Latent Aspect Rating Regression (LARR) model is proposed (Data Presentation). Finally, addressed the designing issues (deciding numbers of mappers and reducers needed) towards implementing the parallel Aspect Level sentiment Analysis with the objective of improving the resource utilization in Big Data clusters. This work can help the researchers doing research in the field of speech recognition, development of recommended systems. The evaluation Metric used in estimating the performance of each step in our research are F-score, Rand Index, Classification accuracy and Root Mean Absolute Error (RMAE), Throughput. The findings of our research work help the customer to directly use the result obtained from the proposed model in the form of Aspect level rating.
References
Bafna K, Toshniwal D (2013) Feature based summarization of customers’ reviews of online products. Procedia Computer Science 22:142–151
Balazs JA, Velásquez JD (2016) Opinion mining and information fusion: a survey. Information Fusion 27:95–110
Bamman D, Smith NA (2015) Contextualized sarcasm detection on twitter. International Conference on Web and Social Media 2:15
Basha SM, Rajput DS (2017) Evaluating the impact of feature selection on overall performance of sentiment analysis. In: Proceedings of the 2017 International Conference on Information Technology. ACM, p 96–102
Basha SM, Rajput DS (2018) Sentiment analysis: using artificial neural fuzzy inference system. In: Handbook of research on pattern engineering system development for big data analytics. IGI Global, p 130–152
Basha SM, Rajput DS, Poluru RK, Bhushan SB, Basha SAK (2018) Evaluating the performance of supervised classification models: decision tree and Naïve Bayes using KNIME. International Journal of Engineering & Technology 7(4.5):248–253
Basha SM, Rajput DS, Vandhan V (2018) Impact of gradient ascent and boosting algorithm in classification. International Journal of Intelligent Engineering and Systems (IJIES) 11(1):41–49
Basha SM, Rajput DS, Iyengar NCS (2019) Conceptual approach to predict loan defaults using decision trees. In: Sentiment analysis and knowledge discovery in contemporary business. IGI Global, p 148–161
Bouk SH, Ahmed SH, Kim D, Song H (2017) Named-data-networking-based ITS for smart cities. IEEE Commun Mag 55(1):105–111
Bulkowski BJ, Srinivasan V (2018) Data distribution across nodes of a distributed database base system. United States patent application US 15/488,511
Buschmeier K, Cimiano P, Klinger R (2014) An impact analysis of features in a classification approach to irony detection in product reviews. In: Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, p 42–49
Chan H (2004) ACE: an emergent algorithm for highly uniform cluster formation. In: European workshop on wireless sensor networks. Berlin, p 154–171
Charalampakis B, Spathis D, Kouslis E, Kermanidis K (2016) A comparison between semi-supervised and supervised text mining techniques on detecting irony in Greek political tweets. Eng Appl Artif Intell 51:50–57
Dupuy C, Bach F (2017) Online but accurate inference for latent variable models with local Gibbs sampling. J Mach Learn Res 18(1):4581–4625
Ganesan K, Zhai C (2012) Opinion-based entity ranking. Inf Retr 15(2):116–150
Guo Y, Barnes SJ, Jia Q (2017) Mining meaning from online ratings and reviews: tourist satisfaction analysis using latent Dirichlet allocation. Tour Manag 59:467–483
Hai Z, Cong G, Chang K, Liu W, Cheng P (2014) Coarse-to-fine review selection via supervised joint aspect and sentiment model. In: Proceedings of the 37th International ACM SIGIR conference on Research & development in information retrieval. ACM, p 617–626
Hai Z, Chang K, Kim JJ, Yang CC (2014) Identifying features in opinion mining via intrinsic and extrinsic domain relevance. IEEE Trans Knowl Data Eng 26(3):623–634
Hai Z, Cong G, Chang K, Cheng P, Miao C (2017) Analyzing sentiments in one go: a supervised joint topic modeling approach. IEEE Trans Knowl Data Eng 29(6):1172–1185
Hirschberg J, Manning CD (2015) Advances in natural language processing. Science 349(6245):261–266
Hox JJ, Moerbeek M, Van de Schoot R (2017) Multilevel analysis: techniques and applications. Routledge
Huang S, Niu Z, Shi C (2014) Automatic construction of domain-specific sentiment lexicon based on constrained label propagation. Knowl-Based Syst 56:191–200
Hutchins B (2011) The acceleration of media sports culture: twitter, telepresence and online messaging. Inf Commun Soc 14(2):237–257
Ibrahim IA, Bassiouni M (2017) Improving MapReduce performance with progress and feedback based speculative execution. 2017 IEEE International Conference on InSmart Cloud (SmartCloud). IEEE, p 120–125
Joshi A, Bhattacharyya P, Carman MJ (2017) Automatic sarcasm detection: a survey. ACM Computing Surveys (CSUR) 50(5):73
Kaplan AM, Haenlein M (2010) Users of the world, unite! The challenges and opportunities of social media. Business horizons 53(1):59–68
Karmiloff-Smith A (2018) Précis of beyond modularity: a developmental perspective on cognitive science. In: Thinking developmentally from constructivism to neuroconstructivism. Routledge, p 64–94
Kim Y, Shin H (2017) Finding sentiment dimension in vector space of movie reviews: an unsupervised approach. Journal of Cognitive Science 18(1):85–101
Lau RY, Zhang W, Xu W (2018) Parallel aspect-oriented sentiment analysis for sales forecasting with big data. Prod Oper Manag 27(10):1775–1794
Liao X, Qin Z, Ding L (2017) Data embedding in digital images using critical functions. Signal Process Image Commun 58:146–156
Liao X, Guo S, Yin J, Wang H, Li X, Sangaiah AK (2017) New cubic reference table based image steganography. Multimed Tools Appl 1–18
Liu Z, Wang Y, Dontcheva M, Hoffman M, Walker S, Wilson A (2017) Patterns and sequences: interactive exploration of clickstreams to understand common visitor paths. IEEE Trans Vis Comput Graph 23(1):321–330
Mathwick C, Mosteller J (2017) Online reviewer engagement: a typology based on reviewer motivations. J Serv Res 20(2):204–218
McGranahan N, Swanton C (2017) Clonal heterogeneity and tumor evolution: past, present, and the future. Cell 168(4):613–628
Miura Y, Sakaki S, Hattori K, Ohkuma T (2014) TeamX: a sentiment analyzer with enhanced lexicon mapping and weighting scheme for unbalanced data. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), p 628–632
Morad TY (2006) Performance, power efficiency and scalability of asymmetric cluster chip multiprocessors. IEEE Comput Archit Lett 5(1):14–17
Papadimitriou D, Koutrika G, Velegrakis Y, Mylopoulos J (2017) Finding related forum posts through content similarity over intention-based segmentation. IEEE Trans Knowl Data Eng 29(9):1860–1873
Rajadesingan A, Zafarani R, Liu H (2015) Sarcasm detection on twitter: a behavioral modeling approach. In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining. ACM, p 97–106
Reyes A, Rosso P, Veale T (2013) A multidimensional approach for detecting irony in twitter. Lang Resour Eval 47(1):239–268
Saif H, He Y, Fernandez M, Alani H (2016) Contextual semantics for sentiment analysis of twitter. Inf Process Manag 52(1):5–19
Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 1:1–1
Smith AN, Fischer E, Yongjian C (2012) How does brand-related user-generated content differ across YouTube, Facebook, and twitter? J Interact Mark 26(2):102–113
Soleymani M, Garcia D, Jou B, Schuller B, Chang SF (2017) Pantic M. a survey of multimodal sentiment analysis. Image Vis Comput 65:3–14
Tang J, Wu W, Qin X, Feng Y (2016) Structural analysis methods for differential algebraic equations via fixed-point iteration. J Comput Theor Nanosci 13(10):7705–7711
Tzoreff E, Weiss AJ (2017) Expectation-maximization algorithm for direct position determination. Signal Process 133:32–39
Venkatesh S (1997) A steady-state throughput analysis of cluster tools: dual-blade versus single-blade robots. IEEE Trans Semicond Manuf 10(4):418–424
Wang P, Xu B, Xu J, Tian G, Liu CL, Hao H (2016) Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification. Neurocomputing 174:806–814
You L, Tunçer B (2016) Exploring public sentiments for livable places based on a crowd-calibrated sentiment analysis mechanism. In: 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, p 693–700
You L, Tunçer B (2016) Exploring the utilization of places through a scalable “Activities in Places” analysis mechanism. 2016 IEEE International Conference on In Big Data (Big Data). IEEE, p 3563–3572
You L, Tunçer B, Xing H (2018) Harnessing multi-source data about public sentiments and activities for informed design. IEEE Trans Knowl Data Eng
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Basha, S.M., Rajput, D.S. A roadmap towards implementing parallel aspect level sentiment analysis. Multimed Tools Appl 78, 29463–29492 (2019). https://doi.org/10.1007/s11042-018-7093-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-7093-z