skip to main content
10.1145/3291280.3291795acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiaitConference Proceedingsconference-collections
research-article

Sentiment Analysis Based on MapReduce: A survey

Published: 10 December 2018 Publication History

Abstract

Sentiment analysis is the process of analyzing people's sentiments, opinions, evaluations and emotions by studying their written text. It attracts the interest of many researchers, since it is useful for many applications, ranging from decision making to product evaluation to mention a few. Sentiment analysis can be conducted using machine-learning techniques, lexicon-based techniques or hybrid techniques that combines both. As people are more reliant on social networks such as Twitter, this has become a valuable source for sentiment analysis. However, the existence of big data frameworks require adaptation of these techniques to run within such frameworks. This paper reviews sentiment analysis techniques, focusing on the MapReduce-based analysis techniques. We found that the Naïve Bayes algorithm was the most used machine learning technique for extracting sentiments from big datasets because of its high accuracy rates. However, the dictionary-based techniques achieved better results in terms of execution time.

References

[1]
T. White, Hadoop: The Definitive Guide, 4th Edition, California, USA: O'Reilly Media, 2015.
[2]
B. Marr, "/big-data-," 30 9 2015. {Online}. Available: https://www.forbes.com/sites/bernardmarr/2015/09/30/big-data-20-mind-boggling-facts-everyone-must-read/#4d9f09f717b1.
[3]
B. Liu, E. Blasch, Y. Chen, D. Shen and G. Chen, "Scalable Sentiment Classification for Big Data Analysis Using Naïve Bayes Classifier," in 2013 IEEE International Conference on Big Data, USA, 2013.
[4]
Y. Madani, M. Erritali and J. Bengourram, "Sentiment analysis using semantic similarity and Hadoop MapReduce," Knowledge and Information Systems, pp. 1--24, 2018.
[5]
H. Kaur, V. Mangat and N. Nidhi, "A Survey of Sentiment Analysis techniques," International conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud), pp. 921--925, 2017.
[6]
M. Edison and A. Aloysius, "Concepts and Methods of Sentiment Analysis on Big Data," International Journal of Innovative Research in Science, Engineering and Technology, vol. 5, no. 9, pp. 16288--16296, 2016.
[7]
V. Patel, G. Prabhu and K. Bhowmick, "A Survey of Opinion Mining and Sentiment Analysis," International Journal of Computer Applications, vol. 131, no. 1, pp. 24--27, 2015.
[8]
Y. Madani, J. Bengourram, M. Erritali, B. Hssina and M. Birjali, "Adaptive e-learning using Genetic Algorithm and Sentiments Analysis in a Big Data System," (IJACSA) International Journal of Advanced Computer Science and Applications, vol. 8, no. 8, pp. 394--403, 2017.
[9]
W. Medhat, A. Hassan and H. Korashy, "Sentiment analysis algorithms and applications: A survey," Ain Shams Engineering Journal, vol. 5, no. 4, p. 1093--1113.
[10]
M. Biltawi, W. Etaiwi, S. Tedmori, A. Hudaib and A. Awajan, "Sentiment Classification Techniques For Arabic Language: A Survey," in 2016 7th International Conference on Information and Communication Systems (ICICS), Irbid, Jordan, 2016.
[11]
Y. Madani, E. Mohammed and B. Jamaa, "A Parallel Semantic Sentiment Analysis," in 2017 3rd International Conference of Cloud Computing Technologies and Applications (CloudTech), Rabat, 2017.
[12]
O. Appel, F. Chiclana, J. Carter and H. Fujita, "A hybrid approach to the sentiment analysis problem at the sentence level," Knowledge-Based Systems, pp. 110--124, 2016.
[13]
M. Biltawi, G. Al-Naymat and S. Tedmori, "Arabic Sentiment Classification: A Hybrid Approach," in 2017 International Conference on New Trends in Computing Sciences, Amman, Jordan, 2017.
[14]
M. Shiha and S. Ayvaz, "The effects of emoji in sentiment analysis," International Journal of Computer and Electrical Engineering, vol. 9, no. 1, pp. 360--369, 2017.
[15]
M. Huq, A. Ali and A. Rahman, "Sentiment analysis on Twitter data using KNN and SVM," (IJACSA) International Journal of Advanced Computer Science and Applications, vol. 8, no. 6, pp. 19--25, 2017.
[16]
Z. Jianqiang and G. Xiaolin, "Comparison Research on Text Pre-processing Methods on Twitter Sentiment Analysis," in IEEE Access, 2017.
[17]
E. Haddia, X. Liu and Y. Shi, "The role of text pre-processing in sentiment analysis," in Information Technology and Quantitative Management (ITQM2013), 2013.
[18]
S. Hassan, F. Miriam, H. Yulan and A. Harith, "On stopwords, filtering and data sparsity for sentiment analysis of Twitter," LREC 2014, Ninth International Conference on Language Resources and Evaluation, p. 810--817, 2014.
[19]
T. Singh and M. Kumari, "Role of Text Pre-processing in Twitter Sentiment Analysis," Procedia Computer Science, pp. 549--554, 2016.
[20]
W. Sharif, N. Samsudin, M. Deris and R. Nase, "Effect of Negation in Sentiment Analysis," in 2016 Sixth International Conference on Innovative Computing Technology (INTECH), Dublin, Ireland, 2016.
[21]
H. Saif, Y. He, M. Fernandez and H. Alani, "Semantic Patterns for Sentiment Analysis of Twitter," in International Semantic Web Conference, 2014.
[22]
S. Tartir and I. Abdul-Nabi, "Semantic Sentiment Analysis in Arabic Social Media," in Journal of King Saud University --- Computer and Information Sciences, 2017.
[23]
M. Biltawi, W. Etaiwi, S. Tedmori and A. Shaout, "Fuzzy based Sentiment Classification in the Arabic Language," in Intelligent Systems Conference 2018, London, UK, 2018.
[24]
V. Chauhan and A. Shukla, "Sentimental Analysis of Social Networks using MapReduce and Big Data Technologies," IJCSN International Journal of Computer Science and Network, vol. 6, no. 2, pp. 120--130, 2017.
[25]
H. Parveen and S. Pandey, "Sentiment Analysis on Twitter Data-set using Naive Bayes Algorithm," in 2016 2nd International Conference on Applied and Theoretical Computing and Communication Technology (iCATccT), Bangalore, India, 2016.
[26]
I. Ha, B. Back and B. Ahn, "MapReduce Functions to Analyze Sentiment Information from Social Big Data," International Journal of Distributed Sensor Networks, pp. 1--11, 2015.
[27]
R. Ramesh, G. Divya, D. Divya, M. Kurian and V. Vishnuprabha, "Big Data Sentiment Analysis using Hadoop," IJIRST ---International Journal for Innovative Research in Science & Technology, vol. 1, no. 1, pp. 92--96, 2015.
[28]
C. González, J. García-Nieto, I. Navas-Delgado and J. Aldana-Monte, "A Fine Grain Sentiment Analysis with Semantics in Tweets," International Journal of Interactive Multimedia and Artificial Intelligence, vol. 3, no. 6, pp. 22--28, 2016.
[29]
V. N. Khuc, C. Shivade, R. Ramnath and J. Ramanathan, "Towards building large-scale distributed systems for Twitter," Proceedings of the 27th Annual ACM Symposium on Applied Computing (SAC '12), p. 459--464, March 2012.

Cited By

View all
  • (2024)TOOL SUPPORT FOR DATA-DRIVEN SERVICE INNOVATION: A SYSTEMATIC LITERATURE REVIEWInternational Journal of Innovation Management10.1142/S136391962430004628:07n08Online publication date: 26-Sep-2024
  • (2024)ML-based Expert Products Scoring System2024 Progress in Applied Electrical Engineering (PAEE)10.1109/PAEE63906.2024.10701451(1-5)Online publication date: 24-Jun-2024
  • (2022)A Comparative Study Of Sentiment Analysis For Big Data On Hadoop2022 International Telecommunications Conference (ITC-Egypt)10.1109/ITC-Egypt55520.2022.9855697(1-7)Online publication date: 26-Jul-2022
  • Show More Cited By

Index Terms

  1. Sentiment Analysis Based on MapReduce: A survey
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          IAIT '18: Proceedings of the 10th International Conference on Advances in Information Technology
          December 2018
          145 pages
          ISBN:9781450365680
          DOI:10.1145/3291280
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          In-Cooperation

          • KMUTT: King Mongkut's University of Technology Thonburi

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 10 December 2018

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Big Data
          2. Dictionary Based Analysis
          3. Machine Learning
          4. MapReduce Framework
          5. Naïve Bayes
          6. Sentiment Analysis

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          IAIT 2018

          Acceptance Rates

          IAIT '18 Paper Acceptance Rate 20 of 47 submissions, 43%;
          Overall Acceptance Rate 20 of 47 submissions, 43%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)7
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 02 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)TOOL SUPPORT FOR DATA-DRIVEN SERVICE INNOVATION: A SYSTEMATIC LITERATURE REVIEWInternational Journal of Innovation Management10.1142/S136391962430004628:07n08Online publication date: 26-Sep-2024
          • (2024)ML-based Expert Products Scoring System2024 Progress in Applied Electrical Engineering (PAEE)10.1109/PAEE63906.2024.10701451(1-5)Online publication date: 24-Jun-2024
          • (2022)A Comparative Study Of Sentiment Analysis For Big Data On Hadoop2022 International Telecommunications Conference (ITC-Egypt)10.1109/ITC-Egypt55520.2022.9855697(1-7)Online publication date: 26-Jul-2022
          • (2019)On using MapReduce to scale algorithms for Big Data analytics: a case studyJournal of Big Data10.1186/s40537-019-0269-16:1Online publication date: 30-Nov-2019

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media