Abstract
Satisfaction Detection is one of the most common issues that impact the business world. So, this study aims to suggest an application that detects the Satisfaction tone that leads to customer happiness for Big Data that came out from online businesses on social media, in particular, Facebook and Twitter, by using two famous methods, machine learning and deep learning (DL) techniques.There is a lack of datasets that are involved with this topic. Therefore, we have collected the dataset from social media. We have simplified the concept of Big Data analytics for business on social media using three of the most famous Natural Language Processing tools, stemming, normalization, and stop word removal. To evaluate the performance of the classifiers, we calculated F1-measure, Recall, and Precision measures. The result showed superiority for the Random Forest classifier the highest value of F1-measure with (99.1%). The best result achieved without applying pre-processing techniques, through Support Vector Machine with F1-measure (93.4%). On the other hand, we apply DL techniques, and we apply the feature extraction method, which includes Word Embedding and Bag of Words on the dataset. The results showed superiority for the Deep Neural Networks DNN algorithm.
Similar content being viewed by others
Data availability
The data set used in the work will be available upon request
References
Assad, C.A.R.: Building a generic big data analytics framework for e-commerce in developing countries. Int. J. Digit. Inf. Wirel. Commun. 8(4), 280–288 (2018)
Zakir, J., Seymour, T., Berg, K.: Big data analytics. Issues Inf. Syst. 16(2), 81–90 (2015)
Sahatiya, P.: Big data analytics on social media data: a literature review. Int. Res. J. Eng. Technol. 5(2), 189–192 (2018)
Duwairi, R.M., Qarqaz, I.: Arabic sentiment analysis using supervised classification. In: 2014 International Conference on Future Internet of Things and Cloud, pp. 579–583. IEEE, Barcelona (2014)
Alomari, K.M., ElSherif, H.M., Shaalan, K.: Arabic tweets sentimental analysis using machine learning. In: International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 602–610. Springer, Cham (2017)
Hussien, W.A., Tashtoush, Y.M., Al-Ayyoub, M., Al-Kabi, M.N.: Are emoticons good enough to train emotion classifiers of Arabic tweets? In: 2016 7th International Conference on Computer Science and Information Technology (CSIT), pp. 1–6. Amman, IEEE (2016)
Hadi, W.: Classification of Arabic social media data. Adv. Comput. Sci. Technol. 8(1), 29–34 (2015)
Alabbas, W., Al Khateeb, H.M., Mansour, A., Epiphaniou, G., Frommholz, I.: Classification of colloquial Arabic tweets in real-time to detect high-risk floods. In: International Conference On Social Media, Wearable And Web Analytics (Social Media), pp. 1–8. IEEE, London (2017)
Al-Harbi, W.A., Emam, A.: Effect of Saudi dialect preprocessing on Arabic sentiment analysis. Int. J. Adv. Comput. Technol. 4(6), 91–99 (2015)
Al-Khatib, A., El-Beltagy, S.R.: Emotional tone detection in Arabic tweets. In: International Conference on Computational Linguistics and Intelligent Text Processing, pp. 105–114. Springer, Cham (2017)
Abdullah, M., AlMasawa, M., Makki, I., Alsolmi, M., Mahrous, S.: Emotions extraction from Arabic tweets. Int. J. Comput. Appl. 42(7), 661–675 (2020)
Elbes, M., Aldajah, A., Sadaqa, O.: P-stemmer or NLTK stemmer for Arabic text classification? In: 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS), pp. 516–520. IEEE, Granada (2019)
Abu Maria, E., Abu Maria, K., Alia, M.: Smart agents in the business information system. ICIC Express Lett. 13, 921–929 (2019)
Rabie, O., Sturm, C.: Feel the heat: emotion detection in Arabic social media content. In: The International Conference on Data Mining, Internet Computing, and Big Data (BigData2014), pp. 37–49. Kuala Lumpur, Citeseer (2014)
DoniaGamal, M.A., El-Horbaty, E.-S.M., Salem, A.-B.: Opinion mining for Arabic dialects on twitter. Egypt. Comput. Sci. J. 42(4), 52–61 (2018)
Elghazaly, T., Mahmoud, A., Hefny, H. A.: “Political sentiment analysis using twitter data,” in Proceedings of the International Conference on Internet of things and Cloud Computing, pp. 1–5 (2016)
Barakat, S., Elrashidy, N., Elawady, R., et al.: “Sentimentanalysis for Arabic and English Datasets,” (2019)
Salima, B., Fatiha, B., Ghalem, B.: Sentiment analysis of Arabic tweets: opinion target extraction. In: International Symposium on Modelling and Implementation of Complex Systems, pp. 158–167. Springer, Cham (2018)
Al-Horaibi, L., Khan, M.B.: Sentiment analysis of Arabic tweets using text mining techniques. In: First International Workshop on Pattern Recognition. International Society for Optics and Photonics, Bellingham (2016)
El-Makky, N., Nagi, K., El-Ebshihy, A., Apady, E., Hafez, O., Mostafa, S., Ibrahim, S.: Sentiment analysis of colloquial Arabic tweets. In: ASE BigData, SocialInformatics, PASSAT, BioMedCom: Conference, pp. 1–9. Harvard University, Cambridge (2014)
Kanan, T., Aldaaja, A., Hawashin, B.: Cyber-bullying and cyber-harassment detection using supervised machine learning techniques in Arabic social media contents. J. Internet Technol. 21(5), 1409–1421 (2020)
Gamal, D., Alfonse, M., El-Horbaty, E.-S.M., Salem, A.-B.M.: Twitter benchmark dataset for Arabic sentiment analysis. Int. J. Mod. Educ. Comput. Sci. 11(1), 33 (2019)
Sawhney, R., Manchanda, P., Singh, R., Aggarwal, S.: “A computational approach to feature extraction for identification of suicidal ideation in tweets,” in Proceedings of ACL 2018, Student Research Workshop, pp. 91–98 (2018)
Bhargava, S., Choudhary, S.: “Behavioral analysis of depressed sentimental over twitter: based on supervised machine learning approach,” in Proceedings of 3rd International Conference on Internet of Things and Connected Technologies (ICIoTCT), pp. 26–27 (2018)
O’dea, B., Wan, S., Batterham, P.J., Calear, A.L., Paris, C., Christensen, H.: Detecting suicidality on twitter. Internet Interv. 2(2), 183–188 (2015)
Tadesse, M.M., Lin, H., Xu, B., Yang, L.: Detection of depression-related posts in reddit social media forum. IEEE Access 7, 44883–44893 (2019)
Vioules, M.J., Moulahi, B., Azé, J., Bringay, S.: Detection of suicide-related posts in twitter data streams. IBM J. Res. Dev. 62(1), 7–12 (2018)
Boukil, S., Biniz, M., El Adnani, F., Cherrat, L., El Moutaouakkil, A.E.: Arabic text classification using deep learning technics. Int. J. Grid Distrib. Comput. 11(9), 103–114 (2018)
Heikal, M., Torki, M., El-Makky, N.: Sentiment analysis of Arabic tweets using deep learning. Proc. Comput. Sci. 142, 114–122 (2018)
Ismail, N. H., Liu, N., Du, M., He, Z., Hu, X.: “Using deep neural network to identify cancer survivors living with post-traumatic stress disorder on social media.” in SEPDA@ ISWC, pp. 48–52 (2019)
Rieder, B.: Studying Facebook via data extraction: the Netvizz application. In: Proceedings of the 5th Annual ACM Web Science Conference, pp. 346–355. ACM, New York (2013)
“khoja-stemmer-command-line/stopwords.txt at master. motazsaad/khoja-stemmer-command-line . github,” https://github.com/motazsaad/khoja-stemmer-command-line/blob/master/stopwords.txt. Accessed on 28 Dec 2021
Kanan, T., Sadaqa, O., Almhirat, A., Kanan, E.: Arabic light stemming: A comparative study between p-stemmer, khoja stemmer, and light10 stemmer. In: 2019 Sixth International Conference on Social Networks Analysis, Management and Security (SNAMS), pp. 511–515. IEEE, Granada (2019)
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)
Mohammad, A.H., Alwada’n, T., Al-Momani, O.: Arabic text categorization using support vector machine, naïve bayes and neural network. GSTF J. Comput. 5(1), 1–8 (2016)
El Kourdi, M., Bensaid, A., Rachidi, T.-E.: “Automatic arabic document categorization based on the naïve bayes algorithm,” in Proceedings of the Workshop on Computational Approaches to Arabic Script-Based Languages, pp. 51–58 (2004)
Dasarathy, B.V.: Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Comput. Soc. Tutor. 17(3), 441–458 (1991)
Lukasik, M., Bontcheva, K., Cohn, T., Zubiaga, A., Liakata, M., Procter, R.: “Using gaussian processes for rumour stance classification in social media,” arXiv preprint arXiv:1609.01962, (2016)
Quinlan, R.: C4.5: Programs for Machine Learning. Elsevier, Amsterdam (1993)
Freund, Y., Mason, L.: The alternating decision tree learning algorithm. icml 99, 124–133 (1999)
Zhang, L., Wang, S., Liu, B.: Deep learning for sentiment analysis: a survey. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8(4), e1253 (2018)
Graves, A., Mohamed, A.-R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645–6649. IEEE, Vancouver (2013)
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: “How transferable are features in deep neural networks?” arXiv preprint arXiv:1411.1792, (2014)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM SIGKDD Explor. Newsl. 11(1), 10–18 (2009)
Funding
This work was supported by the Hashemite University and AL Zaytoonah University of Jordan.
Author information
Authors and Affiliations
Contributions
All Authors worked in an equivalent load at all stages to produce this research
Corresponding author
Ethics declarations
Conflict of interest
The authors have not disclosed any competing interests.
Informed consent
I have read and I understand the journal information and have agreed to all mentioned terms and conditions.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kanan, T., Mughaid, A., Al-Shalabi, R. et al. Business intelligence using deep learning techniques for social media contents. Cluster Comput 26, 1285–1296 (2023). https://doi.org/10.1007/s10586-022-03626-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-022-03626-y