Abstract
Twitter is a social media that developed rapidly in today’s modern world. As millions of Twitter messages are sent day by day, the value and importance of developing a new technique for detecting spammers become significant. Moreover, legitimate users are affected by means of spams in the form of unwanted URLs, irrelevant messages, etc. Another hot topic of research is sentiment analysis that is based on each tweet sent by the user and opinion mining of the customer reviews. Most commonly natural language processing is used for sentiment analysis. The text is collected from user’s tweets by opinion mining and automatic sentiment analysis that are oriented with ternary classifications, such as “positive,” “neutral,” and “negative.” Due to limited size, unstructured nature, misspells, slangs, and abbreviations, it is more challenging for researchers to find sentiments for Twitter data. In this paper, we collected 600 million public tweets using URL-based security tool and feature generation is applied for sentiment analysis. The ternary classification is processed based on preprocessing technique, and the results of tweets sent by the users are obtained. We use a hybridization technique using two optimization algorithms and one machine learning classifier, namely particle swarm optimization and genetic algorithm and decision tree for classification accuracy by sentiment analysis. The results are compared with previous works, and our proposed method shows a better analysis than that of other classifiers.
Similar content being viewed by others
References
Somani A, Suman U (2011) Counter measures against evolving search engine spamming techniques. In: 2011 3rd international conference on electronics computer technology (ICECT), vol 6, pp 214–217
Varatharajan R, Manogaran G, Priyan MK, Sundarasekar R (2017) Wearable sensor devices for early detection of Alzheimer disease using dynamic time warping algorithm. Clust Comput. https://doi.org/10.1007/s10586-017-0977-2
Varatharajan R, Manogaran G, Priyan MK, Balaş VE, Barna C (2017) Visual analysis of geospatial habitat suitability model based on inverse distance weighting with paired comparison analysis. Multimed Tools Appl. https://doi.org/10.1007/s11042-017-4768-9
Balan EV, Priyan MK, Gokulnath C, Devi GU (2015) Fuzzy based intrusion detection systems in MANET. Proc Comput Sci 50:109–114
Manogaran G, Varatharajan R, Priyan MK (2018) Hybrid recommendation system for heart disease diagnosis based on multiple kernel learning with adaptive neuro-fuzzy inference system. Multimed Tools Appl 77(4):4379–4399
Devi GU, Balan EV, Priyan MK, Gokulnath C (2015) Mutual authentication scheme for IoT application. Indian J Sci Technol 8(26):15
Priyan MK, Devi GU (2017) Energy efficient node selection algorithm based on node performance index and random waypoint mobility model in internet of vehicles. Clust Comput. https://doi.org/10.1007/s10586-017-0998-x
Varatharajan R, Manogaran G, Priyan MK (2017) A big data classification approach using LDA with an enhanced SVM method for ECG signals in cloud computing. Multimed Tools Appl. https://doi.org/10.1007/s11042-017-5318-1
Devi GU, Priyan MK, Balan EV, Nath CG, Chandrasekhar M (2015) Detection of DDoS attack using optimized hop count filtering technique. Indian J Sci Technol 8(26):4
Gokulnath C, Priyan MK, Balan EV, Prabha KR, Jeyanthi R (2015) Preservation of privacy in data mining by using PCA based perturbation technique. In: 2015 International conference on smart technologies and management for computing, communication, controls, energy and materials (ICSTM). IEEE, pp 202–206
Kumar PM, Gandhi U, Varatharajan R, Manogaran G, Jidhesh R, Vadivel T (2017) Intelligent face recognition and navigation system using neural learning for smart security in internet of things. Clust Comput. https://doi.org/10.1007/s10586-017-1323-4
Manogaran G, Varatharajan R, Lopez D, Kumar PM, Sundarasekar R, Thota C (2017) A new architecture of Internet of Things and big data ecosystem for secured smart healthcare monitoring and alerting system. Future Gener Comput Syst 80:1
Go A, Bhayani R, Huang L (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, vol 1, no 12
Liu KL, Li WJ, Guo M (2012) Emoticon smoothed language models for Twitter sentiment analysis. In: Aaai
Da Silva NF, Hruschka ER, Hruschka ER Jr (2014) Tweet sentiment analysis with classifier ensembles. Decis Support Syst 66:170–179
Kaewpitakkun Y, Shirai K, Mohd M (2014) Sentiment lexicon interpolation and polarity estimation of objective and out-of-vocabulary words to improve sentiment classification on microblogging. In: Proceedings of the 28th Pacific Asia conference on language, information and computing
Saif H, He Y, Fernandez M, Alani H (2014) Adapting sentiment lexicons using contextual semantics for sentiment analysis of twitter. In: Presutti V, Blomqvist E, Troncy R, Sack H, Papadakis I, Tordai A (eds) The semantic web: ESWC 2014 satellite events. ESWC 2014. Lecture notes in computer science, vol 8798. Springer, Cham, pp 54–63
Coletta LFS, da Silva NFF, Hruschka ER, Hruschka ER (2014) Combining classification and clustering for tweet sentiment analysis. In: 2014 Brazilian conference on intelligent systems (BRACIS), pp 210–215
Lu TJ (2015) Semi-supervised microblog sentiment analysis using social relation and text similarity. In: 2015 International conference on big data and smart computing (BigComp), pp 194–201
Saif H, He Y, Fernandez M, Alani H (2014) Semantic patterns for sentiment analysis of twitter. In: Mika P et al (eds) The semantic web – ISWC 2014. ISWC 2014. Lecture notes in computer science, vol 8797. Springer, Cham, pp 324–340
Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau R (2011) Sentiment analysis of Twitter data. In: Proceedings of the workshop on languages in social media. Association for Computational Linguistics, pp 30–38
Khan FH, Qamar U, Bashir S (2017) A semi-supervised approach to sentiment analysis using revised sentiment strength based on SentiWordNet. Knowl Inf Syst 51(3):851–872
Agarwal B, Poria S, Mittal N, Gelbukh A, Hussain A (2015) Concept-level sentiment analysis with dependency-based semantic parsing: a novel approach. Cognit Comput 7(4):487–499
Bhadane C, Dalal H, Doshi H (2015) Sentiment analysis: measuring opinions. Proc Comput Sci 45:808–814
Muhammad A, Wiratunga N, Lothian R (2016) Contextual sentiment analysis for social media genres. Knowl Based Syst 108:92–101
Mukwazvure A, Supreethi KP (2015) A hybrid approach to sentiment analysis of news comments. In: 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), pp 1–6
Saif H, He Y, Fernandez M, Alani H (2016) Contextual semantics for sentiment analysis of Twitter. Inf Process Manage 52(1):5–19
Jianqiang Z, Xiaolin G (2017) Comparison research on text pre-processing methods on Twitter sentiment analysis. IEEE Access 5:2870–2879
Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing, vol 10, pp 79–86
Pang B, Lee L (2004) A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics, p 271
Mullen T, Collier N (2004) Sentiment analysis using support vector machines with diverse information sources. In: Proceedings of the 2004 conference on empirical methods in natural language processing
Wiebe J, Wilson T, Bruce R, Bell M, Martin M (2004) Learning subjective language. Comput Linguist 30(3):277–308
Zhang C, Zuo W, Peng T, He F (2008) Sentiment classification for Chinese reviews using machine learning methods based on string kernel. In: Third international conference on convergence and hybrid information technology, ICCIT’08, vol 2, pp 909–914
Chen LS, Chiu HJ (2009) Developing a neural network based index for sentiment classification. In: Proceedings of the international multiconference of engineers and computer scientists, vol 1, pp 18–20
Tao J, Tan T (2004) Emotional Chinese talking head system. In: Proceedings of the 6th international conference on multimodal interfaces, pp 273–280
Hu M, Liu B (2004). Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pp 168–177
Ye Q, Zhang Z, Law R (2009) Sentiment classification of online reviews to travel destinations by supervised machine learning approaches. Expert Syst Appl 36(3):6527–6535
Zhang Y, Dang Y, Chen H (2011) Gender classification for web forums. IEEE Trans Syst Man Cybernet Part A Syst Hum 41(4):668–677
Manogaran CTG, Priyan M (2017) Centralized fog computing security platform for IoT and cloud in healthcare system. In: Exploring the convergence of big data and the internet of things, p 141, IGI Global
Balan EV, Priyan MK, Devi GU (2015) Hybrid architecture with misuse and anomaly detection techniques for wireless networks. In: 2015 International conference on communications and signal processing (ICCSP). IEEE, pp 0185–0189
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
This statement is to certify that all authors have seen and approved the manuscript being submitted. We warrant that the article is the author’s original work. We warrant that the article has not received prior publications and is not under consideration for publication elsewhere. On behalf of all co-authors the corresponding author shall bear full responsibility for the submission. The author(s) declare that there is no conflict of interest.
Rights and permissions
About this article
Cite this article
Nagarajan, S.M., Gandhi, U.D. Classifying streaming of Twitter data based on sentiment analysis using hybridization. Neural Comput & Applic 31, 1425–1433 (2019). https://doi.org/10.1007/s00521-018-3476-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-018-3476-3