Abstract
In this work, we conduct a study where we compare the usage of a single classifier and the usage of a majority vote system composed of multiple classifiers. Each classifier was created using machine learning techniques and trained with real data. For the domain, we considered textual expressions present on online social networks, which can be volatile in characters count. This work seeks to prove two hypothesis: (1) the usage of a vote module that considers the output of an odd number of classifiers, will address the advantages characteristics of each classifier while mitigating their disadvantages; (2) the usage of a vote system will enable classifiers to correctly classify new labels that were not defined in the training process (like classifying neutral sentiment in addition to positive and negative). Our vote module is composed by a Naïve Bayes, a Logistic Regression, and a Support Vector Machine classifier. The tests that we conducted consider the online social textual content that varies in the character counting and our results suggests that there is no need for a vote system when considering the online social content, like comments, that is typically informal and do not surpass the count of 500 characters.
This work has been supported by FCT - Fundação para a Ciência e Tecnologia within the Project Scope: UIDB/04728/2020.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Barbosa, R., Santos, R.: Online social networks as sensors in smart environments. In: 2016 Global Information Infrastructure and Networking Symposium (GIIS). IEEE, October 2016. https://doi.org/10.1109/giis.2016.7814950
Barbosa, R., Santos, R.: Layer by layer: a multidimensional approach to online social profiles. In: 2017 Computing Conference. IEEE, July 2017. https://doi.org/10.1109/sai.2017.8252202
Barbosa, R., Santos, R.: Multidimensional approach to online interest networks. In: DEStech Transactions on Computer Science and Engineering (CMEE), January 2017. https://doi.org/10.12783/dtcse/cmee2016/5372
Bespalov, D., Bai, B., Qi, Y., Shokoufandeh, A.: Sentiment classification based on supervised latent n-gram analysis. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management. CIKM 2011, pp. 375–382. ACM, New York (2011). https://doi.org/10.1145/2063576.2063635
Catal, C., Nangir, M.: A sentiment classification model based on multiple classifiers. Appl. Soft Comput. 50, 135–141 (2017). https://doi.org/10.1016/j.asoc.2016.11.022
Celli, F.: Adaptive Personality Recognition from Text. LAP Lambert Academic Publishing (2013). https://www.ebook.de/de/product/20375194/fabio_celli_adaptive_personality_recognition_from_text.html
Chopra, D., Joshi, N., Mathur, I.: Mastering Natural Language Processing with Python. Packt Publishing, Birmingham (2016)
Harrison: Positive and negative short reviews dataset, May 2013. https://pythonprogramming.net/static/downloads/short_reviews/
Jackson, D.: Know your limit: The ideal length of every social media post. https://sproutsocial.com/insights/social-media-character-counter//
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Koppel, M., Schler, J.: The importance of neutral examples for learning sentiment. Comput. Intell. 22(2), 100–109 (2006). https://doi.org/10.1111/j.1467-8640.2006.00276.x. https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1467-8640.2006.00276.x
Kotzias, D., Denil, M., De Freitas, N., Smyth, P.: From group to individual labels using deep features. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 597–606. ACM (2015)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing. EMNLP 2002, vol. 10, pp. 79–86. Association for Computational Linguistics, Stroudsburg (2002). https://doi.org/10.3115/1118693.1118704
Patodkar, V., I.R., S.: Twitter as a corpus for sentiment analysis and opinion mining. IJARCCE 5, 320–322 (2016). https://doi.org/10.17148/IJARCCE.2016.51274
Platt, J.C.: Advances in Kernel Methods: Support Vector Learning. The MIT Press, Cambridge (1998). https://www.amazon.com/Advances-Kernel-Methods-Support-Learning/dp/0262194163?SubscriptionId=0JYN1NVW651KCA56C102&tag=techkie-20&linkCode=xm2&camp=2025&creative=165953&creativeASIN=0262194163
Severyn, A., Moschitti, A.: Unitn: training deep convolutional neural network for Twitter sentiment classification. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp. 464–469 (2015)
Socher, R.,et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631–1642 (2013)
Sotiropoulos, D.N., Pournarakis, D.E., Giaglis, G.M.: SVM-based sentiment classification: a comparative study against state-of-the-art classifiers. Int. J. Comput. Intell. Stud. 6(1), 52–67 (2017)
Tang, D., Wei, F., Qin, B., Liu, T., Zhou, M.: Coooolll: a deep learning system for twitter sentiment classification. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pp. 208–212 (2014)
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, Heidelberg (1998). https://www.amazon.com/Nature-Statistical-Learning-Theory/dp/0387945598?SubscriptionId=0JYN1NVW651KCA56C102&tag=techkie-20&linkCode=xm2&camp=2025&creative=165953&creativeASIN=0387945598
Wawre, S.V., Deshmukh, S.N.: Sentiment classification using machine learning techniques. Int. J. Sci. Res. (IJSR) 5(4), 819–821 (2016)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Barbosa, R., Santos, R., Novais, P. (2020). Should You Use a Vote Module for Sentiment Classification of Online Social Text?. In: De La Prieta, F., et al. Highlights in Practical Applications of Agents, Multi-Agent Systems, and Trust-worthiness. The PAAMS Collection. PAAMS 2020. Communications in Computer and Information Science, vol 1233. Springer, Cham. https://doi.org/10.1007/978-3-030-51999-5_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-51999-5_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51998-8
Online ISBN: 978-3-030-51999-5
eBook Packages: Computer ScienceComputer Science (R0)