Abstract
Nowadays, the exponential diffusion of information forces Machine Learning algorithms to take relations into account in addition to data, which are no longer independent. We propose a Bayesian ensemble learning methodology named Relational Bayesian Model Averaging (RBMA) which, in addition to a probabilistic ensemble voting, takes relations into account. We tested the RBMA on a benchmark dataset for Sentiment Analysis in social networks and we compared it with its previous non-relational variant and we show that the introduction of relations significantly improves the performance of classification. Moreover, we propose a model for making predictions when new data becomes available modifying and increasing the underneath graph of relations on which the RBMA was trained.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Weitz, S.: Search: How the Data Explosion Makes Us Smarter. GreenHouse Collection. Routledge, London (2014)
Fersini, E., Pozzi, F.A., Messina, E.: Approval network: a novel approach for sentiment analysis in social networks. World Wide Web 20(4), 831–854 (2016). https://doi.org/10.1007/s11280-016-0419-8
Lu, Q., Getoor, L.: Link-based classification. In: ICML 2003: Proceedings of the Twentieth International Conference on International Conference on Machine Learning, pp. 496–503 (2003)
Rokach, L.: Ensemble-based classifiers. Artif. Intell. Rev. 33(1), 1–39 (2010)
Fersini, E., Messina, E., Pozzi, F.A.: Sentiment analysis: Bayesian ensemble learning. Decis. Support Syst. 68, 26–38 (2014)
Nickel, M., Murphy, K., Tresp, V., Gabrilovich, E.: A review of relational machine learning for knowledge graphs. Proc. IEEE 104(1), 11–33 (2016)
Mooney, R.: Statistical relational learning and script induction for textual inference. Technical report AFRL-RI-RS-TR-2017-243, Air force research laboratory information dictorate (2017)
Ramanan, N., et al.: Structure learning for relational logistic regression: an ensemble approach. In: Proceedings of the Sixteenth International Conference on Principles of Knowledge Representation and Reasoning (KR 2018), pp. 661–662 (2018)
Gomes, H.M., Barddal, J.P., Enembreck, F., Bifet, A.: A survey on ensemble learning for data stream classification. ACM Computing Surveys 50(2), 1–36 (2017). Article 23
Sagi, O.: L-Rokach: ensemble learning: a survey. WIREs Data Min. Knowl. Discov. (2018). https://doi.org/10.1002/widm.1249
Preisach, C., Schmidt-Thieme, L.: Ensembles of relational classifiers. Knowl. Inf. Syst. 14(3), 249–272 (2008)
Li, Y., Zhong, S., Zhong, Q., Shi, K.: Lithium-ion battery state of health monitoring based on ensemble learning. IEEE Access 7, 8754–8762 (2019)
Bablani, A., Edla, D.R., Tripathi, D., Kuppili, V.: An efficient concealed information test: EEG feature extraction and ensemble classification for lie identification. Mach. Vis. Appl. 30(5), 813–832 (2019)
Alfred, R., Shin, K.K., Chin, K.O., Lau, H.K., Hijazi, M.H.A.: k-NN ensemble DARA approach to learning relational. In: Abawajy, J.H., Othman, M., Ghazali, R., Deris, M.M., Mahdin, H., Herawan, T. (eds.) Proceedings of the International Conference on Data Engineering 2015 (DaEng-2015). LNEE, vol. 520, pp. 203–212. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-1799-6_22
Hoeting, J.A., Madigan, D., Raftery, A.E., Volinsky, C.T.: Bayesian model averaging: a tutorial. Stat. Sci. 14(4), 382–401 (1999)
Wasserman, L.: Bayesian model selection and model averaging. J. Math. Psychol. 44, 92–107 (2000)
Raftery, A.E., Gneiting, T., Balabdaoui, F., Polakowski, M.: Using Bayesian model averaging to calibrate forecast ensembles. Mon. Weather Rev. 133(5), 1155–1174 (2005)
Pozzi, F.A., Fersini, E., Messina, E.: Bayesian model averaging and model selection for polarity classification. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) NLDB 2013. LNCS, vol. 7934, pp. 189–200. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38824-8_16
Pozzi, F.A., Maccagnola, D., Fersini, E., Messina, E.: Enhance user-level sentiment analysis on microblogs with approval relations. In: Baldoni, M., Baroglio, C., Boella, G., Micalizio, R. (eds.) AI*IA 2013. LNCS (LNAI), vol. 8249, pp. 133–144. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-03524-6_12
Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016)
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Series in Artificial Intelligence. Prentice Hall, Upper Saddle River (2008)
Macskassy, S.A., Provost, F.: Classification in networked data: a toolkit and a univariate case study. J. Mach. Learn. Res. 8, 935–983 (2007)
Swamynathan, M.: Mastering Machine Learning with Python in Six Steps: A Practical Implementation Guide to Predictive Data Analytics Using Python (2017)
Università degli Studî di Milano-Bicocca: The MIND laboratory. http://www.mind.disco.unimib.it/gallery/index.asp?cat=92&level=1&lang=en
Acknowledgements
This work has been partially funded by MISE (Ministero Italiano dello Sviluppo Economico) under the project “SMARTCAL – Smart Tourism in Calabria” (F/050142/01-03/x32).
Moreover, the authors are very grateful to Sofus A. Macskassy for his generosity in helping with the NetKit toolkit.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Baldi, M.M., Fersini, E., Messina, E. (2020). Relational Bayesian Model Averaging for Sentiment Analysis in Social Networks. In: Nicosia, G., et al. Machine Learning, Optimization, and Data Science. LOD 2020. Lecture Notes in Computer Science(), vol 12565. Springer, Cham. https://doi.org/10.1007/978-3-030-64583-0_27
Download citation
DOI: https://doi.org/10.1007/978-3-030-64583-0_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-64582-3
Online ISBN: 978-3-030-64583-0
eBook Packages: Computer ScienceComputer Science (R0)