Abstract
During disaster, detecting tweets related to the target event is a challenging task. Earthquake, floods, tsunami, etc., are the examples for target event. Prior to several studies have been made on earthquake detection. The event contains many categories (classes) of information such as resources, infrastructure damage and helping requests. Different organizations need different categories (classes) of information. There have been only a few studies on the detection of a certain kind of classes and how they are interrelated during the disaster. It is difficult to design features for discriminating and detecting specific classes. Hence, this paper focuses on detection of medical resource (requirement and availability) tweets class during disaster to help medical organizations and victims. For this purpose, the Majority Voting-based Ensemble method is proposed for the detection of medical resource tweets during a disaster. It uses informative features and is fed to various classifiers such as bagging, AdaBoost, gradient boost, random forest and SVM classifiers. The output of different classifiers is combined by majority voting to detect medical resource tweets during the disaster. The proposed informative features are tested on different classifiers such as bagging, AdaBoost, gradient boosting, random forest and SVM classifiers by using the real-time Nepal earthquake dataset. And the results are compared with standard baseline BOW model. The classifiers considered in this paper with the proposed informative features outperform BOW model. The dimensionality, sparsity and computational time for features are less in case of the proposed informative features as compared with BOW model. The proposed method outperforms the state -of the art for Nepal and Italy Earthquake datasets on different parameters. It detects 82.4% of tweets that are correctly related to medical resources during a disaster.
Similar content being viewed by others
References
Alam F, Imran M, Ofli F (2019) Crisisdps: crisis data processing services. In: Proceedings of the 16th international conference on information systems for crisis response and management (ISCRAM). ISCRAM Association, New York
Alpaydin E (2014) Introduction to machine learning. MIT Press, London
Bao Y, Yi C, Xue Y, Dong Y (2015) Precise modeling rumor propagation and control strategy on social networks. In: Przemyslaw K, Chawla NV (eds) Applications of social media and social network analysis. Springer, Berlin, pp 77–102
Basu M, Ghosh S, Jana A, Bandyopadhyay S, Singh R (2017a) Medical requirements during a natural disaster: a case study on Whatsapp chats among medical personnel during the 2015 Nepal earthquake. Disaster Med Public Health Preparedness 11(6):652–655
Basu M, Ghosh S, Jana A, Bandyopadhyay S, Singh R (2017b) Resource mapping during a natural disaster: a case study on the 2015 Nepal earthquake. Int J Disaster Risk Reduct 24:24–31
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Charitonidis C, Rashid A, Taylor PJ (2017) Predicting collective action from micro-blog data. In: Prediction and inference from social networks and social media. Springer, Berlin, pp 141–170
D’Andrea E, Ducange P, Lazzerini B, Marcelloni F (2015) Real-time detection of traffic from twitter stream analysis. IEEE Trans Intell Transp Syst 16(4):2269–2283
Dietterich TG et al (2000) Ensemble methods in machine learning. Mult Classif Syst 1857:1–15
Freund Y, Schapire RE (1995) A decision-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory. Springer, Berlin, pp 23–37
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 2001:1189–1232
Friedman JH, Hastie T, Tibshirani R (2001) The elements of statistical learning, vol 1. Springer, New York
Ghosh S, Desarkar MS (2018) Class specific TF-IDF boosting for short-text classification: application to short-texts generated during disasters. In: Companion proceedings of the web conference 2018. ACM, New York, pp 1629–1637
Ghosh S, Ghosh K (2016) Overview of the FIRE 2016 microblog track: information extraction from microblogs posted during disasters. In: Working notes of FIRE 2016—forum for Information Retrieval Evaluation, Kolkata, India, December 7–10, 2016, pp 56–61. http://ceur-ws.org/Vol-1737/T2-1.pdf
Ghosh S, Ghosh K, Chakraborty T, Ganguly D, Jones G, Moens MF (2017) First international workshop on exploitation of social media for emergency Reliefand preparedness (SMERP). In: Jose JM et al (eds) Proceedings of the 39th European conference on IR research. ECIR 2017, LNCS 10193, ECIR 2017. Springer, Berlin, pp 779–783 https://doi.org/10.1007/978-3-319-56608-5
Ho TK, Hull JJ, Srihari SN (1994) Decision combination in multiple classifier systems. IEEE Trans Pattern Anal Mach Intell 16(1):66–75
Imran M, Castillo C, Lucas J, Meier P, Vieweg S (2014) AIDR: artificial intelligence for disaster response. In: Proceedings of the 23rd international conference on world wide web. ACM, Berlin, pp 159–162
Imran M, Elbassuoni S, Castillo C, Diaz F, Meier P (2013a) Extracting information nuggets from disaster-related messages in social media. In: Iscram, pp 1–10
Imran M, Elbassuoni S, Castillo C, Diaz F, Meier P (2013b) Practical extraction of disaster-relevant information from social media. In: Proceedings of the 22nd international conference on world wide web. ACM, New York, pp 1021–1024
Imran M, Mitra P, Castillo C (2016) Twitter as a lifeline: human-annotated twitter corpora for NLP of crisis-related messages. Preprint, pp 1638–1643. arXiv:1605.05894
Janssens O, Van de Walle R, Van Hoecke S (2015) A learning based approach for real-time emotion classification of tweets. In: Applications of social media and social network analysis. Springer, Berlin, pp 125–142
Khosla P, Basu M, Ghosh K, Ghosh S (2017) Microblog retrieval for post-disaster relief: applying and comparing neural IR models. Preprint arXiv:1707.06112
Kibanov M, Stumme G, Amin I, Lee JG (2017) Mining social media to inform Peatland fire and haze disaster management. Soc Netw Anal Min 7(1):30
Kušen E, Strembeck M, Conti M (2018) Emotional valence shifts and user behavior on twitter, Facebook, and Youtube. In: IEEE/ACM international conference on advances in social networks analysis and mining. Springer, Berlin, pp 63–83
Liatsis P (2002) Recent trends in multimedia information processing. In: Proceedings of the 9th international workshop on systems, signals and image processing: Manchester Town Hall, UK, 7–8 November 2002. World Scientific, Singapore
Liu B (2007) Web data mining: exploring hyperlinks, contents, and usage data. Springer, Berlin
Madichetty S, Sridevi M (2019) Disaster damage assessment from the tweets using the combination of statistical features and informative words. Soc Netw Anal Min 9(1):42
Nazer TH, Morstatter F, Dani H, Liu H (2016) Finding requests in social media for disaster relief. In: 2016 IEEE/ACM international conference on, advances in social networks analysis and mining (ASONAM). IEEE, New York, pp 1410–1413
Nguyen DT, Mannai KAA, Joty S, Sajjad H, Imran M, Mitra P (2016) Rapid classification of crisis-related data on social networks using convolutional neural networks. Preprint arXiv:1608.03902
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830
Purohit H, Castillo C, Diaz F, Sheth A, Meier P (2013) Emergency-relief coordination on social media: automatically matching resource requests and offers. First Monday. https://doi.org/10.5210/fm.v19i1.4848
Purohit H, Castillo C, Pandey R (2020) Ranking and grouping social media requests for emergency services using serviceability model. Soc Netw Anal Min 10(1):1–17
Rudra K, Sharma A, Ganguly N, Ghosh S (2016) Characterizing communal microblogs during disaster events. In: 2016 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM). IEEE, New York, pp 96–99
Rudra K, Sharma A, Ganguly N, Ghosh S (2018a) Characterizing and countering communal microblogs during disaster events. IEEE Trans Comput Soc Syst 5(2):403–417
Rudra K, Ganguly N, Goyal P, Ghosh S (2018b) Extracting and summarizing situational information from the Twitter social media during disasters. ACM Trans Web (TWEB) 12(3):17
Sakaki T, Okazaki M, Matsuo Y (2013) Tweet analysis for real-time event detection and earthquake reporting system development. IEEE Trans Knowl Data Eng 25(4):919–931
Sreenivasulu M, Sridevi M (2017) Mining informative words from the tweets for detecting the resources during disaster. In: International conference on mining intelligence and knowledge exploration. Springer, Berlin, pp 348–358
Varga I, Sano M, Torisawa K, Hashimoto C, Ohtake K, Kawai T, Oh JH, De Saeger S (2013) Aid is out there: looking for help from tweets during a large scale disaster. ACL 1:1619–1629
Verma S, Vieweg S, Corvey WJ, Palen L, Martin JH, Palmer M, Schram A, Anderson KM (2011) Natural language processing to the rescue? Extracting “situational awareness” tweets during mass emergency. Citeseer, London, pp 385–392
Vieweg S, Castillo C, Imran M (2014) Integrating social media communications into the rapid assessment of sudden onset disasters. In: International conference on social informatics. Springer, Berlin, pp 444–461
Woods K, Kegelmeyer WP, Bowyer K (1997) Combination of multiple classifiers using local accuracy estimates. IEEE Trans Pattern Anal Mach Intell 19(4):405–410
Yadav M, Rahman Z (2016) The social role of social media: the case of Chennai rains-2015. Soc Netw Anal Min 6(1):101
Yin J, Lampert A, Cameron M, Robinson B, Power R (2012) Using social media to enhance emergency situation awareness. IEEE Intell Syst 27(6):52–59
Zhang D, Tsai JJ (2005) Machine learning applications in software engineering, vol 16. World Scientific, Singapore
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Madichetty, S., M, S. Identification of medical resource tweets using Majority Voting-based Ensemble during disaster. Soc. Netw. Anal. Min. 10, 66 (2020). https://doi.org/10.1007/s13278-020-00679-y
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-020-00679-y