research-article

Cross-lingual Perspectives about Crisis-Related Conversations on Twitter

Author:

Johnny Torres, Carmen VacaAuthors Info & Claims

WWW '19: Companion Proceedings of The 2019 World Wide Web Conference

Pages 255 - 261

https://doi.org/10.1145/3308560.3316799

Published: 13 May 2019 Publication History

Abstract

The role of social networks during natural disasters is becoming crucial to share relevant information and coordinate relief actions. With the reach of the social networks, any user around the world has the possibility of interact in crisis-events as these unfold. A large part of the information posted during a disaster uses the native language where the disaster occurred. However, there are also users from other parts of the world who can comment about the event, often in another language. In this work, we conducted a study of crisis-related tweets about the earthquake that occurred in Ecuador in April 2016. To that end, we introduce a new annotated dataset in both Spanish and English languages with approximately 8K tweets; half of them belong to conversations. We evaluate several neural architectures to identify crisis-related tweets in a multi-lingual setting, and we found that deep contextual multi-lingual embeddings outperform other strong baseline models. We then explore the type of conversations that occur from the perspective of different languages. The results show that certain types of conversations occur more in the native language and others in a foreign language. Conversations from foreign countries seek to gather situation awareness and give emotional support, while in the affected country the conversations aim mainly to humanitarian aid.

References

[1]

Adam Acar and Yuya Muraki. 2011. Twitter for crisis communication: lessons learned from Japan’s tsunami disaster. International Journal of Web Based Communities 7, 3(2011), 392–402.

Digital Library

[2]

Alan Akbik, Duncan Blythe, and Roland Vollgraf. 2018. Contextual String Embeddings for Sequence Labeling. In COLING 2018, 27th International Conference on Computational Linguistics. 1638–1649.

[3]

Firoj Alam, Shafiq Joty, and Muhammad Imran. 2018. Domain Adaptation with Adversarial Training and Graph Embeddings. arXiv preprint arXiv:1805.05151(2018).

[4]

Firoj Alam, Shafiq Joty, and Muhammad Imran. 2018. Graph Based Semi-Supervised Learning with Convolution Neural Networks to Classify Crisis Related Tweets. In Twelfth International AAAI Conference on Web and Social Media.

[5]

Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural language processing with Python: analyzing text with the natural language toolkit. ” O’Reilly Media, Inc.”.

Digital Library

[6]

Gregoire Burel and Harith Alani. 2018. Crisis Event Extraction Service (CREES)-Automatic Detection and Classification of Crisis-related Content on Social Media. (2018).

[7]

Mark A Cameron, Robert Power, Bella Robinson, and Jie Yin. 2012. Emergency situation awareness from twitter for crisis management. In Proceedings of the 21st International Conference on World Wide Web. ACM, 695–698.

Digital Library

[8]

Carlos Castillo. 2016. Big crisis data: social media in disasters and time-critical situations. Cambridge University Press.

Digital Library

[9]

Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters.Psychological bulletin 76, 5 (1971), 378.

[10]

Jeremy Howard and Sebastian Ruder. 2018. Universal Language Model Fine-tuning for Text Classification. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 328–339.

[11]

Muhammad Imran, Carlos Castillo, Fernando Diaz, and Sarah Vieweg. 2015. Processing social media messages in mass emergency: A survey. ACM Computing Surveys (CSUR) 47, 4 (2015), 67.

Digital Library

[12]

Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. AIDR: Artificial intelligence for disaster response. In Proceedings of the 23rd International Conference on World Wide Web. ACM, 159–162.

Digital Library

[13]

Muhammad Imran, Prasenjit Mitra, and Carlos Castillo. 2016. Twitter as a lifeline: Human-annotated twitter corpora for NLP of crisis-related messages. arXiv preprint arXiv:1605.05894(2016).

[14]

Muhammad Imran, Prasenjit Mitra, and Jaideep Srivastava. 2016. Cross-language domain adaptation for classifying crisis-related short messages. In 13th International Conference on Information Systems for Crisis Response and Management, ISCRAM 2016. Information Systems for Crisis Response and Management, ISCRAM.

[15]

Prashant Khare, Grégoire Burel, Diana Maynard, and Harith Alani. 2018. Cross-Lingual Classification of Crisis Data. In International Semantic Web Conference. Springer, 617–633.

[16]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980(2014).

[17]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781(2013).

[18]

Tomas Mikolov, Edouard Grave, Piotr Bojanowski, Christian Puhrsch, and Armand Joulin. 2018. Advances in Pre-Training Distributed Word Representations. In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2018).

[19]

Fred Morstatter, Jürgen Pfeffer, Huan Liu, and Kathleen M Carley. 2013. Is the sample good enough? comparing data from twitter’s streaming api with twitter’s firehose. In Seventh international AAAI conference on weblogs and social media.

[20]

Dat Tien Nguyen, Kamla Al-Mannai, Shafiq R Joty, Hassan Sajjad, Muhammad Imran, and Prasenjit Mitra. 2017. Robust Classification of Crisis-Related Data on Social Networks Using Convolutional Neural Networks. In ICWSM. 632–635.

[21]

Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP). 1532–1543.

[22]

Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. Deep Contextualized Word Representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), Vol. 1. 2227–2237.

[23]

Anthony Rios and Ramakanth Kavuluru. 2018. Few-shot and zero-shot multi-label learning for structured label spaces. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 3132–3142.

[24]

Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World wide web. ACM, 851–860.

Digital Library

[25]

Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research 15, 1 (2014), 1929–1958.

Digital Library

[26]

Johnny Torres, Carmen Vaca, and Cristina L Abad. 2017. What Ignites a Reply?: Characterizing Conversations in Microblogs. In Proceedings of the Fourth IEEE/ACM International Conference on Big Data Computing, Applications and Technologies. ACM, 149–156.

Digital Library

[27]

István Varga, Motoki Sano, Kentaro Torisawa, Chikara Hashimoto, Kiyonori Ohtake, Takao Kawai, Jong-Hoon Oh, and Stijn De Saeger. 2013. Aid is out there: Looking for help from tweets during a large scale disaster. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 1619–1629.

[28]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems. 5998–6008.

Digital Library

[29]

Sudha Verma, Sarah Vieweg, William J Corvey, Leysia Palen, James H Martin, Martha Palmer, Aaron Schram, and Kenneth Mark Anderson. 2011. Natural Language Processing to the Rescue? Extracting” Situational Awareness” Tweets During Mass Emergency. In ICWSM. Barcelona, 385–392.

[30]

Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, and Ahmed Elgammal. 2018. A generative adversarial approach for zero-shot learning from noisy texts. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1004–1013.

Cited By

Mahindrakar SMondal TArosh SDhakne AChavan D(2024)Exploration of summary informativeness and topic richness through mining multilingual tweets: a study on turkey earthquake 2023Social Network Analysis and Mining10.1007/s13278-024-01386-814:1Online publication date: 5-Dec-2024
https://doi.org/10.1007/s13278-024-01386-8
Karimi YSquicciarini AWilson S(2022)Automated Detection of Doxing on TwitterProceedings of the ACM on Human-Computer Interaction10.1145/35551676:CSCW2(1-24)Online publication date: 11-Nov-2022
https://dl.acm.org/doi/10.1145/3555167
Ghosh SMaji SDesarkar MBaeza-Yates RWeller KPortela MSeneviratne OWeber IYasseri TBon ASrinivas SIbáñez L(2022)GNoM: Graph Neural Network Enhanced Language Models for Disaster Related Multilingual Text ClassificationProceedings of the 14th ACM Web Science Conference 202210.1145/3501247.3531561(55-65)Online publication date: 26-Jun-2022
https://dl.acm.org/doi/10.1145/3501247.3531561
Show More Cited By

Index Terms

Cross-lingual Perspectives about Crisis-Related Conversations on Twitter

Index terms have been assigned to the content through auto-classification.

Recommendations

Twitter under crisis: can we trust what we RT?
SOMA '10: Proceedings of the First Workshop on Social Media Analytics

In this article we explore the behavior of Twitter users under an emergency situation. In particular, we analyze the activity related to the 2010 earthquake in Chile and characterize Twitter in the hours and days following this disaster. Furthermore, we ...
Connected Through Crisis: Emotional Proximity and the Spread of Misinformation Online
CSCW '15: Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing

During crises, the ability to access relevant information is extremely important for those affected. Previous research shows that social media have become popular for rapid information exchange between members of the online community after crisis ...
Conversations on Twitter: structure, pace, balance
DYNAK'14: Proceedings of the 2nd International Conference on Dynamic Networks and Knowledge Discovery - Volume 1229

Twitter is both a micro-blogging service and a platform for public conversation. Direct conversation is facilitated in Twitter through the use of @'s (mentions) and replies. While the conversational element of Twitter is of particular interest to the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '19: Companion Proceedings of The 2019 World Wide Web Conference

May 2019

1331 pages

ISBN:9781450366755

DOI:10.1145/3308560

Editors:
Ling Liu
Georgia Tech, USA
,
Ryen White
Microsoft Research, USA

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

IW3C2: International World Wide Web Conference Committee

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '19

WWW '19: The Web Conference

May 13 - 17, 2019

San Francisco, USA

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
245
Total Downloads

Downloads (Last 12 months)11
Downloads (Last 6 weeks)2

Reflects downloads up to 16 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mahindrakar SMondal TArosh SDhakne AChavan D(2024)Exploration of summary informativeness and topic richness through mining multilingual tweets: a study on turkey earthquake 2023Social Network Analysis and Mining10.1007/s13278-024-01386-814:1Online publication date: 5-Dec-2024
https://doi.org/10.1007/s13278-024-01386-8
Karimi YSquicciarini AWilson S(2022)Automated Detection of Doxing on TwitterProceedings of the ACM on Human-Computer Interaction10.1145/35551676:CSCW2(1-24)Online publication date: 11-Nov-2022
https://dl.acm.org/doi/10.1145/3555167
Ghosh SMaji SDesarkar MBaeza-Yates RWeller KPortela MSeneviratne OWeber IYasseri TBon ASrinivas SIbáñez L(2022)GNoM: Graph Neural Network Enhanced Language Models for Disaster Related Multilingual Text ClassificationProceedings of the 14th ACM Web Science Conference 202210.1145/3501247.3531561(55-65)Online publication date: 26-Jun-2022
https://dl.acm.org/doi/10.1145/3501247.3531561
Sánchez CDiaz FShah CSuel TCastells PJones RSakai T(2021)Transfer Learning for the Multilingual and Multi-Domain Classification of Messages Relating to CrisesProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3404835.3463272(2708-2708)Online publication date: 11-Jul-2021
https://doi.org/10.1145/3404835.3463272

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents