Skip to main content

Tweet2Story: A Web App to Extract Narratives from Twitter

  • Conference paper
  • First Online:
Advances in Information Retrieval (ECIR 2022)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13186))

Included in the following conference series:

Abstract

Social media platforms are used to discuss current events with very complex narratives that become difficult to understand. In this work, we introduce Tweet2Story, a web app to automatically extract narratives from small texts such as tweets and describe them through annotations. By doing this, we aim to mitigate the difficulties existing on creating narratives and give a step towards deeply understanding the actors and their corresponding relations found in a text. We build the web app to be modular and easy-to-use, which allows it to easily incorporate new techniques as they keep getting developed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    in the sentence “Steve Jobs was the CEO of Apple”, the entity “Steve Jobs” fits the category of “person”.

  2. 2.

    the expression “last year” would be parsed as “2020” as of this writing.

  3. 3.

    in the sentence “Sally lives in Paris. She lives in France”, both “Sally” and “She” refer to the same entity and, therefore, belong to the same cluster.

  4. 4.

    in the sentence “Sally lives in Paris”, the event is expressed through the verb “lives”.

  5. 5.

    in the sentence “Sally lives in Paris” the triple “Sally - lives - in Paris” is categorized as a location triple.

  6. 6.

    https://github.com/LIAAD/Tweet2Story-demo.

  7. 7.

    http://tweet2story.inesctec.pt/.

  8. 8.

    https://brat.nlplab.org/standoff.html.

References

  1. MuckRack: The state of journalism 2021. MUCK RACK Blog, 15 March 2021. https://muckrack.com/blog/2021/03/15/state-of-journalism-2021

  2. Rudra, K., Goyal, P., Ganguly, N., Mitra, P., Imran, M.: Identifying sub-events and summarizing disaster-related information from microblogs. In: The 41st International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR 2018, pp. 265–274. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3209978.3210030

  3. El-Kassas, W.S., Salama, C.R., Rafea, A.A., Mohamed, H.K.: Automatic text summarization: a comprehensive survey. Expert Syst. Appl. 165, 113679 (2021). https://www.sciencedirect.com/science/article/pii/S0957417420305030

  4. Jorge, A., Campos, R., Jatowt, A., Nunes, S.: Information processing & management journal special issue on narrative extraction from texts (Text2Story). Inf. Process. Manag. 56, 1771–1774 (2019)

    Google Scholar 

  5. Campos, R., Jorge, A., Jatowt, A., Bhatia, S., Finlayson, M.: The 4\(^{th}\) international workshop on narrative extraction from texts: Text2Story 2021. In: Hiemstra, D., Moens, M.-F., Mothe, J., Perego, R., Potthast, M., Sebastiani, F. (eds.) ECIR 2021. LNCS, vol. 12657, pp. 701–704. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-72240-1_84

    Chapter  Google Scholar 

  6. Metilli, D., Bartalesi, V., Meghini, C.: Steps towards a system to extract formal narratives from text. In: Text2Story 2019 - Second Workshop on Narrative Extraction From Texts, Cologne, Germany, 14 April 2019, pp. 53–61. CEUR-WS.org, Aachen, DEU (2019)

    Google Scholar 

  7. Vargas, J.V.: Narrative information extraction with non-linear natural language processing pipelines. Ph.D. dissertation, Drexel University (2017)

    Google Scholar 

  8. Eisenberg, J.D.: Automatic extraction of narrative structure from long form text. Ph.D. dissertation, Florida International University (2018)

    Google Scholar 

  9. Honnibal, M., Montani, I.: spaCy 2: Natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing (2017). To appear

    Google Scholar 

  10. Bird, S., Loper, E.: NLTK: the natural language toolkit. In: Proceedings of the ACL Interactive Poster and Demonstration Sessions, pp. 214–217. Association for Computational Linguistics, Barcelona, July 2004. https://aclanthology.org/P04-3031

  11. Kocaman, V., Talby, D.: Spark NLP: natural language understanding at scale. CoRR, abs/2101.10848 (2021). arXiv:2101.10848

  12. Strotgen, J., Gertz, M.: Heideltime: high quality rule-based extraction and normalization of temporal expressions. In: Proceedings of the 5th International Workshop on Semantic Evaluation, ser. SemEval 2010, pp. 321–324. Association for Computational Linguistics, USA (2010)

    Google Scholar 

  13. Strotgen, J., Gertz, M.: Multilingual and cross-domain temporal tagging. Lang. Resour. Eval. 47(2), 269–298 (2013)

    Article  Google Scholar 

  14. Lee, K., He, L., Zettlemoyer, L.: Higher-order coreference resolution with coarse-to-fine inference. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pp. 687–692. Association for Computational Linguistics, New Orleans, June 2018. https://aclanthology.org/N18-2108

  15. Shi, P., Lin, J.: Simple Bert models for relation extraction and semantic role labeling. ArXiv, abs/1904.05255 (2019)

    Google Scholar 

  16. Gardner, M., et al.: Allennlp: a deep semantic natural language processing platform. CoRR, abs/1803.07640 (2018). arXiv:1803.07640

  17. Kamp, H., Genabith, J., Reyle, U.: Discourse representation theory. 11, 125–394 (2010). Springer

    Google Scholar 

  18. Letier, E., Kramer, J., Magee, J., Uchitel, S.: Monitoring and control in scenario- based requirements analysis. In: Proceedings of the 27th International Conference on Software Engineering, pp. 382–391. Association for Computing Machinery, June 2005

    Google Scholar 

Download references

Acknowledgements

Vasco Campos and Pedro Mota were financed by National Funds through the Portuguese funding agency, FCT - Fundação para a Ciência e a Tecnologia, within project UIDB/50014/2020 and LA/P/0063/2020. Ricardo Campos and Alípio Jorge were financed by the ERDF - European Regional Development Fund through the North Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 and by National Funds through the Portuguese funding agency, FCT - Fundação para a Ciência e a Tecnologia within project PTDC/CCI-COM/31857/2017 (NORTE-01-0145-FEDER-03185). This funding fits under the research line of the Text2Story project.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vasco Campos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Campos, V., Campos, R., Mota, P., Jorge, A. (2022). Tweet2Story: A Web App to Extract Narratives from Twitter. In: Hagen, M., et al. Advances in Information Retrieval. ECIR 2022. Lecture Notes in Computer Science, vol 13186. Springer, Cham. https://doi.org/10.1007/978-3-030-99739-7_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-99739-7_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-99738-0

  • Online ISBN: 978-3-030-99739-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics