Automatic generation of entity-oriented summaries for reputation management

Rodríguez-Vidal, Javier; Carrillo-de-Albornoz, Jorge; Amigó, Enrique; Plaza, Laura; Gonzalo, Julio; Verdejo, Felisa

doi:10.1007/s12652-019-01255-9

Automatic generation of entity-oriented summaries for reputation management

Original Research
Published: 26 February 2019

Volume 11, pages 1577–1591, (2020)
Cite this article

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Javier Rodríguez-Vidal ORCID: orcid.org/0000-0002-9006-9639¹,
Jorge Carrillo-de-Albornoz¹,
Enrique Amigó¹,
Laura Plaza¹,
Julio Gonzalo¹ &
…
Felisa Verdejo¹

414 Accesses
4 Citations
5 Altmetric
Explore all metrics

Abstract

Producing online reputation summaries for an entity (company, brand, etc.) is a focused summarization task with a distinctive feature: issues that may affect the reputation of the entity take priority in the summary. In this paper we (i) present a new test collection of manually created (abstractive and extractive) reputation reports which summarize tweet streams for 31 companies in the banking and automobile domains; (ii) propose a novel methodology to evaluate summaries in the context of online reputation monitoring, which profits from an analogy between reputation reports and the problem of diversity in search; and (iii) provide empirical evidence that producing reputation reports is different from a standard summarization problem, and incorporating priority signals is essential to address the task effectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5

Fig. 6

Fig. 7

Fig. 8

Fig. 9

Tweet Stream Summarization for Online Reputation Management

Integrating learned and explicit document features for reputation monitoring in social media

Article 19 July 2019

RepLab: An Evaluation Campaign for Online Monitoring Systems

Notes

References

Alsaedi N, Burnap P, Rana OF (2016) Automatic summarization of real world events using twitter. In: Proceedings of the tenth international AAAI conference on web and social media. Cologne, pp 511–514
Amigó E, De Albornoz JC, Chugur I, Corujo A, Gonzalo J, Martín T, Meij E, De Rijke M, Spina D (2013a) Overview of replab 2013: evaluating online reputation monitoring systems. In: International conference of the cross-language evaluation forum for European languages. Springer, pp 333–352
Amigó E, Gonzalo J, Verdejo F (2013b) A general evaluation measure for document organization tasks. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval. ACM, pp 643–652
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Article Google Scholar
Chakrabarti D, Punera K (2011) Event summarization using tweets. ICWSM 11:66–73
Google Scholar
Cho SG, Kim SB (2015) Summarization of documents by finding key sentences based on social network analysis. In: International conference on industrial, engineering and other applications of applied intelligent systems. Springer, pp 285–292
Clarke CL, Kolla M, Cormack GV, Vechtomova O, Ashkan A, Büttcher S, MacKinnon I (2008) Novelty and diversity in information retrieval evaluation. In: Proceedings of the 31st annual international ACM SIGIR conference on research and development in information retrieval. ACM, pp 659–666
Cossu J-V, Bigot B, Bonnefoy L, Senay G (2014) Towards the improvement of topic priority assignment using various topic detection methods for e-reputation monitoring on twitter. In: International conference on applications of natural language to data bases/information systems. Springer, pp 154–159
De Maio C, Fenza G, Loia V, Parente M (2016) Time aware knowledge extraction for microblog summarization on twitter. Inf Fusion 28:60–74
Article Google Scholar
de Albornoz JC, Plaza L, Gervás P (2012) Sentisense: an easily scalable concept-based affective lexicon for sentiment analysis. In: LREC, pp 3562–3567
Duan Y, Chen Z, Wei F, Zhou M, Shum H-Y (2012) Twitter topic summarization by ranking tweets using social influence and content quality. Proc COLING 2012:763–780
Google Scholar
Erkan G, Radev DR (2004) Lexrank: Graph-based lexical centrality as salience in text summarization. J Artif Intell Res 22:457–479
Article Google Scholar
Fiszman M, Demner-Fushman D, Kilicoglu H, Rindflesch TC (2009) Automatic summarization of medline citations for evidence-based medical treatment: a topic-oriented evaluation. J Biomed Inform 42(5):801–813
Article Google Scholar
He R, Liu Y, Yu G, Tang J, Hu Q, Dang J (2017) Twitter summarization with social-temporal context. World Wide Web 20(2):267–290
Article Google Scholar
Inouye D, Kalita JK (2011) Comparing twitter summarization algorithms for multiple post summaries. In: Privacy, security, risk and trust (PASSAT) and 2011 IEEE 3rd international conference on social computing (SocialCom). IEEE, pp 298–306
Lin C-Y (2004) Rouge: a package for automatic evaluation of summaries. In: Proceedings of the workshop on text summarization branches out (WAS 2004). Association for Computational Linguistics, Barcelona, pp 74–81
Google Scholar
Litvak M, Last M, Kandel A (2013) Degext: a language-independent keyphrase extractor. J Ambient Intell Humaniz Comput 4(3):377–387
Article Google Scholar
Litvak M, Last M (2008) Graph-based keyword extraction for single-document summarization. In: Proceedings of the workshop on multi-source multilingual information extraction and summarization. Association for Computational Linguistics, pp 17–24
Litvak M, Vanetik N (2017) Query-based summarization using mdl principle. In: Proceedings of the multiling 2017 workshop on summarization and summary evaluation across source types and genres, pp 22–31
Liu X, Li Y, Wei F, Zhou M (2012) Graph-based multi-tweet summarization using social signals. Proc COLING 2012:1699–1714
Google Scholar
Louis A, Newman T (2012) Summarization of business-related tweets: a concept-based approach. In: Proceedings of COLING 2012: Posters, pp 765–774
Marujo L, Ribeiro R, de Matos DM, Neto JP, Gershman A, Carbonell J (2015) Extending a single-document summarizer to multi-document: a hierarchical approach. arXiv:1507.02907
Meena YK, Gopalani D (2015) Feature priority based sentence filtering method for extractive automatic text summarization. Proc Comput Sci 48:728–734
Article Google Scholar
Mei Q, Guo J, Radev D (2010) Divrank: the interplay of prestige and diversity in information networks. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1009–1018
Mihalcea R, Tarau P (2004) Textrank: bringing order into text. In: Proceedings of the 2004 conference on empirical methods in natural language processing
Mike T, Kevan B, Georgios P, Di C (2010) Sentiment in short strength detection informal text. JASIST 61(12):2544–2558
Article Google Scholar
Moffat A, Zobel J (2008) Rank-biased precision for measurement of retrieval effectiveness. ACM Trans Inf Syst (TOIS) 27(1):2
Article Google Scholar
Nastase V (2008) Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 763–772
Nguyen-Hoang T-A, Nguyen K, Tran Q-V (2012) Tsgvi: a graph-based summarization system for vietnamese documents. J Ambient Intell Humaniz Comput 3(4):305–313
Article Google Scholar
Pang B, Lee L et al (2008) Opinion mining and sentiment analysis. Found Trends Inf Retr 2(1–2):1–135
Article Google Scholar
Plaza L, Carrillo-de Albornoz J (2013) Evaluating the use of different positional strategies for sentence selection in biomedical literature summarization. BMC Bioinform 14(1):71
Article Google Scholar
Radev DR, Allison T, Blair-Goldensohn S, Blitzer J, Celebi A, Dimitrov S, Drabek E, Hakim A, Lam W, Liu D et al (2004) MEAD-a platform for multidocument multilingual text summarization. In: Proceedings of the fourth international conference on language resources and evaluation (LREC’04). European Language Resources Association (ELRA), Lisbon, Portugal
Google Scholar
Sarkar K, Saraf K, Ghosh A (2015) Improving graph based multidocument text summarization using an enhanced sentence similarity measure. In: Recent trends in information systems (ReTIS), 2015 IEEE 2nd international conference. IEEE, pp 359–365
Sharifi B, Hutton M-A, Kalita J (2010) Summarizing microblogs automatically. In: Human language technologies: The 2010 annual conference of the north American chapter of the association for computational linguistics. Association for Computational Linguistics, pp 685–688
Stone PJ, Dunphy DC, Smith MS (1966) The general inquirer: a computer approach to content analysis. MIT press, Cambridge
Google Scholar
Takamura H, Yokono H, Okumura M (2011) Summarizing a document stream. In: European conference on information retrieval. Springer, pp 177–188
Van Erp M, Schomaker L (2000) Variants of the borda count method for combining ranked classifier hypotheses. In: 7th international workshop on frontiers in handwriting recognition. Amsterdam learning methodology inspired by humans intelligence Bo Zhang, Dayong Ding. And Ling Zhang, Citeseer
Wu H, Gu Y, Sun S, Gu X (2016) Aspect-based opinion summarization with convolutional neural networks. In: Neural networks (IJCNN), 2016 international joint conference. IEEE, pp 3157–3163
Zhang H, Fiszman M, Shin D, Miller CM, Rosemblat G, Rindflesch TC (2011) Degree centrality for semantic abstraction summarization of therapeutic studies. J Biomed Inform 44(5):830–838
Article Google Scholar
Zhuang H, Rahman R, Hu X, Guo T, Hui P, Aberer K (2016) Data summarization with social contexts. In: Proceedings of the 25th ACM international on conference on information and knowledge management. ACM, pp 397–406

Download references

Acknowledgements

This research was partially supported by the Spanish Ministry of Science and Innovation (Vemodalen Project, TIN2015-71785-R) and UNED (project 2014V/PUNED/001).

Author information

Authors and Affiliations

UNED IR & NLP Group, Calle Juan del Rosal, 16, 28040, Madrid, Spain
Javier Rodríguez-Vidal, Jorge Carrillo-de-Albornoz, Enrique Amigó, Laura Plaza, Julio Gonzalo & Felisa Verdejo

Authors

Javier Rodríguez-Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Carrillo-de-Albornoz
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Amigó
View author publications
You can also search for this author in PubMed Google Scholar
Laura Plaza
View author publications
You can also search for this author in PubMed Google Scholar
Julio Gonzalo
View author publications
You can also search for this author in PubMed Google Scholar
Felisa Verdejo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Javier Rodríguez-Vidal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Example of a reputation summary

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rodríguez-Vidal, J., Carrillo-de-Albornoz, J., Amigó, E. et al. Automatic generation of entity-oriented summaries for reputation management. J Ambient Intell Human Comput 11, 1577–1591 (2020). https://doi.org/10.1007/s12652-019-01255-9

Download citation

Received: 17 September 2018
Accepted: 15 February 2019
Published: 26 February 2019
Issue Date: April 2020
DOI: https://doi.org/10.1007/s12652-019-01255-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic generation of entity-oriented summaries for reputation management

Abstract

Access this article

Similar content being viewed by others

Tweet Stream Summarization for Online Reputation Management

Integrating learned and explicit document features for reputation monitoring in social media

RepLab: An Evaluation Campaign for Online Monitoring Systems

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Example of a reputation summary

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Automatic generation of entity-oriented summaries for reputation management

Abstract

Access this article

Similar content being viewed by others

Tweet Stream Summarization for Online Reputation Management

Integrating learned and explicit document features for reputation monitoring in social media

RepLab: An Evaluation Campaign for Online Monitoring Systems

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: Example of a reputation summary

Appendix: Example of a reputation summary

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation