NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System

Boudin, Florian; Torres Moreno, Juan Manuel

doi:10.1007/978-3-540-70939-8_49

Florian Boudin¹ &
Juan Manuel Torres Moreno¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4394))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1531 Accesses
8 Citations

Abstract

This paper discusses an approach to topic-oriented multi-document summarization. It investigates the effectiveness of using additional information about the document set as a whole, as well as individual documents. We present NEO-CORTEX, a multi-document summarization system based on the existing CORTEX system. Results are reported for experiments with a document base formed by the NIST DUC-2005 and DUC-2006 data. Our experiments have shown that NEO-CORTEX is an effective system and achieves good performance on topic-oriented multi-document summarization task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mani, I., Maybury, M.T.: Advances in Automatic Text Summarization. MIT Press, Cambridge (1999)
Google Scholar
Mani, I.: Automatic Summarization. John Benjamins, Amsterdam (2001)
Book MATH Google Scholar
Luhn, P.H.: Automatic creation of literature abstracts. IBM Journal of Research and Development, 155–164 (1958)
Google Scholar
Edmundson, H.P.: New Methods in Automatic Extracting. Journal of the ACM (JACM) 16(2), 264–285 (1969)
Article MATH Google Scholar
Paice, C.D.: Constructing literature abstracts by computer: techniques and prospects. Inf. Process. Manage. 26(1), 171–186 (1990)
Article Google Scholar
Mani, I., Bloedorn, E.: Machine Learning of Generic and User-Focused Summarization. Arxiv preprint cs (CL/9811006) (1998)
Google Scholar
Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 68–73. ACM Press, New York (1995)
Google Scholar
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, pp. 10–17 (1997)
Google Scholar
Stairmand, M.: A Computational Analysis of Lexical Cohesion with Applications in Information Retrieval. Unpublished PhD Thesis. UMIST Computational Linguistics Laboratory (1996)
Google Scholar
Mann, W., Thompson, S.: Rhetorical Structure Theory: A Theory of Text Organization. University of Southern California, Information Sciences Institute (1987)
Google Scholar
Torres-Moreno, J.-M., Velázquez-Morales, P., Meunier, J.G.: Cortex: un algorithme pour la condensation automatique de textes. ARCo 2, 365 (2001)
Google Scholar
Torres-Moreno, J.-M., Velázquez-Morales, P., Meunier, J.G.: Condensés de textes par des méthodes numériques. JADT 2, 723–734 (2002)
Google Scholar
Abdillahi, N., Nocera, P., Torres-Moreno, J.-M.: Boîtes à outils TAL pour les langues peu informatisées: Le cas du somali. JADT, 697–705 (2006)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Article Google Scholar
Salton, G.: In: Automatic text processing, Addison-Wesley Publishing, Reading (1989)
Google Scholar
Salton, G., McGill, M.: Introduction to modern information retrieval. Computer Science Series. McGraw-Hill, New York (1983)
MATH Google Scholar
Passonneau, R.J., Nenkova, A., McKeown, K., Sigleman, S.: Applying the Pyramid Method in DUC 2005. In: Proc. of DUC 2005 at the Human Language Technology Conf./Conf. on Empirical Methods in Natural Language Processing (HLT/EMNLP) (2005)
Google Scholar
Hovy, E., Lin, C.Y., Zhou, L.: Evaluating DUC 2005 using Basic Elements. In: Proc. of DUC 2005 at the Human Language Technology Conf./Conf. on Empirical Methods in Natural Language Processing (HLT/EMNLP) (2005)
Google Scholar
Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. Technical report, Information Sciences Institute (2002)
Google Scholar
Favre, B., Béchet, F., Bellot, P., Boudin, F., El-Bèze, M., Gillard, L., Lapalme, G., Torres-Moreno, J.M.: The LIA-Thales summarization system at DUC-2006 (2006), http://www-nlpir.nist.gov/projects/duc/index.html

Download references

Author information

Authors and Affiliations

Laboratoire Informatique d’Avignon, BP 1228 F-84911 Avignon Cedex 9, France
Florian Boudin & Juan Manuel Torres Moreno

Authors

Florian Boudin
View author publications
You can also search for this author in PubMed Google Scholar
Juan Manuel Torres Moreno
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Boudin, F., Torres Moreno, J.M. (2007). NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_49

Download citation

DOI: https://doi.org/10.1007/978-3-540-70939-8_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70938-1
Online ISBN: 978-3-540-70939-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics