Abstract
In this article, we present an information filtering method that selects from a set of documents their most significant excerpts in relation to a user profile. This method relies on both structured profiles and a topical analysis of documents. The topical analysis is also used for expanding a profile in relation to a particular document by selecting the terms of the document that are closely linked to those of the profile. This expansion is a way for selecting in a more reliable way excerpts that are linked to profiles but also for selecting excerpts that may bring new and interesting information about their topics. This method was implemented by the REDUIT system, which was successfully evaluated for document filtering and passage extraction.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Over, P., Yen, J.: An introduction to DUC 2003: Intrinsic evaluation of generic new text summarization systems. In: Document Understanding Conference (2003)
Sanderson, M.: Accurate user directed summarization from existing tools. In: CIKM 1998, pp. 45–51 (1998)
Okumura, M., Mochizuki, H.: Query-biased summarization based on lexical chaining. Computational Intelligence 16, 578–585 (2000)
Berger, A., Mittal, V.O.: Query-relevant summarization using faqs. In: ACL 2000, pp. 294–301 (2000)
Wu, H., Radev, D.R., Fan, W.: Towards answer-focused summarization. In: 1st International Conference on Information Technology and Applications (2002)
Mori, T., Nozawa, M., Asada, Y.: Multi-answer-focused multi-document summarization using a question-answering engine. In: COLING 2004, pp. 439–445 (2004)
Mani, I., House, D., Klein, G., Hirshman, L., Orbst, L., Firmin, T., Chrzanowski, M., Sundheim, B.: The TIPSTER SUMMAC text summarization evaluation. Technical Report MTR 98W0000138, The Mitre Corporation (1998)
Ferret, O.: Filtrage thématique d’un réseau de collocations. In: TALN 2003, Batz sur mer, France, pp. 347–352 (2003)
Mesnard, O., Naets, H.: Concept-based searching and merging for multilingual information retrieval: First experiments at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 174–184. Springer, Heidelberg (2004)
Choi, F.Y.Y.: Advances in domain independent linear text segmentation. In: NAACL 2000, pp. 26–33 (2000)
Fukusima, T., Okumura, M.: Text summarization challenge: Text summarization evaluation in japan. In: NAACL 2001 Workshop on Automatic Summarization, pp. 51–59 (2001)
Radev, D.R., Jing, H., Budzikowska, M.: Centroid-based summarization of multiple documents: Sentence extraction, utility-based evaluation, and user studies. In: ANLP/NAACL 2000 Workshop on Automatic Summarization, Seattle, WA (2000)
Lin, C.Y., Hovy, E.H.: Automatic evaluation of summaries using n-gram co-occurrence statistics. In: HLT/NAACL 2003, Edmonton, Canada (2003)
Hull, D., Robertson, S.: The TREC-8 filtering track final report. In: 8th Text Retrieval Conference (TREC-8), pp. 35–55 (2000)
Hu, P., He, T., Ji, D.: Chinese text summarization based on thematic area detection. In: ACL 2004 Workshop: Text Summarization Branches Out, Barcelona, Spain, Association for Computational Linguistics, pp. 112–119 (2004)
Allan, J.: HARD track overview in trec 2003 - High accuracy retrieval from documents. In: 12th Text Retrieval Conference, TREC-2003 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Châar, S.L., Ferret, O., Fluhr, C. (2005). Filtering for Profile-Biased Multi-document Summarization. In: Losada, D.E., Fernández-Luna, J.M. (eds) Advances in Information Retrieval. ECIR 2005. Lecture Notes in Computer Science, vol 3408. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-31865-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-31865-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25295-5
Online ISBN: 978-3-540-31865-1
eBook Packages: Computer ScienceComputer Science (R0)