Assessing the Impact of Lexical Chain Scoring Methods and Sentence Extraction Schemes on Summarization

Doran, William; Stokes, Nicola; Carthy, Joe; Dunnion, John

doi:10.1007/978-3-540-24630-5_77

William Doran⁵,
Nicola Stokes⁵,
Joe Carthy⁵ &
…
John Dunnion⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2945))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

968 Accesses
8 Citations

Abstract

We present a comparative study of lexical chain-based summarisation techniques. The aim of this paper is to highlight the effect of lexical chain scoring metrics and sentence extraction techniques on summary generation. We present our own lexical chain-based summarisation system and compare it to other chain-based summarisation systems. We also compare the chain scoring and extraction techniques of our system to those of several other baseline systems, including a random summarizer and one based on tf.idf statistics. We use a task-orientated summarisation evaluation scheme that determines summary quality based on TDT story link detection performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alemany, L., Fuentes, M.: Integrating Cohesion and Coherence for Text Summarization. In: Proceedings of the EACL Student Workshop (2003)
Google Scholar
Allan, J.: Introduction to Topic Detection and Tracking. In: Topic Detection and Tracking: Event-based Information Organization, pp. 1–16. Kluwer Academic Publishers, Dordrecht (2002)
Google Scholar
Barzilay, R., Elhadad, M.: Using Lexical Chains for Summarisation. In: ACL/EACL 1997 summarisation workshop, pp. 10–18 (1997)
Google Scholar
Brunn, M., Chali, Y., Pinchak, C.: Text summarisation using lexical chains. In: Workshop on Text Summarisation in conjunction with the ACM SIGIR Conference 2001, New Orleans, Louisiana (2001)
Google Scholar
DUC (2003), http://www-nlpir.nist.gov/projects/duc/
Green, S.: Automatically generating hypertext by computing semantic similarity, PhD thesis, University of Toronto (1997)
Google Scholar
Hearst, M.: Multi-paragraph segmentation of expository text. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pp. 9–16. Association for Computational Linguistics, Las Cruces (1994)
Chapter Google Scholar
Mani, I., House, D., Klein, G., Hirschman, L., Obrst, L., Firmin, T., Chrzanowski, M., Sundheim, B.: The TIPSTER SUMMAC text summarisation evaluation: Final report. MITRE Technical Report MTR 98w0000138, MITRE (1998)
Google Scholar
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: 1990, Five papers on WordNet. Technical Report, Cognitive Science Laboratory (1990)
Google Scholar
Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17(1), 21–43 (1991)
Google Scholar
Salton, G., Singhal, A., Mitra, M., Buckley, C.: Automatic text structuring and summarisation. Information Processing and Management 33(2), 193–208 (1997)
Article Google Scholar
Silber, G., McCoy, K.: Efficient Text Summarisation Using Lexical Chains. In: Proceedings of the ACM Conference on Intelligent User Interfaces, IUI 2000 (2000)
Google Scholar
Spark-Jones, K.: Factorial Summary Evaluation. In: Workshop on Text Summarisation in conjunction with the ACM SIGIR Conference 2001, New Orleans, Louisiana (2001)
Google Scholar
St. Onge, D.: Detection and Correcting Malapropisms with Lexical Chains, M.Sc Thesis, University of Toronto, Canada (1995)
Google Scholar
Stairmand, M.: A Computational Analysis of Lexical Cohesion with Applications in Information Retrieva, Ph.D. Dissertation, Center for Computational Linguistics, UMIST, Manchester (1996)
Google Scholar
Stokes, N., Carthy, J., Smeaton, A.F.: SeLeCT: A Lexical Chain-based News Story Segmentation System. To appear in the AI Communications Journal
Google Scholar
TDT Pilot Corpus, http://www.nist.gov/speech/tests/tdt/
van Rijsbergen, C.J.: Information Retrieval. Butterworths (1979)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University College, Dublin, Ireland
William Doran, Nicola Stokes, Joe Carthy & John Dunnion

Authors

William Doran
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Stokes
View author publications
You can also search for this author in PubMed Google Scholar
Joe Carthy
View author publications
You can also search for this author in PubMed Google Scholar
John Dunnion
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Polytechnic Institute, Center for Computing Research, 07738, Mexico City, México
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Doran, W., Stokes, N., Carthy, J., Dunnion, J. (2004). Assessing the Impact of Lexical Chain Scoring Methods and Sentence Extraction Schemes on Summarization. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_77

Download citation

DOI: https://doi.org/10.1007/978-3-540-24630-5_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics