Abstract
We present a comparative study of lexical chain-based summarisation techniques. The aim of this paper is to highlight the effect of lexical chain scoring metrics and sentence extraction techniques on summary generation. We present our own lexical chain-based summarisation system and compare it to other chain-based summarisation systems. We also compare the chain scoring and extraction techniques of our system to those of several other baseline systems, including a random summarizer and one based on tf.idf statistics. We use a task-orientated summarisation evaluation scheme that determines summary quality based on TDT story link detection performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alemany, L., Fuentes, M.: Integrating Cohesion and Coherence for Text Summarization. In: Proceedings of the EACL Student Workshop (2003)
Allan, J.: Introduction to Topic Detection and Tracking. In: Topic Detection and Tracking: Event-based Information Organization, pp. 1–16. Kluwer Academic Publishers, Dordrecht (2002)
Barzilay, R., Elhadad, M.: Using Lexical Chains for Summarisation. In: ACL/EACL 1997 summarisation workshop, pp. 10–18 (1997)
Brunn, M., Chali, Y., Pinchak, C.: Text summarisation using lexical chains. In: Workshop on Text Summarisation in conjunction with the ACM SIGIR Conference 2001, New Orleans, Louisiana (2001)
DUC (2003), http://www-nlpir.nist.gov/projects/duc/
Green, S.: Automatically generating hypertext by computing semantic similarity, PhD thesis, University of Toronto (1997)
Hearst, M.: Multi-paragraph segmentation of expository text. In: Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pp. 9–16. Association for Computational Linguistics, Las Cruces (1994)
Mani, I., House, D., Klein, G., Hirschman, L., Obrst, L., Firmin, T., Chrzanowski, M., Sundheim, B.: The TIPSTER SUMMAC text summarisation evaluation: Final report. MITRE Technical Report MTR 98w0000138, MITRE (1998)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: 1990, Five papers on WordNet. Technical Report, Cognitive Science Laboratory (1990)
Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17(1), 21–43 (1991)
Salton, G., Singhal, A., Mitra, M., Buckley, C.: Automatic text structuring and summarisation. Information Processing and Management 33(2), 193–208 (1997)
Silber, G., McCoy, K.: Efficient Text Summarisation Using Lexical Chains. In: Proceedings of the ACM Conference on Intelligent User Interfaces, IUI 2000 (2000)
Spark-Jones, K.: Factorial Summary Evaluation. In: Workshop on Text Summarisation in conjunction with the ACM SIGIR Conference 2001, New Orleans, Louisiana (2001)
St. Onge, D.: Detection and Correcting Malapropisms with Lexical Chains, M.Sc Thesis, University of Toronto, Canada (1995)
Stairmand, M.: A Computational Analysis of Lexical Cohesion with Applications in Information Retrieva, Ph.D. Dissertation, Center for Computational Linguistics, UMIST, Manchester (1996)
Stokes, N., Carthy, J., Smeaton, A.F.: SeLeCT: A Lexical Chain-based News Story Segmentation System. To appear in the AI Communications Journal
TDT Pilot Corpus, http://www.nist.gov/speech/tests/tdt/
van Rijsbergen, C.J.: Information Retrieval. Butterworths (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Doran, W., Stokes, N., Carthy, J., Dunnion, J. (2004). Assessing the Impact of Lexical Chain Scoring Methods and Sentence Extraction Schemes on Summarization. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_77
Download citation
DOI: https://doi.org/10.1007/978-3-540-24630-5_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive