Synonyms
Definition
Text summarization is the process of distilling the most important information from a text to produce an abridged version for a particular task and user [9].
Historical Background
With more and more digitalized text being available, especially with the development of the Internet, people are being overwhelmed with data. How to help people effectively and efficiently capture the information from the data becomes extremely important. Many techniques have been proposed for this goal and text summarization is one of them.
Text summarization in some form has been in existence since the 1950s [8]. Two main influences have dominated the research in this area, as summarized by Mani in [10]. Work in library science, office automation, and information retrieval has resulted in a focus on methods for producing extracts from scientific papers, including the use of “shallow” linguistic analysis and the use of term statistics. The other influence has been...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsRecommended Reading
Berry M.W., Dumais S.T., and O’Brien G.W. Using linear algebra for intelligent information retrieval. SIAM Rev., 37(4): 573–595, 1995.
Brandowa R., Mitzeb K., and Rauc L.F. Automatic condensation of electronic publications by sentence selection. Inform. Process. Manage., 41(6):675–685, 1995.
Conroy J.M. and O’leary D.P. Text summarization via hidden markov models. In Proc. 24th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001, pp. 406–407.
Goldstein J., Kantrowitz M., Mittal V., and Carbonell J. Summarizing text documents: sentence selection and evaluation metrics. In Proc. 22nd Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999, pp. 121–128.
Gong Y. and Liu X. Generic text summarization using relevance measure and latent semantic analysis. In Proc. 24th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2001, pp. 19–25.
Kupiec J., Pedersen J., and Chen F. A trainable document summarizer. In Proc. 18th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 1995, pp. 68–73.
Lin C.-Y. and Hovy E. Automatic evaluation of summaries using n-gram co-occurrence statistics. In Proc. Human Lang. Tech. Conf. of the North American Chapter of Assoc. Comput. Linguistics, 2003, pp. 71–78.
Luhn H.P. The automatic creation of literature abstracts. IBM J. Res. Dev., 2(2), 1958.
Mani I. Advances in Automatic Text Summarization. MIT, Cambridge, MA, USA, 1999.
Mani I. Recent developments in text summarization. In Proc. 10th Int. Conf. on Information and Knowledge Management, pp. 529–531. 2001,
Marcu D. From discourse structures to text summaries. In Proc. ACL Workshop on Intelligent Scalable Text Summarization, 1997, pp. 82–88.
Mihalcea R. Language independent extractive summarization. In Proc. 20th National Conf. on AI and 17th Innovative Applications of AI Conf., 2005, pp. 1688–1689.
Shen D., Chen Z., Yang Q., Zeng H.-J., Zhang B., Lu Y., and Ma W.-Y. Web-page classification through summarization. In Proc. 30th Annual Int. ACM SIGIR Conf. on Research and Development in Information Retrieval, 2004, pp. 242–249.
Shen D., Sun J.-T., Li H., Yang Q., and Chen Z. Document summarization using conditional random fields. In Proc. 20th Int. Joint Conf. on AI, 2007, pp. 2862–2867.
Teufel S. and Moens M. Sentence extraction as a classification task. In Proc. ACL Workshop on Intelligent Text Summarization. 1997, pp. 58–65.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this entry
Cite this entry
Shen, D. (2009). Text Summarization. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_424
Download citation
DOI: https://doi.org/10.1007/978-0-387-39940-9_424
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-35544-3
Online ISBN: 978-0-387-39940-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering