Abstract
This paper describes the prospect of word extraction for text summarization based on combinatorial optimization. Instead of the commonly used sentence-based approach, word-based approaches are preferable if highly-compressed summarizations are required. However, naively applying conventional methods for word extraction yields excessively fragmented summaries. We avoid this by restricting the number of selected fragments from each sentence to at most one when formulating the maximum coverage problem. Consequently, the method only choose sub-sentences as fragments. Experiments show that our method matches the ROUGE scores of state-of-the-art systems without requiring any training or special parameters.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
McDonald, R.: A Study of Global Inference Algorithms in Multi-document Summarization. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 557–564. Springer, Heidelberg (2007)
Gillick, D., Favre, B.: A scalable global model for summarization. In: Proceedings of the Workshop on ILP for NLP (2009)
Lin, H., Bilmes, J.: A class of submodular functions for document summarization. In: Proceedings of the ACL/HLT (2011)
Takamura, H., Okumura, M.: Text summarization model based on maximum coverage problem and its variant. In: Proceedings of the 14th EACL (2009)
Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. In: Proc. of Workshop on Text Summarization Branches Out, pp. 74–81 (2004)
Gillick, D.: Sentence boundary detection and the problem with the U.S. In: Proceedings of the HLT-NAACL (2009)
Pingali, P., Rahul, K., Varma, V.: IIIT hyderabad at duc 2007. In: Proceedings of DUC (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yasuda, N., Nishino, M., Hirao, T., Nagata, M. (2013). Sub-sentence Extraction Based on Combinatorial Optimization. In: Serdyukov, P., et al. Advances in Information Retrieval. ECIR 2013. Lecture Notes in Computer Science, vol 7814. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36973-5_91
Download citation
DOI: https://doi.org/10.1007/978-3-642-36973-5_91
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-36972-8
Online ISBN: 978-3-642-36973-5
eBook Packages: Computer ScienceComputer Science (R0)