Abstract
Query focused summarization is the task of producing a compressed text of original set of documents based on a query. Documents can be viewed as graph with sentences as nodes and edges can be added based on sentence similarity. Graph based ranking algorithms which use ‘Biased random surfer model’ like topic-sensitive LexRank have been successfully applied to query focused summarization. In these algorithms, random walk will be biased towards the sentences which contain query relevant words. Specifically, it is assumed that random surfer knows the query relevance score of the sentence to where he jumps. However, neighbourhood information of the sentence to where he jumps is completely ignored. In this paper, we propose look-ahead version of topic-sensitive LexRank. We assume that random surfer not only knows the query relevance of the sentence to where he jumps but he can also look N-step ahead from that sentence to find query relevance scores of future set of sentences. Using this look ahead information, we figure out the sentences which are indirectly related to the query by looking at number of hops to reach a sentence which has query relevant words. Then we make the random walk biased towards even to the indirect query relevant sentences along with the sentences which have query relevant words. Experimental results show 20.2% increase in ROUGE-2 score compared to topic-sensitive LexRank on DUC 2007 data set. Further, our system outperforms best systems in DUC 2006 and results are comparable to state of the art systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization, pp. 10–17 (1997)
Zhao, L., Wu, L., Huang, X.: Using query expansion in graph-based approach for query-focused multi-document summarization. Inf. Process. Manage. 45(1), 35–41 (2009)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1-7), 107–117 (1998)
Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research 22 (2004)
Otterbacher, J., Erkan, G., Radev, D.R.: Using random walks for question-focused sentence retrieval. In: HLT 2005: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 915–922. Association for Computational Linguistics, Morristown (2005)
Wan, X., Yang, J., Xiao, J.: Using cross-document random walks for topic-focused multi-document. In: WI 2006: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 1012–1018. IEEE Computer Society, Washington, DC (2006)
Erkan, G.: Using biased random walks for focused summarization. In: Proceedings of the DUC 2006 Document Understanding Workshop, Brooklyn, NY, USA (2006)
Wan, X., Yang, J., Xiao, J.: Manifold-ranking based topic-focused multi-document summarization. In: IJCAI 2007: Proceedings of the 20th International Joint Conference on Artifical Intelligence, pp. 2903–2908. Morgan Kaufmann Publishers Inc., San Francisco (2007)
Pingali, P., Rahul, K., Varma, V.: Iiit hyderabad at duc 2007. In: Proceedings of the Document Understanding Conference. NIST, Rochester (2007)
Toutanova, K., Brockett, C., Gamon, M., Jagarlamudi, J., Suzuki, H., Vanderwende, L.: The pythy summarization system: Microsoft research at duc2007. In: DUC 2007: Document Understanding Conference, Rochester, NY, USA (2007)
Zhang, J., Cheng, X., Wu, G., Xu, H.: Adasum: an adaptive model for summarization. In: CIKM 2008: Proceeding of the 17th ACM Conference on Information and Knowledge Management, pp. 901–910. ACM, New York (2008)
Ouyang, Y., Li, W., Li, S., Lu, Q.: Applying regression models to query-focused multi-document summarization. Inf. Process. Manage (2010)
Ouyang, Y., Li, W., Lu, Q.: An integrated multi-document summarization approach based on word hierarchical representation. In: ACL-IJCNLP 2009: Proceedings of the ACL-IJCNLP 2009 Conference Short Papers, pp. 113–116. Association for Computational Linguistics, Morristown (2009)
Nastase, V.: Topic-driven multi-document summarization with encyclopedic knowledge and spreading activation. In: EMNLP 2008: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 763–772. Association for Computational Linguistics, Morristown (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Badrinath, R., Venkatasubramaniyan, S., Veni Madhavan, C.E. (2011). Improving Query Focused Summarization Using Look-Ahead Strategy. In: Clough, P., et al. Advances in Information Retrieval. ECIR 2011. Lecture Notes in Computer Science, vol 6611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20161-5_64
Download citation
DOI: https://doi.org/10.1007/978-3-642-20161-5_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20160-8
Online ISBN: 978-3-642-20161-5
eBook Packages: Computer ScienceComputer Science (R0)