Abstractive Document Summarization via Neural Model with Joint Attention

Hou, Liwei; Hu, Po; Bei, Chao

doi:10.1007/978-3-319-73618-1_28

Abstractive Document Summarization via Neural Model with Joint Attention

Liwei Hou¹⁸,
Po Hu¹⁸ &
Chao Bei¹⁹

Conference paper
First Online: 05 January 2018

3636 Accesses
20 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10619))

Abstract

Due to the difficulty of abstractive summarization, the great majority of past work on document summarization has been extractive, while the recent success of sequence-to-sequence framework has made abstractive summarization viable, in which a set of recurrent neural networks models based on attention encoder-decoder have achieved promising performance on short-text summarization tasks. Unfortunately, these attention encoder-decoder models often suffer from the undesirable shortcomings of generating repeated words or phrases and inability to deal with out-of-vocabulary words appropriately. To address these issues, in this work we propose to add an attention mechanism on output sequence to avoid repetitive contents and use the subword method to deal with the rare and unknown words. We applied our model to the public dataset provided by NLPCC 2017 shared task3. The evaluation results show that our system achieved the best ROUGE performance among all the participating teams and is also competitive with some state-of-the-art methods.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manage. 40(6), 919–938 (2004)
Article MATH Google Scholar
Erkan, G., Radev, D.R.: LexPageRank: prestige in multi-document text summarization. In: EMNLP (2004)
Google Scholar
Wan, X., Yang, J., Xiao, J.: Manifold-ranking based topic-focused multi-document summarization. In: IJCAI (2007)
Google Scholar
Titov, I., McDonald, R.: A joint model of text and aspect ratings for sentiment summarization. In: ACL (2008)
Google Scholar
Li, S., Ouyang, Y., Wang, W., Sun, B.: Multi-document summarization using support vector regression. In: DUC (2007)
Google Scholar
Nishikawa, H., Arita, K., Tanaka, K., Hirao, T., Makino, T., Matsuo, Y.: Learning to generate coherent summary with discriminative hidden semi-Markov model. In: COLING (2014)
Google Scholar
Gillick, D., Favre, B.: A scalable global model for summarization. In: ACL (2009)
Google Scholar
Li, J., Li, L., Li, T.: Multi-document summarization via submodularity. Appl. Intell. 37(3), 420–430 (2012)
Article Google Scholar
Lin, H., Bilmes, J.: Multi-document summarization via budgeted maximization of submodular functions. In: NAACL (2010)
Google Scholar
Chopra, S., Auli, M., Rush, A.M.: Abstractive sentence summarization with attentive recurrent neural networks. In: NAACL (2016)
Google Scholar
Wang, L., Ling, W.: Neural network-based abstract generation for opinions and arguments. In: NAACL (2016)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: EMNLP (2014)
Google Scholar
Bahdanau, D., Cho, K., Bengio, Y.: Neural Machine Translation by Jointly Learning to Align and Translate. arXiv preprint arXiv:1409.0473 (2014)
Gu, J., Lu, Z., Li, H., Li, V.O.K.: Incorporating copying mechanism in sequence-to-sequence learning. In: ACL (2016)
Google Scholar
Gulcehre, C., Ahn, S., Nallapati, R., Zhou, B., Bengio, Y.: Pointing the Unknown Words. arXiv preprint arXiv:1603.08148 (2016)
Nallapati, R., Zhou, B., dos Santos, C., Gulcehre, C., Xiang, B.: Abstractive text summarization using sequence-to-sequence RNNs and Beyond. In: CoNLL (2016)
Google Scholar
Paulus, R., Xiong, C., Socher, R.: A Deep Reinforced Model for Abstractive Summarization. arXiv preprint arXiv:1705.04304 (2017)
Sennrich, R., Haddow, B., Birch, A.: Neural machine translation of rare words with subword units. In: ACL (2016)
Google Scholar
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: ACL (2004)
Google Scholar
Zhang, J., Wang, T., Wan, X.: PKUSUMSUM: a Java platform for multilingual document summarization. In: COLING (2016)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No. 61402191), the Specific Funding for Education Science Research by Self-determined Research Funds of CCNU from the Colleges’ Basic Research and Operation of MOE (No. CCNU16JYKX15), and the Thirteen Five-year Research Planning Project of National Language Committee (No. WT135-11). We also thank Zhiwen Xie for helpful discussion.

Author information

Authors and Affiliations

School of Computer Science, Central China Normal University, Wuhan, 430079, China
Liwei Hou & Po Hu
Global Tone Communication Technology Co., Ltd., Beijing, 100043, China
Chao Bei

Authors

Liwei Hou
View author publications
You can also search for this author in PubMed Google Scholar
Po Hu
View author publications
You can also search for this author in PubMed Google Scholar
Chao Bei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Po Hu .

Editor information

Editors and Affiliations

Fudan University, Shanghai, China
Xuanjing Huang
Singapore Management University, Singapore, Singapore
Jing Jiang
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Yansong Feng
Soochow University, Suzhou, China
Yu Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hou, L., Hu, P., Bei, C. (2018). Abstractive Document Summarization via Neural Model with Joint Attention. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-73618-1_28
Published: 05 January 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics