Unsupervised Joint Learning for Headline Generation and Discourse Structure of Reviews

Isonuma, Masaru; Mori, Junichiro; Sakata, Ichiro

doi:10.1007/978-3-030-39878-1_13

Masaru Isonuma²²,
Junichiro Mori^22,23 &
Ichiro Sakata²²

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1128))

Included in the following conference series:

Annual Conference of the Japanese Society for Artificial Intelligence

545 Accesses

Abstract

This is an extension from a selected paper from JSAI2019. Recently, using a large number of reference summaries, supervised neural summarization models have achieved success. However, such data is rare, and trained models cannot be shared across domains. As a solution for such a problem, we propose the first unsupervised end-to-end headline generation model for a single review. We assume that a review can be described as a discourse tree in which the headline is the root and the child sentences elaborate on their parent. By estimating the parent from their children recursively, our model induces the tree and generates the headline that describes the entire review. Through the evaluation of the generated headline on actual reviews, our model achieved competitive performance with supervised models, especially on relatively long reviews. In induced trees, we confirmed that the child sentences explain the parent in detail and the generated headlines abstract for the entire review.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ElmNet: a benchmark dataset for generating headlines from Persian papers

Article 14 October 2021

Headline Generation with Recurrent Neural Network

SHEG: summarization and headline generation of news articles using deep learning

Article 23 July 2020

References

Bing, L., Li, P., Liao, Y., Lam, W., Guo, W., Passonneau, R.: Abstractive multi-document summarization via phrase selection and merging. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, vol. 1, pp. 1587–1597 (2015)
Google Scholar
Carenini, G., Cheung, J.C.K., Pauls, A.: Multi-document summarization of evaluative text. Comput. Intell. 29(4), 545–576 (2013)
Article MathSciNet Google Scholar
Chopra, S., Auli, M., Rush, A.M.: Abstractive sentence summarization with attentive recurrent neural networks. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 93–98 (2016)
Google Scholar
Chu, E., Liu, P.: MeanSum: a neural model for unsupervised multi-document abstractive summarization. In: Proceedings of the 36th International Conference on Machine Learning, vol. 97, pp. 1223–1232 (2019)
Google Scholar
Di Fabbrizio, G., Stent, A., Gaizauskas, R.: A hybrid approach to multi-document summarization of opinions in reviews. In: Proceedings of the 8th International Natural Language Generation Conference, pp. 54–63 (2014)
Google Scholar
Dohare, S., Gupta, V., Karnick, H.: Unsupervised semantic abstractive summarization. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Student Research Workshop, pp. 74–83 (2018)
Google Scholar
Erkan, G., Radev, D.R.: Lexpagerank: prestige in multi-document text summarization. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 365–371 (2004)
Google Scholar
Fang, Y., Zhu, H., Muszyńska, E., Kuhnle, A., Teufel, S.: A proposition-based abstractive summariser. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics, pp. 567–578 (2016)
Google Scholar
Gerani, S., Mehdad, Y., Carenini, G., Ng, R.T., Nejat, B.: Abstractive summarization of product reviews using discourse structure. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1602–1613 (2014)
Google Scholar
He, R., McAuley, J.: Ups and downs: modeling the visual evolution of fashion trends with one-class collaborative filtering. In: Proceedings of the 25th International Conference on World Wide Web, pp. 507–517 (2016)
Google Scholar
Hirao, T., Yoshida, Y., Nishino, M., Yasuda, N., Nagata, M.: Single-document summarization as a tree knapsack problem. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1515–1520 (2013)
Google Scholar
Isonuma, M., Fujino, T., Mori, J., Matsuo, Y., Sakata, I.: Extractive summarization using multi-task learning with document classification. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 2101–2110 (2017)
Google Scholar
Ji, Y., Smith, N.A.: Neural discourse structure for text categorization. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 996–1005 (2017)
Google Scholar
Joulin, A., Grave, E., Mikolov, P.B.T.: Bag of tricks for efficient text classification. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol. 2, pp. 427–431 (2017)
Google Scholar
Kikuchi, Y., Hirao, T., Takamura, H., Okumura, M., Nagata, M.: Single document summarization based on nested tree structure. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol. 2, pp. 315–320 (2014)
Google Scholar
Koo, T., Globerson, A., Carreras, X., Collins, M.: Structured prediction models via the matrix-tree theorem. In: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 141–150 (2007)
Google Scholar
Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In: Aggarwal, C., Zhai, C. (eds.) Mining Text Data, pp. 415–463. Springer, Boston (2012)
Chapter Google Scholar
Liu, Y., Lapata, M.: Learning structured text representations. Trans. Assoc. Comput. Linguist. 6, 63–75 (2018)
Article Google Scholar
Ma, S., Sun, X., Lin, J., Ren, X.: A hierarchical end-to-end model for jointly improving text summarization and sentiment classification. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence, pp. 4251–4257 (2018)
Google Scholar
Mann, W.C., Thompson, S.A.: Rhetorical structure theory: toward a functional theory of text organization. Text-Interdiscip. J. Study Discourse 8(3), 243–281 (1988)
Article Google Scholar
McAuley, J., Targett, C., Shi, Q., Van Den Hengel, A.: Image-based recommendations on styles and substitutes. In: Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 43–52 (2015)
Google Scholar
Miao, Y., Blunsom, P.: Language as a latent variable: discrete generative models for sentence compression. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 319–328 (2016)
Google Scholar
Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 404–411 (2004)
Google Scholar
Nallapati, R., Zhou, B., dos Santos, C., Gulcehre, C., Xiang, B.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pp. 280–290 (2016)
Google Scholar
Paulus, R., Xiong, C., Socher, R.: A deep reinforced model for abstractive summarization. In: Proceedings of the 6th International Conference on Learning Representations (2018)
Google Scholar
Radev, D.R., Jing, H., Styś, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40(6), 919–938 (2004)
Article Google Scholar
Rush, A.M., Chopra, S., Weston, J.: A neural attention model for abstractive sentence summarization. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 379–389 (2015)
Google Scholar
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1073–1083 (2017)
Google Scholar
Tan, J., Wan, X., Xiao, J.: Abstractive document summarization with a graph-based attentional neural model. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, vol. 1, pp. 1171–1181 (2017)
Google Scholar
Tutte, W.T.: Graph Theory, vol. 21. Addison-Wesley, Boston (1984)
MATH Google Scholar
Wang, H., Ren, J.: A self-attentive hierarchical model for jointly improving text summarization and sentiment classification. In: Proceedings of the 10th Asian Conference on Machine Learning, pp. 630–645 (2018)
Google Scholar
Wang, L., Ling, W.: Neural network-based abstract generation for opinions and arguments. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 47–57 (2016)
Google Scholar
Yoshida, Y., Suzuki, J., Hirao, T., Nagata, M.: Dependency-based discourse parser for single-document summarization. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp. 1834–1839 (2014)
Google Scholar
Yu, N., Huang, M., Shi, Y., Zhu, X.: Product review summarization by exploiting phrase properties. In: Proceedings of the 26th International Conference on Computational Linguistics, pp. 1113–1124 (2016)
Google Scholar

Download references

Acknowledgements

This work was supported by CREST, JST, the New Energy and Industrial Technology Development Organization (NEDO) and Deloitte Tohmatsu Financial Advisory LLC.

Author information

Authors and Affiliations

The University of Tokyo, 3-7-1 Hongo, Bunkyo, Tokyo, Japan
Masaru Isonuma, Junichiro Mori & Ichiro Sakata
RIKEN, 1-4-1 Nihonbashi, Chuo, Tokyo, Japan
Junichiro Mori

Authors

Masaru Isonuma
View author publications
You can also search for this author in PubMed Google Scholar
Junichiro Mori
View author publications
You can also search for this author in PubMed Google Scholar
Ichiro Sakata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masaru Isonuma .

Editor information

Editors and Affiliations

Department of Systems Innovation, University of Tokyo, Tokyo, Japan
Yukio Ohsawa
Faculty of Business and Commerce, Kansai University, Osaka, Japan
Katsutoshi Yada
Nagoya Institute of Technology, Nagoya, Japan
Takayuki Ito
Graduate School of System Design, Tokyo Metropolitan University, Tokyo, Japan
Yasufumi Takama
Department of Information and Communication, Tokyo Metropolitan University, Tokyo, Japan
Eri Sato-Shimokawara
Faculty of Letters, Chiba University, Chiba, Japan
Akinori Abe
School of Engineering, The University of Tokyo, Tokyo, Japan
Junichiro Mori
Graduate School of Economics, Osaka University, Toyonaka, Osaka, Japan
Naohiro Matsumura

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Isonuma, M., Mori, J., Sakata, I. (2020). Unsupervised Joint Learning for Headline Generation and Discourse Structure of Reviews. In: Ohsawa, Y., et al. Advances in Artificial Intelligence. JSAI 2019. Advances in Intelligent Systems and Computing, vol 1128. Springer, Cham. https://doi.org/10.1007/978-3-030-39878-1_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-39878-1_13
Published: 04 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-39877-4
Online ISBN: 978-3-030-39878-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics