Top-down Text-Level Discourse Rhetorical Structure Parsing with Bidirectional Representation Learning

Zhang, Long-Yin; Tan, Xin; Kong, Fang; Li, Pei-Feng; Zhou, Guo-Dong

doi:10.1007/s11390-022-1167-0

Top-down Text-Level Discourse Rhetorical Structure Parsing with Bidirectional Representation Learning

Regular Paper
Published: 30 September 2023

Volume 38, pages 985–1001, (2023)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Long-Yin Zhang¹,
Xin Tan¹,
Fang Kong¹,
Pei-Feng Li¹ &
…
Guo-Dong Zhou¹

254 Accesses
1 Altmetric
Explore all metrics

Abstract

Early studies on discourse rhetorical structure parsing mainly adopt bottom-up approaches, limiting the parsing process to local information. Although current top-down parsers can better capture global information and have achieved particular success, the importance of local and global information at various levels of discourse parsing is different. This paper argues that combining local and global information for discourse parsing is more sensible. To prove this, we introduce a top-down discourse parser with bidirectional representation learning capabilities. Existing corpora on Rhetorical Structure Theory (RST) are known to be much limited in size, which makes discourse parsing very challenging. To alleviate this problem, we leverage some boundary features and a data augmentation strategy to tap the potential of our parser. We use two methods for evaluation, and the experiments on the RST-DT corpus show that our parser can primarily improve the performance due to the effective combination of local and global information. The boundary features and the data augmentation strategy also play a role. Based on gold standard elementary discourse units (EDUs), our parser significantly advances the baseline systems in nuclearity detection, with the results on the other three indicators (span, relation, and full) being competitive. Based on automatically segmented EDUs, our parser still outperforms previous state-of-the-art work.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Mann W C, Thompson S A. Rhetorical structure theory: Toward a functional theory of text organization. Text-Interdisciplinary Journal for the Study of Discourse, 1988, 8(3): 243–281. https://doi.org/10.1515/text.1.1988.8.3.243.
Article Google Scholar
Goyal N, Eisenstein J. A joint model of rhetorical discourse structure and summarization. In Proc. the Workshop on Structured Prediction for NLP, Nov. 2016, pp.25–34. https://doi.org/10.18653/v1/W16-5903.
Ji Y F, Smith N A. Neural discourse structure for text categorization. In Proc. the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jul. 30–Aug. 4, 2017, pp.996–1005. https://doi.org/10.18653/v1/P17-1092.
Joty S, Guzmán F, Màrquez L, Nakov P. Discourse structure in machine translation evaluation. Computational Linguistics, 2017, 43(4): 683–722. https://doi.org/10.1162/COLI_a_00298.
Article MathSciNet Google Scholar
Carlson L, Marcu D, Okurovsky M. Building a discoursetagged corpus in the framework of Rhetorical Structure Theory. In Proc. the 2nd SIGdial Workshop on Discourse and Dialogue, Sept. 2001.
Hernault H, Prendinger H, Du Verle D A, Ishizuka M. Hilda: A discourse parser using support vector machine classification. Dialogue & Discourse, 2010, 1(3): 1–33. https://doi.org/10.5087/dad.2010.003.
Article Google Scholar
Feng V W, Hirst G. A linear-time bottom-up discourse parser with constraints and post-editing. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jun. 2014, pp.511–521. https://doi.org/10.3115/v1/P14-1048.
Ji Y F, Eisenstein J. Representation learning for text-level discourse parsing. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Jun. 2014, pp.13–24. https://doi.org/10.3115/v1/P14-1002.
Li J W, Li R M, Hovy E. Recursive deep models for discourse parsing. In Proc. the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Oct. 2014, pp.2061–2069. https://doi.org/10.3115/v1/D14-1220.
Li Q, Li T S, Chang B B. Discourse parsing with attention-based hierarchical neural networks. In Proc. the 2016 Conference on Empirical Methods in Natural Language Processing, Nov. 2016, pp.362–371. https://doi.org/10.18653/v1/D16-1035.
Braud C, Plank B, Søgaard A. Multi-view and multi-task training of RST discourse parsers. In Proc. the 26th International Conference on Computational Linguistics: Technical Papers, Dec. 2016, pp.1903–1913.
Lin X, Joty S, Jwalapuram P, Bari M S. A unified lineartime framework for sentence-level discourse parsing. In Proc. the 57th Annual Meeting of the Association for Computational Linguistics, Jul. 2019, pp.4190–4200. https://doi.org/10.18653/v1/P19-1410.
Zhang L Y, Xing Y Q, Kong F, Li P F, Zhou G D. A topdown neural architecture towards text-level parsing of discourse rhetorical structure. In Proc. the 58th Annual Meeting of the Association for Computational Linguistics, Jul. 2020, pp.6386–6395. https://doi.org/10.18653/v1/2020.acl-main.569.
Kobayashi N, Hirao T, Kamigaito H, Okumura M, Nagata M. Top-down RST parsing utilizing granularity levels in documents. In Proc. the 34th Association for the Advancement of Artificial Intelligence, Feb. 2020, pp.8099–8106. https://doi.org/10.1609/aaai.v34i05.6321.
Marcu D. The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, 2000.
Book MATH Google Scholar
Morey M, Muller P, Asher N. How much progress have we made on RST discourse parsing? A replication study of recent results on the RST-DT. In Proc. the 2017 Conference on Empirical Methods in Natural Language Processing, Sept. 2017, pp.1319–1324. https://doi.org/10.18653/v1/D17-1136.
Zhang L Y, Kong F, Zhou G D. Syntax-guided sequence to sequence modeling for discourse segmentation. In Proc. the 9th CCF International Conference on Natural Language Processing and Chinese Computing, Oct. 2020, pp.95–107. https://doi.org/10.1007/978-3-030-60457-8_8.
Joty S, Carenini G, Ng R, Mehdad Y. Combining intraand multi-sentential rhetorical parsing for document-level discourse analysis. In Proc. the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug. 2013, pp.486–496.
Yu N, Zhang M S, Fu G H. Transition-based neural RST parsing with implicit syntax features. In Proc. the 27th International Conference on Computational Linguistics, Aug. 2018, pp.559–570.
Braud C, Coavoux M, Søgaard A. Cross-lingual RST discourse parsing. In Proc. the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, Apr. 2017, pp.292–304.
Wang Y Z, Li S J, Wang H F. A two-stage parsing method for text-level discourse analysis. In Proc. the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jul. 30–Aug. 4, 2017, pp.184–188. https://doi.org/10.18653/v1/P17-2029.
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proc. the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Oct. 2014, pp.1724–1734. https://doi.org/10.3115/v1/D14-1179.
Dozat T, Manning C D. Deep biaffine attention for neural dependency parsing. In Proc. the 5th International Conference on Learning Representations, Apr. 2017.
Zhu X D, Sobhani P, Guo H Y. Long short-term memory over recursive structures. In Proc. the 32nd International Conference on International Conference on Machine Learning, Jul. 2015, pp.1604–1612.
Tai K S, Socher R, Manning C D. Improved semantic representations from tree-structured long short-term memory networks. In Proc. the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Jul. 2015, pp.1556–1566. https://doi.org/10.3115/v1/P15-1150.
Tan X, Zhang L Y, Xiong D Y, Zhou G D. Hierarchical modeling of global context for document-level neural machine translation. In Proc. the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Nov. 2019, pp.1576–1585. https://doi.org/10.18653/v1/D19-1168.
Vinyals O, Fortunato M, Jaitly N. Pointer networks. In Proc. the 28th International Conference on Neural Information Processing Systems, Dec. 2015, pp.2692–2700.
Wang Y Z, Li S J, Yang J F. Toward fast and accurate neural discourse segmentation. In Proc. the 2018 Conference on Empirical Methods in Natural Language Processing, Oct. 31–Nov. 4, 2018, pp.962–967. https://doi.org/10.18653/v1/D18-1116.
Li J, Sun A X, Joty S. SegBot: A generic neural text segmentation model with pointer network. In Proc. the 27th International Joint Conference on Artificial Intelligence, Jul. 2018, pp.4166–4172. https://doi.org/10.24963/ijcai.2018/579.
Marcheggiani D, Titov I. Encoding sentences with graph convolutional networks for semantic role labeling. In Proc. the 2017 Conference on Empirical Methods in Natural Language Processing, Sept. 2017, pp.1506–1515. https://doi.org/10.18653/v1/D17-1159.
Sagae K, Lavie A. A classifier-based parser with linear run-time complexity. In Proc. the 9th International Workshop on Parsing Technology, Oct. 2005, pp.125–132.
Li Y C, Feng W H, Sun J, Kong F, Zhou G D. Building Chinese discourse corpus with connective-driven dependency tree structure. In Proc. the 2014 Conference on Empirical Methods in Natural Language Processing, Oct. 2014, pp.2105–2114. https://doi.org/10.3115/v1/D14-1224.
Pennington J, Socher R, Manning C. GloVe: Global vectors for word representation. In Proc. the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Oct. 2014, pp.1532–1543. https://doi.org/10.3115/v1/D14-1162.
Peters M, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L. Deep contextualized word representations. In Proc. the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun. 2018, pp.2227–2237. https://doi.org/10.18653/v1/N18-1202.
Manning C, Surdeanu M, Bauer J, Finkel J, Bethard S, McClosky D. The Stanford CoreNLP natural language processing toolkit. In Proc. the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Jun. 2014, pp.55–60. https://doi.org/10.3115/v1/P14-5010.
Mabona A, Rimell L, Clark S, Vlachos A. Neural generative rhetorical structure parsing. In Proc. the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Nov. 2019, pp.2284–2295. https://doi.org/10.18653/v1/D19-1233.

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Soochow University, Suzhou, 215006, China
Long-Yin Zhang, Xin Tan, Fang Kong, Pei-Feng Li & Guo-Dong Zhou

Authors

Long-Yin Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Xin Tan
View author publications
You can also search for this author inPubMed Google Scholar
Fang Kong
View author publications
You can also search for this author inPubMed Google Scholar
Pei-Feng Li
View author publications
You can also search for this author inPubMed Google Scholar
Guo-Dong Zhou
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Fang Kong.

Supplementary Information

ESM 1

(PDF 147 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, LY., Tan, X., Kong, F. et al. Top-down Text-Level Discourse Rhetorical Structure Parsing with Bidirectional Representation Learning. J. Comput. Sci. Technol. 38, 985–1001 (2023). https://doi.org/10.1007/s11390-022-1167-0

Download citation

Received: 20 November 2020
Accepted: 21 November 2022
Published: 30 September 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s11390-022-1167-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Top-down Text-Level Discourse Rhetorical Structure Parsing with Bidirectional Representation Learning

Abstract

Access this article

Subscribe and save

Buy Now

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now