Externally Controllable RNN for Implicit Discourse Relation Classification

Yue, Xihan; Fu, Luoyi; Wang, Xinbing

doi:10.1007/978-3-319-73618-1_14

Externally Controllable RNN for Implicit Discourse Relation Classification

Xihan Yue¹⁸,
Luoyi Fu¹⁸ &
Xinbing Wang¹⁸

Conference paper
First Online: 05 January 2018

3296 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10619))

Abstract

Without discourse connectives, recognizing implicit discourse relations is a great challenge and a bottleneck for discourse parsing. The key factor lies in proper representing the two discourse arguments as well as modeling their interactions. This paper proposes two novel neural networks, i.e., externally controllable LSTM (ECLSTM) and attention-augmented GRU (AAGRU), which can be stacked to incorporate arguments’ interactions into their representing process. The two networks are variants of Recurrent Neural Network (RNN) but equipped with externally controllable cells that their working processes can be dynamically regulated. ECLSTM is relatively conservative and easily comprehensible while AAGRU works better for small datasets. Multilevel RNN with smaller hidden state allows critical information to be gradually exploited, and thus enables our model to fit deeper structures with slightly increased complexity. Experiments on the Penn Discourse Treebank (PDTB) benchmark show that our method achieves significant performance gain over vanilla LSTM/CNN models and competitive with previous state-of-the-art models.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Marcu, D., Echihabi, A.: An unsupervised approach to recognizing discourse relations. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 368–375 (2002)
Google Scholar
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The Penn Discourse Treebank 2.0. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (2008)
Google Scholar
Pitler, E., Raghupathy, M., Mehta, H., Nenkova, A., Lee, A., Joshi, A.: Easily identifiable discourse relations. In: Proceedings of the 22nd International Conference on Computational Linguistics (2008)
Google Scholar
Pitler, E., Louis, A., Nenkova, A.: Automatic sense prediction for implicit discourse relations in text. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pp. 683–691 (2009)
Google Scholar
Lin, Z., Ng, H.T., Kan, M.Y.: PDTB-styled end-to-end discourse parser. Technical report, School of Computing, National University of Singapore (2010)
Google Scholar
Zhou, Z., Lan, M., Xu, Y., Niu, Z., Su, J., Tan, C.L.: Predicting discourse connectives for implicit discourse relation recognition. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 1507–1514 (2010)
Google Scholar
Park, J., Cardie, C.: Improving implicit discourse relation recognition through feature set optimization. In: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp. 108–112 (2012)
Google Scholar
Rutherford, A.T., Xue, N.: Discovering implicit discourse relations through brown cluster pair representation and coreference patterns. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 645–654 (2014)
Google Scholar
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S.J., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pp. 55–60 (2014)
Google Scholar
Pennington, J., Socher, R., Manning, C.D.: GloVe: Global vectors for word representation. In: Proceedings of the 2014 Conference Empiricial Methods in Natural Language Processing (2014)
Google Scholar
Wang, J., Lan, M.: A refined end-to-end discourse parser. In: Proceedings of the Nineteenth Conference on Computational Natural Language Learning: Shared Task, pp. 17–24 (2015)
Google Scholar
Braud, C., Denis, P.: Comparing word representations for implicit discourse relation classification. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2201–2211 (2015)
Google Scholar
Zhang, B., Su, J., Xiong, D., Lu, Y., Duan, H., Yao, J.: Shallow convolutional neural network for implicit discourse relation recognition. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2230–2235 (2015)
Google Scholar
Kingma, D.P., Adam, J.B.: A method for stochastic optimization. In: Proceedings of the 3rd International Conference for Learning Representations (2015)
Google Scholar
Chen, J., Zhang, Q., Liu, P., Qiu, X., Huang, X.: Implicit discourse relation detection via a deep architecture with gated relevance network. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1726–1735 (2016)
Google Scholar
Wu, C., Shi, X., Cheng, Y., Huang, Y., Su, J.: Bilingually-constrained synthetic data for implicit discourse relation recognition. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 2306–2312 (2016)
Google Scholar
Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E.: Hierarchical attention networks for document classification. In: Proceedings of NAACL Conference (2016)
Google Scholar
Liu, Y., Li, S., Zhang, X., Sui, Z.: Implicit discourse relation classification via multi-task neural networks. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, pp. 2750–2756 (2016)
Google Scholar
Liu, Y., Li, S.: Recognizing implicit discourse relations via repeated reading: neural networks with multi-level attention. In: Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 1224–1233 (2016)
Google Scholar
Qin, L., Zhang, Z., Zhao, H., Hu, Z., Xing, E.P.: Adversarial connective-exploiting networks for implicit discourse relation classification. In: Proceedings of ACL Conference (2017)
Google Scholar
Li, H., Zhang, J., Zong, C.: Implicit discourse relation recognition for english and Chinese with multiview modeling and effective representation learning. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 16, 19 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
Xihan Yue, Luoyi Fu & Xinbing Wang

Authors

Xihan Yue
View author publications
You can also search for this author in PubMed Google Scholar
Luoyi Fu
View author publications
You can also search for this author in PubMed Google Scholar
Xinbing Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinbing Wang .

Editor information

Editors and Affiliations

Fudan University, Shanghai, China
Xuanjing Huang
Singapore Management University, Singapore, Singapore
Jing Jiang
Peking University, Beijing, China
Dongyan Zhao
Peking University, Beijing, China
Yansong Feng
Soochow University, Suzhou, China
Yu Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yue, X., Fu, L., Wang, X. (2018). Externally Controllable RNN for Implicit Discourse Relation Classification. In: Huang, X., Jiang, J., Zhao, D., Feng, Y., Hong, Y. (eds) Natural Language Processing and Chinese Computing. NLPCC 2017. Lecture Notes in Computer Science(), vol 10619. Springer, Cham. https://doi.org/10.1007/978-3-319-73618-1_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-73618-1_14
Published: 05 January 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73617-4
Online ISBN: 978-3-319-73618-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics