Expanding Native Training Data for Implicit Discourse Relation Classification

Hong, Yu; Zhu, Shanshan; Yan, Weirong; Yao, Jianmin; Zhu, Qiaoming; Zhou, Guodong

doi:10.1007/978-3-662-45558-6_6

Yu Hong¹⁶,
Shanshan Zhu¹⁶,
Weirong Yan¹⁶,
Jianmin Yao¹⁶,
Qiaoming Zhu¹⁶ &
…
Guodong Zhou¹⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 489))

Included in the following conference series:

Chinese National Conference on Social Media Processing

1103 Accesses

Abstract

Linguistically informed features are provably useful in classifying implicit discourse relations among adjacent text spans. However the state of the art methods in this area suffer from either sparse natively implicit relation corpus or counter-intuitive artificially implicit one, and consequently either insufficient or distorted training in automatically learning discriminative features. To overcome the problem, this paper proposes a semantic frame based vector model towards unsupervised acquisition of semantically and relationally parallel data, aiming to enlarge natively implicit relation corpus so as to optimize the training effect. Experiments on PDTB 2.0 show the usage of the acquired parallel corpus gives statistically significant improvements over that of the prototypical corpus.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1988)
Google Scholar
Hobbs, J.R.: Literature and cognition. CSLI Lecture Notes, vol. 21. CSLI Publications (1990)
Google Scholar
Lascarides, A., Asher, N.: Temporal interpretation, discourse relations and commonsense entailment. Linguistics and Philosophy 16(5), 437–493 (1993)
Article Google Scholar
Knott, A., Sanders, T.: The classification of coherence relations and their linguistic markers: An exploration of two languages. Journal of Pragmatics 30(2), 135–175 (1998)
Article Google Scholar
Webber, B.: D-LTAG: Extending lexicalized TAG to discourse. Cognitive Science 28(5), 751–779 (2004)
Article Google Scholar
Prasad, R., Joshi, A., Dinesh, N., Lee, A., Miltsakaki, E., Webber, B.: The Penn Discourse TreeBank as a Resource for Natural Language Generation. In: Proceedings of the Corpus Linguistics Workshop on Using Corpora for Natural Language Generation, Birmingham, U.K., pp. 25–32 (2005)
Google Scholar
Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. MIT Press, Cambridge (2000b)
MATH Google Scholar
Sporleder, C., Lapata, M.: Discourse Chunking and its Application to Sentence Compression. In: Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (EMNLP 2005), Vancouver, British Columbia, Canada, pp. 257–264 (2005)
Google Scholar
Verberne, S., Boves, L., Oostdijk, N., Coppen, P.: Evaluating Discourse-based Answer Extraction for Why-question Answering. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), Amsterdam, The Netherlands, pp. 735–736 (2007)
Google Scholar
Lin, Z., Tou Ng, H., Kan, M.: Automatically Evaluating Text Coherence Using Discourse Relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT 2011), Portland, Organ, USA, pp. 997–1006 (2011)
Google Scholar
Zahri, N.A.H.: Exploiting Discourse Relations between Sentences for Text Clustering. In: Proceedings of the Workshop on Advances in Discourse Analysis and its Computational Aspects (ADACA, COLING 2012), Mumbai, India, pp. 17–31 (2012)
Google Scholar
Marcu, D., Echihabi, A.: An Unsupervised Approach to Recognizing Discourse Relations. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia, USA, pp. 368–375 (2002)
Google Scholar
Pitler, E., Raghupathy, M., Mehta, H., Nenkova, A., Lee, A., Joshi, A.: Easily Identifiable Discourse Relations. In: Proceedings of the 22nd International Conference of Computational Linguistics (COLING 2008), Manchester, UK, pp. 87–90 (2008)
Google Scholar
Prasad, R., Joshi, A., Webber, B.: Realization of Discourse Relations by Other Means: Alternative Lexicalizations. In: Proceedings of the 24th International Conference of Computational Linguistics (COLING 2010), Beijing, China, pp. 1023–1031 (2010)
Google Scholar
Pettibone, J., PonBarry, H.: A Maximum Entropy Approach to Recognizing Discourse Relations in Spoken Language. Working Paper. The Stanford Natural Language Processing Group (June 6, 2003)
Google Scholar
Soricut, R., Marcu, D.: Sentence Level Discourse Parsing Using Syntactical and Lexical Information. In: Proceedings of Human Language Technology and North American Association for Computational Linguistics (HLT-NAACL 2003), Edmonton, Canada, pp. 149–156 (2003)
Google Scholar
Saito, M., Yamamoto, K., Sekine, S.: Using Phrasal Patterns to Identify Discourse Relations. In: Proceedings of Human Language Technology and North American Association for Computational Linguistics (HLT-NAACL 2006), New York, USA, pp. 133–136 (2006)
Google Scholar
Wellner, B., Pustejovsky, J., Havasi, C., Rumshisky, A., Sauri, R.: Classification of Discourse Coherence Relations: An Exploratory Study Using Multiple Knowledge Sources. In: Proceedings of the 7th SIGDIAL Workshop on Discourse and Dialogue, Sydney, Australia, pp. 117–125 (2006)
Google Scholar
Pitler, E., Louis, A., Nenkova, A.: Automatic Sense Prediction for Implicit Discourse Relations in Text. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009), Suntec, Singapore, pp. 683–691 (2009)
Google Scholar
Lin, Z., Kan, M., Ng, H.T.: Recognizing Implicit Discourse Relations in the Penn Discourse Treebank. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (EMNLP 2009), Singapore, pp. 343–351 (2009)
Google Scholar
Wang, W., Su, J., Tan, C.L.: Kernel Based Discourse Relation Recognition with Temporal Ordering Information. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Uppsala, Sweden, pp. 710–719 (2010)
Google Scholar
Park, J., Cardie, C.: Improving Implicit Discourse Relation Recognition Through Feature Set Optimization. In: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2012), Seoul, South Korea, pp. 108–112 (2012)
Google Scholar
Biran, O., McKeown, K.: Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, pp. 69–73 (2013)
Google Scholar
Blair-Goldensohn, A., McKeown, K., Rambow, O.: Building and Refining Rhetorical-Semantic Relation Models. In: Proceedings of Human Language Technology and North American Association for Computational Linguistics (HLT-NAACL 2007), Rochester, New York, USA, pp. 428–435 (2007)
Google Scholar
Sporleder, C., Lascarides, A.: Using Automatically Labeled Examples to Classify Rhetorical Relations: An Assessment. Natural Language Engineering 14(03), 369–416 (2008)
Article Google Scholar
Prasad, R., Dinesh, N., Lee, A., Miltsakaki, E., Robaldo, L., Joshi, A., Webber, B.: The Penn Discourse TreeBank 2.0. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), Marrakech, Morocco, pp. 2961–2968 (2008)
Google Scholar
Wang, X., Li, S., Li, J., Li, W.: Implicit Discourse Relation Recognition by Selecting Typical Training Examples. In: Proceedings of the 26th International Conference of Computational Linguistics (COLING 2012), Mumbai, India, pp. 2757–2772 (2012)
Google Scholar
Hong, Y., Zhou, X., Che, T., Yao, J., Zhou, G., Zhu, Q.: Cross-Argument Inference for Implicit Discourse Relation Recognition. In: Proceedings of the 21st ACM International Conference on Information and Knowledge (CIKM 2012), Maui, HI, USA, pp. 295–304 (2012)
Google Scholar
Fillmore, C.J.: Frame Semantics and Nature of Language. In: Annals of the New York Academy of Scienes: Conference on the Origin and Development of Language and Speech, vol. (280), pp. 20–32 (1976)
Google Scholar
Fillmore, C.J., Baker, C.F.: Frame Semantics for Text Understanding. In: Proceedings of WordNet and Other Lexical Resources Workshop (NAACL 2001), Pittsburgh, USA (2001)
Google Scholar
Fillmore, C.J., Baker, C.F.: The Structure of the FrameNet Database. International Journal of Lexicography 16(3), 281–296 (2003)
Article Google Scholar
Fillmore, C.J.: FrameNet, Current Collaborations and Future Goals. Language Resources and Evaluation (46), 269–286 (2012)
Google Scholar
Das, D., Smith, N.A.: Semi-Supervised Frame-Semantic Parsing for Unknown Predicates. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT 2011), Portland, Organ, USA, pp. 1435–1444 (2011)
Google Scholar
Prasad, R., Miltsakaki, E., Dinesh, N., Lee, A., Joshi, A.: The Penn Discourse Treebank 2.0 Annotation Manual. Technical Report IRCS-08-01, Institute for Research in Cognitive Science, University of Pennsylvania (2007), http://www.seas.upenn.edu/~pdtb/PDTBAPI/pdtb-annotation-manual.pdf

Download references

Author information

Authors and Affiliations

Key Laboratory of Natural Language Processing of Jiangsu Province, School of Computer Since and Technology, Soochow University, No.1 Shizi Street, Suzhou City, Jiangsu Province, China
Yu Hong, Shanshan Zhu, Weirong Yan, Jianmin Yao, Qiaoming Zhu & Guodong Zhou

Authors

Yu Hong
View author publications
You can also search for this author in PubMed Google Scholar
Shanshan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Weirong Yan
View author publications
You can also search for this author in PubMed Google Scholar
Jianmin Yao
View author publications
You can also search for this author in PubMed Google Scholar
Qiaoming Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Guodong Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
Heyan Huang
Department of Computer Science and Technology, Harbin Institute of Technology, China
Ting Liu
Beijing Institute of Technology School of Computer Science, China
Hua-Ping Zhang
Department of Computer Science and Technology, Tsinghua University, China
Jie Tang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hong, Y., Zhu, S., Yan, W., Yao, J., Zhu, Q., Zhou, G. (2014). Expanding Native Training Data for Implicit Discourse Relation Classification. In: Huang, H., Liu, T., Zhang, HP., Tang, J. (eds) Social Media Processing. SMP 2014. Communications in Computer and Information Science, vol 489. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45558-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-662-45558-6_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45557-9
Online ISBN: 978-3-662-45558-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics