The Causal Strength Bank: A New Benchmark for Causal Strength Classification

Yuan, Xiaosong; Guan, Renchu; Zuo, Wanli; Zhang, Yijia

doi:10.1007/978-3-031-33374-3_9

Xiaosong Yuan^10,11,
Renchu Guan^10,11,
Wanli Zuo^10,11 &
…
Yijia Zhang¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13935))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

1095 Accesses

Abstract

Causal relation extraction is essential in the causality discovery of natural language processing. The development of causal relation extraction from the model-driven is staggering, so we resort to the data-driven method. More causal information is necessary because most current datasets only label the locations of causal entities or events, which may restrict the learning capacity of models. In this paper, we introduce a novel benchmark causal strength classification and corresponding dataset, Causal Strength Bank (CSB), consisting of a Chinese dataset (C-CSB) and an English dataset (E-CSB) which merge causal strength, causal polarity, and causal entity. To ensure credibility, we select four canonical English datasets and clean Wikipedia passages for the Chinese corpus. The corpus is then annotated and cross-checked by professional annotators in two stages, ensuring the accuracy of CSB. We evaluate various baseline methods on CSB and show that causal strength information benefits causal relation extraction, demonstrating the value of the proposed dataset. Our dataset is available at https://github.com/yuanxs21/CSB-dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Mirza, P.: Extracting temporal and causal relations between events. In: Proceedings of the ACL 2014 Student Research Workshop, pp. 10–17 (2014)
Google Scholar
Caselli, T., Vossen, P.: The event storyline corpus: a new benchmark for causal and temporal relation extraction. In: Proceedings of the Events and Stories in the News Workshop, pp. 77–86 (2017)
Google Scholar
Hendrickx, I., et al.: Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominals. In: Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 33–38 (2010)
Google Scholar
Hashimoto, C., et al.: Toward future scenario generation: extracting event causality exploiting semantic relation, context, and association features. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol. 1, pp. 987–997 (2014)
Google Scholar
Zhao, S., Liu, T., Zhao, S., Chen, Y., Nie, J.Y.: Event causality extraction based on connectives analysis. Neurocomputing 173, 1943–1950 (2016)
Article Google Scholar
Zhao, S., et al.: Constructing and embedding abstract event causality networks from text snippets. In: Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, pp. 335–344. ACM (2017)
Google Scholar
Liang, S., Zuo, W., Shi, Z., Wang, S., Wang, J., Zuo, X.: A multi-level neural network for implicit causality detection in web texts. Neurocomputing 481, 121–132 (2022)
Article Google Scholar
Mirza, P., Sprugnoli, R., Tonelli, S., Speranza, M.: Annotating causality in the TempEval-3 corpus. In: EACL 2014 Workshop on Computational Approaches to Causality in Language (CAtoCL), pp. 10–19. Association for Computational Linguistics (2014)
Google Scholar
Ning, Q., Feng, Z., Wu, H., Roth, D.: Joint reasoning for temporal and causal relations. arXiv preprint arXiv:1906.04941 (2019)
Ding, X., Li, Z., Liu, T., Liao, K.: ELG: an event logic graph. arXiv preprint arXiv:1907.08015 (2019)
Luo, Z., Sha, Y., Zhu, K.Q., Hwang, S.W., Wang, Z.: Commonsense causal reasoning between short texts. In: Fifteenth International Conference on the Principles of Knowledge Representation and Reasoning (2016)
Google Scholar
Sasaki, S., Takase, S., Inoue, N., Okazaki, N., Inui, K.: Handling multiword expressions in causality estimation. In: IWCS 2017 12th International Conference on Computational Semantics Short Papers (2017)
Google Scholar
Yang, X., Obadinma, S., Zhao, H., Zhang, Q., Matwin, S., Zhu, X.: Semeval-2020 task 5: counterfactual recognition. In: Proceedings of the Fourteenth Workshop on Semantic Evaluation, pp. 322–335 (2020)
Google Scholar
Zhang, Y., Yang, Q.: A survey on multi-task learning. arXiv preprint arXiv:1707.08114 (2017)
Akbik, A., Michael, T.: The weltmodell: a data-driven commonsense knowledge base. In: LREC, vol. 2, p. 5 (2014)
Google Scholar
Church, K., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)
Google Scholar
Gordon, A.S., Bejan, C.A., Sagae, K.: Commonsense causal reasoning using millions of personal stories. In: Twenty-Fifth AAAI Conference on Artificial Intelligence (2011)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Mullen, T., Collier, N.: Sentiment analysis using support vector machines with diverse information sources. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp. 412–418 (2004)
Google Scholar
Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746–1751 (2014)
Google Scholar
Zhou, P., Qi, Z., Zheng, S., Xu, J., Bao, H., Xu, B.: Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv preprint arXiv:1611.06639 (2016)
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)
Google Scholar
Chen, T., Xu, R., He, Y., Wang, X.: Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst. Appl. 72, 221–230 (2017)
Article Google Scholar
Dai, Z., Wang, X., Ni, P., Li, Y., Li, G., Bai, X.: Named entity recognition using BERT BilSTM CRF for Chinese electronic health records. In: 2019 12th International Congress on Image and Signal Processing, Biomedical Engineering and Informatics (CISP-BMEI), pp. 1–5. IEEE (2019)
Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China under Grant (61976103), and the general foundation of the National University of Defense Technology under Grant (ZK22-11).

Author information

Authors and Affiliations

College of Computer Science and Technology, Jilin University, Changchun, China
Xiaosong Yuan, Renchu Guan & Wanli Zuo
Key Laboratory of Symbolic Computation and Knowledge Engineering, Ministry of Education, Changchun, China
Xiaosong Yuan, Renchu Guan & Wanli Zuo
College of Electronic Countermeasures, National University of Defense Technology, Hefei, China
Yijia Zhang

Authors

Xiaosong Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Renchu Guan
View author publications
You can also search for this author in PubMed Google Scholar
Wanli Zuo
View author publications
You can also search for this author in PubMed Google Scholar
Yijia Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yijia Zhang .

Editor information

Editors and Affiliations

Kyoto University, Kyoto, Japan
Hisashi Kashima
IBM Research, Thomas J. Watson Research Center, Yorktown Heights, NY, USA
Tsuyoshi Ide
National Chiao Tung University, Hsinchu, Taiwan
Wen-Chih Peng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yuan, X., Guan, R., Zuo, W., Zhang, Y. (2023). The Causal Strength Bank: A New Benchmark for Causal Strength Classification. In: Kashima, H., Ide, T., Peng, WC. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2023. Lecture Notes in Computer Science(), vol 13935. Springer, Cham. https://doi.org/10.1007/978-3-031-33374-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-33374-3_9
Published: 27 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33373-6
Online ISBN: 978-3-031-33374-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Causal Strength Bank: A New Benchmark for Causal Strength Classification