Abstract
Event Detection (ED) is an essential and challenging task in Information Extraction (IE). Recent advances in neural networks make it possible to build reliable models without complicated feature engineering. Neural network-based models commonly regard event detection as a char-wise or word-wise labeling task, which suffers from the problems of long-distance information miss-capturing, discontinuous labeling errors, etc., between characters/words and event triggers, especially in languages without natural word delimiters such as Chinese. In this paper, we propose a novel multi-layer Residual and Gated-Based Atrous Convolution Neural Network (RG-ACNN) framework, which attempts to alleviate above-mentioned problems. Specifically, the ACNN is introduced in our model to expand the receptive field to obtain multi-scale context information to capture dependencies between long-distance information. While gated and residual mechanisms are both imported to ACNN to improve our networks’ capability of the information filtering and aggregation. Besides, RG-ACNN performs event detection in a char-wise paradigm, where a novel “head-tail dual-pointer” labeled strategy is used to overcome the incomplete continuous labeling problem. Experiments on the ACE2005-CN and several standard benchmark datasets show that RG-ACNN significantly outperforms state-of-the-art (SOTA) methods.
Similar content being viewed by others
Notes
B: Begin, I: Intermediate, E: End, S: Single, O: Other.
References
Ghaeini R, Fern XZ, Huang L, Tadepalli P (2016) Event nugget detection with forward-backward recurrent neural networks. In: The 54th annual meeting of the association for computational linguistics, vol 2, pp 369–337
Nguyen TH, Grishman T (2015) Event detection and domain adaptation with convolutional neural networks. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, vol 2, pp 365–371
Chen Y, Xu L, Liu K, Zeng D, Zhao J (2015) Event extraction via dynamic multi-pooling convolutional neural networks. In: Proceedings of the 53rd annual meeting of the association for computational linguistics and the 7th international joint conference on natural language processing, vol 1, pp 167–176
Liu H, Lu Y, Han X, Sun L (2018) Nugget proposal networks for chinese event detection. In: Proceedings of the 56th annual meeting of the association for computational linguistics vol 1, pp 1565–1574
Zeng Y, Yang H, Feng Y, Wang Z, Zhao D (2016) A convolution BiLSTM neural network model for Chinese event extraction. In: Natural Language Understanding and Intelligent Applications. pp 275–287
Chen Z, Ji H (2009) Language specific issue and feature exploration in Chinese event extraction. In: Proceedings of Human language technologies: the 2009 annual conference of the north american chapter of the association for computational linguistics. pp. 209–121
Gehring J, Auli M, Grangier D, Yarats D, Dauphin YN (2017) Convolutional sequence to sequence learning. In: Proceedings of the 34th international conference on machine learning. pp. 1243–1252
Chieu HL, Ng HT (2002) A maximum entropy approach to information extraction from semi-structured and free text. In: Eighteenth national conference on Artificial intelligence. American Association for Artificial Intelligence. pp 786–791
Ahn D (2006) The stages of event extraction. In: Proceedings of the workshop on annotating and reasoning about time and events. pp 1–8
Li Q, Ji H, Yu H, Li S (2014) Constructing information networks using one single model. In: Proceedings of the 2014 conference on empirical methods in natural language processing, pp 1846–1851
Hong Y, Zhang J, Ma B, Yao J, Zhao G, Zhu Q (2011) Using cross-entity inference to improve event extraction. In: Proceedings of the 49th annual meeting of the association for computational linguistics: human language technologies, vol 1, pp 1127–1136
Ji H, Grishman R (2008) Refining event extraction through cross-document inference. In: 46th annual meeting of the association for computational linguistics: human language technologies, ACL-08: HLT. pp 254–262
Liao S, Grishman R (2010) Using document level cross-event inference to improve event extraction. In: Proceedings of the 48th annual meeting of the association for computational linguistics, pp 789–797
Li P, Zhu Q, Zhu G (2013) Joint modeling of argument identification and role determination in Chinese event extraction with discourse-level information. In: Twenty-Third international joint conference on artificial intelligence, pp 2120–2126
Chen C, Ng V (2012) Joint modeling for Chinese event extraction with rich linguistic features. In: Proceedings of COLING, pp 529–544
Li P, Zhou G, Zhu Q, Hou L (2012) Employing compositional semantics and discourse consistency in Chinese event extraction. In: Proceedings of the 2012 joint conference on empirical methods in natural language processing and computational natural language learning, pp 1006–1016
Nguyen TH, Grishman R (2016) Modeling skip-grams for event detection with convolutional neural networks. In: Proceedings of the 2016 conference on empirical methods in natural language, pp 886–891
Nguyen TH, Cho K, Grishman R (2016) Joint event extraction via recurrent neural networks. In: Proceedings of the 2016 conference of the north American chapter of the association for computational linguistics: human language technologies, pp 300–309
He X, Li L (2017) Trigger detection based on bidirectional LSTM and two-stage method. J Chin Inf Process 31(6):147–154
Feng X, Qin B, Liu T (2018) A language-independent neural network for event detection. Sci China Inf Sci Sci China Inf Sci 61(9):092–106
Chen B, Zhou Y (2019) Event trigger word extraction based on convolutional bidirectional long short term memory network. Comput Eng 45(1):153–158
Wang K, Hong Y (2018) Combining context dependency and sentence semantic representation for event nugget detection. J Fornt Comput Sci Technol 12(3):423–431
Liu J, Chen Y, Liu K, Zhao J (2018) Event detection via gated multilingual attention mechanism. In: Thirty-Second AAAI conference on artificial intelligence
Li L, Wan J, Zheng J, Wang J (2018) Biomedical event extraction based on GRU integrating attention mechanism. BMC Bioinformatics 19(9):177
Ramamoorthy S, Murugan S (2018) An attentive sequence model for adverse drug event extraction from biomedical text
Zhang T, Ji H (2018) Event extraction with generative adversarial imitation learning
Mu X, Xu A (2019) A character-level BiLSTM-CRF model with multi-representations for chinese event detection. IEEE Access 7(2019):146524–146532
Song Y, Shi S, Li J, Zhang H (2018) Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: Proceedings of the 2018 conference of the north american chapter of the association for computational linguistics: human language technologies, vol 2, pp 175–180
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Polosukhin I (2017) Attention is all you need, Advances in neural information processing systems, pp 5998–6008
Yu F, Koltun V (2015) Multi-scale context aggregation by dilated convolutions
Wang P, Chen P, Yuan Y, Liu D, Huang Z, Hou X, Cottrell G (2018) Understanding convolution for semantic segmentation, 2018 IEEE winter conference on applications of computer vision (WACV), pp 1451–1460
Chang SY, Li B, Simko G, Sainath TN, Tripthi A (2018) A. vanden Oord and O. Vinyals, Temporal modeling using dilated convolution and gating for voice-activity-detection. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 5549–5553
Van den Oord A, Kalchbrenner N, Espeholt L, Vinyals O, Graves A (2016) Conditional image generation with PixelCNN decoders. In: Proceedings of the 30th international conference on neural information processing systems, pp 4790–4798
Dauphin YN, Fan A, Auli M, Grangier D (2017) Language modeling with gated convolutional networks. In: Proceedings of the 34th international conference on machine learning, vol 70, pp 933–941
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
You L, Liu K (2005) Building chinese framenet database. In: 2005 international conference on natural language processing and knowledge engineering, pp 301–306
Acknowledgements
This research is financially supported by The National Key Research and Development Program of China (Grant Number 2018YFC0807105), National Natural Science Foundation of China (Grant Number 61462073) and Science and Technology Committee of Shanghai Municipality (STCSM) (Under Grant Numbers 17DZ1101003, 18511106602 and 18DZ2252300).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, Z., Guo, Y. & Wang, J. Empower Chinese event detection with improved atrous convolution neural networks. Neural Comput & Applic 33, 5805–5820 (2021). https://doi.org/10.1007/s00521-020-05360-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-05360-1