skip to main content
short-paper

Advancing Chinese Event Detection via Revisiting Character Information

Published: 11 February 2022 Publication History

Abstract

Recently, character information has been successfully introduced into the encoder-decoder event detection model to relieve the trigger-word mismatch problem, thus achieving impressive results in the languages without natural delimiters (i.e., Chinese). However, it is introduced into the encoder or the decoder separately, which makes the advantage of character information not be captured and represented adequately for event detection. In this article, we proposed a novel method to model character information in both the encoding and decoding stages to advance the neural event detection model. In particular, the proposed method can encode both words and characters and predict their event types jointly and further leverage interactions between word and its characters to optimize the inference. Experimental results show that the proposed model outperforms previous event detection methods on the ACE2005 Chinese benchmark. We release our code at Github.

References

[1]
Deng Cai, Hai Zhao, Zhisong Zhang, Yuan Xin, Yongjian Wu, and Feiyue Huang. 2017. Fast and accurate neural word segmentation for chinese. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 608–615.
[2]
Chen Chen and Vincent Ng. 2012. Joint modeling for chinese event extraction with rich linguistic features. In Proceedings of the International Conference on Computational Linguistics (COLING’12). 529–544.
[3]
Yubo Chen, Liheng Xu, Kang Liu, Daojian Zeng, and Jun Zhao. 2015. Event extraction via dynamic multi-pooling convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, 167–176.
[4]
Yiming Cui, Wanxiang Che, Ting Liu, Bing Qin, Shijin Wang, and Guoping Hu. 2020. Revisiting pre-trained models for Chinese natural language processing. In Proceedings of the Conference on Empirical Methods in Natural Language Processing: Findings. Association for Computational Linguistics, 657–668.
[5]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171–4186.
[6]
Ning Ding, Ziran Li, Zhiyuan Liu, Haitao Zheng, and Zibo Lin. 2019. Event detection with trigger-aware lattice neural network. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, 347–356.
[7]
Xiaocheng Feng, Lifu Huang, Duyu Tang, Heng Ji, Bing Qin, and Ting Liu. 2016. A language-independent neural network for event detection. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, 66–71.
[8]
Yu Hong, Jianfeng Zhang, Bin Ma, Jianmin Yao, Guodong Zhou, and Qiaoming Zhu. 2011. Using cross-entity inference to improve event extraction. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 1127–1136.
[9]
Guillaume Lample, Miguel Ballesteros, Sandeep Subramanian, Kazuya Kawakami, and Chris Dyer. 2016. Neural architectures for named entity recognition. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 260–270.
[10]
Peifeng Li, Guodong Zhou, Qiaoming Zhu, and Libin Hou. 2012. Employing compositional semantics and discourse consistency in Chinese event extraction. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, 1006–1016.
[11]
Peifeng Li, Qiaoming Zhu, Hongjun Diao, and Guodong Zhou. 2012. Joint modeling of trigger identification and event type determination in Chinese event extraction. In Proceedings of the 24th International Conference on Computational Linguistics. Association for Computational Linguistics, 1635–1652.
[12]
Qi Li, Heng Ji, and Liang Huang. 2013. Joint event extraction via structured prediction with global features. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 73–82. https://www.aclweb.org/anthology/P13-1008.
[13]
Shen Li, Zhe Zhao, Renfen Hu, Wensi Li, Tao Liu, and Xiaoyong Du. 2018. Analogical reasoning on chinese morphological and semantic relations. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 138–143.
[14]
Shasha Liao and Ralph Grishman. 2010. Using document level cross-event inference to improve event extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 789–797.
[15]
Hongyu Lin, Yaojie Lu, Xianpei Han, and Le Sun. 2018. Adaptive scaling for sparse detection in information extraction. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1033–1043.
[16]
Hongyu Lin, Yaojie Lu, Xianpei Han, and Le Sun. 2018. Nugget proposal networks for Chinese event detection. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1565–1574.
[17]
Xiao Liu, Zhunchen Luo, and Heyan Huang. 2018. Jointly multiple events extraction via attention-based graph information aggregation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1247–1256.
[18]
John E. Mitchell. 2002. Branch-and-cut algorithms for combinatorial optimization problems. Handbook of Applied Optimization, vol. 1. Oxford University Press, Oxford, UK, 65–77.
[19]
Thien Nguyen and Ralph Grishman. 2018. Graph convolutional networks with argument-aware pooling for event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
[20]
Thien Huu Nguyen and Ralph Grishman. 2015. Event detection and domain adaptation with convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Association for Computational Linguistics, 365–371.
[21]
Chuhan Wu, Fangzhao Wu, Tao Qi, Junxin Liu, Yongfeng Huang, and Xing Xie. 2020. Detecting entities of works for chinese chatbot. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 19, 6, Article 88 (Sept. 2020), 13 pages.
[22]
Wang Xu, Kehai Chen, and Tiejun Zhao. 2021. Document-Level relation extraction with reconstruction. Proc. AAAI Conf. Artif. Intelli. 35, 16 (May 2021), 14167–14175.
[23]
Bishan Yang and Tom Mitchell. 2017. Leveraging knowledge bases in LSTMs for improving machine reading. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1436–1446.
[24]
Jie Yang and Yue Zhang. 2018. NCRF++: An open-source neural sequence labeling toolkit. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics.
[25]
Matthew D. Zeiler. 2012. ADADELTA: An adaptive learning rate method. Computing Research Repository abs/1212.5701. http://arxiv.org/abs/1212.5701.
[26]
Ying Zeng, Honghui Yang, Yansong Feng, Zheng Wang, and Dongyan Zhao. 2016. A convolution BiLSTM neural network model for chinese event extraction. In Natural Language Understanding and Intelligent Applications. Springer International Publishing, 275–287.
[27]
Yue Zhang and Jie Yang. 2018. Chinese NER using lattice LSTM. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1554–1564.
[28]
Hai Zhao, Chang-Ning Huang, Mu Li, and Bao-Liang Lu. 2010. A unified character-based tagging framework for chinese word segmentation. ACM Trans. Asian Lang. Inf. Process. 9, 2, Article 5 (June 2010), 32 pages.

Cited By

View all
  • (2024)Dependency structure-enhanced graph attention networks for event detectionProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i17.29877(19098-19106)Online publication date: 20-Feb-2024
  • (2024)Enhancing Chinese Event Extraction with Event Trigger StructuresACM Transactions on Asian and Low-Resource Language Information Processing10.1145/366356723:7(1-18)Online publication date: 19-Jul-2024
  • (2024)SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language UnderstandingProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679676(3570-3580)Online publication date: 21-Oct-2024
  • Show More Cited By

Index Terms

  1. Advancing Chinese Event Detection via Revisiting Character Information

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 4
    July 2022
    464 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3511099
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 February 2022
    Accepted: 01 November 2021
    Revised: 01 August 2021
    Received: 01 March 2021
    Published in TALLIP Volume 21, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Event detection
    2. Chinese
    3. word and character
    4. integer linear programming

    Qualifiers

    • Short-paper
    • Refereed

    Funding Sources

    • National Natural Science Foundation of China

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)36
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Dependency structure-enhanced graph attention networks for event detectionProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i17.29877(19098-19106)Online publication date: 20-Feb-2024
    • (2024)Enhancing Chinese Event Extraction with Event Trigger StructuresACM Transactions on Asian and Low-Resource Language Information Processing10.1145/366356723:7(1-18)Online publication date: 19-Jul-2024
    • (2024)SaLa: Scenario-aware Label Graph Interaction for Multi-intent Spoken Language UnderstandingProceedings of the 33rd ACM International Conference on Information and Knowledge Management10.1145/3627673.3679676(3570-3580)Online publication date: 21-Oct-2024
    • (2024)More Than Syntaxes: Investigating Semantics to Zero-shot Cross-lingual Relation Extraction and Event Argument Role LabellingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/358226123:5(1-21)Online publication date: 10-May-2024
    • (2023)Meta-ED: Cross-lingual Event Detection Using Meta-learning for Indian LanguagesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/355534022:2(1-22)Online publication date: 21-Feb-2023
    • (2023)Modeling Character–Word Interaction via a Novel Mesh Transformer for Chinese Event DetectionNeural Processing Letters10.1007/s11063-023-11382-255:8(11429-11448)Online publication date: 11-Sep-2023

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media