research-article

Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification

Authors:

Zhifang SuiAuthors Info & Claims

CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

Pages 1969 - 1978

https://doi.org/10.1145/3459637.3482236

Published: 30 October 2021 Publication History

Abstract

Few-Shot Event Classification (FSEC) aims at developing a model for event prediction, which can generalize to new event types with a limited number of annotated data. Existing FSEC studies have achieved high accuracy on different benchmarks. However, we find they suffer from trigger biases that signify the statistical homogeneity between some trigger words and target event types, which we summarize as trigger overlapping and trigger separability. The biases can result in context-bypassing problem, i.e., correct classifications can be gained by looking at only the trigger words while ignoring the entire context. Therefore, existing models can be weak in generalizing to unseen data in real scenarios. To further uncover the trigger biases and assess the generalization ability of the models, we propose two new sampling methods, Trigger-Uniform Sampling (TUS) and COnfusion Sampling (COS), for the meta tasks construction during evaluation. Besides, to cope with the context-bypassing problem in FSEC models, we introduce adversarial training and trigger reconstruction techniques. Experiments show these techniques help not only improve the performance, but also enhance the generalization ability of models.

Supplementary Material

MP4 File (CIKM-paper-87.mp4)

Video presentation.

Download
27.68 MB

References

[1]

Yonatan Belinkov, Adam Poliak, Stuart Shieber, Benjamin Van Durme, and Alexander Rush. 2019. Don't Take the Premise for Granted: Mitigating Artifacts in Natural Language Inference. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).

[2]

Zheng Cai, Lifu Tu, and Kevin Gimpel. 2017. Pay Attention to the Ending:Strong Neural Baselines for the ROC Story Cloze Task. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL).

[3]

Xin Cong, Shiyao Cui, Bowen Yu, Tingwen Liu, Yubin Wang, and Bin Wang. 2020. Few-Shot Event Detection with Prototypical Amortized Conditional Random Field. arXiv preprint arXiv:2012.02353 (2020). arxiv: 2012.02353

[4]

Shumin Deng, Ningyu Zhang, Jiaojian Kang, Yichi Zhang, Wei Zhang, and Huajun Chen. 2020. Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection. In Proceedings of the 13th International Conference on Web Search and Data Mining (WSDM). 9.

Digital Library

[5]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics.

[6]

Xiaoan Ding, Tianyu Liu, Baobao Chang, Zhifang Sui, and Kevin Gimpel. 2020. Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference. In Proceedings of the 2020conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online.

[7]

Shi Feng, Eric Wallace, and Jordan Boyd-Graber. 2019. Misleading Failures of Partial-input Baselines. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).

[8]

Xiaocheng Feng, Lifu Huang, Duyu Tang, Heng Ji, Bing Qin, and Ting Liu. 2016. A Language-Independent Neural Network for Event Detection. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 66--71.

[9]

Tianyu Gao, Xu Han, Zhiyuan Liu, and Maosong Sun. 2019. Hybrid Attention-Based Prototypical Networks for Noisy Few-Shot Relation Classification. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) (Jul. 2019).

[10]

Reza Ghaeini, Xiaoli Fern, Liang Huang, and Prasad Tadepalli. 2016. Event Nugget Detection with Forward-Backward Recurrent Neural Networks. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).

[11]

Ian J. Goodfellow, Jonathon Shlens, and Christian Szegedy. 2015. Explaining and Harnessing Adversarial Examples. In International Conference on Learning Representations (ICLR), Yoshua Bengio and Yann LeCun (Eds.).

[12]

Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, and Devi Parikh. 2017. Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]

Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Roy Schwartz, Samuel Bowman, and Noah A. Smith. 2018. Annotation Artifacts in Natural Language Inference Data. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics.

[14]

Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In International Conference on Learning Representations (ICLR).

[15]

Viet Dac Lai, Franck Dernoncourt, and Thien Huu Nguyen. 2020a. Exploiting the Matching Information in the Support Set for Few Shot Event Classification. In Advances in Knowledge Discovery and Data Mining - 24th Pacific-Asia Conference (PAKDD), Hady W. Lauw, Raymond Chi-Wing Wong, Alexandros Ntoulas, Ee-Peng Lim, See-Kiong Ng, and Sinno Jialin Pan (Eds.).

[16]

Viet Dac Lai, Thien Huu Nguyen, and Franck Dernoncourt. 2020b. Extensively Matching for Few-shot Learning Event Detection. In Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events.

[17]

Omer Levy, Steffen Remus, Chris Biemann, and Ido Dagan. 2015. Do Supervised Distributional Methods Really Learn Lexical Inference Relations?. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics.

[18]

Hongyu Lin, Yaojie Lu, Xianpei Han, and Le Sun. 2018. Nugget Proposal Networks for Chinese Event Detection. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

[19]

Hongtao Liu, Peiyi Wang, Fangzhao Wu, Pengfei Jiao, Wenjun Wang, Xing Xie, and Yueheng Sun. 2019b. Reet: Joint relation extraction and entity typing via multi-task learning. In CCF International Conference on Natural Language Processing and Chinese Computing. Springer, 327--339.

Digital Library

[20]

Jian Liu, Yubo Chen, and Kang Liu. 2019a. Exploiting the ground-truth: An adversarial imitation based knowledge distillation approach for event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33.

Digital Library

[21]

Shulin Liu, Yubo Chen, Kang Liu, and Jun Zhao. 2017. Exploiting argument information to improve event detection via supervised attention mechanisms. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers).

[22]

Tianyu Liu, Zheng Xin, Baobao Chang, and Zhifang Sui. 2020a. HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference. In Proceedings of the 12th Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France.

[23]

Tianyu Liu, Zheng Xin, Xiaoan Ding, Baobao Chang, and Zhifang Sui. 2020b. An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference. In Proceedings of the 24th Conference on Computational Natural Language Learning. Association for Computational Linguistics, Online.

[24]

Tom McCoy, Ellie Pavlick, and Tal Linzen. 2019. Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).

[25]

Sewon Min, Eric Wallace, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi, and Luke Zettlemoyer. 2019. Compositional Questions Do Not Necessitate Multi-hop Reasoning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL).

[26]

Takeru Miyato, Andrew M. Dai, and Ian J. Goodfellow. 2017. Adversarial Training Methods for Semi-Supervised Text Classification. In International Conference on Learning Representations (ICLR).

[27]

Thien Nguyen and Ralph Grishman. 2018. Graph convolutional networks with argument-aware pooling for event detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.

[28]

Jeffrey Pennington, Richard Socher, and Christopher Manning. 2014. GloVe: Global Vectors for Word Representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP).

[29]

Roy Schwartz, Maarten Sap, Ioannis Konstas, Leila Zilles, Yejin Choi, and Noah A. Smith. 2017. The Effect of Different Writing Tasks on Linguistic Style: A Case Study of the ROC Story Cloze Task. In Proceedings of the 21st Conference on Computational Natural Language Learning (CoNLL).

[30]

Deven Santosh Shah, H. Andrew Schwartz, and Dirk Hovy. 2020. Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL).

[31]

Jake Snell, Kevin Swersky, and Richard Zemel. 2017. Prototypical Networks for Few-shot Learning. In Advances in Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.).

Digital Library

[32]

Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research, Vol. 9 (2008).

[33]

Christopher Walker, Stephanie Strassel, Julie Medero, and Kazuaki Maeda. 2006. ACE 2005 Multilingual Training Corpus. In Philadelphia: Linguistic Data Consortium.

[34]

Peiyi Wang, Hongtao Liu, Fangzhao Wu, Jinduo Song, Hongyan Xu, and Wenjun Wang. 2019. REKA: Relation Extraction with Knowledge-Aware Attention. In China Conference on Knowledge Graph and Semantic Computing. Springer, 62--73.

[35]

Xiaozhi Wang, Ziqi Wang, Xu Han, Wangyi Jiang, Rong Han, Zhiyuan Liu, Juanzi Li, Peng Li, Yankai Lin, and Jie Zhou. 2020. MAVEN: A Massive General Domain Event Detection Dataset. In Proceedings of the 2020conference on Empirical Methods in Natural Language Processing (EMNLP).

[36]

Runxin Xu, Tianyu Liu, Lei Li, and Baobao Chang. 2021. Document-level Event Extraction via Heterogeneous Graph-based Interaction Model with a Tracker. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL).

[37]

Shuang Zeng, Runxin Xu, Baobao Chang, and Lei Li. 2020. Double Graph Based Reasoning for Document-level Relation Extraction. In Proceedings of the 2020conference on Empirical Methods in Natural Language Processing (EMNLP).

Cited By

Yang ZLiu YOuyang CZhao SZhu C(2024)Improving Few-Shot Named Entity Recognition with Causal InterventionsBig Data Mining and Analytics10.26599/BDMA.2024.90200527:4(1375-1395)Online publication date: Dec-2024
https://doi.org/10.26599/BDMA.2024.9020052
Chen GCheng XChen JShe XQin JChen J(2024)Event assigning based on hierarchical features and enhanced association for Chinese mayor's hotlineComputational Intelligence10.1111/coin.1262640:1Online publication date: 4-Jan-2024
https://doi.org/10.1111/coin.12626
Wang SZheng JChen WCai FLuo XFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)MultiPLe: Multilingual Prompt Learning for Relieving Semantic Confusions in Few-shot Event DetectionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614984(2676-2685)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3614984
Show More Cited By

Index Terms

Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Information extraction
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
    2. Machine learning approaches
      1. Neural networks

Recommendations

Exploiting the Matching Information in the Support Set for Few Shot Event Classification
Advances in Knowledge Discovery and Data Mining
Abstract
The existing event classification (EC) work primarily focuses on the traditional supervised learning setting in which models are unable to extract event mentions of new/unseen event types. Few-shot learning has not been investigated in this area ...
Transductive Event Classification through Heterogeneous Networks
WebMedia '17: Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web

Events can be defined as "something that occurs at specific place and time associated with some specific actions". In general, events extracted from news articles and social networks are used to map the information from web to the various phenomena that ...
TaxonPrompt: Taxonomy-aware curriculum prompt learning for few-shot event classification
Abstract
Event classification (EC) aims to assign the event labels to unlabeled sentences and tends to struggle in real-world applications when only a few annotated samples are available. Previous studies have mainly focused on using meta-learning to ...
Highlights
- We designed a taxonomy-aware event classification framework for overcoming the classification bottleneck brought by insufficient data volume.
- We apply a prompt-based method for few-shot event classification, which does not require ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management

October 2021

4966 pages

ISBN:9781450384469

DOI:10.1145/3459637

General Chairs:
Gianluca Demartini
The University of Queensland, Australia
,
Guido Zuccon
The University of Queensland, Australia
,
Program Chairs:
J. Shane Culpepper
RMIT University, Australia
,
Zi Huang
The University of Queensland, Australia
,
Hanghang Tong
University of Illinois at Urbana-Champaign, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key Research and Development Program of China
NSFC
National Science Foundation of China under Grant

Conference

CIKM '21

Sponsor:

CIKM '21: The 30th ACM International Conference on Information and Knowledge Management

November 1 - 5, 2021

Queensland, Virtual Event, Australia

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
182
Total Downloads

Downloads (Last 12 months)9
Downloads (Last 6 weeks)1

Reflects downloads up to 25 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yang ZLiu YOuyang CZhao SZhu C(2024)Improving Few-Shot Named Entity Recognition with Causal InterventionsBig Data Mining and Analytics10.26599/BDMA.2024.90200527:4(1375-1395)Online publication date: Dec-2024
https://doi.org/10.26599/BDMA.2024.9020052
Chen GCheng XChen JShe XQin JChen J(2024)Event assigning based on hierarchical features and enhanced association for Chinese mayor's hotlineComputational Intelligence10.1111/coin.1262640:1Online publication date: 4-Jan-2024
https://doi.org/10.1111/coin.12626
Wang SZheng JChen WCai FLuo XFrommholz IHopfgartner FLee MOakes MLalmas MZhang MSantos R(2023)MultiPLe: Multilingual Prompt Learning for Relieving Semantic Confusions in Few-shot Event DetectionProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614984(2676-2685)Online publication date: 21-Oct-2023
https://dl.acm.org/doi/10.1145/3583780.3614984
Lixi CJianping LJialan LJia CChangrun C(2023)A Review of Continual Relation Extraction2023 20th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)10.1109/ICCWAMTIP60502.2023.10387017(1-6)Online publication date: 15-Dec-2023
https://doi.org/10.1109/ICCWAMTIP60502.2023.10387017
Xia Z(2023)Two-way Prototypical Network Based on Word Embedding Mixup for Few-shot Event Detection2023 4th International Conference on Computer Engineering and Application (ICCEA)10.1109/ICCEA58433.2023.10135439(46-51)Online publication date: 7-Apr-2023
https://doi.org/10.1109/ICCEA58433.2023.10135439
Li XLi XZhao MYang MYu RYu MYu J(2023)CLINER: exploring task-relevant features and label semantic for few-shot named entity recognitionNeural Computing and Applications10.1007/s00521-023-09285-336:9(4679-4691)Online publication date: 16-Dec-2023
https://dl.acm.org/doi/10.1007/s00521-023-09285-3

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten