research-article

Pretraining Multi-modal Representations for Chinese NER Task with Cross-Modality Attention

Authors:

Yihua HuangAuthors Info & Claims

WSDM '22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining

Pages 726 - 734

https://doi.org/10.1145/3488560.3498450

Published: 15 February 2022 Publication History

Get Access

Abstract

Named Entity Recognition (NER) aims to identify the pre-defined entities from the unstructured text. Compared with English NER, Chinese NER faces more challenges: the ambiguity problem in entity boundary recognition due to unavailable explicit delimiters between Chinese characters, and the out-of-vocabulary (OOV) problem caused by rare Chinese characters. However, two important features specific to the Chinese language are ignored by previous studies: glyphs and phonetics, which contain rich semantic information of Chinese. To overcome these issues by exploiting the linguistic potential of Chinese as a logographic language, we present MPM-CNER (short for Multi-modal Pretraining Model for Chinese NER), a model for learning multi-modal representations of Chinese semantics, glyphs, and phonetics, via four pretraining tasks: Radical Consistency Identification (RCI), Glyph Image Classification (GIC), Phonetic Consistency Identification (PCI), and Phonetic Classification Modeling (PCM). Meanwhile, a novel cross-modality attention mechanism is proposed to fuse these multimodal features for further improvement. The experimental results show that our method outperforms the state-of-the-art baseline methods on four benchmark datasets, and the ablation study also verifies the effectiveness of the pre-trained multi-modal representations.

Supplementary Material

MP4 File (WSDM22-fp396_DOI_10_1145_3488560_3498450.mp4)

This video is a presentation of the paper ?Pretraining Multi-modal Representations for Chinese NER Task with Cross-Modality Attention? in WSDM2022. In this video, we introduce our method, a novel multi-modal pre-training model for Chinese NER, with the cross-modality attention mechanism, to fuse the Chinese semantics, glyphs, and phonetics. The experimental results verified that our method outperforms the previous SOTA baselines and proved the effectiveness of the multi-modal representations, which sheds light on exploiting the linguistic knowledge for Chinese NER.

Download
17.08 MB

References

[1]

Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, and Shengping Liu. 2018. Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism. In EMNLP.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

CSLAN: A Novel Lexicon Attention Network for Chinese NER

A Mixed Semantic Features Model for Chinese NER with Characters and Words

Lexicon-matched Word Injection for Chinese NER

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations