Weibo-MEL, Wikidata-MEL and Richpedia-MEL: Multimodal Entity Linking Benchmark Datasets

Zhou, Xingchen; Wang, Peng; Li, Guozheng; Xie, Jiafeng; Wu, Jiangheng

doi:10.1007/978-981-16-6471-7_27

Xingchen Zhou¹¹,
Peng Wang¹¹,
Guozheng Li¹¹,
Jiafeng Xie¹¹ &
…
Jiangheng Wu¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1466))

Included in the following conference series:

China Conference on Knowledge Graph and Semantic Computing

2159 Accesses
2 Citations

Abstract

Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bases. In this paper, we release three MEL datasets: Weibo-MEL, Wikidata-MEL and Richpedia-MEL, containing 25,602, 18,880 and 17,806 samples from social media, encyclopedia and multimodal knowledge graphs respectively. A MEL dataset construction approach is proposed, including five stages: multimodal information extraction, mention extraction, entity extraction, triple construction and dataset construction. Experiment results demonstrate the usability of the datasets and the distinguishability between baseline models. All resources are available at https://github.com/seukgcode/MELBench.

The work is supported by All-Army Common Information System Equipment Pre-Research Project (No. 31514020501, No. 31514020503).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Moon, S., Neves, L., Carvalho, V.: Multimodal named entity disambiguation for noisy social media posts. In: Proceedings of the 56th ACL, pp. 2000–2008 (2018)
Google Scholar
Adjali, O., Besançon, R., Ferret, O., et al.: Multimodal entity linking for tweets. Adv. Inf. Retrieval 12035, 463 (2020)
Article Google Scholar
Xu, B., Xu, Y., Liang, J., et al.: Cn-dbpedia: a never-ending Chinese knowledge extraction system. In: Proceedings of the 30th IEA-AIE, pp. 428–438 (2017)
Google Scholar
Wang, M., Wang, H., Qi, G., et al.: Richpedia: a large-scale, comprehensive multi-modal knowledge graph. Big Data Res. 130–145 (2020)
Google Scholar
Zhang, H., Liu, L., Jiang, H., et al.: Texsmart: a text understanding system for fine-grained ner and enhanced semantic analysis. arXiv:2012.15639 (2020)
Eshel, Y., Cohen, N., Radinsky, K., et al.: Named entity disambiguation for noisy text. In: Proceedings of the 21st CoNLL, pp. 58–68 (2017)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., et al.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 17th NAACL-HLT, pp. 4171–4186 (2019)
Google Scholar
Adjali, O., Besançon, R., Ferret, O., et al.: Multimodal entity linking for tweets. In: Proceedings of the 42nd ECIR, pp. 463–478 (2020)
Google Scholar
Lu, J., Yang, J., Batra, D., et al.: Hierarchical question-image co-attention for visual question answering. In: Proceedings of the 30th NIPS, pp. 289–297 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Southeast University, Nanjing, China
Xingchen Zhou, Peng Wang, Guozheng Li, Jiafeng Xie & Jiangheng Wu

Authors

Xingchen Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Peng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Guozheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiafeng Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jiangheng Wu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peng Wang .

Editor information

Editors and Affiliations

Harbin Institute of Technology, Harbin, China
Bing Qin
Peking University, Beijing, China
Zhi Jin
Tongji University, Shanghai, China
Haofen Wang
University of Edinburgh, Edinburgh, UK
Jeff Pan
University of South China, Hengyang, China
Yongbin Liu
Chinese Academy of Sciences, Beijing, China
Bo An

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, X., Wang, P., Li, G., Xie, J., Wu, J. (2021). Weibo-MEL, Wikidata-MEL and Richpedia-MEL: Multimodal Entity Linking Benchmark Datasets. In: Qin, B., Jin, Z., Wang, H., Pan, J., Liu, Y., An, B. (eds) Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction. CCKS 2021. Communications in Computer and Information Science, vol 1466. Springer, Singapore. https://doi.org/10.1007/978-981-16-6471-7_27

Download citation

DOI: https://doi.org/10.1007/978-981-16-6471-7_27
Published: 28 October 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-6470-0
Online ISBN: 978-981-16-6471-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics