Overview of NLPCC2022 Shared Task 5 Track 1: Multi-label Classification for Scientific Literature

Liu, Ming; Zhang, He; Tian, Yangjie; Zong, Tianrui; Cai, Borui; Xu, Ruohua; Li, Yunfeng

doi:10.1007/978-3-031-17189-5_28

Ming Liu¹¹,
He Zhang¹¹,
Yangjie Tian¹¹,
Tianrui Zong¹¹,
Borui Cai¹¹,
Ruohua Xu¹¹ &
…
Yunfeng Li¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13552))

Included in the following conference series:

CCF International Conference on Natural Language Processing and Chinese Computing

805 Accesses
1 Citations

Abstract

Given the increasing volume of scientific literature in conferences, journals as well as open access websites, it is important to index these data in a hierarchical way for intelligent retrieval. We organized Track 1 in NLPCC2022 Shared Task 5 for multi-label classification for scientific literature. This paper will summarize the task information, the data set, the models returned from the participants and the final result. Furthermore, we will discuss key findings and challenges for hierarchical multi-label classification in the scientific domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Amigó, E., Delgado, A.: Evaluating extreme hierarchical multi-label classification. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). pp. 5809–5819 (2022)
Google Scholar
Beltagy, I., Lo, K., Cohan, A.: SciBERT: A pretrained language model for scientific text. arXiv preprint arXiv:1903.10676 (2019)
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Huang, W., et al.: Hierarchical multi-label text classification: an attention-based recurrent network approach. In: Proceedings of the 28th ACM international conference on information and knowledge management, pp. 1051–1060 (2019)
Google Scholar
Huang, Y., Giledereli, B., Köksal, A., Özgür, A., Ozkirimli, E.: Balancing methods for multi-label text classification with long-tailed class distribution. arXiv preprint arXiv:2109.04712 (2021)
Khosla, P., et al.: Supervised contrastive learning. Adv. Neural. Inf. Process. Syst. 33, 18661–18673 (2020)
Google Scholar
Lan, Z., Chen, M., Goodman, S., Gimpel, K., Sharma, P., Soricut, R.: Albert: A lite BERT for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942 (2019)
Liu, J., Chang, W.C., Wu, Y., Yang, Y.: Deep learning for extreme multi-label text classification. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 115–124 (2017)
Google Scholar
Lu, J., Du, L., Liu, M., Dipnall, J.: Multi-label few/zero-shot learning with knowledge aggregated from multiple label graphs. arXiv preprint arXiv:2010.07459 (2020)
Yang, Z., Dai, Z., Yang, Y., Carbonell, J., Salakhutdinov, R.R., Le, Q.V.: XLNet: Generalized autoregressive pretraining for language understanding. Adv. Neural Inf. Process. Syst. 32 (2019)
Google Scholar
Zhang, S., Tong, H., Xu, J., Maciejewski, R.: Graph convolutional networks: a comprehensive review. Comput. Soc. Netw. 6(1), 1–23 (2019). https://doi.org/10.1186/s40649-019-0069-y
Article Google Scholar

Download references

Author information

Authors and Affiliations

CNPIEC KEXIN LTD., Beijing, China
Ming Liu, He Zhang, Yangjie Tian, Tianrui Zong, Borui Cai, Ruohua Xu & Yunfeng Li

Authors

Ming Liu
View author publications
You can also search for this author in PubMed Google Scholar
He Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yangjie Tian
View author publications
You can also search for this author in PubMed Google Scholar
Tianrui Zong
View author publications
You can also search for this author in PubMed Google Scholar
Borui Cai
View author publications
You can also search for this author in PubMed Google Scholar
Ruohua Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yunfeng Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to He Zhang .

Editor information

Editors and Affiliations

Singapore University of Technology and Design, Singapore, Singapore
Wei Lu
Nanjing University, Nanjing, China
Shujian Huang
Soochow University, Suzhou, China
Yu Hong
Soochow University, Soochow, China
Xiabing Zhou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, M. et al. (2022). Overview of NLPCC2022 Shared Task 5 Track 1: Multi-label Classification for Scientific Literature. In: Lu, W., Huang, S., Hong, Y., Zhou, X. (eds) Natural Language Processing and Chinese Computing. NLPCC 2022. Lecture Notes in Computer Science(), vol 13552. Springer, Cham. https://doi.org/10.1007/978-3-031-17189-5_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-17189-5_28
Published: 24 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-17188-8
Online ISBN: 978-3-031-17189-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Overview of NLPCC2022 Shared Task 5 Track 1: Multi-label Classification for Scientific Literature