Abstract
Multi-label Text Classification (MLTC) is a variant of classification problem where multiple labels are assigned to each instance. Most existing MLTC methods ignore the relationship between the target labels. Since the hierarchical relationship for addressing these problems is significant, a semantic network approach with the help of knowledge graphs can be used. This paper proposes a knowledge graph-based approach together with GRU (Gated Recurrent Unit) neural network model to solve an MLTC problem on a research text dataset. In particular, we leverage the Tax2Vec approach to extract hypernyms from the WordNet knowledge graph and enrich the dataset. The enrichment results in following a tree-like structure to identify the relationship between the semantic concepts. The result shows that the enriched dataset outperforms the traditional GRU neural network-based model based on different evaluation metrics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aljedani, N., Alotaibi, R., Taileb, M.: HMATC: hierarchical multi-label Arabic text classification model using machine learning. Egypt. Inform. J. 22, 225–237 (2020)
Beheshti, A., Benatallah, B., Sheng, Q.Z., Schiliro, F.: Intelligent knowledge lakes: the age of artificial intelligence and big data. In: U, L.H., Yang, J., Cai, Y., Karlapalem, K., Liu, A., Huang, X. (eds.) WISE 2020. CCIS, vol. 1155, pp. 24–34. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-3281-8_3
Beheshti, S.-M.-R., Benatallah, B., Motahari-Nezhad, H.R.: Scalable graph-based OLAP analytics over process execution data. Distrib. Parallel Databases 34(3), 379–423 (2014). https://doi.org/10.1007/s10619-014-7171-9
Gargiulo, F., Silvestri, S., Ciampi, M.: Deep convolution neural network for extreme multi-label text classification. In: Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (2018)
Huang, W., et al.: Hierarchical multi-label text classification: an attention-based recurrent network approach. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 1051–1060 (2019)
Jiang, X., Shen, Y., Wang, Y., Jin, X., Cheng, X.: Bakgrastec: a background knowledge graph based method for short text classification. In: 2020 IEEE International Conference on Knowledge Graph (ICKG) (2020)
Škrlj, B., Martinc, M., Kralj, J., Lavrač, N., Pollak, S.: Tax2vec: constructing interpretable features from taxonomies for short text classification. Comput. Speech Lang. 65, 101104 (2021)
Pal, A., Selvakumar, M., Sankarasubbu, M.: Magnet: multi-label text classification using attention-based graph neural network. In: Proceedings of the 12th International Conference on Agents and Artificial Intelligence - vol. 2: ICAART, pp. 494–505. INSTICC, SciTePress (2020)
Pal, A., Selvakumar, M., Sankarasubbu, M.: Magnet: multi-label text classification using attention-based graph neural network. In: Proceedings of the 12th International Conference on Agents and Artificial Intelligence (2020)
Pal, A., Selvakumar, M., Sankarasubbu, M.: Multi-label text classification using attention-based graph neural network. arXiv preprint arXiv:2003.11644 (2020)
Peng, H., et al.: Hierarchical taxonomy-aware and attentional graph capsule RCNNs for large-scale multi-label text classification. IEEE Trans. Knowl. Data Eng. 33, 2505–2519 (2019)
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014)
Sharifirad, S., Jafarpour, B., Matwin, S.: Boosting text classification performance on sexist tweets by text augmentation and text generation using a combination of knowledge graphs. In: Proceedings of the 2nd Workshop on Abusive Language Online (ALW2), pp. 107–114 (2018)
Škrlj, B., Martinc, M., Kralj, J., Lavrač, N., Pollak, S.: Tax2vec: constructing interpretable features from taxonomies for short text classification. Comput. Speech Lang. 65, 101104 (2021)
Acknowledgement
The work presented in this paper was funded by Cape Breton University (RISE grant).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Prabhu, D., Rajabi, E., Ganta, M.K., Thomas, T. (2022). Improving Multi-label Text Classification Models with Knowledge Graphs. In: Hacid, H., et al. Service-Oriented Computing – ICSOC 2021 Workshops. ICSOC 2021. Lecture Notes in Computer Science, vol 13236. Springer, Cham. https://doi.org/10.1007/978-3-031-14135-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-14135-5_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-14134-8
Online ISBN: 978-3-031-14135-5
eBook Packages: Computer ScienceComputer Science (R0)