Skip to main content

A Multi-view Active Learning Approach for the Hierarchical Multi-label Classification of Research Papers

  • Conference paper
  • First Online:
Advanced Information Networking and Applications (AINA 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 226))

  • 900 Accesses

Abstract

In this paper, we focus on the hierarchical multi-label classification task of scientific papers, which consists in assigning to a paper the set of relevant classes, which are organized in a hierarchy. The difficulty of manually constructing sufficient labeled datasets renders challenging the automatic classification task of research papers according to hierarchical labels. Multi-view active learning is a widely adopted method to address this issue, by iteratively selecting the most useful unlabeled samples for the multi-view classifiers exploiting disjoint data’ views, and querying an oracle on their real labels. However, none of the state of the art studies in this field is proposed for the hierarchical multi-label classification task. In this paper, we propose an effective multi-view active learning framework for the hierarchical multi-label classification task, applied on scientific papers. Our approach adopts a novel selection strategy that relies on both uncertainty and representativeness criteria when selecting the most informative unlabeled samples in each iteration. Experimental results on a real world dataset of ACM research papers show the efficiency of our approach over several baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Our measure favors selecting samples with high disagreement level on all their contention labels irrespective of the number of contention labels.

  2. 2.

    https://www.connotate.com/connotateexpress/.

References

  1. Silla, C.N., Freitas, A.A.: A survey of hierarchical classification across different application domains. Data Min. Knowl. Disc. 22(1), 31–72 (2011)

    Article  MathSciNet  Google Scholar 

  2. Settles, B.: Active learning literature survey, computer sciences. Technical Report 1648. University of Wisconsin–Madison (2009)

    Google Scholar 

  3. He, T., Zhang, S., Xin, J., Zhao, P., Wu, J., Xian, X., Cui, Z.: An active learning approach with uncertainty, representativeness, and diversity. Sci. World J. (2014)

    Google Scholar 

  4. Zhao, J., Xie, X., Xu, X., Sun, S.: Multi-view learning overview: recent progress and new challenges. Inf. Fusion 38, 43–54 (2017)

    Article  Google Scholar 

  5. Muslea, I., Minton, S., Knoblock, C.A.: Active learning with multiple views. J. Artif. Intell. Res. 27, 203–233 (2006)

    Article  MathSciNet  Google Scholar 

  6. Hu, J., He, Z., Li, J., He, L., Wang, Y.: 3D-Gabor inspired multiview active learning for spectral-spatial hyperspectral image classification. Remote Sens. 10(7), 1070 (2018)

    Article  Google Scholar 

  7. Brinker, K.: On active learning in multi-label classification. In: From Data and Information Analysis to Knowledge Engineering, pp. 206–213. Springer, Berlin, Heidelberg (2006)

    Google Scholar 

  8. Nakano, F.K., Cerri, R., Vens, C.: Active learning for hierarchical multi-label classification. Data Min. Knowl. Disc. 34(5), 1496–1530 (2020)

    Article  MathSciNet  Google Scholar 

  9. Yan, Y., Huang, S.J.: Cost-effective active learning for hierarchical multi-label classification. In: IJCAI, pp. 2962–2968 (2018)

    Google Scholar 

  10. Settles, B., Craven, M.: An analysis of active learning strategies for sequence labeling tasks. In: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pp. 1070–1079 (2008)

    Google Scholar 

  11. Zhu, J., Wang, H., Tsou, B.K., Ma, M.: Active learning with sampling by uncertainty and density for data annotations. IEEE Trans. Audio, Speech Lang. Process. 18(6), 1323–1331 (2009)

    Google Scholar 

  12. Yang, K., Cai, Y., Cai, Z., Xie, H., Wong, T.L., Chan, W.H.: Top K representative: a method to select representative samples based on K nearest neighbors. Int. J. Mach. Learn. Cybern. 1–11 (2019)

    Google Scholar 

  13. Patra, S., Bruzzone, L.: A batch-mode active learning technique based on multiple uncertainty for SVM classifier. IEEE Geosci. Remote Sens. Lett. 9(3), 497–501 (2011)

    Article  Google Scholar 

  14. Di, W., Crawford, M.M.: Multi-view adaptive disagreement based active learning for hyperspectral image classification. In: 2010 IEEE International Geoscience and Remote Sensing Symposium, pp. 1374–1377 (2010)

    Google Scholar 

  15. Zhang, Z., Pasolli, E., Crawford, M.M.: An adaptive multiview active learning approach for spectral-spatial classification of hyperspectral images. IEEE Trans. Geosci. Remote Sens. 58(4), 2557–2570 (2019)

    Article  Google Scholar 

  16. Zhang, Y., Lv, D., Zhao, Y.: Multiple-view active learning for environmental sound classification. Int. J. Online Biomed. Eng. (iJOE) 12(12), 49–54 (2016)

    Article  Google Scholar 

  17. Hu, J., He, Z., Li, J., He, L., Wang, Y.: 3D-Gabor inspired multiview active learning for spectral-spatial hyperspectral image classification. Remote Sens. 10(7), 1070 (2018)

    Article  Google Scholar 

  18. Liu, C., He, L., Li, Z., Li, J.: Feature-driven active learning for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 56(1), 341–354 (2017)

    Article  Google Scholar 

  19. Zhang, X., Cheng, J., Xu, C., Lu, H., Ma, S.: Multi-view multi-label active learning for image classification. In: 2009 IEEE International Conference on Multimedia and Expo, pp. 258–261 (2009)

    Google Scholar 

  20. Zhang, M.L., Li, Y.K., Liu, X.Y., Geng, X.: Binary relevance for multi-label learning: an overview. Front. Comput. Sci. 12(2), 191–202 (2018)

    Article  Google Scholar 

  21. Masmoudi, A., Bellaaj, H., Drira, K., Jmaiel, M.: A co-training-based approach for the hierarchical multi-label classification of research papers. Expert Syst. e12613 (2020)

    Google Scholar 

  22. Liu, R.L.: A new bibliographic coupling measure with descriptive capability. Scientometrics 110(2), 915–935 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abir Masmoudi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Masmoudi, A., Bellaaj, H., Jmaiel, M. (2021). A Multi-view Active Learning Approach for the Hierarchical Multi-label Classification of Research Papers. In: Barolli, L., Woungang, I., Enokido, T. (eds) Advanced Information Networking and Applications. AINA 2021. Lecture Notes in Networks and Systems, vol 226. Springer, Cham. https://doi.org/10.1007/978-3-030-75075-6_33

Download citation

Publish with us

Policies and ethics