Skip to main content
Log in

Partial label learning based on label distributions and error-correcting output codes

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Partial label learning (PLL) is a class of weak supervision learning problems in which each data sample has a candidate set of labels, among which only one label is correct. In this paper, a new PLL algorithm with prior information of the label distribution based on ECOC (PL-PIE) is proposed. PL-PIE utilizes the ECOC framework to decompose the problem into multiple binary problems. Different from the instability of the existing random dichotomy, the proposal exploits the prior information of label distribution to generate positive and negative classes with stable performance. Extensive experimental results demonstrate that the proposed PL-PIE algorithm has highly competitive performance compared to the state-of-the-art PLL algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • JP Ainam, Qin K, Luo G (2018) Person re-identification through clustering and partial label smoothing regularization

  • Allwein EL, Schapire RE, Singer Y (2001) Reducing multiclass to binary: a unifying approach for margin classifiers. J Mach Learn Res 1:113–141

    MathSciNet  MATH  Google Scholar 

  • Bache K, Lichman M (2013) UCI machine learning repository. School of Information and Computer Sciences, University of California, Irvine. [Online]. Available: http://archive.ics.uci.edu/ml

  • Bautista MA, Escalera S, Baro X, Pujol O (2014) On the design of an ECOC-compliant genetic algorithm. Pattern Recognit 47:865–884

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Briggs F, Lakshminarayanan B, Neal L, Fern XZ, Raich R, Hadley SJK, Hadley AS, Betts MG (2012) Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J Acoust Soc Am 131:4640–4650

    Article  Google Scholar 

  • Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:1–27

    Article  Google Scholar 

  • Chen CLP, Liu Z (2018) Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans Neural Netw Learn Syst 29(1):10–24

    Article  MathSciNet  Google Scholar 

  • Cherfi ZL, Oukhellou L, Come E, Denoeux T, Aknin P (2012) Partially supervised independent factor analysis using soft labels elicited from multiple experts: application to railway track circuit diagnosis. Soft Comput 16:741–754

    Article  Google Scholar 

  • Chinchor N (1992) MUC-4 evaluation metrics. In: Proceedings of the 4th conference on message understanding. Association for Computational Linguistics, McLean, Virginia, pp 22–29

  • Cour T, Sapp B, Jordan C, Taskar B (2009) IEEE learning from ambiguously labeled images. In: Cvpr: 2009 IEEE conference on computer vision and pattern recognition, vol 1–4. IEEE, New York, pp 919–926

  • Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12:1501–1536

    MathSciNet  MATH  Google Scholar 

  • Crammer K, Singer Y (2002) On the learnability and design of output codes for multiclass problems. Mach Learn 47:201–233

    Article  Google Scholar 

  • Deng W, Zhao HM, Zou L, Li GY, Yang XH, Wu DQ (2017a) A novel collaborative optimization algorithm in solving complex optimization problems. Soft Comput 21(15):4387–4398

    Article  Google Scholar 

  • Deng W, Zhao H, Yang X, Xiong J, Sun M, Li B (2017b) Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment. Appl Soft Comput 59:288–302

    Article  Google Scholar 

  • Deng W, Xu JJ, Zhao HM (2019) An improved ant colony optimization algorithm based on hybrid strategies for scheduling problem. IEEE Access 7:20281–20292

    Article  Google Scholar 

  • Dietterich TG, Bakiri G (1994) Solving multiclass learning problems via ECOCs. J Artif Intell Res 2:263–286

    Article  Google Scholar 

  • Ding S, Zhang X, An Y, Xue Y (2017a) Weighted linear loss multiple birth support vector machine based on information granulation for multi-class classification. Pattern Recognit 67:32–46

    Article  Google Scholar 

  • Ding S, Zhang N, Zhang J, Xu X, Shi Z (2017b) Unsupervised extreme learning machine with representational features. Int J Mach Learn Cybern 8(2):587–595

    Article  Google Scholar 

  • Escalera S, Pujol O, Radeva P (2006) ECOC-ONE: a novel coding and decoding strategy. In: International conference on pattern recognition, p 578

  • Escalera S, Pujol O, Radeva P (2010) On the decoding process in ternary error-correcting output codes. IEEE Trans Pattern Anal Mach Intell 32:120–134

    Article  Google Scholar 

  • Evangelos S, David WA (1997) Special issue on lazy learning. Artif Intell Rev 11:7–10

    Article  Google Scholar 

  • Gershman SJ, Blei DM (2012) A tutorial on Bayesian nonparametric models. J Math Psychol 56:1–12

    Article  MathSciNet  Google Scholar 

  • Gong C, Liu TL, Tang YY, Yang J, Yang J, Tao DC (2018) A regularization approach for instance-based superset label learning. IEEE Trans Cybern 48:967–978

    Article  Google Scholar 

  • Guillaumin M, Verbeek J, Schmid C (2010) Multiple instance metric learning from automatically labeled bags of faces. In: Proceedings of the 11th European conference on computer vision: part I. Springer, Heraklion, Crete, Greece, pp 634–647

  • Huang G-B, Zhu Q-Y, Siew C-K (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), Budapest, vol 2, pp 985–990

  • Hullermeier E, Beringer J (2006) Learning from ambiguously labeled examples. Intell Data Anal 10:419–439

    Article  Google Scholar 

  • Liu LP, Dietterich TG (2012) A conditional multinomial mixture model for superset label learning. In: Proceedings of the 25th international conference on neural information processing systems, vol 1. Curran Associates Inc., Lake Tahoe, Nevada, pp 548–556

  • Luo J, Orabona F (2010) Learning from candidate labeling sets. In: NIPS

  • Meng WB, Liu Y, Zhang SL, Pei D, Dong H, Song L, Luo XL (2018) Device-agnostic log anomaly classification with partial labels. In: 2018 IEEE/ACM 26th international symposium on quality of service (Iwqos)

  • Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, Las Vegas, Nevada, USA, pp 551–559

  • Pujol O, Radeva P, Vitria J (2006) Discriminant ECOC: a heuristic method for application dependent design of error correcting output codes. IEEE Trans Pattern Anal 28:1007–1012

    Article  Google Scholar 

  • Pujol O, Escalera S, Radeva P (2008) An incremental node embedding technique for error correcting output codes. Pattern Recognit 41:713–725

    Article  Google Scholar 

  • Rajasekar L, Sharmila D (2019) Performance analysis of soft computing techniques for the automatic classification of fruits dataset. Soft Comput 23:2773–2788

    Article  Google Scholar 

  • Sun MX, Liu KH, Wu QQ, Hong QQ, Wang BZ, Zhang HY (2019) A novel ECOC algorithm for multiclass microarray data classification based on data complexity analysis. Pattern Recognit 90:346–362

    Article  Google Scholar 

  • Xu N, Tao A, Geng X (2018) Label enhancement for label distribution learning. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI-18, international joint conferences on artificial intelligence organization, pp 2926–2932

  • Xue Y, Jiang JM, Zhao BP, Ma TH (2018) A self-adaptive artificial bee colony algorithm based on global best for global optimization. Soft Comput 22:2935–2952

    Article  Google Scholar 

  • Yan JJ, Zhang ZN, Xie LW, Zhu ZT (2019) A unified framework for decision tree on continuous attributes. IEEE Access 7:11924–11933

    Article  Google Scholar 

  • Ye XN, Liu KH (2018) A novel genetic algorithm based ECOC algorithm. In: International conference on semantics, knowledge and grids, pp 241–244

  • Yu F, Zhang ML (2017) Maximum margin partial label learning. Mach Learn 106:573–593

    Article  MathSciNet  Google Scholar 

  • Yuen SY, Lou Y, Zhang X (2019) Selecting evolutionary algorithms for black box design optimization problems. Soft Comput 23:6511–6531

    Article  Google Scholar 

  • Zeng ZN, Xiao SJ, Jia K, Chan TH, Gao SH, Xu D, Ma Y (2013) Learning by associating ambiguously labeled images. In: 2013 IEEE conference on computer vision and pattern recognition (Cvpr), pp 708–715

  • Zhang ML, Zhou BB, Liu XY (2016a) Partial label learning via feature-aware disambiguation. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, San Francisco, California, USA, pp 1335–1344

  • Zhang N, Ding S, Zhang J (2016b) Multi layer ELM-RBF for multi-label learning. Appl Soft Comput 43:535–545

    Article  Google Scholar 

  • Zhang ML, Yu F, Tang CZ (2017) Disambiguation-free partial label learning. IEEE Trans Knowl Data Eng 29:2155–2167

    Article  Google Scholar 

  • Zhao H, Zheng J, Deng W, Song Y (2020) Semi-supervised broad learning system based on manifold regularization and broad network. IEEE Trans Circuits Syst I Regul Pap 67(3):983–994

    Article  MathSciNet  Google Scholar 

  • Zhou Z (2012) Ensemble methods: foundations and algorithms. Chapman and Hall, Boca Raton

    Book  Google Scholar 

  • Zhou Y, Gu H (2018) Geometric mean metric learning for partial label data. Neurocomputing 275:394–402

    Article  Google Scholar 

  • Zhou Y, He JJ, Gu H (2017a) Partial label learning via Gaussian processes. IEEE Trans Cybern 47:4443–4450

    Article  Google Scholar 

  • Zhou LG, Wang QY, Fujita H (2017b) One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies. Inf Fusion 36:80–89

    Article  Google Scholar 

  • Zhou DY, Zhang ZK, Zhang ML, He YL (2018) Weakly supervised POS tagging without disambiguation. ACM Trans Asian Low Resour Lang Inf Process 17:1–9

    Article  Google Scholar 

  • Zhou JT, Tsang IW, Ho SS, Muller KR (2019) N-ary decomposition for multi-class classification. Mach Learn 108:809–830

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61772023) and Natural Science Foundation of Fujian Province (No. 2016J 01320)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kunhong Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lin, G., Liu, K., Wang, B. et al. Partial label learning based on label distributions and error-correcting output codes. Soft Comput 25, 1049–1064 (2021). https://doi.org/10.1007/s00500-020-05203-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-020-05203-0

Keywords

Navigation