Abstract
Partial label learning (PLL) is a class of weak supervision learning problems in which each data sample has a candidate set of labels, among which only one label is correct. In this paper, a new PLL algorithm with prior information of the label distribution based on ECOC (PL-PIE) is proposed. PL-PIE utilizes the ECOC framework to decompose the problem into multiple binary problems. Different from the instability of the existing random dichotomy, the proposal exploits the prior information of label distribution to generate positive and negative classes with stable performance. Extensive experimental results demonstrate that the proposed PL-PIE algorithm has highly competitive performance compared to the state-of-the-art PLL algorithms.
Similar content being viewed by others
References
JP Ainam, Qin K, Luo G (2018) Person re-identification through clustering and partial label smoothing regularization
Allwein EL, Schapire RE, Singer Y (2001) Reducing multiclass to binary: a unifying approach for margin classifiers. J Mach Learn Res 1:113–141
Bache K, Lichman M (2013) UCI machine learning repository. School of Information and Computer Sciences, University of California, Irvine. [Online]. Available: http://archive.ics.uci.edu/ml
Bautista MA, Escalera S, Baro X, Pujol O (2014) On the design of an ECOC-compliant genetic algorithm. Pattern Recognit 47:865–884
Breiman L (2001) Random forests. Mach Learn 45:5–32
Briggs F, Lakshminarayanan B, Neal L, Fern XZ, Raich R, Hadley SJK, Hadley AS, Betts MG (2012) Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J Acoust Soc Am 131:4640–4650
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:1–27
Chen CLP, Liu Z (2018) Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans Neural Netw Learn Syst 29(1):10–24
Cherfi ZL, Oukhellou L, Come E, Denoeux T, Aknin P (2012) Partially supervised independent factor analysis using soft labels elicited from multiple experts: application to railway track circuit diagnosis. Soft Comput 16:741–754
Chinchor N (1992) MUC-4 evaluation metrics. In: Proceedings of the 4th conference on message understanding. Association for Computational Linguistics, McLean, Virginia, pp 22–29
Cour T, Sapp B, Jordan C, Taskar B (2009) IEEE learning from ambiguously labeled images. In: Cvpr: 2009 IEEE conference on computer vision and pattern recognition, vol 1–4. IEEE, New York, pp 919–926
Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12:1501–1536
Crammer K, Singer Y (2002) On the learnability and design of output codes for multiclass problems. Mach Learn 47:201–233
Deng W, Zhao HM, Zou L, Li GY, Yang XH, Wu DQ (2017a) A novel collaborative optimization algorithm in solving complex optimization problems. Soft Comput 21(15):4387–4398
Deng W, Zhao H, Yang X, Xiong J, Sun M, Li B (2017b) Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment. Appl Soft Comput 59:288–302
Deng W, Xu JJ, Zhao HM (2019) An improved ant colony optimization algorithm based on hybrid strategies for scheduling problem. IEEE Access 7:20281–20292
Dietterich TG, Bakiri G (1994) Solving multiclass learning problems via ECOCs. J Artif Intell Res 2:263–286
Ding S, Zhang X, An Y, Xue Y (2017a) Weighted linear loss multiple birth support vector machine based on information granulation for multi-class classification. Pattern Recognit 67:32–46
Ding S, Zhang N, Zhang J, Xu X, Shi Z (2017b) Unsupervised extreme learning machine with representational features. Int J Mach Learn Cybern 8(2):587–595
Escalera S, Pujol O, Radeva P (2006) ECOC-ONE: a novel coding and decoding strategy. In: International conference on pattern recognition, p 578
Escalera S, Pujol O, Radeva P (2010) On the decoding process in ternary error-correcting output codes. IEEE Trans Pattern Anal Mach Intell 32:120–134
Evangelos S, David WA (1997) Special issue on lazy learning. Artif Intell Rev 11:7–10
Gershman SJ, Blei DM (2012) A tutorial on Bayesian nonparametric models. J Math Psychol 56:1–12
Gong C, Liu TL, Tang YY, Yang J, Yang J, Tao DC (2018) A regularization approach for instance-based superset label learning. IEEE Trans Cybern 48:967–978
Guillaumin M, Verbeek J, Schmid C (2010) Multiple instance metric learning from automatically labeled bags of faces. In: Proceedings of the 11th European conference on computer vision: part I. Springer, Heraklion, Crete, Greece, pp 634–647
Huang G-B, Zhu Q-Y, Siew C-K (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), Budapest, vol 2, pp 985–990
Hullermeier E, Beringer J (2006) Learning from ambiguously labeled examples. Intell Data Anal 10:419–439
Liu LP, Dietterich TG (2012) A conditional multinomial mixture model for superset label learning. In: Proceedings of the 25th international conference on neural information processing systems, vol 1. Curran Associates Inc., Lake Tahoe, Nevada, pp 548–556
Luo J, Orabona F (2010) Learning from candidate labeling sets. In: NIPS
Meng WB, Liu Y, Zhang SL, Pei D, Dong H, Song L, Luo XL (2018) Device-agnostic log anomaly classification with partial labels. In: 2018 IEEE/ACM 26th international symposium on quality of service (Iwqos)
Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, Las Vegas, Nevada, USA, pp 551–559
Pujol O, Radeva P, Vitria J (2006) Discriminant ECOC: a heuristic method for application dependent design of error correcting output codes. IEEE Trans Pattern Anal 28:1007–1012
Pujol O, Escalera S, Radeva P (2008) An incremental node embedding technique for error correcting output codes. Pattern Recognit 41:713–725
Rajasekar L, Sharmila D (2019) Performance analysis of soft computing techniques for the automatic classification of fruits dataset. Soft Comput 23:2773–2788
Sun MX, Liu KH, Wu QQ, Hong QQ, Wang BZ, Zhang HY (2019) A novel ECOC algorithm for multiclass microarray data classification based on data complexity analysis. Pattern Recognit 90:346–362
Xu N, Tao A, Geng X (2018) Label enhancement for label distribution learning. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI-18, international joint conferences on artificial intelligence organization, pp 2926–2932
Xue Y, Jiang JM, Zhao BP, Ma TH (2018) A self-adaptive artificial bee colony algorithm based on global best for global optimization. Soft Comput 22:2935–2952
Yan JJ, Zhang ZN, Xie LW, Zhu ZT (2019) A unified framework for decision tree on continuous attributes. IEEE Access 7:11924–11933
Ye XN, Liu KH (2018) A novel genetic algorithm based ECOC algorithm. In: International conference on semantics, knowledge and grids, pp 241–244
Yu F, Zhang ML (2017) Maximum margin partial label learning. Mach Learn 106:573–593
Yuen SY, Lou Y, Zhang X (2019) Selecting evolutionary algorithms for black box design optimization problems. Soft Comput 23:6511–6531
Zeng ZN, Xiao SJ, Jia K, Chan TH, Gao SH, Xu D, Ma Y (2013) Learning by associating ambiguously labeled images. In: 2013 IEEE conference on computer vision and pattern recognition (Cvpr), pp 708–715
Zhang ML, Zhou BB, Liu XY (2016a) Partial label learning via feature-aware disambiguation. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, San Francisco, California, USA, pp 1335–1344
Zhang N, Ding S, Zhang J (2016b) Multi layer ELM-RBF for multi-label learning. Appl Soft Comput 43:535–545
Zhang ML, Yu F, Tang CZ (2017) Disambiguation-free partial label learning. IEEE Trans Knowl Data Eng 29:2155–2167
Zhao H, Zheng J, Deng W, Song Y (2020) Semi-supervised broad learning system based on manifold regularization and broad network. IEEE Trans Circuits Syst I Regul Pap 67(3):983–994
Zhou Z (2012) Ensemble methods: foundations and algorithms. Chapman and Hall, Boca Raton
Zhou Y, Gu H (2018) Geometric mean metric learning for partial label data. Neurocomputing 275:394–402
Zhou Y, He JJ, Gu H (2017a) Partial label learning via Gaussian processes. IEEE Trans Cybern 47:4443–4450
Zhou LG, Wang QY, Fujita H (2017b) One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies. Inf Fusion 36:80–89
Zhou DY, Zhang ZK, Zhang ML, He YL (2018) Weakly supervised POS tagging without disambiguation. ACM Trans Asian Low Resour Lang Inf Process 17:1–9
Zhou JT, Tsang IW, Ho SS, Muller KR (2019) N-ary decomposition for multi-class classification. Mach Learn 108:809–830
Acknowledgements
This work is supported by the National Natural Science Foundation of China (No. 61772023) and Natural Science Foundation of Fujian Province (No. 2016J 01320)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Communicated by V. Loia.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lin, G., Liu, K., Wang, B. et al. Partial label learning based on label distributions and error-correcting output codes. Soft Comput 25, 1049–1064 (2021). https://doi.org/10.1007/s00500-020-05203-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-020-05203-0