Partial label learning based on label distributions and error-correcting output codes

Lin, Guangyi; Liu, Kunhong; Wang, Beizhan; Zhang, Xiaoyan

doi:10.1007/s00500-020-05203-0

Partial label learning based on label distributions and error-correcting output codes

Methodologies and Application
Published: 01 August 2020

Volume 25, pages 1049–1064, (2021)
Cite this article

Soft Computing Aims and scope Submit manuscript

Guangyi Lin ORCID: orcid.org/0000-0001-7479-5684¹,
Kunhong Liu¹,
Beizhan Wang¹ &
…
Xiaoyan Zhang²

516 Accesses
7 Citations
Explore all metrics

Abstract

Partial label learning (PLL) is a class of weak supervision learning problems in which each data sample has a candidate set of labels, among which only one label is correct. In this paper, a new PLL algorithm with prior information of the label distribution based on ECOC (PL-PIE) is proposed. PL-PIE utilizes the ECOC framework to decompose the problem into multiple binary problems. Different from the instability of the existing random dichotomy, the proposal exploits the prior information of label distribution to generate positive and negative classes with stable performance. Extensive experimental results demonstrate that the proposed PL-PIE algorithm has highly competitive performance compared to the state-of-the-art PLL algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Partial Label Learning with Gradually Induced Error-Correction Output Codes

PP-PLL: Probability Propagation for Partial Label Learning

Partial Label Learning by Entropy Minimization

References

JP Ainam, Qin K, Luo G (2018) Person re-identification through clustering and partial label smoothing regularization
Allwein EL, Schapire RE, Singer Y (2001) Reducing multiclass to binary: a unifying approach for margin classifiers. J Mach Learn Res 1:113–141
MathSciNet MATH Google Scholar
Bache K, Lichman M (2013) UCI machine learning repository. School of Information and Computer Sciences, University of California, Irvine. [Online]. Available: http://archive.ics.uci.edu/ml
Bautista MA, Escalera S, Baro X, Pujol O (2014) On the design of an ECOC-compliant genetic algorithm. Pattern Recognit 47:865–884
Article Google Scholar
Breiman L (2001) Random forests. Mach Learn 45:5–32
Article Google Scholar
Briggs F, Lakshminarayanan B, Neal L, Fern XZ, Raich R, Hadley SJK, Hadley AS, Betts MG (2012) Acoustic classification of multiple simultaneous bird species: a multi-instance multi-label approach. J Acoust Soc Am 131:4640–4650
Article Google Scholar
Chang CC, Lin CJ (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:1–27
Article Google Scholar
Chen CLP, Liu Z (2018) Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans Neural Netw Learn Syst 29(1):10–24
Article MathSciNet Google Scholar
Cherfi ZL, Oukhellou L, Come E, Denoeux T, Aknin P (2012) Partially supervised independent factor analysis using soft labels elicited from multiple experts: application to railway track circuit diagnosis. Soft Comput 16:741–754
Article Google Scholar
Chinchor N (1992) MUC-4 evaluation metrics. In: Proceedings of the 4th conference on message understanding. Association for Computational Linguistics, McLean, Virginia, pp 22–29
Cour T, Sapp B, Jordan C, Taskar B (2009) IEEE learning from ambiguously labeled images. In: Cvpr: 2009 IEEE conference on computer vision and pattern recognition, vol 1–4. IEEE, New York, pp 919–926
Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12:1501–1536
MathSciNet MATH Google Scholar
Crammer K, Singer Y (2002) On the learnability and design of output codes for multiclass problems. Mach Learn 47:201–233
Article Google Scholar
Deng W, Zhao HM, Zou L, Li GY, Yang XH, Wu DQ (2017a) A novel collaborative optimization algorithm in solving complex optimization problems. Soft Comput 21(15):4387–4398
Article Google Scholar
Deng W, Zhao H, Yang X, Xiong J, Sun M, Li B (2017b) Study on an improved adaptive PSO algorithm for solving multi-objective gate assignment. Appl Soft Comput 59:288–302
Article Google Scholar
Deng W, Xu JJ, Zhao HM (2019) An improved ant colony optimization algorithm based on hybrid strategies for scheduling problem. IEEE Access 7:20281–20292
Article Google Scholar
Dietterich TG, Bakiri G (1994) Solving multiclass learning problems via ECOCs. J Artif Intell Res 2:263–286
Article Google Scholar
Ding S, Zhang X, An Y, Xue Y (2017a) Weighted linear loss multiple birth support vector machine based on information granulation for multi-class classification. Pattern Recognit 67:32–46
Article Google Scholar
Ding S, Zhang N, Zhang J, Xu X, Shi Z (2017b) Unsupervised extreme learning machine with representational features. Int J Mach Learn Cybern 8(2):587–595
Article Google Scholar
Escalera S, Pujol O, Radeva P (2006) ECOC-ONE: a novel coding and decoding strategy. In: International conference on pattern recognition, p 578
Escalera S, Pujol O, Radeva P (2010) On the decoding process in ternary error-correcting output codes. IEEE Trans Pattern Anal Mach Intell 32:120–134
Article Google Scholar
Evangelos S, David WA (1997) Special issue on lazy learning. Artif Intell Rev 11:7–10
Article Google Scholar
Gershman SJ, Blei DM (2012) A tutorial on Bayesian nonparametric models. J Math Psychol 56:1–12
Article MathSciNet Google Scholar
Gong C, Liu TL, Tang YY, Yang J, Yang J, Tao DC (2018) A regularization approach for instance-based superset label learning. IEEE Trans Cybern 48:967–978
Article Google Scholar
Guillaumin M, Verbeek J, Schmid C (2010) Multiple instance metric learning from automatically labeled bags of faces. In: Proceedings of the 11th European conference on computer vision: part I. Springer, Heraklion, Crete, Greece, pp 634–647
Huang G-B, Zhu Q-Y, Siew C-K (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: 2004 IEEE international joint conference on neural networks (IEEE Cat. No. 04CH37541), Budapest, vol 2, pp 985–990
Hullermeier E, Beringer J (2006) Learning from ambiguously labeled examples. Intell Data Anal 10:419–439
Article Google Scholar
Liu LP, Dietterich TG (2012) A conditional multinomial mixture model for superset label learning. In: Proceedings of the 25th international conference on neural information processing systems, vol 1. Curran Associates Inc., Lake Tahoe, Nevada, pp 548–556
Luo J, Orabona F (2010) Learning from candidate labeling sets. In: NIPS
Meng WB, Liu Y, Zhang SL, Pei D, Dong H, Song L, Luo XL (2018) Device-agnostic log anomaly classification with partial labels. In: 2018 IEEE/ACM 26th international symposium on quality of service (Iwqos)
Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, Las Vegas, Nevada, USA, pp 551–559
Pujol O, Radeva P, Vitria J (2006) Discriminant ECOC: a heuristic method for application dependent design of error correcting output codes. IEEE Trans Pattern Anal 28:1007–1012
Article Google Scholar
Pujol O, Escalera S, Radeva P (2008) An incremental node embedding technique for error correcting output codes. Pattern Recognit 41:713–725
Article Google Scholar
Rajasekar L, Sharmila D (2019) Performance analysis of soft computing techniques for the automatic classification of fruits dataset. Soft Comput 23:2773–2788
Article Google Scholar
Sun MX, Liu KH, Wu QQ, Hong QQ, Wang BZ, Zhang HY (2019) A novel ECOC algorithm for multiclass microarray data classification based on data complexity analysis. Pattern Recognit 90:346–362
Article Google Scholar
Xu N, Tao A, Geng X (2018) Label enhancement for label distribution learning. In: Proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI-18, international joint conferences on artificial intelligence organization, pp 2926–2932
Xue Y, Jiang JM, Zhao BP, Ma TH (2018) A self-adaptive artificial bee colony algorithm based on global best for global optimization. Soft Comput 22:2935–2952
Article Google Scholar
Yan JJ, Zhang ZN, Xie LW, Zhu ZT (2019) A unified framework for decision tree on continuous attributes. IEEE Access 7:11924–11933
Article Google Scholar
Ye XN, Liu KH (2018) A novel genetic algorithm based ECOC algorithm. In: International conference on semantics, knowledge and grids, pp 241–244
Yu F, Zhang ML (2017) Maximum margin partial label learning. Mach Learn 106:573–593
Article MathSciNet Google Scholar
Yuen SY, Lou Y, Zhang X (2019) Selecting evolutionary algorithms for black box design optimization problems. Soft Comput 23:6511–6531
Article Google Scholar
Zeng ZN, Xiao SJ, Jia K, Chan TH, Gao SH, Xu D, Ma Y (2013) Learning by associating ambiguously labeled images. In: 2013 IEEE conference on computer vision and pattern recognition (Cvpr), pp 708–715
Zhang ML, Zhou BB, Liu XY (2016a) Partial label learning via feature-aware disambiguation. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, San Francisco, California, USA, pp 1335–1344
Zhang N, Ding S, Zhang J (2016b) Multi layer ELM-RBF for multi-label learning. Appl Soft Comput 43:535–545
Article Google Scholar
Zhang ML, Yu F, Tang CZ (2017) Disambiguation-free partial label learning. IEEE Trans Knowl Data Eng 29:2155–2167
Article Google Scholar
Zhao H, Zheng J, Deng W, Song Y (2020) Semi-supervised broad learning system based on manifold regularization and broad network. IEEE Trans Circuits Syst I Regul Pap 67(3):983–994
Article MathSciNet Google Scholar
Zhou Z (2012) Ensemble methods: foundations and algorithms. Chapman and Hall, Boca Raton
Book Google Scholar
Zhou Y, Gu H (2018) Geometric mean metric learning for partial label data. Neurocomputing 275:394–402
Article Google Scholar
Zhou Y, He JJ, Gu H (2017a) Partial label learning via Gaussian processes. IEEE Trans Cybern 47:4443–4450
Article Google Scholar
Zhou LG, Wang QY, Fujita H (2017b) One versus one multi-class classification fusion using optimizing decision directed acyclic graph for predicting listing status of companies. Inf Fusion 36:80–89
Article Google Scholar
Zhou DY, Zhang ZK, Zhang ML, He YL (2018) Weakly supervised POS tagging without disambiguation. ACM Trans Asian Low Resour Lang Inf Process 17:1–9
Article Google Scholar
Zhou JT, Tsang IW, Ho SS, Muller KR (2019) N-ary decomposition for multi-class classification. Mach Learn 108:809–830
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61772023) and Natural Science Foundation of Fujian Province (No. 2016J 01320)

Author information

Authors and Affiliations

School of Informatics, Xiamen University, Xiamen, 361005, Fujian Province, People’s Republic of China
Guangyi Lin, Kunhong Liu & Beizhan Wang
Xiamen University Tan Kah Kee College, Xiamen, 363105, Fujian Province, People’s Republic of China
Xiaoyan Zhang

Authors

Guangyi Lin
View author publications
You can also search for this author in PubMed Google Scholar
Kunhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Beizhan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kunhong Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by V. Loia.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, G., Liu, K., Wang, B. et al. Partial label learning based on label distributions and error-correcting output codes. Soft Comput 25, 1049–1064 (2021). https://doi.org/10.1007/s00500-020-05203-0

Download citation

Published: 01 August 2020
Issue Date: January 2021
DOI: https://doi.org/10.1007/s00500-020-05203-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Partial label learning based on label distributions and error-correcting output codes

Abstract

Access this article

Similar content being viewed by others

Partial Label Learning with Gradually Induced Error-Correction Output Codes

PP-PLL: Probability Propagation for Partial Label Learning

Partial Label Learning by Entropy Minimization

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Partial label learning based on label distributions and error-correcting output codes

Abstract

Access this article

Similar content being viewed by others

Partial Label Learning with Gradually Induced Error-Correction Output Codes

PP-PLL: Probability Propagation for Partial Label Learning

Partial Label Learning by Entropy Minimization

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation