Maximum relevancy maximum complementary based ordered aggregation for ensemble pruning

Xia, Xin; Lin, Tao; Chen, Zhi

doi:10.1007/s10489-017-1106-x

Maximum relevancy maximum complementary based ordered aggregation for ensemble pruning

Published: 12 December 2017

Volume 48, pages 2568–2579, (2018)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Xin Xia¹,
Tao Lin¹ &
Zhi Chen¹

503 Accesses
Explore all metrics

Abstract

Ensemble methods have delivered exceptional performance in various applications. However, this exceptional performance is achieved at the expense of heavy storage requirements and slower predictions. Ensemble pruning aims at reducing the complexity of this popular learning paradigm without worsening its performance. This paper presents an efficient and effective ordering-based ensemble pruning methods which ranks all the base classifiers with respect to a maximum relevancy maximum complementary (MRMC) measure. The MRMC measure evaluates the base classifier’s classification ability as well as its complementariness to the ensemble, and thereby a set of accurate and complementary base classifiers can be selected. Moreover, an evaluation function that deliberately favors the candidate sub-ensembles with a better performance in classifying low margin instances has also been proposed. Experiments performed on 25 benchmark datasets demonstrate the effectiveness of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Laradji IH, Alshayeb M, Ghouti L (2015) Software defect prediction using ensemble learning on selected features. Inf Softw Technol 58:388–402
Article Google Scholar
Idris A, Khan A, Lee YS (2013) Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification. Appl Intell 39(3):659–672
Article Google Scholar
Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51(2):181–207
Article MATH Google Scholar
Partalas I, Tsoumakas G, Vlahavas I (2009) Pruning an ensemble of classifiers via reinforcement learning. Neurocomputing 72(7–9):1900–1909
Article Google Scholar
Tamon C, Xiang J (2000) On the boosting pruning problem. In: European conference on machine learning
Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226– 239
Article Google Scholar
Britto AS, Sabourin R, Oliveira LES (2014) Dynamic selection of classifiers—a comprehensive review. Pattern Recogn 47(11):3665–3680
Article Google Scholar
Haghighi MS, Vahedian A, Yazdi HS (2012) Making diversity enhancement based on multiple classifier system by weight tuning. Neural Process Lett 35(1):61–80
Article Google Scholar
Wang L, Sugiyama M, Jing Z, Yang C, Zhou ZH, Feng J (2011) A refined margin analysis for boosting algorithms via equilibrium margin. J Mach Learn Res 12(2):1835–1863
MathSciNet MATH Google Scholar
Sun B, Chen H, Wang J (2015) An empirical margin explanation for the effectiveness of DECORATE ensemble learning algorithm. Knowl-Based Syst 78:1–12
Article Google Scholar
Tang EK, Suganthan PN, Yao X (2006) An analysis of diversity measures. Mach Learn 65(1):247–271
Article Google Scholar
Ko AHR, Sabourin R, Britto ADS Jr, Oliveira L (2007) Pairwise fusion matrix for combining classifiers. Pattern Recogn 40(8):2198–2210
Article MATH Google Scholar
Tsch G, Warmuth MK (2005) Efficient margin maximizing with boosting. J Mach Learn Res 6:2131–2152
MathSciNet MATH Google Scholar
Shen C, Li H (2010) Boosting through optimization of margin distributions. IEEE Trans Neural Netw 21 (4):659–666
Article Google Scholar
Dai Q, Han XM (2016) An efficient ordering-based ensemble pruning algorithm via dynamic programming. Appl Intell 44(4):816–830
Article MathSciNet Google Scholar
Cavalcanti GDC, Oliveira LS, Moura TJM, Carvalho GV (2016) Combining diversity measures for ensemble pruning. Pattern Recogn Lett 74:38–45
Article Google Scholar
Yin XC, Huang K, Hao HW, Iqbal K, Wang ZB (2014) A novel classifier ensemble method with sparsity and diversity. Neurocomputing 134(134):214–221
Article Google Scholar
Ykhlef H, Bouchaffra D (2017) An efficient ensemble pruning approach based on simple coalitional games. Information Fusion 34:28–42
Article Google Scholar
Margineantu DD, Dietterich TG (1997) Pruning adaptive boosting. In: Proceedings of the fourteenth international conference on machine learning. Morgan Kaufmann Publishers Inc, pp 211– 218
Zhang Y, Burer S, Street WN (2006) Ensemble pruning via semi-definite programming. J Mach Learn Res 7(3):1315–1338
MathSciNet MATH Google Scholar
Zhang H, Cao L (2014) A spectral clustering based ensemble pruning approach. Neurocomputing 139 (139):289–297
Article Google Scholar
Bakker B, Heskes T (2003) Clustering ensembles of neural network models. Neural Netw 16(2):261–269
Article Google Scholar
Xie Z, Xu Y, Hu Q, Zhu P (2012) Margin distribution based bagging pruning. Neurocomputing 85:11–19
Article Google Scholar
Yang F, Lu WH, Luo LK, Li T (2012) Margin optimization based pruning for random forest. Neurocomputing 94(3):54–63
Article Google Scholar
Li L, Zou B, Hu Q, Wu X, Yu D (2013) Dynamic classifier ensemble using classification confidence. Neurocomputing 99:581–591
Article Google Scholar
Guo L, Boukir S (2013) Margin-based ordered aggregation for ensemble pruning. Pattern Recogn Lett 34 (6):603–609
Article Google Scholar
Dai Q, Yao CS (2016) A hierarchical and parallel branch-and-bound ensemble selection algorithm. Appl Intell 1–17
Dai Q (2013) A competitive ensemble pruning approach based on cross-validation technique. Knowl-Based Syst 37(2):394–414
Article MathSciNet Google Scholar
Zhao QL, Jiang YH, Xu M (2009) A fast ensemble pruning algorithm based on pattern mining process. Data Min Knowl Disc 19(2):277–292
Article MathSciNet Google Scholar
Zhou H, Zhao X, Wang X (2014) An effective ensemble pruning algorithm based on frequent patterns. Knowl-Based Syst 56(3):79–85
Article Google Scholar
Krawczyk B, Woźniak M (2016) Untrained weighted classifier combination with embedded ensemble pruning. Neurocomputing 196:14–22
Article Google Scholar
Özögür-Akyüz S, Windeatt T, Smith R (2015) Pruning of error correcting output codes by optimization of accuracy—diversity trade off. Mach Learn 101(1):1–17
MathSciNet MATH Google Scholar
Peng H, Long F, Ding C (2005) Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238
Article Google Scholar
Chernbumroong S, Shuang C, Yu H (2015) Maximum relevancy maximum complementary feature selection for multi-sensor activity recognition. Expert Syst Appl 42(1):573–583
Article Google Scholar
Shannon CEA (2001) A mathematical theory of communication. AT&T Tech J Acm Sigmobile Mobile Computing & Communications Review 5(1):3–55
Article MathSciNet Google Scholar
Tsymbal A, Pechenizkiy M, Cunningham P (2005) Diversity in search strategies for ensemble feature selection. Information Fusion 6(1):83–98
Article Google Scholar
Asuncion A, Newman D (2007) UCI machine learning repository [Online]. Available: http://www.ics.uci.edu/mlearn/MLRepository.html
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor Newsl 11(1):10–18
Article Google Scholar
Martinez-Muoz G, Hernandez-Lobato D, Suarez A (2009) An analysis of ensemble pruning techniques based on ordered aggregation. IEEE Trans Pattern Anal Mach Intell 31(2):245– 59
Article Google Scholar
Rodriguez JJ, Kuncheva LI, Alonso CJ (2006) Rotation forest: a new classifier ensemble method. IEEE Trans Pattern Anal Mach Intell 28(10):1619–30
Article Google Scholar
Mukherjee I, Schapire RE (2011) A theory of multiclass boosting. J Mach Learn Res 14(1):437–497
MathSciNet MATH Google Scholar
Hodges JL, Lehmann EL (1962) Rank methods for combination of independent experiments in analysis of variance. Ann Math Stat 33(2):482–497
Article MathSciNet MATH Google Scholar
Holm S (1979) A simple sequentially rejective multiple test procedure. Scand J Stat 6(2):65–70
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science, Sichuan University, No. 24 South Section 1 of Yihuan Road, Chengdu, Sichuan, China
Xin Xia, Tao Lin & Zhi Chen

Authors

Xin Xia
View author publications
You can also search for this author inPubMed Google Scholar
Tao Lin
View author publications
You can also search for this author inPubMed Google Scholar
Zhi Chen
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Tao Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xia, X., Lin, T. & Chen, Z. Maximum relevancy maximum complementary based ordered aggregation for ensemble pruning. Appl Intell 48, 2568–2579 (2018). https://doi.org/10.1007/s10489-017-1106-x

Download citation

Published: 12 December 2017
Issue Date: September 2018
DOI: https://doi.org/10.1007/s10489-017-1106-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Maximum relevancy maximum complementary based ordered aggregation for ensemble pruning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pruning the Ensemble of ANN Based on Decision Tree Induction

A New Function for Ensemble Pruning

A New Ensemble Pruning Method Based on Margin and Diversity

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Maximum relevancy maximum complementary based ordered aggregation for ensemble pruning

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pruning the Ensemble of ANN Based on Decision Tree Induction

A New Function for Ensemble Pruning

A New Ensemble Pruning Method Based on Margin and Diversity

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now