Skip to main content
Log in

A new multi-colony fairness algorithm for feature selection

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

As the world gradually transforms from an information world to a data-driven world, areas of pattern recognition and data mining are facing more and more challenges. The process of feature subset selection becomes a necessary part of big data pattern recognition due to the data with explosive growth. Inspired by the behavior of grabbing resources in animals, this paper adds personal grabbing-resource behavior into the model of resource allocation transformed from the model of feature selection. Multi-colony fairness algorithm (MCFA) is proposed to deal with grabbing-resource behaviors in order to obtain a better distribution scheme (i.e., to obtain a better feature subset). The algorithm effectively fuses strategies of the random search and the heuristic search. In addition, it combines methods of filter and wrapper so as to reduce the amount of calculation while improving classification accuracies. The convergence and the effectiveness of the proposed algorithm are verified both from mathematical and experimental aspects. MCFA is compared with other four classic feature selection algorithms such as sequential forward selection, sequential backward selection, sequential floating forward selection, and sequential floating backward selection and three mainstream feature selection algorithms such as relevance–redundancy feature selection, minimal redundancy–maximal relevance, and ReliefF. The comparison results show that the proposed algorithm can obtain better feature subsets both in the aspects of feature subset length which is defined as the number of features in a feature subset and the classification accuracy. The two aspects indicate the efficiency and the effectiveness of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  • Averell L, Heathcote A (2011) The form of the forgetting curve and the fate of memories. J Math Psychol 55(1):25–35

    Article  MATH  MathSciNet  Google Scholar 

  • Azar AT, Elshazly HI, Hassanien AE et al (2014) A random forest classifier for lymph diseases. Comput Methods Programs Biomed 113(2):465–473

    Article  Google Scholar 

  • Bouatmane S, Roula MA, Bouridane A et al (2011) Round-robin sequential forward selection algorithm for prostate cancer classification and diagnosis using multispectral imagery. Mach Vis Appl 22:865–878

    Article  Google Scholar 

  • Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MATH  MathSciNet  Google Scholar 

  • Feng X, Yang T, Li S (2015) Network behavior-oriented CDN cache allocation strategy. Comput Sci 42:156–161

    Google Scholar 

  • Gan JQ, Hasan BAS, Tsui CSL (2014) A filter-dominating hybrid sequential forward floating search method for feature subset selection in high-dimensional space. Int J Mach Learn Cybern 5:413–423

    Article  Google Scholar 

  • Garcia S, Molina D, Lozano M et al (2009) A study on the use of non-parametric tests for analyzing the evolutionary algorithms behaviour: a case study on the CEC2005 special session on real parameter optimization. J Heuristics 15:617–644

    Article  MATH  Google Scholar 

  • Glten A (2013) Genetic algorithm wrapped bayesian network feature selection applied to differential diagnosis of erythemato-squamous diseases. Digit Signal Proc 23(1):230–237

    Article  MathSciNet  Google Scholar 

  • Guyon I (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182

    MATH  Google Scholar 

  • Han XH, Chang XM, Quan L et al (2014) Feature subset selection by gravitational search algorithm optimization. Inf Sci 281:128–146

    Article  MathSciNet  Google Scholar 

  • Herzfeld DJ, Vaswani PA, Marko MK et al (2014) A memory of errors in sensorimotor learning. Science 345(6202):1349–1353

    Article  Google Scholar 

  • Juanying X, Weixin X (2014) Several feature selection algorithms based on the discernibility of a feature subset and support vector machines. Chin J Comput 37(8):1704–1718

    Google Scholar 

  • Linksvayer T (2014) Evolutionary biology: Survival of the fittest group. Nature 514(7522):308–309

    Article  Google Scholar 

  • Mar T, Zaunseder S, Martinez JP et al (2011) Optimization of ECG classification by means of feature selection. IEEE Trans Biomed Eng 58(8):2168–2177

    Article  Google Scholar 

  • Mersch DP, Crespi A, Keller L (2013) Tracking individuals shows spatial fidelity is a key regulator of ant social organization. Science 340(6136):1090–1093

    Article  Google Scholar 

  • Moradi P, Rostami M (2015) Integration of graph clustering with ant colony optimization for feature selection. Knowl Based Syst 84:144–161

    Article  Google Scholar 

  • Nemati S, Basiri ME (2011) Text-independent speaker verification using ant colony optimization-based selected features. Expert Syst Appl 38(1):620–630

    Article  Google Scholar 

  • Parkka J, Ermes M, van Gils M (2010) Automatic feature selection and classification of physical and mental load using data from wearable sensors. IEEE, Washington

    Book  Google Scholar 

  • Peng H, Yinlian F, Liu J et al (2013) Optimal gene subset selection using the modified SFFS algorithm for tumor classification. Neural Comput Appl 23:1531–1538

    Article  Google Scholar 

  • Peter C, Jessica JK (2008) The interaction between predation and competition. Nature 456(7219):235–238

    Article  Google Scholar 

  • Uzer MS, Inan O, Yilmaz N (2013) A hybrid breast cancer detection system via neural network and feature selection based on sbs, sfs and pca. Neural Comput Appl 23:719–728

    Article  Google Scholar 

  • Vergara JR, Estevez PA (2014) A review of feature selection methods based on mutual information. Neural Comput Appl 24:175–186

    Article  Google Scholar 

  • Xiaofeng M, Yong L, Jianhua Z (2013) Social computing in the era of big data: opportunities and challenges. J Comput Res Dev 50(12):2483–2491

    Google Scholar 

  • Xie J, Lei J, Xie W (2013) Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases. Health Inf Sci Syst 1:1–14

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant Nos. 61472139 and 61462073, the Software and Integrated Circuit Industry Development Special Funds of Shanghai Economic and Information Commission under Grant No. 140304.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Xiang Feng or Tan Yang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Communicated by V. Loia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Feng, X., Yang, T. & Yu, H. A new multi-colony fairness algorithm for feature selection. Soft Comput 21, 7141–7157 (2017). https://doi.org/10.1007/s00500-016-2257-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-016-2257-0

Keywords

Navigation