Skip to main content
Log in

A survey on feature selection methods for mixed data

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

Feature Selection for mixed data is an active research area with many applications in practical problems where numerical and non-numerical features describe the objects of study. This paper provides the first comprehensive and structured revision of the existing supervised and unsupervised feature selection methods for mixed data reported in the literature. Additionally, we present an analysis of the main characteristics, advantages, and disadvantages of the feature selection methods reviewed in this survey and discuss some important open challenges and potential future research opportunities in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Notes

  1. Also called heterogeneous or assorted data.

  2. The label assigned to each object in the dataset can be a category, an ordered value, or a real value, depending on the specific task.

  3. For the case of UFS methods, class labels are not used in this step.

  4. A parameter given by the user in the range (0, 1) that specifies the average fraction of features per cluster.

References

Download references

Acknowledgements

The first author gratefully acknowledges to the Instituto Nacional de Astrofísica, Óptica y Electrónica (INAOE) for the collaboration grant awarded for the completion of this survey.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saúl Solorio-Fernández.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Solorio-Fernández, S., Carrasco-Ochoa, J. & Martínez-Trinidad, J.F. A survey on feature selection methods for mixed data. Artif Intell Rev 55, 2821–2846 (2022). https://doi.org/10.1007/s10462-021-10072-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-021-10072-6

Keywords

Navigation