Abstract:
In exploratory association studies of genes with certain diseases, a single or a small number of genes (features) related with the diseases are selected1 among many thous...Show MoreMetadata
Abstract:
In exploratory association studies of genes with certain diseases, a single or a small number of genes (features) related with the diseases are selected1 among many thousands investigated. We investigate the statistical bias and variance of simple yet common (correlation and mutual information based) feature selection algorithms using well-known cross-validation methods (leave-one-out and k-fold) on a gene finding study for hypertension prediction. Our findings show that selected genes are different for different methods and different cross-validation runs for both single gene selection and gene subset selection.
Published in: Proceedings of 2012 IEEE-EMBS International Conference on Biomedical and Health Informatics
Date of Conference: 05-07 January 2012
Date Added to IEEE Xplore: 07 June 2012
ISBN Information: