Reference Hub1
Improved Feature Selection by Incorporating Gene Similarity into the LASSO

Improved Feature Selection by Incorporating Gene Similarity into the LASSO

Christopher E. Gillies, Xiaoli Gao, Nilesh V. Patel, Mohammad-Reza Siadat, George D. Wilson
Copyright: © 2012 |Volume: 3 |Issue: 1 |Pages: 22
ISSN: 1947-9115|EISSN: 1947-9123|EISBN13: 9781466613218|DOI: 10.4018/jkdb.2012010101
Cite Article Cite Article

MLA

Gillies, Christopher E., et al. "Improved Feature Selection by Incorporating Gene Similarity into the LASSO." IJKDB vol.3, no.1 2012: pp.1-22. http://doi.org/10.4018/jkdb.2012010101

APA

Gillies, C. E., Gao, X., Patel, N. V., Siadat, M., & Wilson, G. D. (2012). Improved Feature Selection by Incorporating Gene Similarity into the LASSO. International Journal of Knowledge Discovery in Bioinformatics (IJKDB), 3(1), 1-22. http://doi.org/10.4018/jkdb.2012010101

Chicago

Gillies, Christopher E., et al. "Improved Feature Selection by Incorporating Gene Similarity into the LASSO," International Journal of Knowledge Discovery in Bioinformatics (IJKDB) 3, no.1: 1-22. http://doi.org/10.4018/jkdb.2012010101

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Personalized medicine is customizing treatments to a patient’s genetic profile and has the potential to revolutionize medical practice. An important process used in personalized medicine is gene expression profiling. Analyzing gene expression profiles is difficult, because there are usually few patients and thousands of genes, leading to the curse of dimensionality. To combat this problem, researchers suggest using prior knowledge to enhance feature selection for supervised learning algorithms. The authors propose an enhancement to the LASSO, a shrinkage and selection technique that induces parameter sparsity by penalizing a model’s objective function. Their enhancement gives preference to the selection of genes that are involved in similar biological processes. The authors’ modified LASSO selects similar genes by penalizing interaction terms between genes. They devise a coordinate descent algorithm to minimize the corresponding objective function. To evaluate their method, the authors created simulation data where they compared their model to the standard LASSO model and an interaction LASSO model. The authors’ model outperformed both the standard and interaction LASSO models in terms of detecting important genes and gene interactions for a reasonable number of training samples. They also demonstrated the performance of their method on a real gene expression data set from lung cancer cell lines.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.