Microarrays measure expression patterns of thousands of genes at a time, under same or diverse conditions, to facilitate faster analysis of biological processes. This gene expression data is being widely used for diagnosis, prognosis and tailored drug discovery. Microarray data, however, commonly contains missing values, which can have high impact on subsequent biological knowledge discovery methods. This has been catalyst for the manifest of different imputation algorithms, including Collateral Missing Value Estimation (CMVE), Bayesian Principal Component Analysis (BPCA), Least Square Impute (LSImpute), Local Least Square Impute (LLSImpute) and K-Nearest Neighbour (KNN). This Chapter investigates the impact of missing values on post genomic knowledge discovery methods like, Gene Selection and Gene Regulatory Network (GRN) reconstruction. A framework for robust subsequent biological knowledge inference has been proposed which has shown significant improvements in the outcomes of Gene Selection and GRN reconstruction methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
P. D. Sutphin, S. Raychaudhuri, N. C. Denko, R. B. Altman, and A. J. Giaccia, “Application of supervised machine learning to identify genes associated with the hypoxia response,” Nature Genetics, vol. 27, pp. 90, 2001.
D. Schmatz and S. Friend, “A simple recipe for drug interaction networks earns its stars,” Nature Genetics, vol. 38, pp. 405-406, 2006.
M. Joron, C. D. Jiggins, A. Papanicolaou, and W. O. McMillan, “Heliconius wing patterns: an evo-devo model for understanding phenotypic diversity,” Heredity, vol. 97, pp. 157-167, 2006.
I. P. Ioshikhes, I. Albert, S. J. Zanton, and B. F. Pugh, “Nucleosome positions predicted through comparative genomics,” Nature Genetics, vol. doi:10.1038/ng1878, 2006.
A. Brazma, P. Hingamp, J. Quackenbush, G. Sherlock, P. Spellman, C. Stoeckert, J. Aach, W. Ansorge, C. A, and et al, “Minimum information about a microarray experiment (MIAME)-toward standards for microarray data,” Nature Genetics, vol. 29, pp. 365-371, 2001.
S. Oba, M. A. Sato, I. Takemasa, M. Monden, K. Matsubara, and S. Ishii, “A Bayesian Missing Value Estimation Method for Gene Expression Profile Data,” Bioinformatics, vol. 19, pp. 2088-2096, 2003.
E. Wit and J. McClure, Statistics for Microarrays: Design, Analysis and Inference: John Wiley & Sons, 2004.
J. Tuikkala, L. Elo, O. S. Nevalainen, and T. Aittokallio, “Improving missing value estimation in microarray data with gene ontology 10.1093/bioinformatics/btk019,” Bioinformatics, pp. btk019, 2005.
E. Acuna and C. Rodriguez, “The treatment of missing values and its effect in the classifier accuracy,” Classification, Clustering and Data Mining Applications, pp. 639-648, 2004.
H. Kim, G. H. Golub, and H. Park, “Missing value estimation for DNA microarray gene expression data: local least squares imputation 10.1093/bioinformatics/bth499,” Bioinformatics, vol. 21, pp. 187-198, 2005.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Collateral Missing Value Estimation: Robust missing value estimation for consequent microarray data processing,” Lecture Notes in Artificial Intelligence \(LNAI\), Springer-Verlag, pp. 274-283, 2005.
O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, D. Botstein, and R. Altman, “Missing Value Estimation Methods for DNA Microarrays,” Bioinformatics, vol. 17, pp. 520-525, 2001.
T. H. B, B. Dysvik, and I. Jonassen, “LSimpute: Accurate estimation of missing values in microarray data with least squares methods,” Nucleic Acids Res., pp. 32(3):e34, 2004.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Collateral Missing Value Imputation: a new robust missing value estimation algorithm for microarray data,” Bioinformatics, vol. 21(10), pp. 2417-2423, 2005.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Missing Values Imputation for DNA Microarray Data using Ranked Covariance Vectors,” The International Journal of Hybrid Intelligent Systems \(IJHIS\), vol. ISSN 1448-5869, 2005.
R. Jornsten, H.-Y. Wang, W. J. Welsh, and M. Ouyang, “DNA microarray data imputation and significance analysis of differential expression 10.1093/bioinformatics/bti638,” Bioinformatics, vol. 21, pp. 4155-4161, 2005.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Missing Value Imputation Framework for Microarray Significant Gene Selection and Class Prediction,” Lecture Notes in Bioinformatics (LNBI), Springer-Verlag, vol. 3916/2006, pp. 131-142, 2006.
Z. Sidak, P. K. Sen, and J. Hajek, Theory of Rank Tests Probability and Mathematical Statistics: Academic Press, 1999.
I. Hedenfalk, D. Duggan, Y. Chen, M. Radmacher, M. Bittner, R. Simon, P. Meltzer, B. Gusterson, M. Esteller, O. P. Kallioniemi, B. Wilfond, A. Borg, and J. Trent, “Gene-expression profiles in hereditary breast cancer,” N. Engl. J. Med, pp. 22; 344\(8\):539-548, 2001.
X.-w. Chen, G. Anantha, and X. Wang, “An effective structure learning method for constructing gene networks 10.1093/bioinformatics/btl090,” Bioinformatics, vol. 22, pp. 1367-1374, 2006.
K. E. Lee, N. Sha, E. R. Dougherty, M. Vannucci, and B. K. Mallick, “Gene selection: a Bayesian variable selection approach 10.1093/bioinformatics/19.1.90,” Bioinformatics, vol. 19, pp. 90-97, 2003.
P. Y. Chen and P. M. Popovich, Correlation: Parametric and Nonparametric Measures, 1st edition ed: SAGE Publications, 2002.
M. Harvey and C. Arthur, “Fitting models to biological Data using linear and nonlinear regression,” Oxford University Press, 2004.
K. Basso, A. A. Margolin, G. Stolovitzky, U. Klein, R. Dalla-Favera, and A. Califano, “Reverse engineering of regulatory networks in human B cells,” Nature Genetics, vol. 37, pp. 382-390, 2005.
F. V. Jensen, Bayesian Networks and Decision Graphs, 2 ed: Springer, 2002.
J. Ihmels, R. Levy, and N. Barkai, “Principles of transcriptional control in the metabolic network of Saccharomyces cerevisiae,” Nature Biotechnology, vol. 22, pp. 86-92, 2003.
G. Casella and C. P. Robert, Monte Carlo Statistical Methods: Springer, 2005.
E. Jansen, J. S. E. Laven, H. B. R. Dommerholt, J. Polman, C. van Rijt, C. van den Hurk, J. Westland, S. Mosselman, and B. C. J. M. Fauser, “Abnormal Gene Expression Profiles in Human Ovaries from Polycystic Ovary Syndrome Patients 10.1210/me.2004-0074,” Mol Endocrinol, vol. 18, pp. 3050-3063, 2004.
A. A. Margolin, I. Nemenman, K. Basso, C. Wiggins, G. Stolovitzky, R. D. Favera, and A. Califano, “ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context,” BMC Bioinformatics, vol. 7, 2006.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “CF-GeNe: Fuzzy Framework for Robust Gene Regulatory Network Inference,” Journal of Computers, Academy Press, vol. 7, pp. 1-8, 2006.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Missing Value Imputation Framework for Microarray Significant Gene Selection and Class Prediction,” Lecture Notes in Bioinformatics (LNBI), Springer-Verlag, vol. 3916, pp. 131-142, 2006.
M. S. B. Sehgal, I. Gondal, and L. Dooley, “Missing Values Imputation for DNA Microarray Data using Ranked Covariance Vectors,” The International Journal of Hybrid Intelligent Systems (IJHIS), vol. ISSN 1448-5869, 2005.
L. Liu, D. M. Hawkins, S. Ghosh, and S. S. Young, “Robust singular value decomposition analysis of microarray data.” vol. 100, 2003, pp. 13167-13172.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Sehgal, M.S., Gondal, I., Dooley, L. (2008). Gene Expression Imputation Techniques for Robust Post Genomic Knowledge Discovery. In: Kelemen, A., Abraham, A., Liang, Y. (eds) Computational Intelligence in Medical Informatics. Studies in Computational Intelligence, vol 85. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75767-2_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-75767-2_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75766-5
Online ISBN: 978-3-540-75767-2
eBook Packages: EngineeringEngineering (R0)