Abstract
Preprocessing microarray data consists of a number of statistical procedures that convert the observed intensities into quantities that represent biological events of interest, like gene expression and allele-specific abundances. Here, we present a summary of the theory behind microarray data preprocessing for expression, whole transcriptome and SNP designs and focus on the computational protocol used to obtain processed data that will be used on downstream analyses. We describe the main features of the oligo Bioconductor package, an application designed to support oligonucleotide microarrays using the R statistical environment and the infrastructure provided by Bioconductor, allowing the researcher to handle probe-level data and interface with advanced statistical tools under a simplified framework. We demonstrate the use of the package by preprocessing data originated from three different designs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Welter D, MacArthur J, Morales J et al (2014) The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 42(Database issue):D1001–D1006
Irizarry R, Hobbs B, Collin F et al (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4(2):249–264
Irizarry R, Bolstad BM, Collin F et al (2003) Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 31(4), e15
Gentleman RC, Carey VJ, Bates DM et al (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5(10):R80
R Core Team (2014) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org/
Carvalho BS, Irizarry RA (2010) A framework for oligonucleotide microarray preprocessing. Bioinformatics 26(19):2363–2367
Carvalho B, Bengtsson H, Speed TP, Irizarry R (2007) Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. Biostatistics 8(2):485–499
Zhang L, Yin S, Miclaus et al (2010) Assessment of variability in GWAS with CRLMM genotyping algorithm on WTCCC coronary artery disease. Pharmacogenomics J 10(4):347–354
Ritchie ME, Phipson B, Wu D, Hu Y et al (2015) limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43(7):e47
Clayton D (2014) snpStats: SnpMatrix and XSnpMatrix classes and methods. R package version 1.16.0
Carvalho B, Louis TA, Irizarry RA (2010) Quantifying uncertainty in genotype calls. Bioinformatics 26(2):242–249
Kwissa M, Nakaya HI, Onlamoon N et al (2014) Dengue virus infection induces expansion of a CD14(+)CD16(+) monocyte population that stimulates plasmablast differentiation. Cell Host Microbe 16(1):115–127
Ilmjärv S, Hundahl CA, Reimets R et al (2014) Estimating differential expression from multiple indicators. Nucleic Acids Res 42(8), e72
Koeman JM, Russell RC, Tan M-H et al (2008) Somatic pairing of chromosome 19 in renal oncocytoma is associated with deregulated EGLN2-mediated [corrected] oxygen-sensing response. PLoS Genet 4(9):e1000176
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this protocol
Cite this protocol
Carvalho, B.S. (2016). Working with Oligonucleotide Arrays. In: Mathé, E., Davis, S. (eds) Statistical Genomics. Methods in Molecular Biology, vol 1418. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-3578-9_7
Download citation
DOI: https://doi.org/10.1007/978-1-4939-3578-9_7
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-3576-5
Online ISBN: 978-1-4939-3578-9
eBook Packages: Springer Protocols