Integrative Exploratory Analysis of Two or More Genomic Datasets

Meng, Chen; Culhane, Aedin

doi:10.1007/978-1-4939-3578-9_2

Chen Meng MS⁴ &
Aedin Culhane PhD^5,6

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1418))

8744 Accesses
3 Citations
1 Altmetric

Abstract

Exploratory analysis is an essential step in the analysis of high throughput data. Multivariate approaches such as correspondence analysis (CA), principal component analysis, and multidimensional scaling are widely used in the exploratory analysis of single dataset. Modern biological studies often assay multiple types of biological molecules (e.g., mRNA, protein, phosphoproteins) on a same set of biological samples, thereby creating multiple different types of omics data or multiassay data. Integrative exploratory analysis of these multiple omics data is required to leverage the potential of multiple omics studies. In this chapter, we describe the application of co-inertia analysis (CIA; for analyzing two datasets) and multiple co-inertia analysis (MCIA; for three or more datasets) to address this problem. These methods are powerful yet simple multivariate approaches that represent samples using a lower number of variables, allowing a more easily identification of the correlated structure in and between multiple high dimensional datasets. Graphical representations can be employed to this purpose. In addition, the methods simultaneously project samples and variables (genes, proteins) onto the same lower dimensional space, so the most variant variables from each dataset can be selected and associated with samples, which can be further used to facilitate biological interpretation and pathway analysis. We applied CIA to explore the concordance between mRNA and protein expression in a panel of 60 tumor cell lines from the National Cancer Institute. In the same 60 cell lines, we used MCIA to perform a cross-platform comparison of mRNA gene expression profiles obtained on four different microarray platforms. Last, as an example of integrative analysis of multiassay or multi-omics data we analyzed transcriptomic, proteomic, and phosphoproteomic data from pluripotent (iPS) and embryonic stem (ES) cell lines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Hardcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fellenberg K, Hauser NC, Brors B, Neutzner A, Hoheisel JD, Vingron M (2001) Correspondence analysis applied to microarray data. Proc Natl Acad Sci USA 98:10781–10786
Article CAS PubMed PubMed Central Google Scholar
Raychaudhuri S, Stuart JM, Altman RB (2000) Principal components analysis to summarize microarray experiments: application to sporulation time series. In: Pacific Symposium on Biocomputing, pp 455–466
Google Scholar
Culhane AC, Perriere G, Higgins DG (2003) Cross-platform comparison and visualisation of gene expression data using co-inertia analysis. BMC Bioinformatics 21(4):59
Article Google Scholar
Meng C, Kuster B, Culhane A, Gholami AM (2014) A multivariate approach to the integration of multi-omics datasets. BMC Bioinformatics 29(15):162
Google Scholar
Dolédec S, Chessel D (1994) Co-inertia analysis: an alternative method for studying species-environment relationships. Freshw Biol 31:277–294
Article Google Scholar
Culhane AC, Fagan A, Higgins DG (2007) A multivariate analysis approach to the integration of proteomic and gene expression data. Proteomics 7:2162–2171
Google Scholar
Reinhold WC, Sunshine M, Liu H, Varma S, Kohn KW, Morris J, Doroshow J, Pommier Y (2012) Cellminer: a web-based suite of genomic and pharmacologic tools to explore transcript and drug patterns in the NCI-60 cell line set. Cancer Res 72(14):3499–511
Article CAS PubMed PubMed Central Google Scholar
Moghaddas Gholami A, Hahne H, Wu Z, Auer FJ, Meng C, Wilhelm M, Kuster B (2013) Global proteome analysis of the NCI-60 cell line panel. Cell Rep 4:609–620
Google Scholar
Phanstiel DH, Brumbaugh J, Wenger CD, Tian S, Probasco MD, Bailey DJ, Swaney DL, Tervo MA, Bolin JM, Ruotti V, Stewart R, Thomson JA, Coon JJ (2011) Proteomic and phosphoproteomic comparison of human ES and iPS cells. Nat Methods 8:821–827
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Chair of Proteomics and Bioanalytics, Technische Universität Mnchen, Emil-Erlenmeyer-Forum 5, 85354, Freising, Germany
Chen Meng MS
Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, 450 Brookline Ave., Boston, MA, 02215, USA
Aedin Culhane PhD
Department of Biostatistics, Harvard T.H. Chan School of Public Health, 677 Huntington Avenue, Boston, MA, 02115, USA
Aedin Culhane PhD

Authors

Chen Meng MS
View author publications
You can also search for this author in PubMed Google Scholar
Aedin Culhane PhD
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chen Meng MS .

Editor information

Editors and Affiliations

Ohio State University, Biomed Informatics, College of Medicine, Columbus, Ohio, USA
Ewy Mathé
National Cancer Institute, National Institutes of Health, Columbia, Maryland, USA
Sean Davis

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Meng, C., Culhane, A. (2016). Integrative Exploratory Analysis of Two or More Genomic Datasets. In: Mathé, E., Davis, S. (eds) Statistical Genomics. Methods in Molecular Biology, vol 1418. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-3578-9_2

Download citation

DOI: https://doi.org/10.1007/978-1-4939-3578-9_2
Published: 24 March 2016
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-3576-5
Online ISBN: 978-1-4939-3578-9
eBook Packages: Springer Protocols

Publish with us

Policies and ethics