Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter February 21, 2018

Missing data in open-data era – a barrier to multiomics integration

  • Monika Piwowar EMAIL logo and Wiktor Jurkowski

Abstract

The exploration of complex interactions in biological systems is one of the main aims in nature science nowadays. Progress in this area is possible because of high-throughput omics technologies and the computational surge. The development of analytical methods “is trying to keep pace” with the development of molecular biology methods that provide increasingly large amounts of data – omics data. Specialized databases consist of ever-larger collections of experiments that are usually conducted by one next-generation sequencing technique (e.g. RNA-seq). Other databases integrate data by defining qualitative relationships between individual objects in the form of ontologies, interactions, and pathways (e.g. GO, KEGG, and String). However, there are no open-source complementary quantitative data sets for the biological processes studied, including information from many levels of the organism organization, which would allow the development of multidimensional data analysis methods (multiscale and insightful overviews of biological processes). In the paper, the lack of omics complementary quantitative data set, which would help integrate the defined qualitative biological relationships of individual biomolecules with statistical, computational methods, is discussed.

  1. Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: None declared.

  3. Employment or leadership: None declared.

  4. Honorarium: None declared.

  5. Competing interests: The funding organization(s) played no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the report for publication.

References

[1] Tebani A, Afonso C, Marret S, Bekri S. Omics-based strategies in precision medicine: toward a paradigm shift in inborn errors of metabolism investigations.. Int J Mol Sci. 2016;17(9): E1555.10.3390/ijms17091555Search in Google Scholar PubMed PubMed Central

[2] Trivedi DK, Hollywood KA, Goodacre R. Metabolomics for the masses: the future of metabolomics in a personalized world. New Horizons Transl Med [Internet] 2017;3:294–305.10.1016/j.nhtm.2017.06.001Search in Google Scholar PubMed PubMed Central

[3] Chen HH, Kuo MT. Improving radiotherapy in cancer treatment: promises and challenges [Internet]. Oncotarget 2017;8:62742–58.10.18632/oncotarget.18409Search in Google Scholar PubMed PubMed Central

[4] Weinshilboum RM, Wang L. Pharmacogenomics: precision medicine and drug response [Internet]. Mayo Clin Proc 2017;92:1711–22.10.1016/j.mayocp.2017.09.001Search in Google Scholar PubMed PubMed Central

[5] Neavin D, Kaddurah-Daouk R, Weinshilboum R. Pharmacometabolomics informs pharmacogenomics [Internet]. Metabolomics 2016;12:121.10.1007/s11306-016-1066-xSearch in Google Scholar PubMed PubMed Central

[6] Senft D, Leiserson MD, Ruppin E, Ronai ZA. Precision oncology: the road ahead [Internet]. Trends Mol Med 2017;23:874–98.10.1016/j.molmed.2017.08.003Search in Google Scholar PubMed PubMed Central

[7] Kan M, Shumyatcher M, Himes BE. Using omics approaches to understand pulmonary diseases [Internet]. Respir Res 2017;18:149.10.1186/s12931-017-0631-9Search in Google Scholar PubMed PubMed Central

[8] Moran S, Martinez-Cardús A, Boussios S, Esteller M. Precision medicine based on epigenomics: the paradigm of carcinoma of unknown primary [Internet]. Nat Rev Clin Oncol 2017;14:682–94.10.1038/nrclinonc.2017.97Search in Google Scholar PubMed

[9] Dunkler D, Sánchez-Cabo F, Heinze G. Statistical analysis principles for omics data. Methods Mol Biol 2011;719:113–31.10.1007/978-1-61779-027-0_5Search in Google Scholar PubMed

[10] Gottardo R, Pannucci JA, Kuske CR, Brettin T. Statistical analysis of microarray data: a Bayesian approach. Biostatistics 2003;4:597–620.10.1093/biostatistics/4.4.597Search in Google Scholar PubMed

[11] Chadeau-Hyam M, Campanella G, Jombart T, Bottolo L, Portengen L, Vineis P, et al. Deciphering the complex: methodological overview of statistical models to derive OMICS-based biomarkers [Internet]. Environ Mol Mutagen 2013;54:542–57.10.1002/em.21797Search in Google Scholar PubMed

[12] Leipzig J. A review of bioinformatic pipeline frameworks [Internet]. Brief Bioinform 2016;18:bbw020.10.1093/bib/bbw020Search in Google Scholar PubMed PubMed Central

[13] Merrick BA, London RE, Bushel PR, Grissom SF, Paules RS. Platforms for biomarker analysis using high-throughput approaches in genomics, transcriptomics, proteomics, metabolomics, and bioinformatics. IARC Sci Publ. 2011;(163):121–42.Search in Google Scholar PubMed

[14] Waller T, Gubała T, Sarapata K, Piwowar M, Jurkowski W. DNA microarray integromics analysis platform [Internet]. BioData Min 2015;8:18.10.1186/s13040-015-0052-6Search in Google Scholar PubMed PubMed Central

[15] Grene R, Klumas C, Suren H, Yang K, Collakova E, Myers E, et al. Mining and visualization of microarray and metabolomic data reveal extensive cell wall remodeling during winter hardening in Sitka spruce (Picea sitchensis). Front Plant Sci 2012;3:241.10.3389/fpls.2012.00241Search in Google Scholar PubMed PubMed Central

[16] Li S, Todor A, Luo R. Blood transcriptomics and metabolomics for personalized medicine. Comput Struct Biotechnol J 2016;14:1–7.10.1016/j.csbj.2015.10.005Search in Google Scholar PubMed PubMed Central

[17] Su G, Burant CF, Beecher CW, Athey BD, Meng F, Ferrara C, et al. Integrated metabolome and transcriptome analysis of the NCI60 dataset. BMC Bioinform 2011;12:S36.10.1186/1471-2105-12-S1-S36Search in Google Scholar PubMed PubMed Central

[18] Kamburov A, Cavill R, Ebbels TM, Herwig R, Keun HC. Integrated pathway-level analysis of transcriptomics and metabolomics data with IMPaLA [Internet]. Bioinformatics 2011;27:2917–8.10.1093/bioinformatics/btr499Search in Google Scholar PubMed

[19] Kuo T-C, Tian T-F, Tseng Y, Kolbe A, Oliver S, Fernie A, et al. 3Omics: a web-based systems biology tool for analysis, integration and visualization of human transcriptomic, proteomic and metabolomic data. BMC Syst Biol 2013;7:64.10.1186/1752-0509-7-64Search in Google Scholar PubMed PubMed Central

[20] Lin D, Zhang J, Li J, Xu C, Deng H-W, Wang Y-P. An integrative imputation method based on multi-omics datasets. BMC Bioinform 2016;17:247.10.1186/s12859-016-1122-6Search in Google Scholar PubMed PubMed Central

[21] Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni JC, et al. Multi-omics factor analysis disentangles heterogeneity in blood cancer [Internet]. bioRxiv 2017. Article no: 217554. https://www.biorxiv.org/content/early/2017/11/10/217554.Search in Google Scholar

[22] Acharjee A, Ament Z, West JA, Stanley E, Griffin JL. Integration of metabolomics, lipidomics and clinical data using a machine learning method. BMC Bioinform 2016;17:440.10.1186/s12859-016-1292-2Search in Google Scholar PubMed PubMed Central

[23] Acharjee A, Kloosterman B, de Vos RC, Werij JS, Bachem CW, Visser RG, et al. Data integration and network reconstruction with ∼omics data using random forest regression in potato. Anal Chim Acta 2011;705:56–63.10.1016/j.aca.2011.03.050Search in Google Scholar PubMed

[24] Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship [Internet]. Sci Data 2016;3. Article no: 160018.Search in Google Scholar

[25] Schneider MV, Orchard S. Omics technologies, data and bioinformatics principles. Methods Mol Biol 2011;719:3–30.10.1007/978-1-61779-027-0_1Search in Google Scholar PubMed

[26] Fletcher B. First paper with “living figure” published. Available at: https://www.biosciencetechnology.com/article/2015/04/first-paper-living-figure-published. Accessed: 24 Jan 2018.Search in Google Scholar

[27] Colomb J, Brembs B. Sub-strains of Drosophila Canton-S differ markedly in their locomotor behavior [Version 2]. F1000Res. 2015;3:176. DOI: 10.12688/f1000research.4263.Search in Google Scholar

Received: 2017-11-5
Accepted: 2017-11-30
Published Online: 2018-2-21

©2018 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 30.4.2024 from https://www.degruyter.com/document/doi/10.1515/bams-2017-0026/html
Scroll to top button