DEAME - Differential Expression Analysis Made Easy

Kraus, Milena; Hesse, Guenter; Slosarek, Tamara; Danner, Marius; Kesar, Ajay; Bhushan, Akshay; Schapranow, Matthieu-P.

doi:10.1007/978-3-030-14177-6_13

Milena Kraus¹⁸,
Guenter Hesse¹⁸,
Tamara Slosarek¹⁸,
Marius Danner¹⁸,
Ajay Kesar¹⁸,
Akshay Bhushan¹⁸ &
…
Matthieu-P. Schapranow¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11470))

Included in the following conference series:

472 Accesses

Abstract

Differential gene and protein expression analysis reveals clinically significant insights that are crucial, e.g., for systems medicine approaches. However, processing of data still needs expertise of a computational biologist and existing bioinformatics tools are developed to answer only one research question at a time. As a result, current automated analysis pipelines and software platforms are not fully suited to help research-oriented clinicians answering their hypotheses arising during their clinical routine. Thus, we conducted user interviews in order to identify software requirements and evaluate our research prototype of an application that (i) automates the complete preprocessing of RNA sequencing data in a way that enables rapid hypothesis testing, (ii) can be run by a clinician and (iii) helps interpreting the data. In our contribution, we share details of our preprocessing pipeline, software architecture of our first prototype and the identified functionalities needed for rapid and clinically relevant hypothesis testing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Clustergrammer’s Documentation. http://clustergrammer.readthedocs.io/index.html
FASTQC Documentation. http://www.bioinformatics.bbsrc.ac.uk/projects/fastqc
Afgan, E., et al.: The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucl. Acids Res. 44, W537–W544 (2016)
Article Google Scholar
Bolger, A.M., Lohse, M., Usadel, B.: Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014). https://doi.org/10.1093/bioinformatics/btu170
Article Google Scholar
Byron, S.A., et al.: Translating RNA sequencing into clinical diagnostics: opportunities and challenges. Nat. Rev. Genet. 17, 257 (2016)
Article Google Scholar
Conesa, A., et al.: A survey of best practices for RNA-Seq data analysis. Genome Biol. 17(1), 13 (2016)
Article Google Scholar
D’Antonio, M., et al.: RAP: RNA-Seq analysis pipeline, a new cloud-based NGS web application. BMC Genom. 16(6), S3 (2015)
Article Google Scholar
Dobin, A., et al.: STAR: ultrafast universal RNA-Seq aligner. Bioinformatics 29(1), 15–21 (2013)
Article Google Scholar
Gaur, P., Chaturvedi, A.: A survey of bioinformatics-based tools in RNA-Sequencing (RNA-Seq) data analysis. In: Wei, D.Q., Ma, Y., Cho, W., Xu, Q., Zhou, F. (eds.) Translational Bioinformatics and Its Application, pp. 223–248. Springer, Dordrecht (2017). https://doi.org/10.1007/978-94-024-1045-7_10
Chapter Google Scholar
Gietzelt, M., et al.: The use of tools, modelling methods, data types, and endpoints in systems medicine: a survey on projects of the German e: Med-Programme. Stud. Health Technol. Inform. 228, 670–674 (2016)
Google Scholar
Han, H., Jiang, X.: Disease biomarker query from RNA-Seq data. Cancer Inform. 13(Suppl. 1), 81 (2014)
Google Scholar
Kraus, M., Schapranow, M.P.: An in-memory database platform for systems medicine. In: Proceedings of the 9th International Conference on Bioinformatics and Computational Biology. ISCA (2017)
Google Scholar
Kraus, M., et al.: Olelo: a web application for intuitive exploration of biomedical literature. Nucl. Acids Res. 45(W1), W478–W483 (2017)
Article Google Scholar
Li, J., et al.: From gigabyte to kilobyte: a bioinformatics protocol for mining large RNA-Seq transcriptomics data. PloS ONE 10(4), e0125000 (2015)
Article Google Scholar
Liao, Y., Smyth, G.K., Shi, W.: FeatureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30(7), 923–930 (2014)
Article Google Scholar
Love, M., Anders, S., Huber, W.: Differential analysis of count data-the DESeq2 package. Genome Biol. 15, 550 (2014)
Article Google Scholar
Love, M.I., Anders, S., Kim, V., Huber, W.: RNA-Seq workflow: gene-level exploratory analysis and differential expression. F1000Research 4 (2015)
Article Google Scholar
Plattner, H., Schapranow, M.P. (eds.): High-Performance In-Memory Genome Data Analysis: How In-Memory Database Technology Accelerates Personalized Medicine. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-03035-7
Book Google Scholar
Queralt-Rosinach, N., Piñero, J., Bravo, A., Sanz, F., Furlong, L.: DisGeNET-RDF: harnessing the innovative power of the semantic web to explore the genetic basis of diseases. Bioinformatics 32(14), 2236–2238 (2016)
Article Google Scholar
Trapnell, C., Pachter, L., Salzberg, S.L.: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25(9), 1105–1111 (2009)
Article Google Scholar
Wagle, P., Nikolić, M., Frommolt, P.: QuickNGS elevates next-generation sequencing data analysis to a new level of automation. BMC Genom. 16(1), 487 (2015)
Article Google Scholar
Wolfien, M., et al.: TRAPLINE: a standardized and automated pipeline for RNA sequencing data analysis, evaluation and annotation. BMC Bioinform. 17(1), 21 (2016)
Article Google Scholar

Download references

Acknowledgement

Parts of this work were generously supported by a grant of the German Federal Ministry of Education and Research (031A427B).

Author information

Authors and Affiliations

Hasso Plattner Institute, Prof.-Dr.-Helmert-Str. 2-3, 14482, Potsdam, Germany
Milena Kraus, Guenter Hesse, Tamara Slosarek, Marius Danner, Ajay Kesar, Akshay Bhushan & Matthieu-P. Schapranow

Authors

Milena Kraus
View author publications
You can also search for this author in PubMed Google Scholar
Guenter Hesse
View author publications
You can also search for this author in PubMed Google Scholar
Tamara Slosarek
View author publications
You can also search for this author in PubMed Google Scholar
Marius Danner
View author publications
You can also search for this author in PubMed Google Scholar
Ajay Kesar
View author publications
You can also search for this author in PubMed Google Scholar
Akshay Bhushan
View author publications
You can also search for this author in PubMed Google Scholar
Matthieu-P. Schapranow
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Milena Kraus .

Editor information

Editors and Affiliations

Massachusetts Institute of Technology, Lexington, MA, USA
Vijay Gadepally
Intel Corporation, Hillsboro, OR, USA
Timothy Mattson
Massachusetts Institute of Technology, Cambridge, MA, USA
Michael Stonebraker
Stony Brook University, Stony Brook, NY, USA
Fusheng Wang
University of Washington, Seattle, WA, USA
Gang Luo
University of Brasília, Brasilia, Brazil
George Teodoro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kraus, M. et al. (2019). DEAME - Differential Expression Analysis Made Easy. In: Gadepally, V., Mattson, T., Stonebraker, M., Wang, F., Luo, G., Teodoro, G. (eds) Heterogeneous Data Management, Polystores, and Analytics for Healthcare. DMAH Poly 2018 2018. Lecture Notes in Computer Science(), vol 11470. Springer, Cham. https://doi.org/10.1007/978-3-030-14177-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-14177-6_13
Published: 21 February 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14176-9
Online ISBN: 978-3-030-14177-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics