Abstract
The analysis of biological data is an important part of modern biological and medical research. Many statistical and computational tools are available for statisticians and data scientists, providing them with both computational power and flexibility. However, these tools are often not suitable for biomedical researchers performing data processing and simple analyses. Learning to code is indeed challenging and quite difficult when coming from a non-technical background.This chapter introduces MetaR, a tool designed to help biologists and clinicians to learn, and perform, the basis of data analysis. Originally created as an educational software, it eventually evolved into a mature and stable Domain Specific Language (DSL) used to support various aspects of a research project up to the creation of figures suitable for scientific publications. MetaR generates to R code, but users do not even see it. By providing high-level abstractions, MetaR hides most of the details of each step in the data analysis process. Because of its capability to blend scripting and graphical elements, it provides a novel approach to the practice of analyzing data. Simplified executions and syntax-completion are additional features that make MetaR easy to use for beginners. Language composition is available for advanced users that wish to contribute and extend the project.In our data-rich age, there is a great divide between biomedical researchers and data analysts. MetaR has proved to be an educational bridge between these two worlds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Campagne, F., Digan, W.E.R., Simi, M.: MetaR: simple, high-level languages for data analysis with the R ecosystem. (2015). [Online]. https://doi.org/10.1101/030254
Simi, M.: MetaR Case Study. (2020). [Online]. https://resources.jetbrains.com/storage/products/mps/docs/MPS_MetaR_Case_Study.pdf
Erdweg, S., van der Storm, T., Völter, M., et al.: The state of the art in language workbenches. In: Lecture Notes in Computer Science. (2013).
Dmitriev, S.: LOP – Language Oriented Programming: the next programming paradigm. (2004). [Online]. https://resources.jetbrains.com/storage/products/mps/docs/Language_Oriented_Programming.pdf
Di Gesù, V.: Data analysis and bioinformatics. In: Pattern Recognition and Machine Intelligence. PReMI 2007. Lecture Notes in Computer Science, vol. 4815. (2007).
Chu, Y., Corey, R.D.: RNA sequencing: platform selection, experimental design, and data interpretation. Nucleic Acid Therap. 22(4), 271–274 (2012)
Ihaka, R., Gentleman, R.: R: a language for data analysis and graphics. J Comput Graphical Stat. 5(3), 299–314 (1996)
Fowler, M.: Domain-specific languages. Addison-Wesley (2010)
Gentleman, R., Carey, V., Huber, W., Irizarry, R., Dudoit, S.: Bioinformatics and computational biology solutions using R and Bioconductor. Springer, Berlin (2005)
Bentley, J.: Programming pearls: little languages. Commun ACM. 29(8), 711–721 (1986)
Stroustrup, B.: Programming: principles and practice using C++, 2nd edn. Addison-Wesley Professional (2014)
Aho, V.A., Lam, S.M., Sethi, R., Ullman, D.J.: Compilers: principles, techniques, and tools, 2nd edn. Addison-Wesley Longman, Boston, MA (2006)
Simi, M., Campagne, F.: Composable languages for bioinformatics: the NYoSh experiment. PeerJ. 2(562), e241 (2014)
Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design patterns: elements of reusable object-oriented software, pp. 139–142. Addison Wesley (1994)
Conway, R.J., Lex, A., Gehlenborg, N.: UpSetR: an R package for the visualization of intersecting sets and their properties. Bioinformatics. 33(18), 2938–2940 (2017)
Ritchie, E.M., Phipson, B., Wu, W.D., Hu, Y., Law, W.C., Shi, W., Smyth, K.G.: Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43(7), e47 (2015)
Merkel, D.: Docker: lightweight linux containers for consistent development and deployment. Linux J. 2014(239), 2 (2014)
Acknowledgments
Work reported in this chapter was supported by the National Center for Advancing Translational Science of the National Institute of Health under awards number UL1RR024996 and UL1TR002384. Additional support was provided by the National Institute of Health NIAID award number 5R01AI107762-02 to Fabien Campagne.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Simi, M. (2021). Learning Data Analysis with MetaR. In: Bucchiarone, A., Cicchetti, A., Ciccozzi, F., Pierantonio, A. (eds) Domain-Specific Languages in Practice. Springer, Cham. https://doi.org/10.1007/978-3-030-73758-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-73758-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73757-3
Online ISBN: 978-3-030-73758-0
eBook Packages: Computer ScienceComputer Science (R0)