skip to main content
10.1145/3575882.3575958acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesic3inaConference Proceedingsconference-collections
research-article

Implementation of Workflow Engine on BRIN HPC Infrastructure

Published:27 February 2023Publication History

ABSTRACT

National Research and Innovation Agency (BRIN)- Indonesia, hosts high performance computing (HPC) facilities to support research and innovation that need high computation resources. One example of a research area is bioinformatics. As sequencing technology advances, any lab with next generation sequencing (NGS) access can generate a huge amount of data in a very short time. However, the difficulties then have shifted to the data analysis step that follows. It usually requires significant computation resources, many specific tools that need to be chained together, and man resources that are familiar with command line syntax. In addition, the chaining of multiple tools into a comprehensive workflow is also difficult since one needs to understand both the computer system administration and biological information related to the problems they try to answer. These hinder the biologist to take advantage of sequencing technology for their research. In this technical report, we described our approaches to integrate Galaxy and BRIN HPC, to ease users to deploy their analysis workflow on BRIN HPC facility.

Skip Supplemental Material Section

Supplemental Material

Implementation of Workflow Engine on BRIN HPC Infrastructure.mp4

mp4

23.7 MB

References

  1. Enis Afgan, Dannon Baker, Marius Van den Beek, Daniel Blankenberg, Dave Bouvier, Martin Čech, John Chilton, Dave Clements, Nate Coraor, Carl Eberhard, 2016. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic acids research 44, W1 (2016), W3–W10.Google ScholarGoogle Scholar
  2. El Mustapha Bahassi and Peter J Stambrook. 2014. Next-generation sequencing technologies: breaking the sound barrier of human genetics. Mutagenesis 29, 5 (2014), 303–310.Google ScholarGoogle ScholarCross RefCross Ref
  3. Daniel Blankenberg, Gregory Von Kuster, Nathaniel Coraor, Guruprasad Ananda, Ross Lazarus, Mary Mangan, Anton Nekrutenko, and James Taylor. 2010. Galaxy: a web-based genome analysis tool for experimentalists. Current protocols in molecular biology 89, 1 (2010), 19–10.Google ScholarGoogle Scholar
  4. Peter JA Cock, Björn A Grüning, Konrad Paszkiewicz, and Leighton Pritchard. 2013. Galaxy tools and workflows for sequence analysis with applications in molecular plant pathology. PeerJ 1(2013), e167.Google ScholarGoogle ScholarCross RefCross Ref
  5. Jeremy Goecks, Anton Nekrutenko, and James Taylor. 2010. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome biology 11, 8 (2010), 1–13.Google ScholarGoogle Scholar
  6. Björn Grüning, John Chilton, Johannes Köster, Ryan Dale, Nicola Soranzo, Marius van den Beek, Jeremy Goecks, Rolf Backofen, Anton Nekrutenko, and James Taylor. 2018. Practical computational reproducibility in the life sciences. Cell systems 6, 6 (2018), 631–635.Google ScholarGoogle Scholar
  7. Tobias Gysi, Carlos Osuna, Oliver Fuhrer, Mauro Bianco, and Thomas C Schulthess. 2015. STELLA: A domain-specific tool for structured grid methods in weather and climate models. In Proceedings of the international conference for high performance computing, networking, storage and analysis. 1–12.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Pratik D Jagtap, James E Johnson, Getiria Onsongo, Fredrik W Sadler, Kevin Murray, Yuanbo Wang, Gloria M Shenykman, Sricharan Bandhakavi, Lloyd M Smith, and Timothy J Griffin. 2014. Flexible and accessible workflows for improved proteogenomic analysis using the Galaxy framework. Journal of proteome research 13, 12 (2014), 5898–5908.Google ScholarGoogle ScholarCross RefCross Ref
  9. Sean Kross and Philip J Guo. 2019. Practitioners teaching data science in industry and academia: Expectations, workflows, and challenges. In Proceedings of the 2019 CHI conference on human factors in computing systems. 1–14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Arnida L Latifah, R Kartika Lestari, Inna Syafarina, and Kei Yoshimura. 2020. Sensitivity Experiments of Rainfall to Warm Cloud Auto-Conversion Threshold and Relative Humidity Threshold of Cloudiness in RegCM4. 6 over the Maritime Continent. Atmosphere-Ocean 58, 1 (2020), 1–12.Google ScholarGoogle ScholarCross RefCross Ref
  11. Phong VV Le, Praveen Kumar, Albert J Valocchi, and Hoang-Vu Dang. 2015. GPU-based high-performance computing for integrated surface–sub-surface flow modeling. Environmental Modelling & Software 73 (2015), 1–13.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Richard M Leggett, Ricardo H Ramirez-Gonzalez, Bernardo J Clavijo, Darren Waite, and Robert P Davey. 2013. Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics. Frontiers in genetics 4(2013), 288.Google ScholarGoogle Scholar
  13. Wolfgang Maier, Simon Bray, Marius van den Beek, Dave Bouvier, Nathan Coraor, Milad Miladi, Babita Singh, Jordi Rambla De Argila, Dannon Baker, Nathan Roach, 2021. Ready-to-use public infrastructure for global SARS-CoV-2 monitoring. Nature biotechnology 39, 10 (2021), 1178–1179.Google ScholarGoogle Scholar
  14. Maria Luiza Mondelli, Thiago Magalhães, Guilherme Loss, Michael Wilde, Ian Foster, Marta Mattoso, Daniel Katz, Helio Barbosa, Ana Tereza R de Vasconcelos, Kary Ocaña, 2018. BioWorkbench: a high-performance framework for managing and analyzing bioinformatics experiments. PeerJ 6(2018), e5551.Google ScholarGoogle ScholarCross RefCross Ref
  15. Furqon Hensan Muttaqien, Laksmita Rahadianti, and Arnida L Latifah. 2021. Downscaling for Climate Data in Indonesia Using Image-to-Image Translation Approach. In 2021 International Conference on Advanced Computer Science and Information Systems (ICACSIS). IEEE, 1–8.Google ScholarGoogle Scholar
  16. Alexis K Navarre-Sitchler, Reed M Maxwell, Erica R Siirila, Glenn E Hammond, and Peter C Lichtner. 2013. Elucidating geochemical response of shallow heterogeneous aquifers to CO2 leakage using high-performance computing: Implications for monitoring of CO2 sequestration. Advances in Water Resources 53 (2013), 45–55.Google ScholarGoogle ScholarCross RefCross Ref
  17. Osamu Ogasawara, Yuichi Kodama, Jun Mashima, Takehide Kosuge, and Takatomo Fujisawa. 2020. DDBJ Database updates and computational infrastructure enhancement. Nucleic acids research 48, D1 (2020), D45–D50.Google ScholarGoogle Scholar
  18. Eric E Schadt, Michael D Linderman, Jon Sorenson, Lawrence Lee, and Garry P Nolan. 2010. Computational solutions to large-scale data management and analysis. Nature reviews genetics 11, 9 (2010), 647–657.Google ScholarGoogle Scholar
  19. Bertil Schmidt and Andreas Hildebrandt. 2017. Next-generation sequencing: big data meets high performance computing. Drug discovery today 22, 4 (2017), 712–717.Google ScholarGoogle Scholar
  20. Thamarai Selvi Somasundaram and Kannan Govindarajan. 2014. CLOUDRB: A framework for scheduling and managing High-Performance Computing (HPC) applications in science cloud. Future Generation Computer Systems 34 (2014), 47–65.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Implementation of Workflow Engine on BRIN HPC Infrastructure

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      IC3INA '22: Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications
      November 2022
      415 pages
      ISBN:9781450397902
      DOI:10.1145/3575882

      Copyright © 2022 ACM

      © 2022 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 27 February 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)19
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format