skip to main content
10.1145/1341811.1341853acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesmardi-grasConference Proceedingsconference-collections
research-article

Experiences with developing and deploying dynamic BLAST

Published: 29 January 2008 Publication History

Abstract

Basic Local Alignment Search Tool (BLAST) is a heavily used bioinformatics application that has gotten significant attention from the high performance computing community. The authors have taken BLAST execution a step further and enabled it to execute on grid resources. Adapting BLAST to execute on the grid brings up concerns regarding grid resource heterogeneity, which inevitably cause difficulty with application availability, fault tolerance, interoperability, and variability in performance of individual segments that are being distributed throughout grid resources. In addition difficulties arise because of tools, technologies, and middleware dependencies that an application developer must deal with. This paper describes Dynamic BLAST and experiences with developing and deploying the application on grid resources over a two year period. Dynamic BLAST is a BLAST-specific metascheduler, a multithreaded, master-worker type application that handles all aspects of a BLAST job submission on the grid for the user. It was developed with the goal of bringing the grid closer to a typical scientist by eliminating the initial learning curve necessary for use of many grid applications. Associated research and development have resulted in the authors' extensive experience in dealing with grid related issues with respect to available tools and technologies. Lessons learned and suggestions are also presented in this paper.

References

[1]
The Grid: Blueprint for a New Computing Infrastructure: Morgan Kaufmann Publishers, 1998.
[2]
F. Berman, G. Fox, and T. Hey, "The Grid: Past, Present, and Future," in Grid Computing - Making the Global Infrastructure a Reality, F. Berman, G. Fox, and T. Hey, Eds. Hoboken, NJ: John Wiley & Sons Inc., 2003, pp. 9--51.
[3]
I. Foster, C. Kesselman, and S. Tuecke, "The Anatomy of the Grid," Lecture Notes in Computer Science, vol. 2150, pp. 1--28, 2001.
[4]
E. Afgan, P. Bangalore, and J. Gray, "A Domain-Specific Language for Describing Grid Applications," in Designing Software-Intensive Systems: Methods and Principles, P. F. Tiako, Ed., 2007.
[5]
M. Halappanavar, J.-P. Robinson, E. Afgan, M. F. Yafchak, and P. Bangalore, "A Common Application Platform for the SURAgrid (CAP)," Report in preparation.
[6]
S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, "Basic local alignment search tool," Mol Biol, vol. 215, pp. 403--410, 1990.
[7]
B. Bergeron, Bioinformatics Computing. Upper Saddle River, New Jersey: Prentice Hall PTR, 2002.
[8]
A. E. Darling, L. Carey, and W.-C. Feng, "The Design, Implementation, and Evaluation of mpiBLAST," San Jose, CA, 2003.
[9]
R. D. Bjomson, A. H. Sherman, S. B. Weston, N. Willard, and J. Wing, "TurboBLAST: A Parallel Implementation of BLAST Built on the TurboHub," Ft. Lauderdale, FL, 2002.
[10]
A. Krishnan, "GridBLAST: A Globus-based high-throughput implementation of BLAST in a Grid computing framework," Concurrency And Computation: Practice And Experience, vol. 17, pp. 1607--1623, 2005.
[11]
D. Sulakhe, A. Rodriguez, M. D'Souza, M. Wilde, V. Nefedova, I. Foster, and N. Maltsev, "GNARE: An Environment for Grid-Based High-Throughput Genome Analysis," Grid 2005, Cardiff, UK, 2005.
[12]
I. Foster and C. Kesselman, "The Globus toolkit," in The Grid: Blueprint for a New Computing Infrastructure, I. Foster and C. Kesselman, Eds. San Francisco, California: Morgan Kaufmann, 1999, pp. 259--278.
[13]
H. Rajic, R. Brobst, W. Chan, F. Ferstl, J. Gardiner, A. Haas, B. Nitzberg, and J. Tollefsrud, "Distributed Resource Management Application API (DRMAA) Specification 1.0 FD-R-P.022," Global Grid Forum (GGF) 2004.
[14]
E. Huedo, R. S. Montero, and I. M. Llorente, "A Framework for Adaptive Execution on Grids," Journal of Software - Practice and Experience, vol. 34, pp. 631--651, 2004.
[15]
E. Afgan and P. Bangalore, "Performance Characterization of BLAST for the Grid," BIBE 2007, Boston, MA, 2007.
[16]
C. Wang and E. J. Lefkowitz, "SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters," BMC Bioinformatics, vol. 5, 2004.
[17]
C. Dwan, "Bioinformatics Benchmarks on the Dual Core Intel Xeon Processor," The BioTeam, Inc., Cambridge, MA 2006.
[18]
SURAgrid, "SURAgrid," Available from http://www.sura.org/programs/sura_grid.html. Last accessed December 12, 2007.
[19]
NCBI, "BLAST Frequently Asked Questions," Available from www.ncbi.nlm.nih.gov/blast/blast_FAQs.shtml, Last accessed December 12, 2007.
[20]
E. Afgan and P. Bangalore, "GridAtlas: A Grid Service for Storing Application Parameters on Grid Resources," Report in preparation.

Cited By

View all

Index Terms

  1. Experiences with developing and deploying dynamic BLAST

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        MG '08: Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities
        January 2008
        178 pages
        ISBN:9781595938350
        DOI:10.1145/1341811
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Sponsors

        • National e-Science Institute (Edinburgh, UK)
        • Louisiana State University (USA)

        In-Cooperation

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 29 January 2008

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. BLAST
        2. best practices
        3. grid computing
        4. grid-enabling
        5. load-balancing
        6. scheduling

        Qualifiers

        • Research-article

        Funding Sources

        Conference

        Mardi Gras'08
        Sponsor:
        Mardi Gras'08: 15th Mardi Gras Conference on Distributed Applications
        January 29 - February 3, 2008
        Louisiana, Baton Rouge, USA

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)1
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 19 Feb 2025

        Other Metrics

        Citations

        Cited By

        View all

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media