Abstract
Stored procedures are an important feature of all major database systems that allows to execute application logic within database servers. This paper reports on experiences to implement a popular scientific algorithm, the Basic Local Alignment Search Tool (BLAST), as stored procedures within a relational database. We implemented the un-gapped, nucleotide version of the BLAST algorithm with four different relational database engines, both commercial and open source. In an experimental evaluation, we compared our dbBLAST implementations with a standard file-based BLAST implementation from NCBI with regard to the implementation effort, runtime performance, and scalability. It shows that although our dbBLAST runs faster than the file-based BLAST program for short query sequences, all implementations lack scalability. However, the results also indicate that stored procedures require significant less development effort—both in time and space—than traditional programming approaches.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. Journal of Molecular Biology 215, 403–410 (1990)
Altschul, S.F., et al.: Gapped Blast and PSI-Blast: a new generation of protein database search programs. Nucleic Acids Research 25, 3389–3402 (1997)
Eisenberg, A.: New standard for stored procedures in sql. ACM SIGMOD Record 25 (1996)
Melton, J.: Understanding sql stored procedures: a complete guide to sql/psm. M.K (1998)
NCBI: NCBI-BLAST : http://www.ncbi.nlm.nih.gov/BLAST
Grossmann, D.A., Friedler, O.: Information Retrieval: Algorithms and Heuristics. Kluwer Academic Publishers, Dordrecht (1998)
Diep, T.M., Röhm, U.: dbBLAST: A comparison study of BLAST implementations with stored procedures in different RDBMS. Technical report, University of Sydney (2006)
The PostgreSQL Global Development Group: PostgreSQL 8.0.1 documentation
Washington University: WU-BLAST 2.0. http://blast.wustl.edu/blast/
Stephens, S., Chen, J.Y., Thomas, S.: ODM BLAST: Sequence homology search in the RDBMS. IEEE Data Engineering Bulletin 27, 20–23 (2004)
Stephens, S., et al.: Oracle Database 10g: A platform for BLAST search and regular expression pattern matching in life sciences. Nucleic Acids Research 33, D675–D679 (2005)
Oracle Coorp: Oracle Data Mining Application Developer’s Guide 10g Release 1 (2004)
Laboratory of Neuro Imaging (LONI): BLASTgres User Guide. UCLA. 1.0 edn (2005)
Liu, Y.: Grid-BLAST: Building a cyberinfrastructure for large-scale comparative genomics research. In: 2003 Virtual Conference on Genomics and Bioinformatics (2003)
Konishi, F., Shiroto, Y., Umetsu, R.: Konagaya: A scalable BLAST service in OBIGrid environment. Genome Informatics 14, 535–536 (2003)
Chen, C.W., Röhm, U.: A service-oriented approach for parallelising data-intensive algorithms in grid-enabled cluster. In: 1st Int. Workshop on Biomedical Data Engineering (2005)
Krishnan, A.: GridBLAST: A globus based high-throughput implementation of BLAST in a grid framework. In: Concurrency and Computation: Practice and Experience, pp. 1–7 (2000)
Nieto-Santisteban, M.A., Gray, J., et al.: When database systems meet the grid. In: 2nd Biennial Conference on Innovative Data Systems Research (CIDR), pp. 154–161 (2005)
Saracco, C.M.: Leveraging stored procedures through enterprise javabeans. IBM (2000)
Acheson, A., et al.: Hosting the .NET runtime in Microsoft SQL server. In: Proceedings of the 23th ACM SIGMOD Int. Conf. on Management of Data, Paris, France, pp. 860–865 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Röhm, U., Diep, TM. (2006). How to BLAST Your Database — A Study of Stored Procedures for BLAST Searches. In: Li Lee, M., Tan, KL., Wuwongse, V. (eds) Database Systems for Advanced Applications. DASFAA 2006. Lecture Notes in Computer Science, vol 3882. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11733836_58
Download citation
DOI: https://doi.org/10.1007/11733836_58
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33337-1
Online ISBN: 978-3-540-33338-8
eBook Packages: Computer ScienceComputer Science (R0)