Abstract
Search engines of main-stream literature digital libraries such as ACM Digital Library, Google Scholar, and PubMed employ file-based systems, and provide users with a basic boolean keyword search functionalities. As a result, new and powerful querying capabilities are not easy to implement on top of such systems, and not provided. In comparison, query languages of database systems traditionally have high expressive power. This paper evaluates the scalability of the approach of deploying relational databases as backend systems to digital libraries, and, thus, making use of the query languages and the query processing capabilities of database query engines for literature digital libraries.
To evaluate our approach, we built a scalable prototype digital library built on top of a relational database management system, and its advanced query interface which allows users to specify dynamic text and path queries in an intuitive, hierarchical manner. This paper evaluates the scalability of two search query processing approaches, namely, ad-hoc queries, pre-compiled queries (stored-procedures). We demonstrate that, with reasonably priced hardware, we are able to build an RDBMS-based digital library search engine that can scale to handle millions of queries per day.
This research is supported by the US National Science Foundation grant ITR-0312200.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Case Explorer, http://nashua.cwru.edu/CaseExplorer.htm
ACM SIGMOD Anthology, http://www.acm.org/sigmod/dblp/db/anthology.html
DBLP bibliography, http://www.informatik.uni-trier.de/~ley/db
Li, L.: Metadata Extraction: RelatedToPapers and its Use in Web Resource Querying., MS Thesis, EECS Dept, CWRU (2003)
Al-Hamdani, A.: Querying web resources with metadata in a database., PhD Thesis, EECS Dept., CWRU (2004)
Ozsoyoglu, G., Al-Hamdani, A., Altingovde, I.S., Ozel, S.A., Ulusoy, O., Ozsoyoglu, Z.M.: Sideway Value Algebra for Object-Relational Databases. In: Proc. of VLDB 2002, Hong Kong (August 2002)
Microsoft Full Text Search, http://msdn.microsoft.com/library/en-us/dnsql90/html/sql2005ftsearch.asp
Kowalski, G.: Information retrieval systems: theory and implementation. Kluwer Academic Publishers, Dordrecht (1997)
Microsoft Application Center Test, http://msdn.microsoft.com/library/en-us/act/htm/actml_main.asp
Chmura, J.: Scalable Web Data Source Search Engine Using an RDBMS (2005)
http://msdn.microsoft.com/library/en-us/dnsql2k/html/sql_queryrecompilation.asp
ACM Digital Library, http://portal.acm.org/portal.cfm
CiteSeer, http://citeseer.ist.psu.edu
Ozsoyoglu, G., Altingovde, I.S., Al-Hamdani, A., Ozel, S.A., Ulusoy, O., Ozsoyoglu, Z.M.: Querying Web metadata: Native Score Management and Text Support in Databases. ACM Transactions on Database Systems (December 2004)
Newman, S., Ozsoyoglu, M.: A Tree-Structured Query Interface for Querying Semi-Structured Data. In: SSDBM 2004 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chmura, J., Ratprasartporn, N., Ozsoyoglu, G. (2005). Scalability of Databases for Digital Libraries. In: Fox, E.A., Neuhold, E.J., Premsmit, P., Wuwongse, V. (eds) Digital Libraries: Implementing Strategies and Sharing Experiences. ICADL 2005. Lecture Notes in Computer Science, vol 3815. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11599517_53
Download citation
DOI: https://doi.org/10.1007/11599517_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30850-8
Online ISBN: 978-3-540-32291-7
eBook Packages: Computer ScienceComputer Science (R0)