Skip to main content

Scalability of Databases for Digital Libraries

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3815))

Abstract

Search engines of main-stream literature digital libraries such as ACM Digital Library, Google Scholar, and PubMed employ file-based systems, and provide users with a basic boolean keyword search functionalities. As a result, new and powerful querying capabilities are not easy to implement on top of such systems, and not provided. In comparison, query languages of database systems traditionally have high expressive power. This paper evaluates the scalability of the approach of deploying relational databases as backend systems to digital libraries, and, thus, making use of the query languages and the query processing capabilities of database query engines for literature digital libraries.

To evaluate our approach, we built a scalable prototype digital library built on top of a relational database management system, and its advanced query interface which allows users to specify dynamic text and path queries in an intuitive, hierarchical manner. This paper evaluates the scalability of two search query processing approaches, namely, ad-hoc queries, pre-compiled queries (stored-procedures). We demonstrate that, with reasonably priced hardware, we are able to build an RDBMS-based digital library search engine that can scale to handle millions of queries per day.

This research is supported by the US National Science Foundation grant ITR-0312200.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Case Explorer, http://nashua.cwru.edu/CaseExplorer.htm

  2. ACM SIGMOD Anthology, http://www.acm.org/sigmod/dblp/db/anthology.html

  3. DBLP bibliography, http://www.informatik.uni-trier.de/~ley/db

  4. Li, L.: Metadata Extraction: RelatedToPapers and its Use in Web Resource Querying., MS Thesis, EECS Dept, CWRU (2003)

    Google Scholar 

  5. Al-Hamdani, A.: Querying web resources with metadata in a database., PhD Thesis, EECS Dept., CWRU (2004)

    Google Scholar 

  6. Ozsoyoglu, G., Al-Hamdani, A., Altingovde, I.S., Ozel, S.A., Ulusoy, O., Ozsoyoglu, Z.M.: Sideway Value Algebra for Object-Relational Databases. In: Proc. of VLDB 2002, Hong Kong (August 2002)

    Google Scholar 

  7. Microsoft Full Text Search, http://msdn.microsoft.com/library/en-us/dnsql90/html/sql2005ftsearch.asp

  8. Kowalski, G.: Information retrieval systems: theory and implementation. Kluwer Academic Publishers, Dordrecht (1997)

    MATH  Google Scholar 

  9. Microsoft Application Center Test, http://msdn.microsoft.com/library/en-us/act/htm/actml_main.asp

  10. Chmura, J.: Scalable Web Data Source Search Engine Using an RDBMS (2005)

    Google Scholar 

  11. http://msdn.microsoft.com/library/en-us/dnsql2k/html/sql_queryrecompilation.asp

  12. ACM Digital Library, http://portal.acm.org/portal.cfm

  13. CiteSeer, http://citeseer.ist.psu.edu

  14. Pubmed, http://www.ncbi.nlm.nih.gov/entrez/query.fcgi

  15. Ozsoyoglu, G., Altingovde, I.S., Al-Hamdani, A., Ozel, S.A., Ulusoy, O., Ozsoyoglu, Z.M.: Querying Web metadata: Native Score Management and Text Support in Databases. ACM Transactions on Database Systems (December 2004)

    Google Scholar 

  16. Newman, S., Ozsoyoglu, M.: A Tree-Structured Query Interface for Querying Semi-Structured Data. In: SSDBM 2004 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chmura, J., Ratprasartporn, N., Ozsoyoglu, G. (2005). Scalability of Databases for Digital Libraries. In: Fox, E.A., Neuhold, E.J., Premsmit, P., Wuwongse, V. (eds) Digital Libraries: Implementing Strategies and Sharing Experiences. ICADL 2005. Lecture Notes in Computer Science, vol 3815. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11599517_53

Download citation

  • DOI: https://doi.org/10.1007/11599517_53

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30850-8

  • Online ISBN: 978-3-540-32291-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics