Skip to main content

A High Performance System for Processing Queries on Distributed Geospatial Data Sets

  • Conference paper
High Performance Computing for Computational Science - VECPAR 2004 (VECPAR 2004)

Abstract

The size of many geospatial databases has grown exponentially in recent years. This increase in size brings with it an increased requirement for additional CPU and I/O resources to handle the querying and retrieval of this data. A number of proprietary systems could be ideally suited for such tasks, but are impractical in many situations because of their high cost. On the other hand, Beowulf clusters have gained popularity for providing such resources in a cost-effective manner. In this paper, we present a system that uses the compute nodes of a Beowulf cluster to store fragments of a large geospatial database and allows for the seamless viewing, querying, and retrieval of desired geospatial data in a parallel fashion i.e. utilizing the compute and I/O resources of multiple nodes in the cluster. Experimental results are provided to quantify the performance of the system and ascertain its feasibility versus traditional GIS architectures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Shekhar, S., Chawla, S.: Spatial Databases: A Tour. Prentice Hall, Englewood Cliffs (2002)

    Google Scholar 

  2. Garcia-Molina, H., Ullman, J.D., Widom, J.: Database System Implementation. Prentice Hall, Englewood Cliffs (2000)

    Google Scholar 

  3. Abdelguerfi, M., Wong, K.-F. (eds.): Parallel Database Techniques. IEEE Computer Society, Los Alamitos (1998)

    Google Scholar 

  4. Abdelguerfi, M., Lavington, S. (eds.): Emerging Trends in Database and Knowledge-Base Machines. IEEE Computer Society, Los Alamitos (1995)

    Google Scholar 

  5. Ridge, D., Becker, D., Merkey, P., Sterling, T.: Beowulf: Harnessing the Power of Parallelism in a Pile-of-PCs. In: Proceedings IEEE Aerospace Goddard Space Flight Center, CACR Caltech (1997)

    Google Scholar 

  6. Kurtzer, G.: The Warewulf Clustering System, http://www.runlevelzero.net/greg/warewulf/stuff/lbnl-pres/

  7. Dongara, J.J., Luszczek, P., Petitet, A.: The LINPACK Benchmark: Past, Present, and Future (2001)

    Google Scholar 

  8. Worsley, J., Drake, J., Brookins, A., Holloway, M. (eds.): Practical PostgreSQL. Command Prompt Inc., (2002)

    Google Scholar 

  9. PostGIS Manual. Refractions Research Inc., (2003), http://postgis.refractions.net/docs/

  10. OpenGIS Simple Features Specification for SQL. OpenGIS Consortium Inc., (1999)

    Google Scholar 

  11. Bureau of Transportation Statistics (BTS), “2002 National Transportation Atlas Data Hydrographic Features” BTS (2002), http://www.bts.gov/gis/download_sites/ntad02/metadata/hydrolin.htm

  12. Geospatial Database Queries in a Parallel Environment (August 2003)

    Google Scholar 

  13. http://www.cs.uno.edu/Facilities/vcrl/RapportV3.doc

  14. Beckmann, N., Kreigel, R., Seeger, B.: The R*-tree: An Efficient and Robust Access method for Points and rectangles. In: Proceedings of the ACM SIGMOD International Conference on management of Data, pp. 322–331 (1990)

    Google Scholar 

  15. Moon, B., Jagadish, H., Faloutsos, C., Saltz, J.H.: Analysis of the Clustering Properties of Hilbert Space-filling Curve, Technical Report UMIACS-TR-96-20, CS Department

    Google Scholar 

  16. Patel, J., DeWitt, D., Join, C., Join, S.: Two Parallel Spatial Join Algorithms. In: ACM GIS Conference, Washington D.C (2000)

    Google Scholar 

  17. DeWitt, D., Gray, J.: Parallel Database Systems: The future of Database. In: Processing or a passing Fad? Communications of the ACM (June 1992)

    Google Scholar 

  18. Patel, J., et al.: Building A Scalable GeoSpatial Database System: Technology, Implementation, and Evaluation. In: SIGMOD 1997 (1997)

    Google Scholar 

  19. Wilson, R., Cobb, M., McCreedy, F., Ladner, R., Olivier, D., Lovitt, T., Shaw, K., Petry, F., Abdelguerfi, M.: Geographical Data Interchange Using XML-Enabled Technology within the GIDBTM System. In: Chaudhri, A.B. (ed.) XML Data Management. John Wiley & Sons, Chichester (2003)

    Google Scholar 

  20. Digital Mapping, Charting and Geodesy Analysis Program (DMAP) Team, http://dmap.nrlssc.navy.mil

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abdelguerfi, M., Mahadevan, V., Challier, N., Flanagin, M., Shaw, K., Ratcliff, J. (2005). A High Performance System for Processing Queries on Distributed Geospatial Data Sets. In: Daydé, M., Dongarra, J., Hernández, V., Palma, J.M.L.M. (eds) High Performance Computing for Computational Science - VECPAR 2004. VECPAR 2004. Lecture Notes in Computer Science, vol 3402. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11403937_10

Download citation

  • DOI: https://doi.org/10.1007/11403937_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25424-9

  • Online ISBN: 978-3-540-31854-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics