Skip to main content

Bitmap Indices for Speeding Up High-Dimensional Data Analysis

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2453))

Abstract

Bitmap indices have gained wide acceptance in data warehouse applications and are an efficient access method for querying large amounts of read-only data. The main trend in bitmap index research focuses on typical business applications based on discrete attribute values. However, scientific data that is mostly characterised by non-discrete attributes cannot be queried efficiently by currently supported access methods.

In our previous work [13] we introduced a novel bitmap algorithm called GenericRangeEval for efficiently querying scientific data. We evaluated our approach based primarily on uniformly distributed and independent data. In this paper we analyse the behaviour of our bitmap index algorithm against various queries based on different data distributions.

We have implemented an improved version of one of the most cited bitmap compression algorithms called Byte Aligned Bitmap Compression and adapted it to our bitmap indices. To prove the efficiency of our access method, we carried out high-dimensional queries against real data taken from two different scientific applications, namely High Energy Physics and Astronomy. The results clearly show that depending on the underlying data distribution and the query access patterns, our proposed bitmap indices can significantly improve the response time of high-dimensional queries when compared to conventional access methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. G. Antoshenkov, Byte-Aligned Bitmap Compression, Technical Report, Oracle Corp., 1994.

    Google Scholar 

  2. S. Amer-Yahia, T. Johnson, Optimizing Queries On Compressed Bitmaps, Proceedings of 26th International Conference on Very Large Data Bases, Cairo, Egypt, Sept. 2000, Morgan Kaufmann.

    Google Scholar 

  3. S. Berchtold, C. Boehm, H.-P. Kriegel, The Pyramid-Tree: Breaking the Curse of D imensionality, SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, June 1998.

    Google Scholar 

  4. C. Chan, Y.E. Ioannidis, Bitmap Index Design and Evaluation, In Proceedings ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, June 1998.

    Google Scholar 

  5. C. Chan, Y.E. Ioannidis, An Efficient Bitmap Encoding Scheme for Selection Queries, Proceedings ACM SIGMOD International Conference on Management of Data, Philadephia, Pennsylvania, USA, June 1999.

    Google Scholar 

  6. T. Johnson, Performance Measurements of Compressed Bitmap Indices, Proceedings of 25th International Conference on Very Large Data Bases, Edinburgh, Scotland, UK, September 1999, Morgan Kaufmann.

    Google Scholar 

  7. N. Koudas, Space Efficient Bitmap Indexing, International Conference on Information and Knowledge Management, McLean, VA, USA, November 2000.

    Google Scholar 

  8. P. O’Neil, D. Quass, Improved Query Performance with Variant Indexes, Proceedings ACM SIGMOD International Conference on Management of Data, Tucson, Arizona, USA, May 1997.

    Google Scholar 

  9. A. Shoshani, L.M. Bernardo, H. Nordberg, D. Rotem, A. Sim, Multidimensional Indexing and Query Coordination for Tertiary Storage Management, 11th International Conference on Scientific and Statistical Database Management, Cleveland, Ohio, USA, July 1999.

    Google Scholar 

  10. A. Szalay, P. Kunszt, A. Thakar, J. Gray, D. Slutz, Designing and Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey, Proceedings ACM SIGMOD International Conference on Management of Data, Philadephia, Pennsylvania, USA, June 1999.

    Google Scholar 

  11. K. Stockinger, D. Duellmann, W. Hoschek, E. Schikuta. Improving the Performance of High Energy Physics Analysis through Bitmap Indices. International Conference on Database and Expert Systems Applications, London-Greenwich, UK, Sept. 2000. Springer-Verlag.

    Google Scholar 

  12. K. Stockinger, Design and Implementation of Bitmap Indices for Scientific Data, International Database Engineering & Applications Symposium, Grenoble, France, July 2001, IEEE Computer Society Press.

    Google Scholar 

  13. K. Stockinger, Performance Analysis of Generic vs. Sliced Tags in HepODBMS, International Conference on Computing in High Energy and Nuclear Physics, Beijing, China, September, 2001.

    Google Scholar 

  14. K. Stockinger, Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN, Ph.D. Thesis, University of Vienna, Austria, November 2001.

    Google Scholar 

  15. M. Wu, A.P. Buchmann, Encoded Bitmap Indexing for Data Warehouses, Proceedings of the Fourteenth International Conference on Data Engineering, Orlando, Florida, USA, February 1998.

    Google Scholar 

  16. M. Wu, Query Optimization for Selections Using Bitmaps, SIGMOD 1999, Proceedings ACM SIGMOD International Conference on Management of Data, Philadephia, Pennsylvania, USA, June 1999.

    Google Scholar 

  17. K. Wu, E. J. Otoo, A. Shoshani, A Performance Comparison of Bitmap Indexes, International Conference on Information and Knowledge Management, Atlanta, Georgia, USA, November, 2001.

    Google Scholar 

  18. K. Wu, P.S. Yu, Range-Based Bitmap Indexing for High-Cardinality Attributes with Skew, Technical Report, IBM Watson Research Center, May 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Stockinger, K. (2002). Bitmap Indices for Speeding Up High-Dimensional Data Analysis. In: Hameurlain, A., Cicchetti, R., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2002. Lecture Notes in Computer Science, vol 2453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46146-9_87

Download citation

  • DOI: https://doi.org/10.1007/3-540-46146-9_87

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44126-7

  • Online ISBN: 978-3-540-46146-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics