Improving the Processing of DW Star-Queries under Concurrent Query Workloads

Costa, João Pedro; Furtado, Pedro

doi:10.1007/978-3-319-10160-6_22

João Pedro Costa¹⁷ &
Pedro Furtado¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8646))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

1895 Accesses

Abstract

Currently, Data Warehouse (DW) analyses are extensively being used not only for strategic business decisions by a few, but also for feedback to a wider audience and into daily operational decisions. As a result, there’s an increase in the number of aggregation star-queries that are being concurrently submitted. Although such queries require similar processing patterns, they are stressing the database engine ability to deliver timely execution, due to the fact that each query executes independently from the others (query-at-time processing model). Recently, there’s an increasing interest in approaches that cooperate to manage large numbers of concurrent aggregation star-queries. We have proposed SPIN in a previous paper [1]. It is a data processing model that shares data and computation in order to handle large concurrent query loads, and its data organization provides almost constant and predictable execution times for all submitted queries. It has a data reader that reads data in circular loop, placing it in a pipeline, before being processed by branches that combine common processing computations. SPIN is IO dependent, i.e. a query is only be answered after a full circular loop, even though tuples and similar predicates have been evaluated in the past. In this paper we propose data processing approach that uses a set of bitsets, built on-the-fly, to significantly reduce the query processing time, the tuple evaluation cost and the number of predicates and tuples evaluated, without sacrificing its predictability features. The data read from storage is reduced to the minimum needed by the current query load.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Distributed Data Warehouse Resource Monitoring

Cut-and-Rewind: Extending Query Engine for Continuous Stream Analytics

Supporting Real-Time Analytic Queries in Big and Fast Data Environments

References

Costa, J., Furtado, P.: SPIN: Concurrent Workload Scaling over Data Warehouses. In: Proc. of 15th International Conference on Data Warehousing and Knowledge Discovery - DaWaK 2013, Prague, Czech Republic (2013)
Google Scholar
Costa, J.P., Cecílio, J., Martins, P., Furtado, P.: ONE: a predictable and scalable DW model. In: Proceedings of the 13th International Conference on Data Warehousing and Knowledge Discovery, Toulouse, France, pp. 1–13 (2011)
Google Scholar
Costa, J.P., Martins, P., Cecílio, J., Furtado, P.: A Predictable Storage Model for Scalable Parallel DW. In: 15th International Database Engineering and Applications Symposium (IDEAS 2011), Lisbon, Portugal (2011)
Google Scholar
Zukowski, M., Héman, S., Nes, N., Boncz, P.: Cooperative scans: dynamic bandwidth sharing in a DBMS. In: Proceedings of the 33rd International Conference on Very Large Data Bases, Vienna, Austria, pp. 723–734 (2007)
Google Scholar
Harizopoulos, S., Shkapenyuk, V., Ailamaki, A.: QPipe: A Simultaneously Pipelined Relational Query Engine. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 383–394 (2005)
Google Scholar
Candea, G., Polyzotis, N., Vingralek, R.: A scalable, predictable join operator for highly concurrent data warehouses. Proc. VLDB Endow. 2, 277–288 (2009)
Article Google Scholar
Candea, G., Polyzotis, N., Vingralek, R.: Predictable performance and high query concurrency for data analytics. The VLDB Journal 20(2), 227–248 (2011)
Article Google Scholar

Download references

Author information

Authors and Affiliations

ISEC, DEIS, Polytechnic Institute of Coimbra, Portugal
João Pedro Costa
University of Coimbra, Portugal
Pedro Furtado

Authors

João Pedro Costa
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Furtado
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LIAS/ISAE-ENSMA, Téléport 2, 1 avenue Clément Ader, BP 40109, 86961, Futuroscope Chasseneuil Cedex, France
Ladjel Bellatreche
IBM Research - India, 4, Block-C, Institutional Area, 110070, Vasant Kunj, New Delhi, India
Mukesh K. Mohania

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Costa, J.P., Furtado, P. (2014). Improving the Processing of DW Star-Queries under Concurrent Query Workloads. In: Bellatreche, L., Mohania, M.K. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2014. Lecture Notes in Computer Science, vol 8646. Springer, Cham. https://doi.org/10.1007/978-3-319-10160-6_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-10160-6_22
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10159-0
Online ISBN: 978-3-319-10160-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Improving the Processing of DW Star-Queries under Concurrent Query Workloads

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Distributed Data Warehouse Resource Monitoring

Cut-and-Rewind: Extending Query Engine for Continuous Stream Analytics

Supporting Real-Time Analytic Queries in Big and Fast Data Environments

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Improving the Processing of DW Star-Queries under Concurrent Query Workloads

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Distributed Data Warehouse Resource Monitoring

Cut-and-Rewind: Extending Query Engine for Continuous Stream Analytics

Supporting Real-Time Analytic Queries in Big and Fast Data Environments

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation