Partitioning Algorithms for the Computation of Average Iceberg Queries

Bae, Jinuk; Lee, Sukho

doi:10.1007/3-540-44466-1_27

Jinuk Bae⁷ &
Sukho Lee⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1874))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

Abstract

Iceberg queries are to compute aggregate functions over an attribute (or set of attributes) to find aggregate values above some specified threshold. It’s difficult to execute these queries because the number of unique data is greater than the number of counter buckets in memory. However, previous research has the limitation that average functions were out of consideration among aggregate functions. So, in order to compute average iceberg queries efficiently we introduce the theorem to select candidates by means of partitioning, and propose POP algorithm based on it. The characteristics of this algorithm are to partition a relation logically and to postpone partitioning to use memory efficiently until all buckets are occupied with candidates. Experiments show that proposed algorithm is affected by memory size, data order, and the distribution of data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Fang, N. Shivakumar, H. Garcia-Molina, R. Motwani, and J. D. Ullman, ”Computing Iceberg Queries Efficiently”, In Proc. of the 24th VLDB Conf., pages 299–310, 1998
Google Scholar
R. T. Ng, L. V. S. Lakshmanan, J. Han, and A. Fang, ” Exploratory Mining and Pruning Optimizations of Constrained Associations Rules”, In Proc. of the ACM SIGMOD Conf. on Management of Data, pages 13–24, 1998
Google Scholar
K. Beyer and R. Ramakrishnan, ” Bottom-Up Computation of Sparse and Iceberg CUBEs”, In Proc. of the ACM SIGMOD Conf., pages 359–370, 1999
Google Scholar
A. Savasere, E. Omiecinski, and S. Navathe, ”An Efficient Algorithm for Mining Association Rules in Large Databases”, In Proc. of the 21st VLDB Conf., pages 432–444, 1995
Google Scholar
S. Christodoulakis, ” Multimedia Data Base Management: Applications and A Position Paper”, In Proc. of the ACM SIGMOD Conf, pages 304–305, 1985
Google Scholar
A. Ghafoor, ”Multimedia Database Management System”, Computing Surveys, Vol. 27, No. 4, pages 593–598, 1985
Article Google Scholar
K. Whang, B. T. Vander-Zanden, and H. M. Taylor, ”A Linear-time Probabilistic Counting Algorithms for DB Applications”, ACM Transactions on Database Systems, 15(2):208–229, 1990
Article Google Scholar
J. S. Park, M. S. Chen, and P. S. Yu, ”An Effiective Hash Based Algorithm For Mining Association Rules”, In Proc. of ACM SIGMOD Conf., pages 175–186, 1995
Google Scholar
R. Agrawal and R. Srikant, ”Fast Algorithms for Mining Association Rules”, In Proc. of the 20th VLDB Conf., pages 487–499, 1994
Google Scholar
R. T. Ng and J. Han, ”Efficient and Effiective Clustering Method for Spatial Data Mining”, In Proc. of the 20th VLDB Conf., pages 144–155, 1994
Google Scholar
T. Zhang, R. Ramakrishnan, and M. Livny, ” BIRCH: An Efficient Data Clustering Method for Very Large Databases”, In Proc. of ACM SIGMOD Conf., pages 103–114, 1996
Google Scholar
A. Bouju, A. Stockus, F. Bertrand, P. Boursier, ”Client-Server Architecture for Accessing Multimedia and Geographic Databases within Embedded Systems”, DEXA Workshop 1999
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Seoul National University, Korea
Jinuk Bae & Sukho Lee

Authors

Jinuk Bae
View author publications
You can also search for this author in PubMed Google Scholar
Sukho Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, Kyoto, 606-8501, Japan
Yahiko Kambayashi
Computer Science Department, Western Michigan University, Kalamazoo, MI, 49008, USA
Mukesh Mohania
Vienna University of Technology, IFS, Favoritenstr. 9-11/188, 1040, Vienna, Austria
A. Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bae, J., Lee, S. (2000). Partitioning Algorithms for the Computation of Average Iceberg Queries. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2000. Lecture Notes in Computer Science, vol 1874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44466-1_27

Download citation

DOI: https://doi.org/10.1007/3-540-44466-1_27
Published: 06 July 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67980-6
Online ISBN: 978-3-540-44466-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics