Abstract
Given an OLAP query expressed over multiple source OLAP databases, we study the problem of evaluating the result OLAP target database. The problem arises when it is not possible to derive the result from a single database. The method we use is the linear indirect estimator, commonly used for statistical estimation. We examine two obvious computational methods for computing such a target database, called the “Full-cross-product” (F) and the “Pre-aggregation” (P) methods. We study the accuracy and computational complexity of these methods. While the method F provides a more accurate estimate, it is more expensive computationally than P. Our contribution is in proposing a third new method, called the “Partial-Pre-aggregation” method (PP), which is significantly less expensive than F, but is just as accurate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chan, P., Shoshani, A.: SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases. In: Conference on Very Large Data Bases, pp. 553–563 (1981)
Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data cube: a Relational Aggregation Operator Generalizing Group-by, Cross-tabs and Subtotals. In: 12th IEEE Int. Conf. on Data Engineering, pp. 152–159 (1996)
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP (On-Line Analytical Processing) to User-Analysts: An IT mandate. Technical report (1993)
Ghosh, M., Rao, J.N.K.: Small Area Estimation: An Appraisal. Journal of Statistical Science 9, 55–93 (1994)
Pfeffermann, D.: Samll Area Estimation - New Developments and Directions. International Statistical Review 70 (2002)
Pourabbas, E., Shoshani, A.: Joint Queries Estimation from Aggregate OLAP Databases. LBNL Technical Report, LBNL-48750 (2001)
Shoshani, A.: OLAP and Statistical Databases: Similarities and Differences. In: 16th ACM Symposium on Principles of Database Systems, pp. 185–196 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pourabbas, E., Shoshani, A. (2003). Answering Joint Queries from Multiple Aggregate OLAP Databases. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2003. Lecture Notes in Computer Science, vol 2737. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45228-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-45228-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40807-9
Online ISBN: 978-3-540-45228-7
eBook Packages: Springer Book Archive