skip to main content
10.1145/1031763.1031779acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

An analysis of additivity in OLAP systems

Published:12 November 2004Publication History

ABSTRACT

Accurate summary data is of paramount concern in data warehouse systems; however, there have been few attempts to completely characterize the ability to summarize measures. The sum operator is the typical aggregate operator for summarizing the large amount of data in these systems. We look to uncover and characterize potentially inaccurate summaries resulting from aggregating measures using the sum operator. We discuss the effect of classification hierarchies, and non-, semi-, and fully- additive measures on summary data, and develop a taxonomy of the additive nature of measures. Additionally, averaging and rounding rules can add complexity to seemingly simple aggregations. To deal with these problems, we describe the importance of storing metadata that can be used to restrict potentially inaccurate aggregate queries. These summary constraints could be integrated into data warehouses, just as integrity constraints and are integrated into OLTP systems. We conclude by suggesting methods for identifying and dealing with non- and semi- additive attributes.

References

  1. Adamson, C., Venerable, M. (1998). Data Warehouse Design Solutions, John Wiley and Sons, Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bedell, J. (1998). "Outstanding Challenges in OLAP." Data Engineering. Proceedings of 14th ICDE, 23-27 Feb. 1998. 178--179. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chaudhuri, S., and Dayal, U. (1997). "An Overview of Data Warehousing and OLAP Technology". SIGMOD Record. 65--74. ACM Press, New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Ewen, E. F., Medsker, C. E., and Dusterhoft, L. E. (1998). "Data Warehousing in an Integrated Health System; Building the Business Case". DOLAP '98, Washington, DC. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Golfarelli, M., Maio, D., and Rizzi, S. (1998). "Conceptual Design of Data Warehouses from E/R Schemes". Proceedings of the Thirty-First Hawaii International Conference, 6-9 Jan. 1998, 7, 334--343. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Holowczak, R., Adam, N., Artigas, J., and Bora, I. (2003). "Data Warehousing in Environmental Digital Libraries". Communications Of The ACM, September 2003, 46. 172--178. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Hüsemann, B., Lechtenbörger, J, and Vossen, G. (2000). "Conceptual data warehouse design". Proc. Of International Workshop on Design and Management of Data Warehouses, 2000.Google ScholarGoogle Scholar
  8. Kim, B., Choi, K., Kim, S., and Lee,D. (2003). "A Taxonomy of Dirty Data". Data Mining and Knowledge Discovery". 81--99. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Kimball, R. and Ross, M. (2002). The Data Warehouse Toolkit: Second Edition. John Wiley and Sons, Inc. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lehner, W. (1998). "Modeling Large Scale OLAP Scenarios". In Proceedings of the Sixth International Conference on Extending Database Technology, 153--167. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Martyn, T. (2004). "Reconsidering multi-dimensional schemas". SIGMOD Record. 33 - 1. 83--88 ACM Press New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Pourabbas, E. and Rafanelli, M. (1999). "Characterizations of hierarchies and some operators in OLAP environments". Proceedings of the 2nd ACM international workshop on Data warehousing and OLAP (DOLAP'99). Kansas City, Missouri. 54--59. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Shoshani, A. (1997). "OLAP and statistical databases: Similarities and differences". Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems. Tucson, Arizona. 185--196. ACM Press New York, NY. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Trujillo, J., Palomar, M. Gomez, J., and Song, I. (2001). "Designing Data Warehouses with OO Conceptual Models". IEEE Computer. V34, No 12, 66--75. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Tryfona, N., Busborg, F., and Borch Christiansen, J. (1999). "StarER: A Conceptual Model for Data Warehouse Design". ACM, DOLAP '99 Kansas City, MO. USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. United States Environmental Protection Agency (US EPA). 1997. National Ambient Air Quality Standards for Particulate Matter, Final Rule, US EPA, Part 50 of Title 40 of the Code of Federal Regulations.Google ScholarGoogle Scholar

Index Terms

  1. An analysis of additivity in OLAP systems

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        DOLAP '04: Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
        November 2004
        130 pages
        ISBN:1581139772
        DOI:10.1145/1031763

        Copyright © 2004 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 November 2004

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate29of79submissions,37%

        Upcoming Conference

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader