skip to main content
10.1145/1967486.1967576acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
research-article

Distributed construction of data cubes from tuple stream

Published: 08 November 2010 Publication History

Abstract

We propose a distributed construction scheme of MOLAP data cubes in related sites on the network. The server that directly receives tuple data sent from tuple generation source constructs the base cuboid, and sends them to downstream sites in tuple stream. The downstream site receives the tuple data to aggregate and construct its own cuboid whose dimension is lower than the upstream cuboid, then sends the aggregated data to its downstream sites. Using the implementation scheme of multidimensional datasets based on the history-offset tuple encoding method, the tuple stream can be processed efficiently to construct cuboids in real-time on each site, while MOLAP operations can be processed against one of the data cube versions in background. In this paper, we describe our tuple stream processing scheme and distributed data cube construction, then evaluate the required communication cost.

References

[1]
Jim Gray, Surajit Chaudhuri, Adam Bosworth, et al., Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals, Journal of Data Mining and Knowledge Discover, 1, 1997, pp. 29--53.
[2]
Ying Chen, Frank Dehne, Todd Eavis, A. R. Chaplin, Parallel ROLAP Data Cube Construction on Shared-Nothing Multiprocessors, Distributed and parallel Databases, Vol. 15, 2004, pp. 219--236.
[3]
S. Muto, M. Kitsuregawa, A Dynamic Load Balancing Strategy for Parallel Datacube Computation, Proc. of DOLAP, 1999, pp. 67--72.
[4]
K. Y. Lee, K. M. H. Kim, Efficient Incremental Maintenance of Data Cubes, Proc. of VLDB, 2006, pp. 823--833.
[5]
Lok Hang Lee, Man Hon Wong, Aggregate Sum Retrieval in Sensor Network by Distributed Prefix Sum Data Cube, Proceedings of the 19th International Conference on Advanced Information Networking and Applications, vol. 1, pp. 331--336.
[6]
Dan Wu, Chi Hong Cheong, Man Hon Wong Supporting asynchronous update for distributed data cubes, Journal of Network and Computer Applications, 32, 2009, pp. 889--900.
[7]
E. J. Otoo, T. H. Merrett, "A Storage Scheme for Extendible Arrays", Computing, Vol. 31, 1983, pp. 1--9.
[8]
T. Tsuji, G. Mizuno, T. Hochin, K. Higuchi, Deferred Allocation Scheme of Extendible Arrays, Transactions of IEICE, Vol. J86-D-I, No. 5, 2003, pp. 351--356.
[9]
K. M. A. Hasan, M. Kuroda, N. Azuma, T. Tsuji, K. Higuchi, An Extendible Array Based Implementation of Relational Tables for Multi Dimensional Databases, Proc. of DaWaK 2005, 2005, pp. 233--242.
[10]
T. Tsuji, M. Kuroda, K. Higuchi, History offset implementation scheme for large scale multidimensional data sets, Proc. of ACM Symposium on Applied computing, 2008, pp. 1021--1028.
[11]
J. Dong, T. Tsuji, K. Higuchi, An Incremental Maintenance Scheme of Data Cubes and Its Evaluation, IPSJ Online Transaction, Vol. 2, 2009, pp. 36--48.
[12]
Shivnath Babu, Jennifer Widom, Continuous Queries Over Data Streams, ACM SIGMOD Tuple, Volume 30, Issue 3, 2001, pp. 109--120.
[13]
Chunk Cranor, Theodore Johnson, Oliver Spataschek, Gigascope: A Stream Database for Network Applications, Proceedings of the 2003 ACM SIGMOD international conference on Management of data, 2003, pp. 647--651.
[14]
Ron Avnur, Joseph M. Hellerstein, Eddies: Continuously Adaptive Query Processing, ACM SIGMOD Tuples, Volume 29, Issue 2, 2000, pp. 261--272.
[15]
R. Motwani, J. Widom, A. Arasu, B. Babcock, S. Badu, M. Datar, G. Manku, C. Olston, J. Rosenstein, R. Varma, Query Processing, Resource Management, and Approximation in a Data Stream Management System, Proceedings of the First Conference on Innovative Data Systems Research CIDR 2003, 2003, pp. 245--256.
[16]
J. Chen, D. J. DeWitt, F. Tian, Y. Wang, NiagaraCQ: A Scalable Continuous Query System for Internet Databases, Proceedings of the 2000 ACM SIGMOD international conference on Management of data, 2000, pp. 379--390.
[17]
B. Babcock, S. Babu, M. Datar, R. Motwani, and J. Widom, Models and issues in data stream systems, Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, 2002, pp. 1--16.
[18]
Joong Hyuk Chang, Hye-Chung Kum, Frequency-based load shedding over a data stream of tuples, Information Science, Elsevier, 2009, pp. 3733--3744.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
iiWAS '10: Proceedings of the 12th International Conference on Information Integration and Web-based Applications & Services
November 2010
895 pages
ISBN:9781450304214
DOI:10.1145/1967486
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IIWAS: International Organization for Information Integration
  • Web-b: Web-b

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MOLAP
  2. data cube
  3. data stream
  4. network
  5. tuple stream

Qualifiers

  • Research-article

Conference

iiWAS '10
Sponsor:
  • IIWAS
  • Web-b

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 105
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media