Improving performance by creating a native join-index for OLAP

Zhang, Yansong; Wang, Shan; Lu, Jiaheng

doi:10.1007/s11704-011-9181-3

Improving performance by creating a native join-index for OLAP

Research Article
Published: 16 February 2011

Volume 5, pages 236–249, (2011)
Cite this article

Frontiers of Computer Science in China Aims and scope Submit manuscript

Yansong Zhang¹,
Shan Wang^2,3 &
Jiaheng Lu^2,3

117 Accesses
6 Citations
Explore all metrics

Abstract

The performance of online analytical processing (OLAP) is critical for meeting the increasing requirements of massive volume analytical applications. Typical techniques, such as in-memory processing, column-storage, and join indexes focus on high performance storage media, efficient storage models, and reduced query processing. While they effectively perform OLAP applications, there is a vital limitation: mainmemory database based OLAP (MMOLAP) cannot provide high performance for a large size data set. In this paper, we propose a novel memory dimension table model, in which the primary keys of the dimension table can be directly mapped to dimensional tuple addresses. To achieve higher performance of dimensional tuple access, we optimize our storage model for dimension tables based on OLAP query workload features. We present directly dimensional tuple accessing (DDTA) based join (DDTAJOIN), a technique to optimize query processing on the memory dimension table by direct dimensional tuple access. We also contribute by proposing an optimization of the predicate tree to shorten predicate operation length by pruning useless predicate processing. Our experimental results show that the DDTA-JOIN algorithm is superior to both simulated row-store main memory query processing and the open-source column-store main memory database MonetDB, thanks to the reduced join cost and simple yet efficient query processing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Main-memory foreign key joins on advanced processors: design and re-evaluations for OLAP workloads

Article 23 May 2018

Efficient Key-Value Encoding for MOLAP Query Processing

Efficient Query Processing for Multidimensional Data Cubes

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

O’Neil P, O’Neil B, Chen X. The star schema benchmark (SSB). http://www.cs.umb.edu/~poneil/StarSchemaB.PDF
Johnson R, Raman V, Sidle R, Swart G. Row-wise parallel predicate evaluation. In: Proceedings of VLDB Endowment, 2008, 1(1): 622–634
Google Scholar
Stonebraker M, Abadi D J, Batkin A, Chen X, Cherniack M, Ferreira M, Lau E, Lin A, Madden S, O’Neil E J, O’Neil P E, Rasin A, Tran N, Zdonik S B. C-store: A column-oriented DBMS. In: Proceedings of the 29th International Conference on Very Large Data Bases. 2005, 553–564
MacNicol R, French B. Sybase IQ multiplex-designed for analytics. In: Proceedings of the 30th International Conference on Very Large Data Bases. 2004, 1227–1230
Boncz P A, Manegold S, Kersten M L. Database architecture optimized for the new bottleneck: memory access. In: Proceedings of the 25th International Conference on Very Large Data Bases. 1999, 54–65
Ailamaki D J, DeWitt D J, Hill M D. Data page layouts for relational databases on deep memory hierarchies. VLDB Journal, 2002, 11(3): 198–215
Article MATH Google Scholar
Hankins R A, Patel J M. Data morphing: an adaptive, cacheconscious storage technique. In: Proceedings of the 29th international conference on Very Large Data Bases. 2003, 417–428
Bruno N. Teaching an old elephant new tricks. In: Proceedings of 4th Biennial Conference on Innovative Data Systems Research. 2009
Abadi D J, Myers D S, DeWitt D J, Madden S. Materialization strategies in a column-oriented DBMS. In: Proceedings of the 23rd International Conference on Data Engineering. 2007, 466–475
Zukowski M, Nes N, Boncz P A. DSM vs. NSM: CPU performance tradeoffs in block-oriented query processing. In: Proceedings of the 4th International Workshop on Data Management on New Hardware. 2008, 47–54
Abadi D J, Madden S R, Hachem N. Column-stores vs. row-stores: how different are they really? In: Proceedings of the ACM SIGMOD International Conference on Management of Data. 2008, 967–980

Download references

Author information

Authors and Affiliations

National Survey Research Center at Renmin University of China, Beijing, 100872, China
Yansong Zhang
Key Laboratory of the Ministry of Education for Data Engineering and Knowledge Engineering, Renmin University of China, Beijing, 100872, China
Shan Wang & Jiaheng Lu
School of Information, Renmin University of China, Beijing, 100872, China
Shan Wang & Jiaheng Lu

Authors

Yansong Zhang
View author publications
Search author on:PubMed Google Scholar
Shan Wang
View author publications
Search author on:PubMed Google Scholar
Jiaheng Lu
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Yansong Zhang.

Additional information

Yansong Zhang was born in 1973 in Mudanjiang of Heilongjiang province in China. He is a postdoc researcher in the National Survey Research Center (NSRC) at Renmin University. His research interests include main memory database systems, OLAP, Data warehouse and cloud computing.

Professor Shan Wang was born in 1944. She is a Ph.D. supervisor in Renmin University and she is a senior member of CCF. Her research interests include main memory database systems, OLAP, Data warehouse and video database.

Jiaheng Lu is an associate professor in Renmin University. His research interests are in the fields of database and information systems, including XML query processing, data mining, XML keyword suggestion, approximate string matching, cloud data management.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, Y., Wang, S. & Lu, J. Improving performance by creating a native join-index for OLAP. Front. Comput. Sci. China 5, 236–249 (2011). https://doi.org/10.1007/s11704-011-9181-3

Download citation

Received: 17 December 2009
Accepted: 02 July 2010
Published: 16 February 2011
Issue Date: June 2011
DOI: https://doi.org/10.1007/s11704-011-9181-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Improving performance by creating a native join-index for OLAP

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Main-memory foreign key joins on advanced processors: design and re-evaluations for OLAP workloads

Efficient Key-Value Encoding for MOLAP Query Processing

Efficient Query Processing for Multidimensional Data Cubes

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now