Optimizing Scalar User-Defined Functions in In-Memory Column-Store Database Systems

Ryu, Cheol; Lee, Sunho; Kim, Kihong; Park, Kunsoo; Kwon, Yongsik; Cha, Sang Kyun; Song, Changbin; Ziegler, Emanuel; Muench, Stephan

doi:10.1007/978-3-319-55699-4_35

Optimizing Scalar User-Defined Functions in In-Memory Column-Store Database Systems

Cheol Ryu²⁰,
Sunho Lee²⁰,
Kihong Kim¹⁸,
Kunsoo Park²⁰,
Yongsik Kwon¹⁸,
Sang Kyun Cha^18,20,
Changbin Song¹⁸,
Emanuel Ziegler¹⁹ &
…
Stephan Muench¹⁹

Conference paper
First Online: 22 March 2017

2591 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10178))

Abstract

User-defined functions such as currency conversion and factory calendar are important ingredients in many business applications. Since currency conversion and factory calendar are expensive user-defined functions, optimizing these functions is essential to high performance business applications. We optimize scalar user-defined functions by caching function call results. In this paper we investigate which method for function result caching is best in the context of in-memory column-store database systems. Experiments show that our method, which implements a function result cache as an array, combined with SAP HANA in-memory column store provides the high performance required by real-time global business applications.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Abadi, D.J., Madden, S.R., Hachem, N.: Column-stores vs. row-stores: how different are they really? In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 967–980 (2008)
Google Scholar
Balkesen, Ç., Teubner, J., Alonso, G., Özsu, M.T.: Main-memory hash joins on modern processor architectures. IEEE Trans. Knowl. Data Eng. 27(7), 1754–1766 (2015)
Article Google Scholar
Binnig, C., May, N., Mindnich, T.: SQLScript: efficiently analyzing big enterprise data in SAP HANA. In: Database Systems for Business, Technology, and Web, pp. 363–382 (2013)
Google Scholar
Books online for SQL server 2016. https://msdn.microsoft.com/en-us/library/ms191007.aspx
Chaudhuri, S., Shim, K.: Optimization of queries with user-defined predicates. ACM Trans. Database Syst. 24(2), 177–228 (1999)
Article Google Scholar
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C.: Introduction to Algorithms, 3rd edn. The MIT Press, Cambridge (2009)
Google Scholar
Courtois, P.J., Heymans, F., Parnas, D.L.: Concurrent control with “readers” and “writers”. Commun. ACM 14(10), 667–668 (1971)
Article Google Scholar
Färber, F., Cha, S.K., Primsch, J., Bornhövd, C., Sigg, S., Lehner, W.: SAP HANA database: data management for modern business applications. SIGMOD Rec. 40(4), 45–51 (2012)
Article Google Scholar
Färber, F., May, N., Lehner, W., Große, P., Müller, I., Rauhe, H., Dees, J.: The SAP HANA database-an architecture overview. IEEE Data Eng. Bull. 35(1), 423–434 (2012)
Google Scholar
Friedman, E., Pawlowski, P., Cieslewicz, J.: SQL/Mapreduce: a practical approach to self-describing, polymorphic, and parallelizable user-defined functions. Proc. VLDB Endow. 2(2), 1402–1413 (2009)
Article Google Scholar
Gan, Q., Suel, T.: Improved techniques for result caching in web search engines. In: Proceedings of the 18th International Conference on WWW, pp. 431–440 (2009)
Google Scholar
Garrod, C., Manjhi, A., Ailamaki, A., Maggs, B., Mowry, T., Olston, C., Tomasic, A.: Scalable query result caching for web applications. Proc. VLDB Endow. 1(1), 550–561 (2008)
Article Google Scholar
Google sparsehash. http://goog-sparsehash.sourceforge.net/
Hash table benchmarks. http://incise.org/hash-table-benchmarks.html
Hellerstein, J.M., Naughton, J.F.: Query execution techniques for caching expensive methods. SIGMOD Rec. 25(2), 423–434 (1996)
Article Google Scholar
Hellerstein, J.M., Stonebraker, M.: Predicate migration: optimizing queries with expensive predicates. SIGMOD Rec. 22(2), 267–276 (1993)
Article Google Scholar
Heydon, A., Levin, R., Yu, Y.: Caching function calls using precise dependencies. SIGPLAN Not. 35(5), 311–320 (2000)
Article Google Scholar
IBM i version 7.2, database SQL programming. https://www.ibm.com/support/knowledgecenter/ssw_ibm_i_72/sqlp/rbafypdf.pdf
Jaedicke, M., Mitschang, B.: On parallel processing of aggregate and scalar functions in object-relational DBMS. SIGMOD Rec. 27(2), 379–389 (1998)
Google Scholar
Jarke, M.: Common subexpression isolation in multiple query optimization. In: Query Processing in Database Systems, pp. 191–205 (1985)
Google Scholar
Knuth, D.E.: The Art of Computer Programming, vol. 3: Sorting and Searching, 2nd edn. Addison Wesley Longman Publishing Co., Inc, Boston (1998)
Google Scholar
Mistry, H., Roy, P., Sudarshan, S., Ramamritham, K.: Materialized view selection and maintenance using multi-query optimization. SIGMOD Rec. 30(2), 307–318 (2001)
Article Google Scholar
Oracle database performance tuning guide, 12c release 1. https://docs.oracle.com/database/121/TGDBA/toc.htm
Performance notes. http://goog-sparsehash.sourceforge.net/doc/performance.html
Richardson, S.E.: Caching function results: faster arithmetic by avoiding unnecessary computation. Technical report, Mountain View, CA, USA (1992)
Google Scholar
Ross, K.A., Srivastava, D., Sudarshan, S.: Materialized view maintenance and integrity constraint checking: trading space for time. SIGMOD Rec. 25(2), 447–458 (1996)
Article Google Scholar
Sap, ERP 6.0 enhancement package 8. http://help.sap.com/erp2005_ehp_08/helpdata/en/59/cdc8109ce34bca896115f8ae660a69/content.htm
Sellis, T.K.: Multiple-query optimization. ACM Trans. Database Syst. 13(1), 23–52 (1988)
Google Scholar
Sikka, V., Färber, F., Lehner, W., Cha, S.K., Peh, T., Bornhövd, C.: Efficient transaction processing in SAP HANA database: The end of a column store myth. In: Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pp. 731–742 (2012)
Google Scholar

Download references

Acknowledgments

The work of Ryu, Lee and Park was supported in part by the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (No. 2012M3A9D1054622).

Author information

Authors and Affiliations

SAP Labs Korea, Seoul, Korea
Kihong Kim, Yongsik Kwon, Sang Kyun Cha & Changbin Song
SAP SE Germany, Walldorf, Germany
Emanuel Ziegler & Stephan Muench
Seoul National University, Seoul, Korea
Cheol Ryu, Sunho Lee, Kunsoo Park & Sang Kyun Cha

Authors

Cheol Ryu
View author publications
You can also search for this author in PubMed Google Scholar
Sunho Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kihong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kunsoo Park
View author publications
You can also search for this author in PubMed Google Scholar
Yongsik Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Sang Kyun Cha
View author publications
You can also search for this author in PubMed Google Scholar
Changbin Song
View author publications
You can also search for this author in PubMed Google Scholar
Emanuel Ziegler
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Muench
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kunsoo Park .

Editor information

Editors and Affiliations

Arizona State University, Tempe - Phoenix, Arizona, USA
Selçuk Candan
of Science and Technology, Hong Kong University of Science and Technology, Hong Kong, China
Lei Chen
Aalborg University , Aalborg, Denmark
Torben Bach Pedersen
University of New South Wales , Sydney, New South Wales, Australia
Lijun Chang
The University of Queensland , Brisbane, Queensland, Australia
Wen Hua

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ryu, C. et al. (2017). Optimizing Scalar User-Defined Functions in In-Memory Column-Store Database Systems. In: Candan, S., Chen, L., Pedersen, T., Chang, L., Hua, W. (eds) Database Systems for Advanced Applications. DASFAA 2017. Lecture Notes in Computer Science(), vol 10178. Springer, Cham. https://doi.org/10.1007/978-3-319-55699-4_35

Download citation

DOI: https://doi.org/10.1007/978-3-319-55699-4_35
Published: 22 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-55698-7
Online ISBN: 978-3-319-55699-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics