skip to main content
10.1145/1376616.1376736acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Multi-tenant databases for software as a service: schema-mapping techniques

Published: 09 June 2008 Publication History

Abstract

In the implementation of hosted business services, multiple tenants are often consolidated into the same database to reduce total cost of ownership. Common practice is to map multiple single-tenant logical schemas in the application to one multi-tenant physical schema in the database. Such mappings are challenging to create because enterprise applications allow tenants to extend the base schema, e.g., for vertical industries or geographic regions. Assuming the workload stays within bounds, the fundamental limitation on scalability for this approach is the number of tables the database can handle. To get good consolidation, certain tables must be shared among tenants and certain tables must be mapped into fixed generic structures such as Universal and Pivot Tables, which can degrade performance.
This paper describes a new schema-mapping technique for multi-tenancy called Chunk Folding, where the logical tables are vertically partitioned into chunks that are folded together into different physical multi-tenant tables and joined as needed. The database's "meta-data budget" is divided between application-specific conventional tables and a large fixed set of generic structures called Chunk Tables. Good performance is obtained by mapping the most heavily-utilized parts of the logical schemas into the conventional tables and the remaining parts into Chunk Tables that match their structure as closely as possible. We present the re sults of several experiments designed to measure the efficacy of Chunk Folding and describe the multi-tenant database testbed in which these experiments were performed.

References

[1]
D. J. Abadi, A. Marcus, S. Madden, and K. J. Hollenbach. Scalable Semantic Web Data Management Using Vertical Partitioning. In Proceedings of the 33rd International Conference on Very Large Data Bases, University of Vienna, Austria, September 23-27, 2007, pages 411--422, 2007.
[2]
R. Agrawal, A. Somani, and Y. Xu. Storage and Querying of E-Commerce Data. In VLDB ?01: Proceedings of the 27th International Conference on Very Large Data Bases, pages 149--158, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc.
[3]
J. L. Beckmann, A. Halverson, R. Krishnamurthy, and J. F. Naughton. Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format. In ICDE ?06: Proceedings of the 22nd International Conference on Data Engineering (ICDE'06), page 58, Washington, DC, USA, 2006. IEEE Computer Society.
[4]
P. A. Boncz. Monet: A Next-Generation DBMS Kernel For Query-Intensive Applications. Ph.D. Thesis, Universiteit van Amsterdam, Amsterdam, The Netherlands, May 2002.
[5]
B. Burtin and S. Dietzen (Zimbra Inc., Sunnyvale, CA, USA). Personal communication, 2007.
[6]
conject.com. http://www.conject.com/.
[7]
G. P. Copeland and S. N. Khoshafian. A decomposition storage model. In SIGMOD ?85: Proceedings of the 1985 ACM SIGMOD international conference on Management of data, pages 268?279, New York, NY, USA, 1985. ACM.
[8]
C. Cunningham, G. Graefe, and C. A. Galindo-Legaria. PIVOT and UNPIVOT: Optimization and Execution Strategies in an RDBMS. In (e)Proceedings of the Thirtieth International Conference on Very Large Data Bases, Toronto, Canada, August 31 - September 3 2004, pages 998--1009, 2004.
[9]
R. Elmasri and S. B. Navathe. Fundamentals of Database Systems, 5th Edition. Addison-Wesley, 2007.
[10]
L. Fegaras and D. Maier. Optimizing object queries using an effective calculus. ACM Transactions on Database Systems (TODS), 25(4), 2000.
[11]
D. Florescu and D. Kossmann. A Performance Evaluation of Alternative Mapping Schemes for Storing XML Data in a Relational Database. Technical report, Inria, France, 1999.
[12]
G. Graefe. Sorting and Indexing with Partitioned B-Trees. In Proc. of the 1st Int?l Conference on Innovative Data Systems Research (CIDR), Asilomar, CA, USA, Jan. 2003.
[13]
T. Grust, M. V. Keulen, and J. Teubner. Accelerating XPath evaluation in any RDBMS. ACM Trans. Database Syst., 29(1):91--131, 2004.
[14]
J. R. Hamilton. On designing and deploying internet-scale services. In Proceedings of the 21th Large Installation System Administration Conference, LISA 2007, Dallas, Texas, USA, November 11-16, 2007, pages 231--242. USENIX, 2007.
[15]
ibm.com. http://www.ibm.com/.
[16]
D. Jacobs. Data management in application servers. In Readings in Database Systems, 4th edition. The MIT Press, 2005.
[17]
A. Kemper, D. Kossmann, and F. Matthes. SAP R/3: a database application system (Tutorial). In SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, June 2-4, 1998, Seattle, Washington, USA., page 499, 1998.
[18]
D. Maier and J. D. Ullman. Maximal objects and the semantics of universal relation databases. ACM Trans. Database Syst., 8(1):1--14, 1983.
[19]
Anatomy of MySQL on the GRID. http://blog.mediatemple.net/weblog/2007/01/19/anatomy-of-mysql-on-the-grid/.
[20]
mysql.com. http://www.mysql.com/.
[21]
NetSuite NetFlex. http://www.netsuite.com/portal/products/netflex/main.shtml.
[22]
Salesforce AppExchange. http://www.salesforce.com/appexchange/about.
[23]
M. Stonebraker, D. J. Abadi, A. Batkin, X. Chen, M. Cherniack, M. Ferreira, E. Lau, A. Lin, S. Madden, E. J. O?Neil, P. E. O?Neil, A. Rasin, N. Tran, and S. B. Zdonik. C-Store: A Column-oriented DBMS. In Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005, pages 553--564, 2005.
[24]
E. TenWolde. Worldwide Software on Demand 2007-2011 Forecast: A Preliminary Look at Delivery Model Performance, IDC No. 206240, 2007. IDC Report.
[25]
TPC-C on-line transaction processing benchmark. http://www.tpc.org/tpcc/.
[26]
WebEx. http://www.webex.com/.
[27]
Zimbra. http://www.zimbra.com/.

Cited By

View all
  • (2024)A Secure Multi-tenant Architecture for Customer Experience Survey SoftwareFuture Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications10.1007/978-981-96-0437-1_4(48-61)Online publication date: 27-Nov-2024
  • (2024)Polyglotte Persistenz im DatenmanagementSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_7(161-188)Online publication date: 3-May-2024
  • (2024)Systeme für skalierbares DatenmanagementSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_4(61-90)Online publication date: 3-May-2024
  • Show More Cited By

Index Terms

  1. Multi-tenant databases for software as a service: schema-mapping techniques

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGMOD '08: Proceedings of the 2008 ACM SIGMOD international conference on Management of data
      June 2008
      1396 pages
      ISBN:9781605581026
      DOI:10.1145/1376616
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 09 June 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. chunk folding
      2. multi-tenancy
      3. software as a service

      Qualifiers

      • Research-article

      Conference

      SIGMOD/PODS '08
      Sponsor:

      Acceptance Rates

      Overall Acceptance Rate 785 of 4,003 submissions, 20%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)52
      • Downloads (Last 6 weeks)4
      Reflects downloads up to 14 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)A Secure Multi-tenant Architecture for Customer Experience Survey SoftwareFuture Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications10.1007/978-981-96-0437-1_4(48-61)Online publication date: 27-Nov-2024
      • (2024)Polyglotte Persistenz im DatenmanagementSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_7(161-188)Online publication date: 3-May-2024
      • (2024)Systeme für skalierbares DatenmanagementSchnelles und skalierbares Cloud-Datenmanagement10.1007/978-3-031-54388-3_4(61-90)Online publication date: 3-May-2024
      • (2023)A Multi-tenant Key-value SSD with Secondary Index for Search Query Processing and AnalysisACM Transactions on Embedded Computing Systems10.1145/359015322:4(1-27)Online publication date: 26-Jul-2023
      • (2023)A Systematic Literature Review on South African Government to Harness Software as a Service for Enhanced E-Government2023 Ninth International Conference on eDemocracy & eGovernment (ICEDEG)10.1109/ICEDEG58167.2023.10121964(1-6)Online publication date: 3-Apr-2023
      • (2023)A Cost-Effective Query Optimizer for Multi-tenant Parallel DBMSsNew Trends in Database and Information Systems10.1007/978-3-031-42941-5_3(25-34)Online publication date: 31-Aug-2023
      • (2022)TencentCLSProceedings of the VLDB Endowment10.14778/3554821.355483715:12(3472-3482)Online publication date: 1-Aug-2022
      • (2022)Incorporation of Secure Channel Communications Over Multi-tenant DatabaseEmerging Technologies in Data Mining and Information Security10.1007/978-981-19-4193-1_57(587-595)Online publication date: 29-Sep-2022
      • (2020)Cloud-based vs. blockchain-based IoT: a comparative survey and way forwardFrontiers of Information Technology & Electronic Engineering10.1631/FITEE.180034321:4(563-586)Online publication date: 30-Apr-2020
      • (2020)Translation Of DDL And DML For Constructing Flexible Schemas In Saas Applications2020 5th International Conference on Computer and Communication Systems (ICCCS)10.1109/ICCCS49078.2020.9118526(60-66)Online publication date: May-2020
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media