skip to main content
research-article

A Multi-tenant Key-value SSD with Secondary Index for Search Query Processing and Analysis

Published: 26 July 2023 Publication History

Abstract

Key-value SSDs (KVSSDs) introduced so far are limited in their use as an alternative to the key-value store running on the host due to the following technical limitations. First, they were designed only for a single tenant, limiting the use of multiple tenants. Second, they mainly focused on designing indexes for primary key-based searches, without supporting various queries using a combination of primary key and non-primary attribute-based searches. This article proposes Cerberus, a Log Structured Merged (LSM) tree-based KVSSD armed with (1) namespace and performance isolation for multiple tenants in a multi-tenant environment and (2) capability for processing non-primary attribute-based search queries. Specifically, Cerberus identifies the tenant’s namespace and splits a single large LSM-tree into namespace-specific LSM-tree indexes for tenants. Cerberus also manages secondary LSM-tree indexes to enable non-primary attribute-based data access and fast search query processing. With the SSD-internal CPU/DRAM resources, Cerberus supports non-primary attribute-based search queries and handles complex queries that are combined with search and computing operations. We prototyped Cerberus on the Cosmos+ OpenSSD platform. When there are multiple tenants, Cerberus exhibits up to 2.9× higher read throughput and negligible write overhead compared to existing KVSSD. Cerberus also shows lower latency by up to 9.31× for non-primary attribute-based queries.

References

[2]
2017. Cosmos+ OpenSSD Platform. http://www.openssd-project.org/.
[3]
[5]
Stefan Aulbach, Torsten Grust, Dean Jacobs, Alfons Kemper, and Jan Rittinger. 2008. Multi-tenant databases for software as a service: Schema-mapping techniques. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. 1195–1206.
[6]
Nicholas J. Belkin, Colleen Cool, W. Bruce Croft, and James P. Callan. 1993. The effect multiple query representations on information retrieval system performance. In Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 339–346.
[7]
Janki Bhimani, Jingpei Yang, Ningfang Mi, Changho Choi, and Manoj Saha. 2021. Fine-grained control of concurrency within KV-SSDs. In Proceedings of the 14th ACM International System and Storage Conference (SYSTOR’21). ACM, 1–12.
[8]
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. 2008. Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems 26, 2 (2008), 1–26.
[9]
Wonil Choi, Bhuvan Urgaonkar, Mahmut Taylan Kandemir, and George Kesidis. 2022. Multi-resource fair allocation for consolidated flash-based caching systems. In Proceedings of the 23rd Conference on 23rd ACM/IFIP International Middleware Conference. 202–215.
[10]
Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM Symposium on Cloud Computing. 143–154.
[11]
Peter C. Dillinger and Stefan Walzer. 2021. Ribbon filter: Practically smaller than bloom and xor. arXiv preprint arXiv:2103.02515 (2021).
[12]
Samsung Electronics.2018. Samsung Smart SSD. https://samsungatfirst.com/smartssd-ocp/.
[14]
Mike Folk, Gerd Heber, Quincey Koziol, Elena Pourmal, and Dana Robinson. 2011. An overview of the HDF5 technology suite and its applications. In Proceedings of the EDBT/ICDT 2011 Workshop on Array Databases(AD’11). 36–47.
[15]
Google. 2012. RocksDB: A Persistent Key-value Store for Fast Storage Environment. https://rocksdb.org.
[16]
Boncheol Gu, Andre S. Yoon, Duck-Ho Bae, Insoon Jo, Jinyoung Lee, Jonghyun Yoon, Jeong-Uk Kang, Moonsang Kwon, Chanho Yoon, Sangyeun Cho, Jaeheon Jeong, and Duckhyun Chang. 2016. Biscuit: A framework for near-data processing of big data workloads. In Proceedings of the 43rd International Symposium on Computer Architecture (ISCA’16). 153–165.
[17]
Ajay Gulati, Arif Merchant, and Peter J. Varman. 2007. pClock: An arrival curve based approach for QoS guarantees in shared storage systems. ACM SIGMETRICS Performance Evaluation Review 35, 1 (2007), 13–24.
[18]
John L. Hennessy and David A. Patterson. 2011. Computer Architecture: A Quantitative Approach. Elsevier.
[19]
Yang Hu, Hong Jiang, Dan Feng, Lei Tian, Hao Luo, and Chao Ren. 2012. Exploring and exploiting the multilevel parallelism inside SSDs for improved performance and endurance. IEEE Trans. Comput. 62, 6 (2012), 1141–1155.
[20]
Junsu Im, Jinwook Bae, Chanwoo Chung, Arvind, and Sungjin Lee. 2020. PinK: High-speed in-storage key-value store with bounded tails. In Proceedings of the USENIX Annual Technical Conference (ATC’20). USENIX, 173–187.
[21]
Shvetank Jain, Fareha Shafique, Vladan Djeric, and Ashvin Goel. 2008. Application-level isolation and recovery with solitude. In Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems. 95–107.
[22]
Yanqin Jin, Hung-Wei Tseng, Yannis Papakonstantinou, and Steven Swanson. 2017. KAML: A flexible, high-performance key-value SSD. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA’17). IEEE, 373–384.
[23]
Sang-Woo Jun, Ming Liu, Sungjin Lee, Jamey Hicks, John Ankcorn, Myron King, Shuotao Xu, and Arvind. 2015. BlueDBM: An appliance for big data analytics. In Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA’15). ACM, 1–13.
[24]
Yangwook Kang, Yang-suk Kee, Ethan L. Miller, and Chanik Park. 2013. Enabling cost-effective data processing with smart SSD. In Proceedings of the 29th Symposium on Mass Storage Systems and Technologies (MSST’13). IEEE, 1–12.
[25]
Awais Khan, Hyogi Sim, Sudharshan S. Vazhkudai, and Youngjae Kim. 2021. MOSIQS: Persistent memory object storage with metadata indexing and querying for scientific computing. IEEE Access 9 (2021), 85217–85231.
[26]
Jaeho Kim, Donghee Lee, and Sam H. Noh. 2015. Towards SLO complying ssds through OPS isolation. In 13th USENIX Conference on File and Storage Technologies (FAST’15). USENIX, 183–189.
[27]
Gunjae Koo, Kiran Kumar Matam, Te I, H. V. Krishna Giri Narra, Jing Li, Hung-Wei Tseng, Steven Swanson, and Murali Annavaram. 2017. Summarizer: Trading communication with computing near storage. In Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-50’17). 219–231.
[28]
Dongup Kwon, Dongryeong Kim, Junehyuk Boo, Wonsik Lee, and Jangwoo Kim. 2021. A fast and flexible hardware-based virtualization mechanism for computational storage devices. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC’21). 729–743.
[29]
Avinash Lakshman and Prashant Malik. 2010. Cassandra: A decentralized structured storage system. ACM SIGOPS Operating Systems Review 44, 2 (2010), 35–40.
[30]
Willis Lang, Srinath Shankar, Jignesh M. Patel, and Ajay Kalhan. 2013. Towards multi-tenant performance SLOs. IEEE Transactions on Knowledge and Data Engineering 26, 6 (2013), 1447–1463.
[31]
Chang-Gyu Lee, Hyeongu Kang, Donggyu Park, Sungyong Park, Youngjae Kim, Jungki Noh, Woosuk Chung, and Kyoung Park. 2019. iLSM-SSD: An intelligent LSM-Tree based key-value SSD for data analytics. In Proceedings of the 27th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS’19). IEEE, 384–395.
[32]
Shengwen Liang, Ying Wang, Youyou Lu, Zhe Yang, Huawei Li, and Xiaowei Li. 2019. Cognitive SSD: A deep learning engine for in-storage data retrieval. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19). USENIX, 395–410.
[33]
Lanyue Lu, Thanumalayan Sankaranarayana Pillai, Hariharan Gopalakrishnan, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. 2016. Wisckey: Separating keys from values in SSD-conscious storage. In Proceedings of the File and Storage Technologies (FAST’16). USENIX, 133–148.
[34]
Yoshinori Matsunobu, Siying Dong, and Herman Lee. 2020. MyRocks: LSM-tree database storage engine serving Facebook’s social graph. Proceedings of the VLDB Endowment 13, 12 (2020), 3217–3230.
[35]
Donghyun Min and Youngjae Kim. 2021. Isolating namespace and performance in key-value SSDs for multi-tenant environments. In Proceedings of the 13th ACM Workshop on Hot Topics in Storage and File Systems. 8–13.
[36]
Patrick O’Neil, Edward Cheng, Dieter Gawlick, and Elizabeth O’Neil. 1996. The log-structured merge-tree (LSM-tree). Acta Informatica 33, 4 (1996), 351–385.
[37]
Mohiuddin Abdul Qader, Shiwen Cheng, and Vagelis Hristidis. 2018. A comparative study of secondary indexing techniques in LSM-based NoSQL databases. In Proceedings of the 2018 ACM SIGMOD International Conference on Management of Data. 551–566.
[38]
Sean Rhea, Brighten Godfrey, Brad Karp, John Kubiatowicz, Sylvia Ratnasamy, Scott Shenker, Ion Stoica, and Harlan Yu. 2005. OpenDHT: A public DHT service and its SSEs. In Proceedings of the 2005 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications. 73–84.
[39]
Zhenyuan Ruan, Tong He, and Jason Con. 2019. Insider: Designing in-storage computing system for emerging high-performance drive. In Proceedings of the 2019 USENIX Annual Technical Conference (ATC’19). USENIX, 379–394.
[40]
Partho Sarthi, Kaushik Rajan, Akash Lal, Abhishek Modi, Prakhar Jain, Mo Liu, Ashit Gosalia, and Saurabh Kalikar. 2020. Generalized sub-query fusion for eliminating redundant I/O from big-data queries. In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI’20). USENIX, 209–224.
[41]
David Shue, Michael J. Freedman, and Anees Shaikh. 2012. Performance isolation and fairness for multi-tenant cloud storage. In Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI’12). USENIX, 349–362.
[42]
Swaminathan Sivasubramanian. 2012. Amazon dynamoDB: A seamlessly scalable non-relational database service. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 729–730.
[43]
National Snow and Ice Data Center. 2016. World Glacier Inventory: Name, location, altitude, and area of every glacier on the planet. https://www.kaggle.com/nsidcorg/glacier-inventory.
[44]
Satoru Watanabe, Kazuhisa Fujimoto, Yuji Saeki, Yoshifumi Fujikawa, and Hiroshi Yoshino. 2019. Column-oriented database acceleration using FPGAs. In Proceedings of 2019 IEEE 35th International Conference on Data Engineering (ICDE’19). 686–697.
[45]
ScaleFlux; Zhushi Cheng Alibaba; Ning Zheng ScaleFlux; Wei Li Wei Cao, Alibaba; Yang Liu, ScaleFlux; Peng Wang Wenjie Wu, Alibaba; Linqiang Ouyang, ScaleFlux; Zhenjun Liu Yijing Wang, Alibaba; Ray Kuan, and ScaleFlux Feng Zhu, Alibaba; Tong Zhang. 2014. POLARDB meets computational storage: Efficiently support analytical workloads in cloud-native relational database. In Proceedings of the 18th USENIX Conference on File and Storage Technologies (FAST’14). USENIX, 29–41.
[46]
Sung-Ming Wu, Kai-Hsiang Lin, and Li-Pin Chang. 2018. KVSSD: Close integration of LSM trees and flash translation layer for write-efficient KV store. In Proceedings of the Design, Automation & Test in Europe Conference & Exhibition (DATE’18). IEEE, 563–568.
[47]
[48]
Shuotao Xu, Thomas Bourgeat, Tianhao Huang, Hojun Koim, Sungjin Lee, and Arvind. 2020. AQUOMAN: An analytic-query offloading machine. In Proceedings of the 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO’20). 386–399.
[49]
Yelp. 2021. A trove of reviews, businesses, users, tips, and check-in data!https://www.kaggle.com/yelp-dataset/yelp-dataset.
[50]
Wei Zhang, Suren Byna, Chenxu Niu, and Yong Chen. 2019. Exploring metadata search essentials for scientific data management. In Proceedings of the 26th International Conference on High Performance Computing, Data, and Analytics (HiPC’19). IEEE, 83–92.
[51]
Wei Zhang, Suren Byna, Houjun Tang, Brody Williams, and Yong Chen. 2019. MIQS: Metadata indexing and querying service for self-describing file formats. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC’19). Article 5, 24 pages.
[52]
Xiao Zongshui, Lanju Kong, Qingzhong Li, and Pang Cheng. 2015. Global index oriented non-shard key for multi-tenant database. In 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing. IEEE, 831–836.

Index Terms

  1. A Multi-tenant Key-value SSD with Secondary Index for Search Query Processing and Analysis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Embedded Computing Systems
    ACM Transactions on Embedded Computing Systems  Volume 22, Issue 4
    July 2023
    551 pages
    ISSN:1539-9087
    EISSN:1558-3465
    DOI:10.1145/3610418
    • Editor:
    • Tulika Mitra
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Journal Family

    Publication History

    Published: 26 July 2023
    Online AM: 11 April 2023
    Accepted: 14 March 2023
    Revised: 15 February 2023
    Received: 29 June 2022
    Published in TECS Volume 22, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Key-value solid-state drive
    2. NoSQL database storage engine

    Qualifiers

    • Research-article

    Funding Sources

    • National Research Foundation of Korea (NRF)
    • Korean government (MSIT)

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 495
      Total Downloads
    • Downloads (Last 12 months)236
    • Downloads (Last 6 weeks)18
    Reflects downloads up to 14 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    Full Text

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media