Abstract
Providing efficient multi-dimensional indexing is critically important to improve the overall performance of the cloud storage system. To achieve efficient querying service, the indexing scheme should guarantee lower routing cost and less false positive. In this paper, we propose RB-Index, a distributed multi-dimensional indexing scheme in modular data centers with Bcube topology. RB-Index is a two-layer indexing scheme, which integrates Bcube-based routing protocol and R-tree-based indexing technology. In its lower layer, each server in the network indexes the local data with R-tree, while in the upper layer the global index is distributed across different servers in the network. Based on the characteristics of Bcube, we build several indexing spaces and propose the way to map servers into the indexing spaces. The dimension of these indexing spaces are dynamically selected according to both the data distribution and the query habit. Index construction and query algorithms are also introduced. We simulate a three-level Bcube to evaluate the efficiency of our indexing scheme and compare the performance of RB-Index with RT-CAN, a similar design in P2P network.
This work has been supported in part by the National Natural Science Foundation of China (Grant number 61202024, 61472252, 61133006, 61422208), China 973 project (2012CB316200), the Natural Science Foundation of Shanghai (Grant No. 12ZR1445000), Shanghai Educational Development Foundation (Chenguang Grant No. 12CG09), Shanghai Pujiang Program 13PJ1403900, and in part by Jiangsu Future Network Research Project No. BY2013095-1-10 and CCF-Tencent Open Fund.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, G., Vo, H.T., Wu, S., Ooi, B.C., Özsu, M.T.: A framework for supporting DBMSlike indexes in the cloud. VLDB 4(11), 702–713 (2011)
Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. In: SIGCOMM (2009)
du Mouza, C., Litwin, W., Rigaux, P.: SD-Rtree: a scalable distributed Rtree. In: ICDE pp. 296–305 (2007)
Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: OSDI, pp. 205–218 (2006)
Li, F., Liang, W., Gao, X., Yao, B., Chen, G.: Efficient R-tree based indexing for cloud storage system with dual-port servers. In: Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. (eds.) DEXA 2014, Part II. LNCS, vol. 8645, pp. 375–391. Springer, Heidelberg (2014)
DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W.: Dynamo: amazons highly available key-value store. SIGOPS 41(6), 205–220 (2007)
Wang, J., Wu, S., Gao, H., Li, J., Ooi, B.C.: Indexing multi-dimensional data in a cloud system. In: SIGMOD, pp. 591–602 (2010)
Hamilton, J.: An architecture for modular data centers. In: CIDR (2007)
Avinash, L., Prashant, M.: Cassandra: a decentralized structured storage system. ACM Spec. Interest Group Oper. Syst. (SIGOPS) 44(2), 35–40 (2010)
Aguilera, M.K., Golab, W., Shah, M.A.: A practical scalable distributed B-tree. PVLDB 1, 598–609 (2008)
Escriva, R., Wong, B., Sirer, E.G.: HyperDex: a distributed, searchable key-value store. In: SIGCOMM, pp. 25–36 (2012)
Sai, W., Wu, K.-L.: An indexing framework for efficient retrieval on the cloud. ICDE 32(1), 75–82 (2009)
Wu, S., Jiang, D., Ooi, B.C., Wu, K.-L.: Efficient B-tree based indexing for cloud data processing. VLDB 3(1–2), 1207–1218 (2010)
Gao, X., Li, B., Chen, Z., Yin, M., Chen, G., Jin, Y.: FT-INDEX: a distributed indexing scheme for switch-centric cloud storage system. In: ICC (2015)
Liang, Y., Balcan, M.-F., Kanchanapally, V.: Distributed PCA and k-means clustering. In: NIPS (2012)
Liu, Y., Gao, X., Chen, G.: Design and optimization for distributed indexing scheme in switch-centric cloud storage system. In: ISCC, pp. 804–809 (2015)
Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)
IBM: Scalable Modular Data Center. http://www-935.ibm.com/services/us/its/pdf/smdc-eb-sfe03001-usen-00-022708.pdf
Rackable Systems: ICE Cube Modular Data Center. http://www.rackable.com/products/icecube.aspx
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Gao, L., Zhang, Y., Gao, X., Chen, G. (2015). Indexing Multi-dimensional Data in Modular Data Centers. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-22852-5_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22851-8
Online ISBN: 978-3-319-22852-5
eBook Packages: Computer ScienceComputer Science (R0)