Skip to main content

Indexing Multi-dimensional Data in Modular Data Centers

  • Conference paper
  • First Online:
Database and Expert Systems Applications (Globe 2015, DEXA 2015)

Abstract

Providing efficient multi-dimensional indexing is critically important to improve the overall performance of the cloud storage system. To achieve efficient querying service, the indexing scheme should guarantee lower routing cost and less false positive. In this paper, we propose RB-Index, a distributed multi-dimensional indexing scheme in modular data centers with Bcube topology. RB-Index is a two-layer indexing scheme, which integrates Bcube-based routing protocol and R-tree-based indexing technology. In its lower layer, each server in the network indexes the local data with R-tree, while in the upper layer the global index is distributed across different servers in the network. Based on the characteristics of Bcube, we build several indexing spaces and propose the way to map servers into the indexing spaces. The dimension of these indexing spaces are dynamically selected according to both the data distribution and the query habit. Index construction and query algorithms are also introduced. We simulate a three-level Bcube to evaluate the efficiency of our indexing scheme and compare the performance of RB-Index with RT-CAN, a similar design in P2P network.

This work has been supported in part by the National Natural Science Foundation of China (Grant number 61202024, 61472252, 61133006, 61422208), China 973 project (2012CB316200), the Natural Science Foundation of Shanghai (Grant No. 12ZR1445000), Shanghai Educational Development Foundation (Chenguang Grant No. 12CG09), Shanghai Pujiang Program 13PJ1403900, and in part by Jiangsu Future Network Research Project No. BY2013095-1-10 and CCF-Tencent Open Fund.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chen, G., Vo, H.T., Wu, S., Ooi, B.C., Özsu, M.T.: A framework for supporting DBMSlike indexes in the cloud. VLDB 4(11), 702–713 (2011)

    Google Scholar 

  2. Guo, C., Lu, G., Li, D., Wu, H., Zhang, X., Shi, Y., Tian, C., Zhang, Y., Lu, S.: BCube: a high performance, server-centric network architecture for modular data centers. In: SIGCOMM (2009)

    Google Scholar 

  3. du Mouza, C., Litwin, W., Rigaux, P.: SD-Rtree: a scalable distributed Rtree. In: ICDE pp. 296–305 (2007)

    Google Scholar 

  4. Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: OSDI, pp. 205–218 (2006)

    Google Scholar 

  5. Li, F., Liang, W., Gao, X., Yao, B., Chen, G.: Efficient R-tree based indexing for cloud storage system with dual-port servers. In: Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. (eds.) DEXA 2014, Part II. LNCS, vol. 8645, pp. 375–391. Springer, Heidelberg (2014)

    Google Scholar 

  6. DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W.: Dynamo: amazons highly available key-value store. SIGOPS 41(6), 205–220 (2007)

    Article  Google Scholar 

  7. Wang, J., Wu, S., Gao, H., Li, J., Ooi, B.C.: Indexing multi-dimensional data in a cloud system. In: SIGMOD, pp. 591–602 (2010)

    Google Scholar 

  8. Hamilton, J.: An architecture for modular data centers. In: CIDR (2007)

    Google Scholar 

  9. Avinash, L., Prashant, M.: Cassandra: a decentralized structured storage system. ACM Spec. Interest Group Oper. Syst. (SIGOPS) 44(2), 35–40 (2010)

    Google Scholar 

  10. Aguilera, M.K., Golab, W., Shah, M.A.: A practical scalable distributed B-tree. PVLDB 1, 598–609 (2008)

    Google Scholar 

  11. Escriva, R., Wong, B., Sirer, E.G.: HyperDex: a distributed, searchable key-value store. In: SIGCOMM, pp. 25–36 (2012)

    Google Scholar 

  12. Sai, W., Wu, K.-L.: An indexing framework for efficient retrieval on the cloud. ICDE 32(1), 75–82 (2009)

    MATH  Google Scholar 

  13. Wu, S., Jiang, D., Ooi, B.C., Wu, K.-L.: Efficient B-tree based indexing for cloud data processing. VLDB 3(1–2), 1207–1218 (2010)

    Google Scholar 

  14. Gao, X., Li, B., Chen, Z., Yin, M., Chen, G., Jin, Y.: FT-INDEX: a distributed indexing scheme for switch-centric cloud storage system. In: ICC (2015)

    Google Scholar 

  15. Liang, Y., Balcan, M.-F., Kanchanapally, V.: Distributed PCA and k-means clustering. In: NIPS (2012)

    Google Scholar 

  16. Liu, Y., Gao, X., Chen, G.: Design and optimization for distributed indexing scheme in switch-centric cloud storage system. In: ISCC, pp. 804–809 (2015)

    Google Scholar 

  17. Jolliffe, I.T.: Principal Component Analysis, 2nd edn. Springer, New York (2002)

    MATH  Google Scholar 

  18. IBM: Scalable Modular Data Center. http://www-935.ibm.com/services/us/its/pdf/smdc-eb-sfe03001-usen-00-022708.pdf

  19. Rackable Systems: ICE Cube Modular Data Center. http://www.rackable.com/products/icecube.aspx

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaofeng Gao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Gao, L., Zhang, Y., Gao, X., Chen, G. (2015). Indexing Multi-dimensional Data in Modular Data Centers. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds) Database and Expert Systems Applications. Globe DEXA 2015 2015. Lecture Notes in Computer Science(), vol 9262. Springer, Cham. https://doi.org/10.1007/978-3-319-22852-5_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22852-5_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22851-8

  • Online ISBN: 978-3-319-22852-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics