Abstract
With the development of cloud computing, more and more large scale multi-dimensional data are stored on cloud platforms. Multi-dimensional index is an efficient technique to support processing data efficiently. Designing a multi-dimensional index which supports multi-user concurrent access efficiently has become a challenging problem. In this paper, we propose a multi-version R-tree based on HBase (HMVR-tree) to support multiple concurrent access. HMVR-tree maintains the newest version of tree while keeping all the old versions of the nodes for efficient concurrent update and query access to different nodes. The evaluation results show that MHVR-tree has good scalability and has much higher update throughput and the same level query throughput compared to the original R-tree on HBase.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
HBase: Bigtable-like structured storage for Hadoop HDFS (2010). http://hadoop.apache.org/hbase/
ZooKeeper (2014). http://zookeeper.apache.org/
Borthakur, D.: The hadoop distributed file system: architecture and design. Hadoop Proj. Website 11, 21 (2007)
Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, OSDI 2006, vol. 7, p. 15. USENIX Association, Berkeley (2006)
Guttman, A.: R-trees: a dynamic index structure for spatial searching. SIGMOD Rec. 14(2), 47–57 (1984)
Hsu, Y.T., Pan, Y.C., Wei, L.Y., Peng, W.C., Lee, W.C.: Key formulation schemes for spatial index in cloud data managements. In: 2012 IEEE 13th International Conference on Mobile Data Management (MDM), pp. 21–26, July 2012
Huang, S., Wang, B., Zhu, J., Wang, G., Yu, G.: R-HBase: a multi-dimensional indexing framework for cloud computing environment. In: 2014 IEEE International Conference on Data Mining Workshop, pp. 569–574. IEEE, December 2014
Nishimura, S., Das, S., Agrawal, D., El Abbadi, A.: \(\cal {MD}\)-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services. Distrib. Parallel Databases 31(2), 289–319 (2013)
Van, L.H., Takasu, A.: An efficient distributed index for geospatial databases. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds.) DEXA 2015. LNCS, vol. 9261, pp. 28–42. Springer, Heidelberg (2015)
Wagner, R.E.: Indexing design considerations. IBM Syst. J. 12(4), 351–367 (1973)
Wang, L., Chen, B., Liu, Y.: Distributed storage and index of vector spatial data based on HBase. In: 2013 21st International Conference on Geoinformatics, No. 2011, pp. 1–5. IEEE, June 2013
Zhou, X., Zhang, X., Wang, Y., Li, R., Wang, S.: Efficient distributed multi-dimensional index for big data management. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 130–141. Springer, Heidelberg (2013)
Acknowledgments
This research was partially supported by the National Natural Science Foundation of China under Grant nos. 61173030, 61272181, 61272182, 61173029, 61332014; and the Fundamental Research Funds for the Central Universities (N120816001).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Huang, S., Wang, B., Deng, S., Zhao, K., Wang, G., Yu, G. (2016). HMVR-tree: A Multi-version R-tree Based on HBase for Concurrent Access. In: Wang, Y., Yu, G., Zhang, Y., Han, Z., Wang, G. (eds) Big Data Computing and Communications. BigCom 2016. Lecture Notes in Computer Science(), vol 9784. Springer, Cham. https://doi.org/10.1007/978-3-319-42553-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-42553-5_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42552-8
Online ISBN: 978-3-319-42553-5
eBook Packages: Computer ScienceComputer Science (R0)