Abstract
Many location-based services need to query objects existing in a specific space, such as location-based tourism resource recommendation. Both a large number of spatial objects and the real-time object access requirements of location-based services pose a big challenge for spatial object storage and query management. In this paper, we propose HGeoHashBase, an improved storage model by integrating GeoHash with key-value structure, to organize spatial objects for efficient range queries. GeoHash is responsible for spatial encoding and key-value structure as underlying data storage. Both the similarity of the encodings for objects in the close geographical locations and the multi-version data mechanism are blended into the proposed model well. Considering the tradeoff between encoding precision and query performance, a theoretical proof is presented. Extensive experiments are designed and conducted, whose results show that the proposed model can gain significant performance improvement.
Similar content being viewed by others
References
Guttman A. R-trees: a dynamic index structure for spatial searching. In: Proceedings of ACM SIGMOD International Conference on Management of Data. 1984, 47–57
Beckmann N, Kriegel H P, Schneider R, Seeger B. The R*-tree: an efficient and robust access method for points and rectangles. In: Proceedings of ACM SIGMOD International Conference on Management of Data. 1990, 322–331
Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber K E. Bigtable: a distributed storage system for structured data. In: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation. 2006, 205–218
Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber K E. Bigtable: a distributed storage system for structured data. ACM Transactions on Computer Systems, 2008, 26(2): 4
Lakshman A, Malik P. Cassandra: a decentralized structured storage system. ACM SIGOPS Operating System Review, 2010, 44(2): 35–40
Wang L, Cheng C Q, Wu S Z, Wu F L, Teng W. Massive remote sensing image data management based on HBase and GeoSOT. In: Proceedings of IEEE International Geoscience and Remote Sensing Symposium. 2015, 4558–4561
Wang L, Chen B, Liu Y H. Distributed storage and index of vector spatial data based on HBase. In: Proceedings of the 21st International Conference on Geoinformatics. 2013, 1–5
Dean J, Ghemawat S. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 2008, 51(1): 107–113
Noghabi A S, Subramanian S, Narayanan P, Narayanan S, Holla G, Zaldeh M, Li T, Gupta I, Campbell R H. Ambry: linkedin’s scalable geo-distributed object store. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 253–265
Shanbhag A, Jindal A, Lu Y, Madden S. Amoeba: a shape changing storage system for big data. Proceedings of the VLDB Endowment, 2016, 9(13): 1569–1572
DeCandia G, Hastorun D, Jampani M, Kakulapati G, Cakshman A, Pilchin A, Sivasubramanian S, Vosshall P, Vogels W. Dynamo: Amazon’s highly available key-value store. In: Proceedings of the 21st ACM SIGOPS Symposium on Operating Systems Principles. 2007, 205–220
Halevy A, Korn F, Noy N F, Olston C, Polyzotis N, Roy S, Whang S E. Goods: organizing google’s datasets. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 795–806
Samet H, Webber E R. Storing a collection of polygons using quadtrees. ACM Transactions on Graphics, 1985, 4(3): 182–222
Han D, Ztroulia E. HGrid: a data model for large geospatial data sets in HBase. In: Proceedings of the IEEE International Conference on Cloud Computing. 2013, 910–917
Fox A, Eichelberger C, Hughes J, Lyon S. Spatio-temporal indexing in non-relational distributed databases. In: Proceedings of the 2013 IEEE International Conference on Big Data. 2013, 291–299
Xie D, Li F, Yao B, Li G F, Zhou L, Guo M Y. Simba: efficient inmemory spatial analytics. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 1071–1085
Tang MJ, Yu Y Y, Malluhi MQ, Ouzzani M, Aref G W. Locationspark: a distributed in-memory data management system for big spatial data. Proceedings of the VLDB Endowment, 2016, 9(13): 1565–1568
Fernandes R, Zaczkowski P, Göttler B, Ettinoffe C, Moussa A. TrafficDB: here’s high performance shared-memory data store. Proceedings of the VLDB Endowment, 2016, 9(13): 1365–1376
Lakshman S, Melkote S, Liang J, Mayuram R. Nitro: a fast, scalable in-memory storage engine for NoSQL global secondary index. Proceedings of the VLDB Endowment, 2016, 9(13): 1413–1424
Liu J J, Li H R, Gao Y, Yu H, Jiang D. A Geohash-based index for spatial data management in distributed memory. In: Proceedings of the 22nd International Conference on Geoinformatics. 2014, 1–4
Arnold T. An entropy maximizing Geohash for distributed spatiotemporal database indexing. 2015, arXiv preprint arXiv:1506.05158
Acknowledgements
This study is supported by the National Natural Science Foundation of China (Grant Nos. 61462017, 61363005, U1501252, 61662013), Guangxi Natural Science Foundation of China (2017GXNSFAA198035, 2014GXNSFAA118353, 2014GXNSFAA118390), Guangxi Key Laboratory of Automatic Detection Technology and Instrument Foundation (YQ15110), Guangxi Cooperative Innovation Center of Cloud Computing and Big Data, and the High Level Innovation Team of Colleges and Universities in Guangxi and Outstanding Scholars Program Funding.
Author information
Authors and Affiliations
Corresponding author
Additional information
Jingwei Zhang is an associate professor at School of Computer and Information Security, Guilin University of Electronic Technology, China. He obtained his PhD from East China Normal University, China in 2012. His research interests include massive data management, distributed computing framework, Web data analysis and big data analytics for emerging applications.
Chao Yang has obtained the master degree from Guilin University of Electronic Technology, China. His research interests include large-scale data computation optimization and big data services for smart tourism.
Qing Yang is an associate professor at School of Electronics Engineering and Automation, Guilin University of Electronic Technology, China. Her research interests include intelligent information processing, social network analysis, and large-scale data processing optimization.
Yuming Lin is an associate professor at School of Computer and Information Security, Guilin University of Electronic Technology, China. He obtained his PhD from East China Normal University, China in 2013. His research interests include sentiment analysis, Web data mining and knowledge graph.
Yanchun Zhang is a professor and the director of the Centre for Applied Informatics, Victoria University, Australia. He received his PhD degree from the University of Queensland, Australia in 1991. He is the Editor-In-Chief of World Wide Web journal (Springer), and Health Information Science and Systems (BioMed Central). He is the Chairman of the International Web Information Systems Engineering Society (WISE Society). He was a member of ARC College of Experts from 2008 to 2010. He received the National “Thousand Talent Program” Award from China in 2010. His research interests include big data analytics, eHealth, Social networking, and Web services.
Electronic supplementary material
Rights and permissions
About this article
Cite this article
Zhang, J., Yang, C., Yang, Q. et al. HGeoHashBase: an optimized storage model of spatial objects for location-based services. Front. Comput. Sci. 14, 208–218 (2020). https://doi.org/10.1007/s11704-018-7030-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11704-018-7030-3