Skip to main content

Advertisement

Log in

HGeoHashBase: an optimized storage model of spatial objects for location-based services

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Many location-based services need to query objects existing in a specific space, such as location-based tourism resource recommendation. Both a large number of spatial objects and the real-time object access requirements of location-based services pose a big challenge for spatial object storage and query management. In this paper, we propose HGeoHashBase, an improved storage model by integrating GeoHash with key-value structure, to organize spatial objects for efficient range queries. GeoHash is responsible for spatial encoding and key-value structure as underlying data storage. Both the similarity of the encodings for objects in the close geographical locations and the multi-version data mechanism are blended into the proposed model well. Considering the tradeoff between encoding precision and query performance, a theoretical proof is presented. Extensive experiments are designed and conducted, whose results show that the proposed model can gain significant performance improvement.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Guttman A. R-trees: a dynamic index structure for spatial searching. In: Proceedings of ACM SIGMOD International Conference on Management of Data. 1984, 47–57

    Google Scholar 

  2. Beckmann N, Kriegel H P, Schneider R, Seeger B. The R*-tree: an efficient and robust access method for points and rectangles. In: Proceedings of ACM SIGMOD International Conference on Management of Data. 1990, 322–331

    Google Scholar 

  3. Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber K E. Bigtable: a distributed storage system for structured data. In: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation. 2006, 205–218

    Google Scholar 

  4. Chang F, Dean J, Ghemawat S, Hsieh W C, Wallach D A, Burrows M, Chandra T, Fikes A, Gruber K E. Bigtable: a distributed storage system for structured data. ACM Transactions on Computer Systems, 2008, 26(2): 4

    Article  Google Scholar 

  5. Lakshman A, Malik P. Cassandra: a decentralized structured storage system. ACM SIGOPS Operating System Review, 2010, 44(2): 35–40

    Article  Google Scholar 

  6. Wang L, Cheng C Q, Wu S Z, Wu F L, Teng W. Massive remote sensing image data management based on HBase and GeoSOT. In: Proceedings of IEEE International Geoscience and Remote Sensing Symposium. 2015, 4558–4561

    Google Scholar 

  7. Wang L, Chen B, Liu Y H. Distributed storage and index of vector spatial data based on HBase. In: Proceedings of the 21st International Conference on Geoinformatics. 2013, 1–5

    Google Scholar 

  8. Dean J, Ghemawat S. Mapreduce: simplified data processing on large clusters. Communications of the ACM, 2008, 51(1): 107–113

    Article  Google Scholar 

  9. Noghabi A S, Subramanian S, Narayanan P, Narayanan S, Holla G, Zaldeh M, Li T, Gupta I, Campbell R H. Ambry: linkedin’s scalable geo-distributed object store. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 253–265

    Google Scholar 

  10. Shanbhag A, Jindal A, Lu Y, Madden S. Amoeba: a shape changing storage system for big data. Proceedings of the VLDB Endowment, 2016, 9(13): 1569–1572

    Article  Google Scholar 

  11. DeCandia G, Hastorun D, Jampani M, Kakulapati G, Cakshman A, Pilchin A, Sivasubramanian S, Vosshall P, Vogels W. Dynamo: Amazon’s highly available key-value store. In: Proceedings of the 21st ACM SIGOPS Symposium on Operating Systems Principles. 2007, 205–220

    Google Scholar 

  12. Halevy A, Korn F, Noy N F, Olston C, Polyzotis N, Roy S, Whang S E. Goods: organizing google’s datasets. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 795–806

    Google Scholar 

  13. Samet H, Webber E R. Storing a collection of polygons using quadtrees. ACM Transactions on Graphics, 1985, 4(3): 182–222

    Article  Google Scholar 

  14. Han D, Ztroulia E. HGrid: a data model for large geospatial data sets in HBase. In: Proceedings of the IEEE International Conference on Cloud Computing. 2013, 910–917

    Google Scholar 

  15. Fox A, Eichelberger C, Hughes J, Lyon S. Spatio-temporal indexing in non-relational distributed databases. In: Proceedings of the 2013 IEEE International Conference on Big Data. 2013, 291–299

    Chapter  Google Scholar 

  16. Xie D, Li F, Yao B, Li G F, Zhou L, Guo M Y. Simba: efficient inmemory spatial analytics. In: Proceedings of the 2016 International Conference on Management of Data. 2016, 1071–1085

    Google Scholar 

  17. Tang MJ, Yu Y Y, Malluhi MQ, Ouzzani M, Aref G W. Locationspark: a distributed in-memory data management system for big spatial data. Proceedings of the VLDB Endowment, 2016, 9(13): 1565–1568

    Article  Google Scholar 

  18. Fernandes R, Zaczkowski P, Göttler B, Ettinoffe C, Moussa A. TrafficDB: here’s high performance shared-memory data store. Proceedings of the VLDB Endowment, 2016, 9(13): 1365–1376

    Article  Google Scholar 

  19. Lakshman S, Melkote S, Liang J, Mayuram R. Nitro: a fast, scalable in-memory storage engine for NoSQL global secondary index. Proceedings of the VLDB Endowment, 2016, 9(13): 1413–1424

    Article  Google Scholar 

  20. Liu J J, Li H R, Gao Y, Yu H, Jiang D. A Geohash-based index for spatial data management in distributed memory. In: Proceedings of the 22nd International Conference on Geoinformatics. 2014, 1–4

    Google Scholar 

  21. Arnold T. An entropy maximizing Geohash for distributed spatiotemporal database indexing. 2015, arXiv preprint arXiv:1506.05158

    Google Scholar 

Download references

Acknowledgements

This study is supported by the National Natural Science Foundation of China (Grant Nos. 61462017, 61363005, U1501252, 61662013), Guangxi Natural Science Foundation of China (2017GXNSFAA198035, 2014GXNSFAA118353, 2014GXNSFAA118390), Guangxi Key Laboratory of Automatic Detection Technology and Instrument Foundation (YQ15110), Guangxi Cooperative Innovation Center of Cloud Computing and Big Data, and the High Level Innovation Team of Colleges and Universities in Guangxi and Outstanding Scholars Program Funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qing Yang.

Additional information

Jingwei Zhang is an associate professor at School of Computer and Information Security, Guilin University of Electronic Technology, China. He obtained his PhD from East China Normal University, China in 2012. His research interests include massive data management, distributed computing framework, Web data analysis and big data analytics for emerging applications.

Chao Yang has obtained the master degree from Guilin University of Electronic Technology, China. His research interests include large-scale data computation optimization and big data services for smart tourism.

Qing Yang is an associate professor at School of Electronics Engineering and Automation, Guilin University of Electronic Technology, China. Her research interests include intelligent information processing, social network analysis, and large-scale data processing optimization.

Yuming Lin is an associate professor at School of Computer and Information Security, Guilin University of Electronic Technology, China. He obtained his PhD from East China Normal University, China in 2013. His research interests include sentiment analysis, Web data mining and knowledge graph.

Yanchun Zhang is a professor and the director of the Centre for Applied Informatics, Victoria University, Australia. He received his PhD degree from the University of Queensland, Australia in 1991. He is the Editor-In-Chief of World Wide Web journal (Springer), and Health Information Science and Systems (BioMed Central). He is the Chairman of the International Web Information Systems Engineering Society (WISE Society). He was a member of ARC College of Experts from 2008 to 2010. He received the National “Thousand Talent Program” Award from China in 2010. His research interests include big data analytics, eHealth, Social networking, and Web services.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, J., Yang, C., Yang, Q. et al. HGeoHashBase: an optimized storage model of spatial objects for location-based services. Front. Comput. Sci. 14, 208–218 (2020). https://doi.org/10.1007/s11704-018-7030-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-018-7030-3

Keywords