skip to main content
10.1145/3291064.3291068acmotherconferencesArticle/Chapter ViewAbstractPublication PagescciotConference Proceedingsconference-collections
research-article

Research on Distributed Storage and Query Optimization of Multi-source Heterogeneous Meteorological Data

Authors Info & Claims
Published:29 October 2018Publication History

ABSTRACT

The growth of massive data makes the real-time data service of meteorological forecast and climate analysis facing severe challenge. Distributed database is a good solution to meet the needs for massive multi-source heterogeneous meteorological data storage. Since the current mainstream HBase database fails to support the non-Rowkey query, the poor performance of the real-time query of meteorological data is unsatisfactory. To address this issue, three kinds of distributed data query optimization strategies are proposed in this paper, including query optimization based on secondary index, secondary index query optimization based on hotscore, and query optimization based on the Redis hot data caching strategy. The corresponding experimental results indicate that the search scheme based on the Redis hot data caching strategy has the best performance among the three schemes, not only can meet the needs of meteorological service query, but also can achieve 3-8 times efficiency enhancement than standard HBase.

References

  1. Gantz J, Reinsel D.The digital universe in 2020: big data, bigger digital shadows, and biggest growth in the far east{J}. IDC iView: IDC Analyze the Future, 2012: 1--16.Google ScholarGoogle Scholar
  2. Shen Wen Hai. Discussion on the future infrastructure of meteorological service information system -- the role of "cloud computing" and "big data" in meteorological information technology {J}. Meteorological Science and Technology Progress, 2015 (3):64--66.Google ScholarGoogle Scholar
  3. Xiong Anyuan, Zhao Fang, Wang Ying, et al. Design and implementation of the national integrated meteorological information sharing system {J}. Journal of Applied Meteorology, 2015 (4): 500--512.Google ScholarGoogle Scholar
  4. Karun A K, Chitharanjan K. A review on hadoop-HDFS infrastructure extensions{C}//Information & Communication Technologies (ICT), 2013 IEEE Conference on. IEEE, 2013: 132--137.Google ScholarGoogle Scholar
  5. Taylor R C.An overview of the Hadoop MapReduce HBase framework and its current applications in bioinformatics{J}.Bmc Bioinformatics, 2010, 11(6):3395--3407.Google ScholarGoogle Scholar
  6. Bhupathiraju V, Ravuri R P. The dawn of big data-HBase{C}//IT in Business, Industry and Government (CSIBIG), 2014 Conference on.IEEE, 2014:1--4.Google ScholarGoogle Scholar
  7. Vashishtha H, Stroulia E. Enhancing query support in hbase via an extended coprocessors framework{C}//European Conference on a Service-Based Internet. Springer, Berlin, Heidelberg, 2011: 75--87. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Liu Xiaoli, Xu Pandeng, Zhu Guobin, et al. Parallel and distributed remote sensing images combined with MapReduce and HBase {J}. Geographic and Geographic Information Science, 2014, 30 (5): 26--28.Google ScholarGoogle Scholar
  9. Feng C, Li B. Research of Temporal Information Index Strategy Based on HBase{J}. Procedia Computer Science, 2017, 107:367--372. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Ma T, Xu X, Tang M, et al. MHBase: A Distributed Real-Time Query Scheme for Meteorological Data Based on HBase{J}. Future Internet, 2016, 8(1):6.Google ScholarGoogle ScholarCross RefCross Ref
  11. Ge Wei, Luo Shengmei, Zhou Wenhui, Zhao Di, Tang Yun, Zhou Juan, Qu Wen Wu, Yuan Chunfeng, Huang Yihua.HiBase: a hierarchical indexing based efficient HBase query technology and system {J}. Computer Journal, 2016, 39 (01): 140--153.Google ScholarGoogle Scholar
  12. Cui Chen, Zheng Linjiang, Han Fengping, et al. Design of HBase based two level index based on memory {J}. Computer Application, 2018: 1--8.Google ScholarGoogle Scholar
  13. Ge W, Huang Y, Zhao D, et al. Cinhba: A secondary index with hotscore caching policy on key-value data store{C}//International Conference on Advanced Data Mining and Applications. Springer, Cham, 2014: 602--615.Google ScholarGoogle Scholar

Index Terms

  1. Research on Distributed Storage and Query Optimization of Multi-source Heterogeneous Meteorological Data

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CCIOT '18: Proceedings of the 2018 International Conference on Cloud Computing and Internet of Things
      October 2018
      91 pages
      ISBN:9781450365765
      DOI:10.1145/3291064

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 October 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader