Skip to main content

Multi-Column Query Method Research and Optimization on HBase

  • Conference paper
  • First Online:
  • 2756 Accesses

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 224))

Abstract

This paper focuses on analyzing how to improve the HBase non RowKey’s query and designing an approach to improve its performance. Analyzing and summarizing the advantages and disadvantages of index technology applied in typical application scenarios, we design a secondary index approach for HBase based on the pre-partition. Through experiments, we found this method can effectively improve the query performance of HBase on non RowKey column, have little effect on the performance of the original data writing and reduce data redundancy. Compared with other methods, this approach has certain performance advantages.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bakshi, K.: Considerations for big data: Architecture and approach. In: Aerospace Conference, 2012 IEEE, pp. 1–7. IEEE (2012)

    Google Scholar 

  2. Wang, J.: Index and query processing technology research in the cloud computing system. HIT (2013)

    Google Scholar 

  3. Du, X.: Big data environment based on HBase distributed query optimization research. Computer CD Software And Application 8 (2014)

    Google Scholar 

  4. Vora, M.N.: Hadoop-HBase for large-scale data. In: 2011 IEEE International Conference on Computer Science and Network Technology (ICCSNT), pp. 601–605 (2011)

    Google Scholar 

  5. Liu, X.: The depth analysis of the HBase performance. Programmer 7, 102–104 (2011)

    Google Scholar 

  6. Zhuo, H.: Real-time query system desigan and implementation based on HBase huge data, BUPT (2013)

    Google Scholar 

  7. Vashishtha, H., Stroulia, E.: Enhancing query support in hbase via an extended coprocessors framework[M]. Springer, Berlin Heidelberg (2011)

    Book  Google Scholar 

  8. Yunzhi, G.: Application of Remote Procedure Call Protocol(RPC) in Heterogeneous Dual-Core Communication[J]. Fire Control Radar Technology (2013)

    Google Scholar 

  9. Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters[J]. Commun. ACM 51(1), 107–113 (2008)

    Article  Google Scholar 

Download references

Acknowledgements

This work has been supported by the National Natural Science Foundation of China under Grant 61172072, 61271308, and Beijing Natural Science Foundation under Grant 4112045, and the Research Fund for the Doctor-al Program of Higher Education of China under Grant W11C100030, the Beijing Science and Technology Pro-gram under Grant Z121100000312024.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bo Shen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Shen, B., Chao, HC., Xu, W. (2015). Multi-Column Query Method Research and Optimization on HBase. In: Uden, L., Heričko, M., Ting, IH. (eds) Knowledge Management in Organizations. KMO 2015. Lecture Notes in Business Information Processing, vol 224. Springer, Cham. https://doi.org/10.1007/978-3-319-21009-4_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-21009-4_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-21008-7

  • Online ISBN: 978-3-319-21009-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics