Multi-Column Query Method Research and Optimization on HBase

Shen, Bo; Chao, Han-Chieh; Xu, Weijia

doi:10.1007/978-3-319-21009-4_32

Multi-Column Query Method Research and Optimization on HBase

Bo Shen⁹,
Han-Chieh Chao¹⁰ &
Weijia Xu⁹

Conference paper
First Online: 01 January 2015

2756 Accesses

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 224))

Abstract

This paper focuses on analyzing how to improve the HBase non RowKey’s query and designing an approach to improve its performance. Analyzing and summarizing the advantages and disadvantages of index technology applied in typical application scenarios, we design a secondary index approach for HBase based on the pre-partition. Through experiments, we found this method can effectively improve the query performance of HBase on non RowKey column, have little effect on the performance of the original data writing and reduce data redundancy. Compared with other methods, this approach has certain performance advantages.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bakshi, K.: Considerations for big data: Architecture and approach. In: Aerospace Conference, 2012 IEEE, pp. 1–7. IEEE (2012)
Google Scholar
Wang, J.: Index and query processing technology research in the cloud computing system. HIT (2013)
Google Scholar
Du, X.: Big data environment based on HBase distributed query optimization research. Computer CD Software And Application 8 (2014)
Google Scholar
Vora, M.N.: Hadoop-HBase for large-scale data. In: 2011 IEEE International Conference on Computer Science and Network Technology (ICCSNT), pp. 601–605 (2011)
Google Scholar
Liu, X.: The depth analysis of the HBase performance. Programmer 7, 102–104 (2011)
Google Scholar
Zhuo, H.: Real-time query system desigan and implementation based on HBase huge data, BUPT (2013)
Google Scholar
Vashishtha, H., Stroulia, E.: Enhancing query support in hbase via an extended coprocessors framework[M]. Springer, Berlin Heidelberg (2011)
Book Google Scholar
Yunzhi, G.: Application of Remote Procedure Call Protocol(RPC) in Heterogeneous Dual-Core Communication[J]. Fire Control Radar Technology (2013)
Google Scholar
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters[J]. Commun. ACM 51(1), 107–113 (2008)
Article Google Scholar

Download references

Acknowledgements

This work has been supported by the National Natural Science Foundation of China under Grant 61172072, 61271308, and Beijing Natural Science Foundation under Grant 4112045, and the Research Fund for the Doctor-al Program of Higher Education of China under Grant W11C100030, the Beijing Science and Technology Pro-gram under Grant Z121100000312024.

Author information

Authors and Affiliations

School of Electronic and Information Engineering, Key Laboratory of Communication and Information Systems, Beijing Municipal Commission of Education, Beijing Jiaotong University, Beijing, China
Bo Shen & Weijia Xu
Department of Electronic Engineering, National Ilan University, Yilan, Taiwan
Han-Chieh Chao

Authors

Bo Shen
View author publications
You can also search for this author in PubMed Google Scholar
Han-Chieh Chao
View author publications
You can also search for this author in PubMed Google Scholar
Weijia Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bo Shen .

Editor information

Editors and Affiliations

Staffordshire University, Stoke-on-Trent, United Kingdom
Lorna Uden
University of Maribor, Maribor, Slovenia
Marjan Heričko
National University of Kaohsiung, Kaohsiung City, Taiwan
I-Hsien Ting

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shen, B., Chao, HC., Xu, W. (2015). Multi-Column Query Method Research and Optimization on HBase. In: Uden, L., Heričko, M., Ting, IH. (eds) Knowledge Management in Organizations. KMO 2015. Lecture Notes in Business Information Processing, vol 224. Springer, Cham. https://doi.org/10.1007/978-3-319-21009-4_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-21009-4_32
Published: 04 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21008-7
Online ISBN: 978-3-319-21009-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics