High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Nam, Beomseok; Hwang, Deukyeon; Kim, Jinwoong; Shin, Minho

doi:10.1007/s10619-012-7098-y

High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Published: 08 August 2012

Volume 30, pages 401–414, (2012)
Cite this article

Distributed and Parallel Databases Aims and scope Submit manuscript

Beomseok Nam¹,
Deukyeon Hwang¹,
Jinwoong Kim¹ &
…
Minho Shin²

362 Accesses
2 Citations
Explore all metrics

Abstract

In distributed scientific query processing systems, leveraging distributed cached data is becoming more important. In such systems, a front-end query scheduler distributes queries among many application servers rather than processing queries in a few high-performance workstations. Although many query scheduling policies exist such as round-robin and load-monitoring, they are not sophisticated enough to exploit cached results as well as balance the workload. Efforts were made to improve the query processing performance using statistical methods such as exponential moving average. However, existing methods have limitations for certain query patterns: queries with hotspots, or dynamic query distributions. In this paper, we propose novel query scheduling policies that take into account both the contents of distributed caching infrastructure and the load balance among the servers. Our experiments show that the proposed query scheduling policies outperform existing policies by producing better query plans in terms of load balance and cache-hit ratio.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Modified Query Processing Algorithm Based on Dynamic Clustering for Big Data Applications

DEMB: Cache-Aware Scheduling for Distributed Query Processing

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients’ Processing Power

References

Andrade, H., Kurc, T., Sussman, A., Saltz, J.: Multiple query optimization for data analysis applications on clusters of SMPs. In: Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid). IEEE Comput. Soc., Los Alamitos (2002)
Google Scholar
Aron, M., Sanders, D., Druschel, P., Zwaenepoel, W.: Scalable content-aware request distribution in cluster-based network servers. In: Proceedings of Usenix Annual Technical Conference (2000)
Google Scholar
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM 45(6), 891–923 (1998)
Article MathSciNet MATH Google Scholar
de Berg, M., Cheong, O., van Kreveld, M., Overmars, M.: Computational Geometry, Algorithms and Applications. Springer, Berlin (1998)
Google Scholar
Chen, F.C.F., Dunham, M.H.: Common subexpression processing in multiple-query processing. Transactions on Knowledge and Data Engineering 10(5), 493–499 (199)
Google Scholar
Chou, Y.l.: Statistical Analysis. Holt International (1975)
Kullback, S., Leibler, R.A.: On information and sufficiency. Ann. Math. Stat. 22(1), 79–86 (1951)
Article MathSciNet MATH Google Scholar
Menasce, D.A., Almeida, V.A.F.: Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning. Prentice Hall, New York (2000)
Google Scholar
Nam, B., Shin, M., Andrade, H., Sussman, A.: Multiple query scheduling for distributed semantic caches. J. Parallel Distrib. Comput. 70(5), 598–611 (2010)
Article MATH Google Scholar
Pai, V., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., Nahum, E.: Locality-aware request distribution in cluster-based network servers. In: Proceedings of ACM ASPLOS (1998)
Google Scholar
Ren, Q., Dunham, M.H., Kumar, V.: Semantic caching and query processing. IEEE Trans. Knowl. Data Eng. 15(1), 192–210 (2003)
Article Google Scholar
Roy, P., Sehadri, S., Sudarshan, S., Bhobe, S.: Efficient and extensible algorithms for multi query optimization. In: Proceedings of 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 249–260 (2000)
Chapter Google Scholar
Sellis, T.K., Ghosh, S.: On the multiple-query optimization problem. IEEE Trans. Knowl. Data Eng. 2(2), 262–266 (1990)
Article Google Scholar
Xiong, X., Mokbel, M.F., Aref, W.G., Hambrusch, S.E., Prabhakar, S.: Scalable spatio-temporal continuous query processing for location-aware services. In: Proceedings of 16th International Conference on Scientific and Statistical Database Management (SSDBM) (2004)
Google Scholar
Zhang, K., Andrade, H., Raschid, L., Sussman, A.: Query planning for the Grid: adapting to dynamic resource availability. In: Proceedings of the 5th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid), Cardiff, UK (2005)
Google Scholar
Zhao, Y., Desshpande, P.M., Naughton, J.F., Shukla, A.: Simultaneous optimization and evaluation of multiple dimensional queries. In: Proceedings of 1998 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 271–282 (1998)
Chapter Google Scholar

Download references

Acknowledgements

This research was supported by PLSI resources, 1.100027.01 Research Fund of the UNIST (Ulsan National Institute of Science and Technology), and 2.110147.01 National Research Foundation of Korea. This work was also supported by 2011 Research Fund of Myongji University.

Author information

Authors and Affiliations

Electrical and Computer Engineering, Ulsan National Inst. of Science and Technology, Ulsan, 689-798, Korea
Beomseok Nam, Deukyeon Hwang & Jinwoong Kim
Dept. of Computer Engineering, Myongji University, Yongin, Gyonggido, Korea
Minho Shin

Authors

Beomseok Nam
View author publications
You can also search for this author in PubMed Google Scholar
Deukyeon Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Jinwoong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Minho Shin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minho Shin.

Additional information

Communicated by Judy Qiu and Dennis Gannon.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nam, B., Hwang, D., Kim, J. et al. High-throughput query scheduling with spatial clustering based on distributed exponential moving average. Distrib Parallel Databases 30, 401–414 (2012). https://doi.org/10.1007/s10619-012-7098-y

Download citation

Published: 08 August 2012
Issue Date: October 2012
DOI: https://doi.org/10.1007/s10619-012-7098-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Abstract

Access this article

Similar content being viewed by others

A Modified Query Processing Algorithm Based on Dynamic Clustering for Big Data Applications

DEMB: Cache-Aware Scheduling for Distributed Query Processing

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients’ Processing Power

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Abstract

Access this article

Similar content being viewed by others

A Modified Query Processing Algorithm Based on Dynamic Clustering for Big Data Applications

DEMB: Cache-Aware Scheduling for Distributed Query Processing

MobiPLACE*: A Distributed Framework for Spatio-Temporal Data Streams Processing Utilizing Mobile Clients’ Processing Power

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation