Abstract
Cloud computing is a suitable platform for running applications to process large volumes of data. Currently, with the growth of geographic and spatial data volume, conceptualized as Big Geospatial Data, some tools have been developed to allow the processing of this data efficiently. This work presents a cost-efficient method for processing geospatial data, optimizing the number of data nodes in a SpatialHadoop cluster according to dataset size. With this, it is possible to analyse and compare the costs for this type of application on public cloud providers.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Alarabi, L., Eldawy, A., Alghamdi, R., Mokbel, M.F.: TAREEG: a MapReduce-based web service for extracting spatial data from OpenStreetMap. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 897–900. ACM (2014)
Bachiega, J., Reis, M., Araujo, A., Holanda, M.: Cost optimization on public cloud provider for big geospatial data: a case study using Open Street Map. In: Proceedings of the 7th International Conference on Cloud Computing and Services Science, pp. 54–62 (2017)
Chaisiri, S., Lee, B.S., Niyato, D.: Optimization of resource provisioning cost in cloud computing. IEEE Trans. Serv. Comput. 5(2), 164–177 (2012)
Das, J., Dasgupta, A., Ghosh, S.K., Buyya, R.: A geospatial orchestration framework on cloud for processing user queries. In: 2016 IEEE International Conference on Cloud Computing in Emerging Markets (CCEM), pp. 1–8. IEEE (2016)
Eldawy, A., Li, Y., Mokbel, M.F., Janardan, R.: CG\__Hadoop: computational geometry in MapReduce. In: Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 294–303. ACM (2013)
Eldawy, A., Mokbel, M.F.: A demonstration of SpatialHadoop: an efficient mapreduce framework for spatial data. Proc. VLDB Endowment 6(12), 1230–1233 (2013)
Eldawy, A., Mokbel, M.F.: Pigeon: a spatial MapReduce language. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 1242–1245. IEEE (2014)
Eldawy, A., Mokbel, M.F.: SpatialHadoop: a MapReduce framework for spatial data. In: 2015 IEEE 31st International Conference on Data Engineering (ICDE), pp. 1352–1363. IEEE (2015)
Eldawy, A., Mokbel, M.F., Jonathan, C.: HadoopViz: a MapReduce framework for extensible visualization of big spatial data. In: 2016 IEEE 32nd International Conference on Data Engineering (ICDE), pp. 601–612. IEEE (2016)
Ji, C., Li, Y., Qiu, W., Awada, U., Li, K.: Big data processing in cloud computing environments. In: Pervasive Systems, Algorithms and Networks (ISPAN), 2012 12th International Symposium on. pp. 17–23. IEEE (2012)
Kambatla, K., Pathak, A., Pucha, H.: Towards optimizing hadoop provisioning in the cloud. HotCloud 9, 12 (2009)
Krämer, M., Senner, I.: A modular software architecture for processing of big geospatial data in the cloud. Comput. Graph. 49, 69–81 (2015)
Li, A., Yang, X., Kandula, S., Zhang, M.: CloudCmp: comparing public cloud providers. In: Proceedings of the 10th ACM SIGCOMM Conference on Internet Measurement, pp. 1–14. ACM (2010)
Mell, P., Grance, T., et al.: The NIST definition of cloud computing (2011)
Mokbel, M.F., Alarabi, L., Bao, J., Eldawy, A., Magdy, A., Sarwat, M., Waytas, E., Yackel, S.: A demonstration of MNTG-a web-based road network traffic generator. In: 2014 IEEE 30th International Conference on Data Engineering (ICDE), pp. 1246–1249. IEEE (2014)
Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: a not-so-foreign language for data processing. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of data, pp. 1099–1110. ACM (2008)
Rosa, M., Moura, B., Vergara, G., Santos, L., Ribeiro, E., Holanda, M., Walter, M.E., Araújo, A.: BioNimbuZ: a federated cloud platform for bioinformatics applications. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 548–555. IEEE (2016)
Sagiroglu, S., Sinanc, D.: Big data: a review. In: 2013 International Conference on Collaboration Technologies and Systems (CTS), pp. 42–47. IEEE (2013)
Yang, C., Goodchild, M., Huang, Q., Nebert, D., Raskin, R., Xu, Y., Bambacus, M., Fay, D.: Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing? Int. J. Digital Earth 4(4), 305–329 (2011)
Yang, C., Yu, M., Hu, F., Jiang, Y., Li, Y.: Utilizing cloud computing to address big geospatial data challenges. Comput. Environ. Urban Syst. 61, 120–128 (2017)
Zhao, Y., Calheiros, R.N., Bailey, J., Sinnott, R.: SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environments. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 432–441. IEEE (2016)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Bachiega, J., Reis, M.S., Araújo, A.P.F., Holanda, M. (2018). Cost Analysis for Big Geospatial Data Processing in Public Cloud Providers. In: Ferguson, D., Muñoz, V., Cardoso, J., Helfert, M., Pahl, C. (eds) Cloud Computing and Service Science. CLOSER 2017. Communications in Computer and Information Science, vol 864. Springer, Cham. https://doi.org/10.1007/978-3-319-94959-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-94959-8_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-94958-1
Online ISBN: 978-3-319-94959-8
eBook Packages: Computer ScienceComputer Science (R0)