Abstract
Infrastructure-as-a-Service (IaaS) cloud platforms offer resources with diverse buying options. Users can run an instance on the on-demand market which is stable but expensive or on the spot market with a significant discount. However, users have to carefully weigh the low cost of spot instances against their poor availability. Spot instances will be revoked when the revocation event occurs. Thus, an important problem that an IaaS user faces now is how to use spot instances in a cost-effective and low-risk way. Based on the replication-based fault tolerance mechanism, we propose an online termination algorithm that optimizes the cost of using spot instances while ensuring operational stability. We prove that in most cases, the cost of our proposed online algorithm will not exceed twice the minimum cost of the optimal offline algorithm that knows the exact future a priori. Through a large number of experiments, we verify that our algorithm in most cases has a competitive ratio of no more than 2, and in other cases it can also reach the guaranteed competitive ratio.
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Neto J P A, Pianto D M, Ralha C G. MULTS: A multicloud fault-tolerant architecture to manage transient servers in cloud computing. Journal of Systems Architecture, 2019, 101: 101651. https://doi.org/10.1016/j.sysarc.2019.101651.
Calatrava A, Romero E, Moltó G, Caballer M, Alonso J M. Self-managed cost-efficient virtual elastic clusters on hybrid Cloud infrastructures. Future Generation Computer Systems, 2016, 61: 13–25. https://doi.org/10.1016/j.future.2016.01.018.
Yi S, Kondo D, Andrzejak A. Reducing costs of spot instances via checkpointing in the Amazon Elastic Compute Cloud. In Proc. the 3rd IEEE International Conference on Cloud Computing, Jul. 2010, pp.236–243. https://doi.org/10.1109/CLOUD.2010.35.
Jangjaimon I, Tzeng N F. Effective cost reduction for elastic clouds under spot instance pricing through adaptive checkpointing. IEEE Trans. Computers, 2015, 64(2): 396–409. https://doi.org/10.1109/TC.2013.225.
Poola D, Ramamohanarao K, Buyya R. Fault-tolerant workflow scheduling using spot instances on clouds. Procedia Computer Science, 2014, 29: 523–533. https://doi.org/10.1016/j.procs.2014.05.047.
Sampaio A M, Barbosa J G. Constructing reliable computing environments on top of Amazon EC2 spot instances. Algorithms, 2020, 13(8): 187. https://doi.org/10.3390/a13080187.
Sampaio A M, Barbosa J G. Enhancing reliability of compute environments on Amazon EC2 spot instances. In Proc. the 2019 International Conference on High Performance Computing & Simulation, Jul. 2019, pp.708–715. https://doi.org/10.1109/HPCS48598.2019.9188116.
Subramanya S, Guo T, Sharma P, Irwin D, Shenoy P. SpotOn: A batch computing service for the spot market. In Proc. the 6th ACM Symposium on Cloud Computing, Aug. 2015, pp.329–341. https://doi.org/10.1145/2806777.2806851.
Domanal S G, Reddy G R M. An efficient cost optimized scheduling for spot instances in heterogeneous cloud environment. Future Generation Computer Systems, 2018, 84: 11–21. https://doi.org/10.1016/j.future.2018.02.003.
Fabra J, Ezpeleta J, Álvarez P. Reducing the price of resource provisioning using EC2 spot instances with prediction models. Future Generation Computer Systems, 2019, 96: 348–367. https://doi.org/10.1016/j.future.2019.01.025.
Ben-Yehuda O A, Ben-Yehuda M, Schuster A, Tsafrir D. Deconstructing Amazon EC2 spot instance pricing. In Proc. the 3rd IEEE International Conference on Cloud Computing Technology and Science, Nov. 29–Dec. 1, 2011, pp.304–311. https://doi.org/10.1109/CloudCom.2011.48.
Wallace R M, Turchenko V, Sheikhalishahi M, Turchenko I, Shults V, Vazquez-Poletti J L, Grandinetti L. Applications of neural-based spot market prediction for cloud computing. In Proc. the 7th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems, Sept. 2013, pp.710–716. https://doi.org/10.1109/IDAACS.2013.6663017.
Gong Y F, He B S, Zhou A C. Monetary cost optimizations for MPI-based HPC applications on Amazon clouds: Checkpoints and replicated execution. In Proc. the 2015 International Conference for High Performance Computing, Networking, Storage and Analysis, Nov. 2015, Article No. 32. https://doi.org/10.1145/2807591.2807612.
Song Y, Zafer M, Lee K W. Optimal bidding in spot instance market. In Proc. the 2012 IEEE INFOCOM, Mar. 2012, pp.190–198. https://doi.org/10.1109/INFCOM.2012.6195567.
Dubois D J, Casale G. OptiSpot: Minimizing application deployment cost using spot cloud resources. Cluster Computing, 2016, 19(2): 893–909. https://doi.org/10.1007/s10586-016-0568-7.
Fleischer R. On the Bahncard problem. Theoretical Computer Science, 2001, 268(1): 161–174. https://doi.org/10.1016/S0304-3975(00)00266-8.
Yang S S, Pan L, Wang Q Y, Liu S J. To sell or not to sell: Trading your reserved instances in Amazon EC2 marketplace. In Proc. the 38th IEEE International Conference on Distributed Computing Systems, Jul. 2018, pp.939–948. https://doi.org/10.1109/ICDCS.2018.00095.
Zhang S, Yuan D, Pan L, Liu S J, Cui L Z, Meng X X. Selling reserved instances through pay-as-you-go model in cloud computing. In Proc. the 2017 IEEE International Conference on Web Services, Jun. 2017, pp.130–137. https://doi.org/10.1109/ICWS.2017.25.
Wang W, Liang B, Li B C. Optimal online multi-instance acquisition in IaaS clouds. IEEE Transactions on Parallel and Distributed Systems, 2015, 26(12): 3407–3419. https://doi.org/10.1109/TPDS.2014.2385697.
Wang C, Liang Q L, Urgaonkar B. An empirical analysis of Amazon EC2 spot instance features affecting cost-effective resource procurement. In Proc. the 8th ACM/SPEC on International Conference on Performance Engineering, Apr. 2017, pp.63–74. https://doi.org/10.1145/3030207.3030210.
Nicolae B, Cappello F. BlobCR: Efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots. In Proc. the 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, Nov. 2011, Article No. 34. https://doi.org/10.1145/2063384.2063429.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Computation, 1997, 9(8): 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.
Borodin A, El-Yaniv R. Online Computation and Competitive Analysis. Cambridge University Press, 2005.
Author information
Authors and Affiliations
Corresponding author
Supplementary Information
ESM 1
(PDF 414 kb)
Rights and permissions
About this article
Cite this article
Xu, ZW., Pan, L. & Liu, SJ. An Online Algorithm Based on Replication for Using Spot Instances in IaaS Clouds. J. Comput. Sci. Technol. 39, 103–115 (2024). https://doi.org/10.1007/s11390-023-1535-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-023-1535-4