Abstract
Although optimal resource allocation is a well-known and studied problem, the recent technological innovations are bringing to light new specificities and issues. Some relevant real-life applications are the optimal management of cloud/high-performance computing resources, and the optimal budget allocation for multi-channel marketing. Recent formulations have led to the definition of the Semi-Bandit Feedback approach, that is the reference method in these emerging real-life settings. In this paper we propose a novel approach, extending the Bayesian Optimization framework to specifically deal with the resource allocation problem, and finally resulting more efficient than Semi-Bandit Feedback. Moreover, the proposed approach can also deal with specific (real-life) settings that cannot be covered by Semi-Bandit Feedback. We have validated our approach on (i) the case study reported in the original paper proposing Semi-Bandit Feedback, (ii) a multi-channel marketing application, and (iii) the optimal mix of water sources in water distribution networks.
This research was supported by the following grant: ENERGIDRICA – Efficienza energetica nelle reti idriche (CUP B42F20000390006) Programma PON “Ricerca e Innovazione” 2014- 2020 – Azione II – OS 1.b.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Archetti, F., Candelieri, A.: Bayesian optimization and data science. Springer (2019). https://doi.org/10.1007/978-3-030-24494-1
Bakker, H., Dunke, F., Nickel, S.: A structuring review on multi-stage optimization under uncertainty: aligning concepts from theory and practice. Omega 96, 102080 (2020)
Balcan, M.F., Dick, T., Pegden, W.: Semi-bandit optimization in the dispersed setting. In: Conference on Uncertainty in Artificial Intelligence, pp. 909–918. PMLR (2020)
Barrier, A., Garivier, A., Stoltz, G.: On best-arm identification with a fixed budget in non-parametric multi-armed bandits. In: International Conference on Algorithmic Learning Theory. pp. 136–181. PMLR (2023)
Berk, J., Gupta, S., Rana, S., Venkatesh, S.: Randomised gaussian process upper confidence bound for Bayesian optimisation. In: Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence, pp. 2284–2290 (2021)
Brandt, J., Haddenhorst, B., Bengs, V., Hüllermeier, E.: Finding optimal arms in non-stochastic combinatorial bandits with semi-bandit feedback and finite budget. arXiv preprint arXiv:2202.04487 (2022)
Candelieri, A.: A gentle introduction to Bayesian optimization. In: 2021 Winter Simulation Conference (WSC), pp. 1–16. IEEE (2021)
Candelieri, A.: Sequential model based optimization of partially defined functions under unknown constraints. J. Global Optim. 79(2), 281–303 (2021)
Candelieri, A., Ponti, A., Giordani, I., Archetti, F.: On the use of Wasserstein distance in the distributional analysis of human decision making under uncertainty. Ann. Math. Artif. Intell., 1–22 (2022)
Carpentier, P., Chancelier, J.P., Cohen, G., De Lara, M.: Stochastic multi-stage optimization. Probability Theory and Stochastic Modelling 75 (2015)
Chen, W., Wang, L., Zhao, H., Zheng, K.: Combinatorial semi-bandit in the non-stationary environment. In: Uncertainty in Artificial Intelligence, pp. 865–875. PMLR (2021)
Dagan, Y., Koby, C.: A better resource allocation algorithm with semi-bandit feedback. In: Algorithmic Learning Theory, pp. 268–320. PMLR (2018)
Frazier, P.I.: Bayesian optimization. In: Recent Advances in Optimization and modeling of contemporary problems, pp. 255–278. Informs (2018)
Gelbart, M.A., Snoek, J., Adams, R.P.: Bayesian optimization with unknown constraints. In: 30th Conference on Uncertainty in Artificial Intelligence, UAI 2014, pp. 250–259. AUAI Press (2014)
Gelbart, M.A.: Constrained Bayesian optimization and applications. Ph.D. thesis (2015)
Gramacy, R.B.: Surrogates: Gaussian process modeling, design, and optimization for the applied sciences. Chapman and Hall/CRC (2020)
Gunjan, A., Bhattacharyya, S.: A brief review of portfolio optimization techniques. Artif. Intell. Rev., 1–40 (2022)
Jourdan, M., Mutnỳ, M., Kirschner, J., Krause, A.: Efficient pure exploration for combinatorial bandits with semi-bandit feedback. In: Algorithmic Learning Theory, pp. 805–849. PMLR (2021)
Lattimore, T., Crammer, K., Szepesvári, C.: Optimal resource allocation with semi-bandit feedback. arXiv preprint arXiv:1406.3840 (2014)
Lattimore, T., Crammer, K., Szepesvári, C.: Linear multi-resource allocation with semi-bandit feedback. In: Advances in Neural Information Processing Systems, vol. 28 (2015)
Lattimore, T., Szepesvári, C.: Bandit algorithms. Cambridge University Press (2020)
Letham, B., Karrer, B., Ottoni, G., Bakshy, E.: Constrained Bayesian optimization with noisy experiments. Bayesian Anal. 14(2), 495–519 (2019)
Liu, B., Rao, Y., Lu, J., Zhou, J., Hsieh, C.J.: Multi-proxy Wasserstein classifier for image classification. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, pp. 8618–8626 (2021)
Neu, G., Bartók, G.: An efficient algorithm for learning with semi-bandit feedback. In: International Conference on Algorithmic Learning Theory, pp. 234–248. Springer (2013). https://doi.org/10.1007/978-3-642-40935-6_17
Patriksson, M.: A survey on the continuous nonlinear resource allocation problem. Eur. J. Oper. Res. 185(1), 1–46 (2008)
Peyré, G., Cuturi, M., et al.: Computational optimal transport: with applications to data science. Found. Trends® Mach. Learn. 11(5-6), 355–607 (2019)
Ponti, A., Candelieri, A., Archetti, F.: A new evolutionary approach to optimal sensor placement in water distribution networks. Water 13(12), 1625 (2021)
Ponti, A., Candelieri, A., Archetti, F.: A Wasserstein distance based multiobjective evolutionary algorithm for the risk aware optimization of sensor placement. Intell. Syst. Appl. 10, 200047 (2021)
Slivkins, A., et al.: Introduction to multi-armed bandits. Found. Trends® Mach. Learn. 12(1-2), 1–286 (2019)
Sonkar, S., Kharat, M.: A review on resource allocation and VM scheduling techniques and a model for efficient resource management in cloud computing environment. In: 2016 International Conference on ICT in Business Industry & Government (ICTBIG), pp. 1–7. IEEE (2016)
Srinivas, N., Krause, A., Kakade, S.M., Seeger, M.W.: Information-theoretic regret bounds for gaussian process optimization in the bandit setting. IEEE Trans. Inf. Theory 58(5), 3250–3265 (2012)
Thananjeyan, B., Kandasamy, K., Stoica, I., Jordan, M., Goldberg, K., Gonzalez, J.: Resource allocation in multi-armed bandit exploration: Overcoming sublinear scaling with adaptive parallelism. In: International Conference on Machine Learning, pp. 10236–10246. PMLR (2021)
Verma, A., Hanawal, M., Rajkumar, A., Sankaran, R.: Censored Semi-Bandits: a framework for resource allocation with censored feedback. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Villani, C.: The Wasserstein distances. In: Optimal Transport, pp. 93–111. Springer (2009). https://doi.org/10.1007/978-3-642-40935-6_17
Vinothina, V.V., Sridaran, R., Ganapathi, P.: A survey on resource allocation strategies in cloud computing. Int. J. Adv. Comput. Sci. Appl. 3(6) (2012)
Wang, C., Long, S., Zeng, R., Lu, Y.: Imputation method for fetal heart rate signal evaluation based on optimal transport theory. SN Comput. Sci. 2(6), 1–12 (2021)
Wang, S., Chen, W.: Thompson sampling for combinatorial semi-bandits. In: International Conference on Machine Learning, pp. 5114–5122. PMLR (2018)
Wen, Z., Kveton, B., Ashkan, A.: Efficient learning in large-scale combinatorial semi-bandits. In: International Conference on Machine Learning, pp. 1113–1122. PMLR (2015)
Williams, C.K., Rasmussen, C.E.: Gaussian Processes for Machine Learning, vol. 2. MIT Press, Cambridge, MA (2006)
Xidonas, P., Steuer, R., Hassapis, C.: Robust portfolio optimization: a categorized bibliographic review. Ann. Oper. Res. 292(1), 533–552 (2020)
Yousafzai, A., et al.: Cloud resource allocation schemes: review, taxonomy, and opportunities. Knowl. Inf. Syst. 50(2), 347–381 (2017)
Zhang, D., et al.: Domain-oriented language modeling with adaptive hybrid masking and optimal transport alignment. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pp. 2145–2153 (2021)
Zhang, J., Liu, T., Tao, D.: An optimal transport analysis on generalization in deep learning. IEEE Trans. Neural Netw. Learn. Syst. (2021)
Zhang, S., Huang, K., Yuan, Y.: Spare parts inventory management: a literature review. Sustainability 13(5), 2460 (2021)
Ziukov, S.: A literature review on models of inventory management under uncertainty. Bus. Syst. Econ. 5(1), 26–35 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Candelieri, A. (2025). Resource Allocation via Bayesian Optimization: an Efficient Alternative to Semi-Bandit Feedback. In: Sergeyev, Y.D., Kvasov, D.E., Astorino, A. (eds) Numerical Computations: Theory and Algorithms. NUMTA 2023. Lecture Notes in Computer Science, vol 14476. Springer, Cham. https://doi.org/10.1007/978-3-031-81241-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-031-81241-5_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-81240-8
Online ISBN: 978-3-031-81241-5
eBook Packages: Computer ScienceComputer Science (R0)