Multi-armed Bandit-Based Metaheuristic Operator Selection: The Pendulum Algorithm Binarization Case

Ábrego-Calderón, Pablo; Crawford, Broderick; Soto, Ricardo; Rodriguez-Tello, Eduardo; Cisternas-Caneo, Felipe; Monfroy, Eric; Giachetti, Giovanni

doi:10.1007/978-3-031-34020-8_19

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1824))

Included in the following conference series:

International Conference on Optimization and Learning

539 Accesses

Abstract

Multi-armed bandit (MAB) is a well-known reinforcement learning algorithm that has shown outstanding performance for recommendation systems and other areas. On the other hand, metaheuristic algorithms have gained much popularity due to their great performance in solving complex problems with endless search spaces. Pendulum Search Algorithm (PSA) is a recently created metaheuristic inspired by the harmonic motion of a pendulum. Its main limitation is to solve combinatorial optimization problems, characterized by using variables in the discrete domain. To overcome this limitation, we propose to use a two-step binarization technique, which offers a large number of possible options that we call scheme. For this, we use MAB as an algorithm that learns and recommends a binarization schemes during the execution of the iterations (online). With the experiments carried out, we show that it delivers better results in solving the Set Covering problem than using a fixed binarization scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Data-Driven Dynamic Discretization Framework to Solve Combinatorial Problems Using Continuous Metaheuristics

Stochastic online decisioning hyper-heuristic for high dimensional optimization

Article 15 December 2023

Fixed Set Search Matheuristic Applied to the min-Knapsack Problem with Compactness Constraints and Penalty Values

References

Ab. Aziz, N.A., Ab. Aziz, K.: Pendulum search algorithm: an optimization algorithm based on simple harmonic motion and its application for a vaccine distribution problem. Algorithms 15(6) (2022)
Google Scholar
Rahman, T.A., Ibrahim, Z., Ab. Aziz, N.A., Zhao, S., Aziz, N.H.A.: Single-agent finite impulse response optimizer for numerical optimization problems. IEEE Access 6, 9358–9374 (2018)
Article Google Scholar
Alizadeh, R., Nishi, T.: Hybrid set covering and dynamic modular covering location problem: Application to an emergency humanitarian logistics problem. Appl. Sci. 10(20), 7110 (2020)
Article Google Scholar
Audibert, J.-Y., Munos, R., Szepesvári, C.: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theoret. Comput. Sci. 410(19), 1876–1902 (2009)
Article MathSciNet MATH Google Scholar
Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47(2), 235–256 (2002)
Article MATH Google Scholar
Becerra-Rozas, M., et al.: Continuous metaheuristics for binary optimization problems: an updated systematic literature review. Mathematics 11(1), 129 (2022)
Article MathSciNet Google Scholar
Crawford, B., Soto, R., Astorga, G., García, J., Castro, C., Paredes, F.: Putting continuous metaheuristics to work in binary search spaces. Complexity 2017 (2017)
Google Scholar
Crawford, B., Soto, R., Monfroy, E., Astorga, G., García, J., Cortes, E.: A meta-optimization approach for covering problems in facility location. In: Figueroa-García, J.C., López-Santana, E.R., Villa-Ramírez, J.L., Ferro-Escobar, R. (eds.) WEA 2017. CCIS, vol. 742, pp. 565–578. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-66963-2_50
Chapter Google Scholar
DaCosta, L., Fialho, A., Schoenauer, M., Sebag, M.: Adaptive operator selection with dynamic multi-armed bandits. In: Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation, pp. 913–920 (2008)
Google Scholar
Elena, G., Milos, K., Eugene, I.: Survey of multiarmed bandit algorithms applied to recommendation systems. Int. J. Open Inf. Technol. 9(4), 12–27 (2021)
Google Scholar
Lanza-Gutierrez, J.M., Caballe, N.C., Crawford, B., Soto, R., Gomez-Pulido, J.A., Paredes, F.: Exploring further advantages in an alternative formulation for the set covering problem. Math. Probl. Eng. 2020 (2020)
Google Scholar
Lanza-Gutierrez, J.M., Crawford, B., Soto, R., Berrios, N., Gomez-Pulido, J.A., Paredes, F.: Analyzing the effects of binarization techniques when solving the set covering problem through swarm optimization. Expert Syst. Appl. 70, 67–82 (2017)
Article Google Scholar
Lemus-Romani, J., et al.: A novel learning-based binarization scheme selector for swarm algorithms solving combinatorial problems. Mathematics 9(22), 2887 (2021)
Article Google Scholar
Mandal, S., Patra, N., Pal, M.: Covering problem on fuzzy graphs and its application in disaster management system. Soft. Comput. 25(4), 2545–2557 (2021)
Article MATH Google Scholar
Patil, V., Ghalme, G., Nair, V., Narahari, Y.: Achieving fairness in the stochastic multi-armed bandit problem. In: AAAI, pp. 5379–5386 (2020)
Google Scholar
Rodriguez-Tello, E., Narvaez-Teran, V., Lardeux, F.: Dynamic multi-armed bandit algorithm for the cyclic bandwidth sum problem. IEEE Access 7, 40258–40270 (2019)
Article MATH Google Scholar
Song, H., Triguero, I., Özcan, E.: A review on the self and dual interactions between machine learning and optimisation. Progr. Artif. Intell. 8(2), 143–165 (2019). https://doi.org/10.1007/s13748-019-00185-z
Article Google Scholar
Soto, R., et al.: A reactive population approach on the dolphin echolocation algorithm for solving cell manufacturing systems. Mathematics 8(9), 1389 (2020)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction, 2nd edn. The MIT Press, Cambridge (2018)
MATH Google Scholar
Talbi, E.-G.: Metaheuristics: From Design to Implementation. Wiley, Hoboken (2009)
Book MATH Google Scholar
Xiang, X., Qiu, J., Xiao, J., Zhang, X.: Demand coverage diversity based ant colony optimization for dynamic vehicle routing problems. Eng. Appl. Artif. Intell. 91, 103582 (2020)
Google Scholar
Fialho, Á., Da Costa, L., Schoenauer, M., Sebag, M.: Dynamic multi-armed bandits and extreme value-based rewards for adaptive operator selection in evolutionary algorithms. LION 3, 176–190 (2009)
Google Scholar

Download references

Acknowledgements

Broderick Crawford, Ricardo Soto, Eduardo Rodriguez-Tello and Felipe Cisternas-Caneo are supported by Dirección de Investigación, VINCI-PUCV; Project: DI Investigación Asociativa Interdisciplinaria 2022 "SELECCIÓN DE CARACTERÍSTICAS USANDO METAHEURÍSTICAS PARA POTENCIAR MODELOS PREDICTIVOS EN SALUD”.

Broderick Crawford and Ricardo Soto are supported by Grant ANID/ FONDECYT/REGULAR/1210810.

Felipe Cisternas-Caneo is supported by Beca INF-PUCV.

Author information

Authors and Affiliations

Pontificia Universidad Católica de Valparaíso, Valparaíso, Chile
Pablo Ábrego-Calderón, Broderick Crawford, Ricardo Soto & Felipe Cisternas-Caneo
Cinvestav, Unidad Tamaulipas, Km. 5.5 Carretera Victoria - Soto La Marina, 87130, Victoria, Tamaulipas, Mexico
Eduardo Rodriguez-Tello
Université d’ Angers, LERIA, Angers, France
Eric Monfroy
Universidad Andres Bello, Santiago, Chile
Giovanni Giachetti

Authors

Pablo Ábrego-Calderón
View author publications
You can also search for this author in PubMed Google Scholar
Broderick Crawford
View author publications
You can also search for this author in PubMed Google Scholar
Ricardo Soto
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Rodriguez-Tello
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Cisternas-Caneo
View author publications
You can also search for this author in PubMed Google Scholar
Eric Monfroy
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Giachetti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pablo Ábrego-Calderón .

Editor information

Editors and Affiliations

University of Cadiz, Cadiz, Spain
Bernabé Dorronsoro
University of Malaga, Malaga, Spain
Francisco Chicano
University of Luxembourg, Esch-sur-Alzette, Luxembourg
Gregoire Danoy
University of Lille, Lille, France
El-Ghazali Talbi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ábrego-Calderón, P. et al. (2023). Multi-armed Bandit-Based Metaheuristic Operator Selection: The Pendulum Algorithm Binarization Case. In: Dorronsoro, B., Chicano, F., Danoy, G., Talbi, EG. (eds) Optimization and Learning. OLA 2023. Communications in Computer and Information Science, vol 1824. Springer, Cham. https://doi.org/10.1007/978-3-031-34020-8_19

Download citation

DOI: https://doi.org/10.1007/978-3-031-34020-8_19
Published: 27 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-34019-2
Online ISBN: 978-3-031-34020-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics