Skip to main content

Reinforcement Learning-Based Adaptive Operator Selection

  • Conference paper
  • First Online:
Optimization and Learning (OLA 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1443))

Included in the following conference series:

  • 710 Accesses

Abstract

Metaheuristic and swarm intelligence approaches require devising optimisation algorithms with operators to let produce neighbouring solutions to conduct a move. The efficiency of algorithms using single operator remains recessive in comparison with those with multiple operators. However, use of multiple operators require a selection mechanism, which may not be always as productive as expected; therefore an adaptive selection scheme is always needed. In this study, an experience-based, reinforcement learning algorithm has been used to build an adaptive selection scheme implemented to work with a binary artificial bee colony algorithm in which the selection mechanism learns when and subject to which circumstances an operator can help produce better and worse neighbours. The implementations have been tested with commonly used benchmarks of uncapacitated facility location problem. The results demonstrates that the selection scheme developed based on reinforcement learning, which can also be named as smart selection scheme, performs much better that state-of-art adaptive selection schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aslan, M., Gunduz, M., Kiran, M.S.: Jayax: jaya algorithm with xor operator for binary optimization. Appl. Soft Comput. 82, 105576 (2019)

    Article  Google Scholar 

  2. Aydin, M.E., Öztemel, E.: Dynamic job-shop scheduling using reinforcement learning agents. Robot. Auton. Syst. 33(2–3), 169–178 (2000)

    Article  Google Scholar 

  3. Coronato, A., Naeem, M., De Pietro, G., Paragliola, G.: Reinforcement learning for intelligent healthcare applications: a survey. Artif. Intell. Med. 109, 101964 (2020)

    Article  Google Scholar 

  4. DaCosta, L., Fialho, A., Schoenauer, M., Sebag, M.: Adaptive operator selection with dynamic multi-armed bandits. In: Proceedings of the 10th annual conference on Genetic and evolutionary computation, pp. 913–920 (2008)

    Google Scholar 

  5. Dokeroglu, T., Sevinc, E., Kucukyilmaz, T., Cosar, A.: A survey on new generation metaheuristic algorithms. Comput. Ind. Eng. 137, 106040 (2019)

    Article  Google Scholar 

  6. Durgut, R.: Improved binary artificial bee colony algorithm. Frontiers of Information Technology & Electronic Engineering (in press) (2020)

    Google Scholar 

  7. Durgut, R., Aydin, M.E.: Adaptive binary artificial bee colony algorithm. Appl. Soft Comput. 101, 107054 (2021)

    Article  Google Scholar 

  8. Fialho, Á.: Adaptive operator selection for optimization. Ph.D. thesis, Université Paris Sud-Paris XI (2010)

    Google Scholar 

  9. Fialho, Á., Da Costa, L., Schoenauer, M., Sebag, M.: Extreme value based adaptive operator selection. In: Rudolph, G., Jansen, T., Beume, N., Lucas, S., Poloni, C. (eds.) PPSN 2008. LNCS, vol. 5199, pp. 175–184. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87700-4_18

    Chapter  Google Scholar 

  10. Fialho, Á., Da Costa, L., Schoenauer, M., Sebag, M.: Analyzing bandit-based adaptive operator selection mechanisms. Ann. Math. Artif. Intell. 60(1–2), 25–64 (2010)

    Article  MathSciNet  Google Scholar 

  11. Hussain, A., Muhammad, Y.S.: Trade-off between exploration and exploitation with genetic algorithm using a novel selection operator. Complex Intell. Syst. 6, 1–14 (2019)

    Article  Google Scholar 

  12. Hussain, K., Salleh, M.N.M., Cheng, S., Shi, Y.: On the exploration and exploitation in popular swarm-based metaheuristic algorithms. Neural Comput. Appl. 31(11), 7665–7683 (2019)

    Article  Google Scholar 

  13. Karaboga, D., Basturk, B.: On the performance of artificial bee colony (ABC) algorithm. Appl. Soft Comput. 8(1), 687–697 (2008)

    Article  Google Scholar 

  14. Kashan, M.H., Nahavandi, N., Kashan, A.H.: DisABC: a new artificial bee colony algorithm for binary optimization. Appl. Soft Comput. 12(1), 342–352 (2012)

    Article  Google Scholar 

  15. Kiran, M.S., Gündüz, M.: Xor-based artificial bee colony algorithm for binary optimization. Turkish J. Electr. Eng. Comput. Sci. 21(Sup. 2), 2307–2328 (2013)

    Article  Google Scholar 

  16. Moerland, T.M., Broekens, J., Jonker, C.M.: Model-based reinforcement learning: A survey. arXiv preprint arXiv:2006.16712 (2020)

  17. Ozturk, C., Hancer, E., Karaboga, D.: Dynamic clustering with improved binary artificial bee colony algorithm. Appl. Soft Comput. 28, 69–80 (2015)

    Article  Google Scholar 

  18. Xue, Y., Xue, B., Zhang, M.: Self-adaptive particle swarm optimization for large-scale feature selection in classification. ACM Trans. Knowl. Discov. Data (TKDD) 13(5), 1–27 (2019)

    Article  Google Scholar 

  19. Yang, T., Zhao, L., Li, W., Zomaya, A.Y.: Reinforcement learning in sustainable energy and electric systems: a survey. Ann. Rev. Control 49, 145–163 (2020)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rafet Durgut .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Durgut, R., Aydin, M.E. (2021). Reinforcement Learning-Based Adaptive Operator Selection. In: Dorronsoro, B., Amodeo, L., Pavone, M., Ruiz, P. (eds) Optimization and Learning. OLA 2021. Communications in Computer and Information Science, vol 1443. Springer, Cham. https://doi.org/10.1007/978-3-030-85672-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-85672-4_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-85671-7

  • Online ISBN: 978-3-030-85672-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics