Attention, Filling in the Gaps for Generalization in Routing Problems

Bdeir, Ahmad; Falkner, Jonas K.; Schmidt-Thieme, Lars

doi:10.1007/978-3-031-26422-1_31

Ahmad Bdeir¹³,
Jonas K. Falkner¹³ &
Lars Schmidt-Thieme¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13718))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

1361 Accesses

Abstract

Machine Learning (ML) methods have become a useful tool for tackling vehicle routing problems, either in combination with popular heuristics or as standalone models. However, current methods suffer from poor generalization when tackling problems of different sizes or different distributions. As a result, ML in vehicle routing has witnessed an expansion phase with new methodologies being created for particular problem instances that become infeasible at larger problem sizes.

This paper aims at encouraging the consolidation of the field through understanding and improving current existing models, namely the attention model by Kool et al. We identify two discrepancy categories for VRP generalization. The first is based on the differences that are inherent to the problems themselves, and the second relates to architectural weaknesses that limit the model’s ability to generalize. Our contribution becomes threefold: We first target model discrepancies by adapting the Kool et al. method and its loss function for Sparse Dynamic Attention based on the alpha-entmax activation. We then target inherent differences through the use of a mixed instance training method that has been shown to outperform single instance training in certain scenarios. Finally, we introduce a framework for inference level data augmentation that improves performance by leveraging the model’s lack of invariance to rotation and dilation changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Comparison of Attention Mechanisms in Machine Learning Models for Vehicle Routing Problems

Comparison of Machine Learning Algorithms for Vehicle Routing Problems

Vehicle Routing Problem Using Reinforcement Learning: Recent Advancements

References

Bai, R., et al.: Analytics and machine learning in vehicle routing research. arXiv:2102.10012 [cs, math] (2021)
Bdeir, A., et al.: RP-DQN: an application of Q-learning to vehicle routing problems. In: Proceedings of the KI: Advances in Artificial Intelligence, pp. 3–16 (2021)
Google Scholar
Correia, G.M., et al.: Adaptively sparse transformers. arXiv:1909.00015 [cs, stat] (2019)
Dantzig, G.B., Ramser, J.H.: The truck dispatching problem. Manag. Sci. 6(1), 80–91 (1959). https://doi.org/10.1287/mnsc.6.1.80
Article MathSciNet Google Scholar
Falkner, J.K., Lars, S.-T.: Learning to solve vehicle routing problems with time windows through joint attention. arXiv:2006.09100 [cs] (2020)
Kool, W., et al.: Attention, learn to solve routing problems! 25 (2019)
Google Scholar
Kwon, Y.-D., et al.: POMO: policy optimization with multiple optima for reinforcement learning. arXiv:2010.16011 [cs] (2021)
Nazari, M., et al.: Reinforcement learning for solving the vehicle routing problem. arXiv:1802.04240 [cs, stat] (2018)
Peng, B., et al.: A deep reinforcement learning algorithm using dynamic attention model for vehicle routing problems. arXiv:2002.03282 [cs, stat] (2020)
Peters, B., et al.: Sparse sequence-to-sequence models. arXiv:1905.05702 [cs] (2019)
Toth, P., Vigo, D.: Vehicle Routing: Problems, Methods, and Applications. Society for Industrial and Applied Mathematics. SIAM, Philadelphia (2015)
Google Scholar
Vaswani, A., et al.: Attention is all you need. arXiv:1706.03762 [cs] (2017)
Williams, R., Peng, J.: Function optimization using connectionist reinforcement learning algorithms. Connect. Sci. 3, 241 (1991). https://doi.org/10.1080/09540099108946587
Article Google Scholar
Wu, Y., et al.: Learning improvement heuristics for solving routing problems. IEEE Trans. Neural Netw. Learn. Syst. 1–13 (2021). https://doi.org/10.1109/TNNLS.2021.3068828
Xin, L., et al.: Multi-decoder attention model with embedding glimpse for solving vehicle routing problems. In: AAAI 2021, pp. 12042–12049 (2021)
Google Scholar

Download references

Acknowledgements

This work was supported by the German Federal Ministry of Education and Research (BMBF), project “Learning to Optimize” (01IS20013A:L2O).

Author information

Authors and Affiliations

Hildesheim Universität, Hildesheim, Germany
Ahmad Bdeir, Jonas K. Falkner & Lars Schmidt-Thieme

Authors

Ahmad Bdeir
View author publications
You can also search for this author in PubMed Google Scholar
Jonas K. Falkner
View author publications
You can also search for this author in PubMed Google Scholar
Lars Schmidt-Thieme
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ahmad Bdeir .

Editor information

Editors and Affiliations

Grenoble Alpes University, Saint Martin d'Hères, France
Massih-Reza Amini
INSA Rouen Normandy, Saint Etienne du Rouvray, France
Stéphane Canu
Ruhr-Universität Bochum, Bochum, Germany
Asja Fischer
KU Leuven, Leuven, Belgium
Tias Guns
Central European University, Vienna, Austria
Petra Kralj Novak
Aristotle University of Thessaloniki, Thessaloniki, Greece
Grigorios Tsoumakas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bdeir, A., Falkner, J.K., Schmidt-Thieme, L. (2023). Attention, Filling in the Gaps for Generalization in Routing Problems. In: Amini, MR., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718. Springer, Cham. https://doi.org/10.1007/978-3-031-26422-1_31

Download citation

DOI: https://doi.org/10.1007/978-3-031-26422-1_31
Published: 18 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26421-4
Online ISBN: 978-3-031-26422-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Attention, Filling in the Gaps for Generalization in Routing Problems