Skip to main content

Attention, Filling in the Gaps for Generalization in Routing Problems

  • Conference paper
  • First Online:
Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13718))

  • 1361 Accesses

Abstract

Machine Learning (ML) methods have become a useful tool for tackling vehicle routing problems, either in combination with popular heuristics or as standalone models. However, current methods suffer from poor generalization when tackling problems of different sizes or different distributions. As a result, ML in vehicle routing has witnessed an expansion phase with new methodologies being created for particular problem instances that become infeasible at larger problem sizes.

This paper aims at encouraging the consolidation of the field through understanding and improving current existing models, namely the attention model by Kool et al. We identify two discrepancy categories for VRP generalization. The first is based on the differences that are inherent to the problems themselves, and the second relates to architectural weaknesses that limit the model’s ability to generalize. Our contribution becomes threefold: We first target model discrepancies by adapting the Kool et al. method and its loss function for Sparse Dynamic Attention based on the alpha-entmax activation. We then target inherent differences through the use of a mixed instance training method that has been shown to outperform single instance training in certain scenarios. Finally, we introduce a framework for inference level data augmentation that improves performance by leveraging the model’s lack of invariance to rotation and dilation changes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bai, R., et al.: Analytics and machine learning in vehicle routing research. arXiv:2102.10012 [cs, math] (2021)

  2. Bdeir, A., et al.: RP-DQN: an application of Q-learning to vehicle routing problems. In: Proceedings of the KI: Advances in Artificial Intelligence, pp. 3–16 (2021)

    Google Scholar 

  3. Correia, G.M., et al.: Adaptively sparse transformers. arXiv:1909.00015 [cs, stat] (2019)

  4. Dantzig, G.B., Ramser, J.H.: The truck dispatching problem. Manag. Sci. 6(1), 80–91 (1959). https://doi.org/10.1287/mnsc.6.1.80

    Article  MathSciNet  Google Scholar 

  5. Falkner, J.K., Lars, S.-T.: Learning to solve vehicle routing problems with time windows through joint attention. arXiv:2006.09100 [cs] (2020)

  6. Kool, W., et al.: Attention, learn to solve routing problems! 25 (2019)

    Google Scholar 

  7. Kwon, Y.-D., et al.: POMO: policy optimization with multiple optima for reinforcement learning. arXiv:2010.16011 [cs] (2021)

  8. Nazari, M., et al.: Reinforcement learning for solving the vehicle routing problem. arXiv:1802.04240 [cs, stat] (2018)

  9. Peng, B., et al.: A deep reinforcement learning algorithm using dynamic attention model for vehicle routing problems. arXiv:2002.03282 [cs, stat] (2020)

  10. Peters, B., et al.: Sparse sequence-to-sequence models. arXiv:1905.05702 [cs] (2019)

  11. Toth, P., Vigo, D.: Vehicle Routing: Problems, Methods, and Applications. Society for Industrial and Applied Mathematics. SIAM, Philadelphia (2015)

    Google Scholar 

  12. Vaswani, A., et al.: Attention is all you need. arXiv:1706.03762 [cs] (2017)

  13. Williams, R., Peng, J.: Function optimization using connectionist reinforcement learning algorithms. Connect. Sci. 3, 241 (1991). https://doi.org/10.1080/09540099108946587

    Article  Google Scholar 

  14. Wu, Y., et al.: Learning improvement heuristics for solving routing problems. IEEE Trans. Neural Netw. Learn. Syst. 1–13 (2021). https://doi.org/10.1109/TNNLS.2021.3068828

  15. Xin, L., et al.: Multi-decoder attention model with embedding glimpse for solving vehicle routing problems. In: AAAI 2021, pp. 12042–12049 (2021)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the German Federal Ministry of Education and Research (BMBF), project “Learning to Optimize” (01IS20013A:L2O).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmad Bdeir .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bdeir, A., Falkner, J.K., Schmidt-Thieme, L. (2023). Attention, Filling in the Gaps for Generalization in Routing Problems. In: Amini, MR., Canu, S., Fischer, A., Guns, T., Kralj Novak, P., Tsoumakas, G. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2022. Lecture Notes in Computer Science(), vol 13718. Springer, Cham. https://doi.org/10.1007/978-3-031-26422-1_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-26422-1_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-26421-4

  • Online ISBN: 978-3-031-26422-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics