An Online Learning Approach to a Multi-player N-armed Functional Bandit

O’Neill, Sam; Bagdasar, Ovidiu; Liotta, Antonio

doi:10.1007/978-3-030-40616-5_41

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11974))

Included in the following conference series:

International Conference on Numerical Computations: Theory and Algorithms

753 Accesses

Abstract

Congestion games possess the property of emitting at least one pure Nash equilibrium and have a rich history of practical use in transport modelling. In this paper we approach the problem of modelling equilibrium within congestion games using a decentralised multi-player probabilistic approach via stochastic bandit feedback. Restricting the strategies available to players under the assumption of bounded rationality, we explore an online multiplayer exponential weights algorithm for unweighted atomic routing games and compare this with a $\epsilon $-greedy algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Computing Approximate Nash Equilibria in Network Congestion Games with Polynomially Decreasing Cost Functions

Congestion Games with Player-Specific Costs Revisited

Improving Approximate Pure Nash Equilibria in Congestion Games

Notes

1.
$(a_i; a_{-i})$ is commonly used to refer to player i’s strategy given the strategy profile $\mathbf {a}=(a_1,\cdots ,a_i, \cdots ,a_N)$.
2.
In general an unweighted traffic rate routes the same quantity $k_i =k \quad \forall i \in \mathcal {N}$.
3.
The source code is available at https://github.com/samtoneill/congestionbanditgames.

References

Belmega, E.V., Mertikopoulos, P., Negrel, R., Sanguinetti, L.: Online convex optimization and no-regret learning: algorithms, guarantees and applications (2018). http://arxiv.org/abs/1804.04529
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, Cambridge (2006)
Book Google Scholar
Cohen, J., Héliou, A., Mertikopoulos, P.: Learning with bandit feedback in potential games (2017). https://hal.archives-ouvertes.fr/hal-01643352
Gigerenzer, G., Selten, R.: Bounded Rationality: The Adaptive Toolbox. MIT Press, Cambridge (2001)
Google Scholar
Patriksson, M.: The Traffic Assignment Problem: Models and Methods. Dover Publications, Mineola (1994)
Google Scholar
Rosenthal, R.W.: A class of games possessing pure-strategy Nash equilibria. Int. J. Game Theory 2(1), 65–67 (1973). https://doi.org/10.1007/BF01737559
Article MathSciNet MATH Google Scholar
Roughgarden, T.: Routing games. In: Nisan, N., Roughgarden, T., Tardos, E., Vazirani, V.V. (eds.) Algorithmic Game Theory, pp. 461–486. Cambridge University Press, Cambridge (2007). https://doi.org/10.1017/CBO9780511800481.020
Chapter Google Scholar
Vinitsky, E., et al.: Benchmarks for reinforcement learning in mixed-autonomy traffic. In: Billard, A., Dragan, A., Peters, J., Morimoto, J. (eds.) Proceedings of the 2nd Conference on Robot Learning. Proceedings of Machine Learning Research, vol. 87, pp. 399–409. PMLR (2018). http://proceedings.mlr.press/v87/vinitsky18a.html

Download references

Author information

Authors and Affiliations

University of Derby, Kedleston Road, Derby, DE22 1GB, UK
Sam O’Neill & Ovidiu Bagdasar
Edinburgh Napier University, Sighthill Campus, Sighthill Court, Edinburgh, EH11 4BN, UK
Antonio Liotta

Authors

Sam O’Neill
View author publications
You can also search for this author in PubMed Google Scholar
Ovidiu Bagdasar
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Liotta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sam O’Neill .

Editor information

Editors and Affiliations

University of Calabria, Rende, Italy
Yaroslav D. Sergeyev
University of Calabria, Rende, Italy
Dmitri E. Kvasov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

O’Neill, S., Bagdasar, O., Liotta, A. (2020). An Online Learning Approach to a Multi-player N-armed Functional Bandit. In: Sergeyev, Y., Kvasov, D. (eds) Numerical Computations: Theory and Algorithms. NUMTA 2019. Lecture Notes in Computer Science(), vol 11974. Springer, Cham. https://doi.org/10.1007/978-3-030-40616-5_41

Download citation

DOI: https://doi.org/10.1007/978-3-030-40616-5_41
Published: 14 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-40615-8
Online ISBN: 978-3-030-40616-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Online Learning Approach to a Multi-player N-armed Functional Bandit

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Computing Approximate Nash Equilibria in Network Congestion Games with Polynomially Decreasing Cost Functions

Congestion Games with Player-Specific Costs Revisited

Improving Approximate Pure Nash Equilibria in Congestion Games

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Online Learning Approach to a Multi-player N-armed Functional Bandit

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Computing Approximate Nash Equilibria in Network Congestion Games with Polynomially Decreasing Cost Functions

Congestion Games with Player-Specific Costs Revisited

Improving Approximate Pure Nash Equilibria in Congestion Games

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation