Reinforcement Learning as a Means of Dynamic Aggregate QoS Provisioning

Akar, Nail; Sahin, Cem

doi:10.1007/3-540-45020-3_8

Nail Akar⁶ &
Cem Sahin⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2698))

Included in the following conference series:

International Workshop on Architectures for Quality of Service in the Internet

253 Accesses

Abstract

Dynamic capacity management (or dynamic provisioning) is the process of dynamically changing the capacity allocation (reservation) of a virtual path (or a pseudo-wire) established between two network end points. This process is based on certain criteria including instantaneous traffic load for the pseudo-wire, network utilization, hour of day, or day of week. Frequent adjustment of the capacity yields a scalability issue in the form of a significant amount of message distribution and processing (i.e., signaling) in the network elements involved in the capacity update process. We therefore use the term “signaling rate” for the number of capacity updates per unit time. On the other hand, if the capacity is adjusted once and for the highest loaded traffic conditions, a significant amount of bandwidth may be wasted depending on the actual traffic load. There is then a need for dynamic capacity management that takes into account the tradeoff between signaling scalability and bandwidth efficiency. In this paper, we introduce a Markov decision framework for an optimal capacity management scheme. Moreover, for problems with large sizes and for which the desired signaling rate is imposed as a constraint, we provide suboptimal schemes using reinforcement learning. Our numerical results demonstrate that the reinforcement learning schemes that we propose provide significantly better bandwidth efficiencies than the static allocation policy without violating the signaling rate requirements of the underlying network.

This work is supported by The Scientific and Technical Research Council of Turkey (TUBITAK) under grant EEEAG-101E048

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Providing traffic tolerance in optical packet switching networks: a reinforcement learning approach

Article 04 May 2017

MetisRL: A Reinforcement Learning Approach for Dynamic Routing in Data Center Networks

RILNET: A Reinforcement Learning Based Load Balancing Approach for Datacenter Networks

References

“ATM User Network Interface (UNI)”, ATM Forum Specification version 4.0, AFUNI-4.0, July 1996.
Google Scholar
F. Baker, C. Iturralde, F. Le Faucheur, B. Davie, “Aggregation of RSVP for IPv4 and IPv6 Reservations”, RFC 3175, September 2001.
Google Scholar
D. P. Bertsekas and J. N. Tsitsiklis, “Neuro-Dynamic Programming” Athena Scientific, Belmont, MA, 1996.
MATH Google Scholar
S. Blake, D. Black, M. Carlson, E. Davies, Z. Wang, W. Weiss, “An Architecture for Differentiated Services”, RFC 2475, 1998.
Google Scholar
D. Clark, S. Shenker, L. Zhang, “Supporting Real-time Applications in an Integrated Services Packet Network: Architecture and Mechanism”, in Proc. SIGCOMM’92, September 1992.
Google Scholar
B. Davie, Y. Rekhter, “MPLS: Technology and Applications”, Morgan Kaufmann Publishers, 2000.
Google Scholar
A. Gosavi, “A Convergent Reinforcement Learning Algorithm for Solving Markov and Semi-Markov Decision Problems Under Long-Run Average Cost”, Accepted in the European Journal of Operational Research, 2001.
Google Scholar
B. Groskinsky, D. Medhi, D. Tipper, “An Investigation of Adaptive Capacity Control Schemes in a Dynamic Traffic Environment”, IEICE Trans. Commun., Vol. E00-A, No, 13, 2001.
Google Scholar
J. Heinanen, R. Guerin, “A Two Rate Three Color Marker”, RFC 2698, 1999.
Google Scholar
A. Jalali, M. Ferguson. “Computationally efficient adaptive control algorithms for Markov chains”, In Proceedings of the 28th. IEEE Conference on Decision and Control, pages 1283–1288, 1989.
Google Scholar
S. Mahadevan, “Average Reward Reinforcement Learning: Foundations, Algorithms and Empirical Results”, Machine Learning, 22, 159–196, 1996.
Google Scholar
K. Nichols, S. Blake, F. Baker, D. Black, “Definition of the Differentiated Services Field (DS field) in the IPv4 and IPv6 Headers”, RFC 2474, December 1998.
Google Scholar
S. Shenker, C. Partridge, R. Guerin, “Specification of Guaranteed Quality of Service”, RFC 2212, 1997.
Google Scholar
S. Shiodam, H. Saito, H. Yokoi, “Sizing and Provisioning for Physical and Virtual Path Networks Using Self-sizing Capability”, IEICE Trans. Commun., Vol. E80-B, No. 2, February 1997.
Google Scholar
R. S. Sutton and A. G. Barto, “Reinforcement Learning: An Introduction”, MIT Press, 1998.
Google Scholar
H. C. Tijms, “Stochastic Models: An Algorithmic Approach”, John Wiley and Sons Ltd., 1994.
Google Scholar
J. Wroclawski, “Specification of the Controlled-Load Network Element Service”, RFC 2211, 1997.
Google Scholar
J. Wroclawski, “The Use of RSVP with IETF Integrated Services”, RFC 2210, 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

Electrical and Electronics Engineering Dept., Bilkent University, 06800, Bilkent, Ankara, Turkey
Nail Akar & Cem Sahin

Authors

Nail Akar
View author publications
You can also search for this author in PubMed Google Scholar
Cem Sahin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Telecommunications, Warsaw University of Technology, ul. Nowowiejska 15/19, 00-665, Warsaw, Poland
Wojciech Burakowski & Andrzej Bęben &
PMC, Johann-Keller-Weg 8 a, 86919, Utting, Germany
Berthold Koch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Akar, N., Sahin, C. (2003). Reinforcement Learning as a Means of Dynamic Aggregate QoS Provisioning. In: Burakowski, W., Bęben, A., Koch, B. (eds) Architectures for Quality of Service in the Internet. Art-QoS 2003. Lecture Notes in Computer Science, vol 2698. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45020-3_8

Download citation

DOI: https://doi.org/10.1007/3-540-45020-3_8
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40444-6
Online ISBN: 978-3-540-45020-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics