Multi-armed Bandit Based Tariff Generation Strategy for Multi-agent Smart Grid Systems

Chandlekar, Sanjay; Subramanian, Easwar; Gujar, Sujit

doi:10.1007/978-3-031-48539-8_8

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14378))

Included in the following conference series:

International Workshop on Engineering Multi-Agent Systems

277 Accesses

Abstract

The emergence of smart grid technology has opened the door for wide-scale automation in decision-making. A distribution company, an integral part of a smart grid system, has to procure electricity from the wholesale market and then sell it to customers in the retail market by publishing attractive tariff contracts. It can deploy autonomous agents to make decisions on its behalf. In this work, we describe the tariff contracts generation strategy of one such autonomous agent, which is based on a Contextual Multi-armed Bandit (ConMAB) based learning technique to generate tariff contracts for various types of customers in the retail market of smart grids. We particularly utilize the Exponential-weight algorithm for Exploration and Exploitation (EXP-3) for ConMAB-based learning. We call our proposed strategy GenerateTariffs-EXP3. Our previous work shows that maintaining an appropriate market share in the retail market yields high net revenue. Thus, we first present a game-theoretic analysis that determines an optimal level of market share. Then we train our proposed strategy to achieve and maintain the suggested level of market share by adapting to the market situation and revising the tariff contracts periodically. We validate our proposed strategy in PowerTAC, a close-to real-world smart grid simulator, and showcase that it is able to maintain the suggested market share.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Fixed-Price Tariff Generation Using Reinforcement Learning

AgentUDE17: A Genetic Algorithm to Optimize the Parameters of an Electricity Tariff in a Smart Grid Environment

Demand Response Management in Smart Grid Networks: a Two-Stage Game-Theoretic Learning-Based Approach

Article 04 October 2018

References

Chandlekar, S., Pedasingu, B.S., Subramanian, E., Bhat, S., Paruchuri, P., Gujar, S.: VidyutVanika21: an autonomous intelligent broker for smart-grids. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-2022, pp. 158–164. International Joint Conferences on Artificial Intelligence Organization (2022). https://doi.org/10.24963/ijcai.2022/23
Ghosh, S., Subramanian, E., Bhat, S.P., Gujar, S., Paruchuri, P.: VidyutVanika: a reinforcement learning based broker agent for a power trading competition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 914–921 (2019). https://doi.org/10.1609/aaai.v33i01.3301914
Jain, S., Narayanaswamy, B., Narahari, Y.: A multiarmed bandit incentive mechanism for crowdsourcing demand response in smart grids. In: AAAI Conference on Artificial Intelligence, Canada (2014)
Google Scholar
Ketter, W., Collins, J., de Weerdt, M.: The 2020 power trading agent competition. ERIM report series reference no. 2020-002 (2020). https://doi.org/10.2139/ssrn.3564107
Li, Y., Hu, Q., Li, N.: Learning and selecting the right customers for reliability: a multi-armed bandit approach. In: 2018 IEEE Conference on Decision and Control (CDC), pp. 4869–4874 (2018). https://doi.org/10.1109/CDC.2018.8619481
Ma, H., Parkes, D.C., Robu, V.: Generalizing demand response through reward bidding. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2017, Brazil, pp. 60–68. (2017). http://dl.acm.org/citation.cfm?id=3091125.3091140
Ma, H., Robu, V., Li, N.L., Parkes, D.C.: Incentivizing reliability in demand-side response. In: The Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), pp. 352–358 (2016). http://www.ijcai.org/Abstract/16/057
McKelveya, R.D., McLennan, A.M., Turocy, T.L.: Gambit: software tools for game theory, version 16.0.1 (2014). http://www.gambit-project.org. Accessed 27 Dec 2021
Methenitis, G., Kaisers, M., La Poutré, H.: Forecast-based mechanisms for demand response. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1600–1608 (2019)
Google Scholar
Orfanoudakis, S., Kontos, S., Akasiadis, C., Chalkiadakis, G.: Aiming for half gets you to the top: winning PowerTAC 2020. In: Rosenfeld, A., Talmon, N. (eds.) EUMAS 2021. LNCS (LNAI), vol. 12802, pp. 144–159. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-82254-5_9
Chapter Google Scholar
Reddy, P.P., Veloso, M.M.: Strategy learning for autonomous agents in smart grid markets. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, IJCAI 2011, vol. 2, pp. 1446–1451. AAAI Press (2011)
Google Scholar
Serrano Cuevas, J., Rodriguez-Gonzalez, A.Y., Munoz de Cote, E.: Fixed-price tariff generation using reinforcement learning. In: Fujita, K., et al. (eds.) Modern Approaches to Agent-based Complex Automated Negotiation. SCI, vol. 674, pp. 121–136. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51563-2_8
Chapter Google Scholar
Shweta, J., Sujit, G.: A multiarmed bandit based incentive mechanism for a subset selection of customers for demand response in smart grids. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2046–2053 (2020)
Google Scholar
Techopedia.com: Smart Grid (2021). https://www.techopedia.com/definition/692/smart-grid. Accessed 19 Jan 2023
Urieli, D., Stone, P.: Autonomous electricity trading using time-of-use tariffs in a competitive market. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-2016). Association for the Advancement of Artificial Intelligence (2016)
Google Scholar
Özdemir, S., Unland, R.: AgentUDE17: a genetic algorithm to optimize the parameters of an electricity tariff in a smart grid environment. In: Demazeau, Y., An, B., Bajo, J., Fernández-Caballero, A. (eds.) PAAMS 2018. LNCS (LNAI), vol. 10978, pp. 224–236. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94580-4_18
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

International Institute of Information Technology (IIIT) Hyderabad, Hyderabad, India
Sanjay Chandlekar & Sujit Gujar
TCS Innovation Labs, Hyderabad, India
Easwar Subramanian

Authors

Sanjay Chandlekar
View author publications
You can also search for this author in PubMed Google Scholar
Easwar Subramanian
View author publications
You can also search for this author in PubMed Google Scholar
Sujit Gujar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sanjay Chandlekar .

Editor information

Editors and Affiliations

University of St.Gallen, St. Gallen, Switzerland
Andrei Ciortea
Utrecht University, Utrecht, The Netherlands
Mehdi Dastani
Zhejiang University, Hangzhou, China
Jieting Luo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chandlekar, S., Subramanian, E., Gujar, S. (2023). Multi-armed Bandit Based Tariff Generation Strategy for Multi-agent Smart Grid Systems. In: Ciortea, A., Dastani, M., Luo, J. (eds) Engineering Multi-Agent Systems. EMAS 2023. Lecture Notes in Computer Science(), vol 14378. Springer, Cham. https://doi.org/10.1007/978-3-031-48539-8_8

Download citation

DOI: https://doi.org/10.1007/978-3-031-48539-8_8
Published: 26 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-48538-1
Online ISBN: 978-3-031-48539-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Multi-armed Bandit Based Tariff Generation Strategy for Multi-agent Smart Grid Systems