Skip to main content

Multi-armed Bandit Based Tariff Generation Strategy for Multi-agent Smart Grid Systems

  • Conference paper
  • First Online:
Engineering Multi-Agent Systems (EMAS 2023)

Abstract

The emergence of smart grid technology has opened the door for wide-scale automation in decision-making. A distribution company, an integral part of a smart grid system, has to procure electricity from the wholesale market and then sell it to customers in the retail market by publishing attractive tariff contracts. It can deploy autonomous agents to make decisions on its behalf. In this work, we describe the tariff contracts generation strategy of one such autonomous agent, which is based on a Contextual Multi-armed Bandit (ConMAB) based learning technique to generate tariff contracts for various types of customers in the retail market of smart grids. We particularly utilize the Exponential-weight algorithm for Exploration and Exploitation (EXP-3) for ConMAB-based learning. We call our proposed strategy GenerateTariffs-EXP3. Our previous work shows that maintaining an appropriate market share in the retail market yields high net revenue. Thus, we first present a game-theoretic analysis that determines an optimal level of market share. Then we train our proposed strategy to achieve and maintain the suggested level of market share by adapting to the market situation and revising the tariff contracts periodically. We validate our proposed strategy in PowerTAC, a close-to real-world smart grid simulator, and showcase that it is able to maintain the suggested market share.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Chandlekar, S., Pedasingu, B.S., Subramanian, E., Bhat, S., Paruchuri, P., Gujar, S.: VidyutVanika21: an autonomous intelligent broker for smart-grids. In: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-2022, pp. 158–164. International Joint Conferences on Artificial Intelligence Organization (2022). https://doi.org/10.24963/ijcai.2022/23

  2. Ghosh, S., Subramanian, E., Bhat, S.P., Gujar, S., Paruchuri, P.: VidyutVanika: a reinforcement learning based broker agent for a power trading competition. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 914–921 (2019). https://doi.org/10.1609/aaai.v33i01.3301914

  3. Jain, S., Narayanaswamy, B., Narahari, Y.: A multiarmed bandit incentive mechanism for crowdsourcing demand response in smart grids. In: AAAI Conference on Artificial Intelligence, Canada (2014)

    Google Scholar 

  4. Ketter, W., Collins, J., de Weerdt, M.: The 2020 power trading agent competition. ERIM report series reference no. 2020-002 (2020). https://doi.org/10.2139/ssrn.3564107

  5. Li, Y., Hu, Q., Li, N.: Learning and selecting the right customers for reliability: a multi-armed bandit approach. In: 2018 IEEE Conference on Decision and Control (CDC), pp. 4869–4874 (2018). https://doi.org/10.1109/CDC.2018.8619481

  6. Ma, H., Parkes, D.C., Robu, V.: Generalizing demand response through reward bidding. In: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, AAMAS 2017, Brazil, pp. 60–68. (2017). http://dl.acm.org/citation.cfm?id=3091125.3091140

  7. Ma, H., Robu, V., Li, N.L., Parkes, D.C.: Incentivizing reliability in demand-side response. In: The Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI 2016), pp. 352–358 (2016). http://www.ijcai.org/Abstract/16/057

  8. McKelveya, R.D., McLennan, A.M., Turocy, T.L.: Gambit: software tools for game theory, version 16.0.1 (2014). http://www.gambit-project.org. Accessed 27 Dec 2021

  9. Methenitis, G., Kaisers, M., La Poutré, H.: Forecast-based mechanisms for demand response. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, pp. 1600–1608 (2019)

    Google Scholar 

  10. Orfanoudakis, S., Kontos, S., Akasiadis, C., Chalkiadakis, G.: Aiming for half gets you to the top: winning PowerTAC 2020. In: Rosenfeld, A., Talmon, N. (eds.) EUMAS 2021. LNCS (LNAI), vol. 12802, pp. 144–159. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-82254-5_9

    Chapter  Google Scholar 

  11. Reddy, P.P., Veloso, M.M.: Strategy learning for autonomous agents in smart grid markets. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, IJCAI 2011, vol. 2, pp. 1446–1451. AAAI Press (2011)

    Google Scholar 

  12. Serrano Cuevas, J., Rodriguez-Gonzalez, A.Y., Munoz de Cote, E.: Fixed-price tariff generation using reinforcement learning. In: Fujita, K., et al. (eds.) Modern Approaches to Agent-based Complex Automated Negotiation. SCI, vol. 674, pp. 121–136. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51563-2_8

    Chapter  Google Scholar 

  13. Shweta, J., Sujit, G.: A multiarmed bandit based incentive mechanism for a subset selection of customers for demand response in smart grids. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 2046–2053 (2020)

    Google Scholar 

  14. Techopedia.com: Smart Grid (2021). https://www.techopedia.com/definition/692/smart-grid. Accessed 19 Jan 2023

  15. Urieli, D., Stone, P.: Autonomous electricity trading using time-of-use tariffs in a competitive market. In: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-2016). Association for the Advancement of Artificial Intelligence (2016)

    Google Scholar 

  16. Özdemir, S., Unland, R.: AgentUDE17: a genetic algorithm to optimize the parameters of an electricity tariff in a smart grid environment. In: Demazeau, Y., An, B., Bajo, J., Fernández-Caballero, A. (eds.) PAAMS 2018. LNCS (LNAI), vol. 10978, pp. 224–236. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-94580-4_18

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sanjay Chandlekar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chandlekar, S., Subramanian, E., Gujar, S. (2023). Multi-armed Bandit Based Tariff Generation Strategy for Multi-agent Smart Grid Systems. In: Ciortea, A., Dastani, M., Luo, J. (eds) Engineering Multi-Agent Systems. EMAS 2023. Lecture Notes in Computer Science(), vol 14378. Springer, Cham. https://doi.org/10.1007/978-3-031-48539-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-48539-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-48538-1

  • Online ISBN: 978-3-031-48539-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics