CO-DECYBER: Co-operative Decision Making for Cybersecurity Using Deep Multi-agent Reinforcement Learning

Cheah, Madeline; Stone, Jack; Haubrick, Peter; Bailey, Samuel; Rimmer, David; Till, Demian; Lacey, Matt; Kruczynska, Jo; Dorn, Mark

doi:10.1007/978-3-031-54129-2_37

Madeline Cheah²⁹,
Jack Stone²⁹,
Peter Haubrick²⁹,
Samuel Bailey²⁹,
David Rimmer²⁹,
Demian Till²⁹,
Matt Lacey²⁹,
Jo Kruczynska²⁹ &
…
Mark Dorn²⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14399))

Included in the following conference series:

European Symposium on Research in Computer Security

859 Accesses

Abstract

Autonomous decision making for cyber-defence in operational situations is desirable but challenging. This is due to the nature of operational technology (because of its cyber-physical nature) as well as the need to account for multiple contexts. Our contribution is the creation of a co-operative decision-making framework to enable autonomous cyber-defence (which we call Co-Decyber). This framework allows us to break up a big multi-contextual action space into smaller decisions that multiple agents can optimize between. We apply this framework to an autonomous vehicle platooning scenario. Results show that Co-Decyber agents are outperforming random reference agents in the cyber-attack scenarios we have tested. We aim to extend this work with more complex attack scenarios, along with training more agents to defend more of the attack surface. We conclude that this framework when mature will contribute to the goal of providing autonomous cyber-defence for operational technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

2v2 Air Combat Confrontation Strategy Based on Reinforcement Learning

Perception of Cyber Threats

Research on Multi-aircraft Cooperative Combat Based on Deep Reinforcement Learning

References

Dhir, N., Hoeltgebaum, H., Adams, N., Briers, M., Burke, A., Jones, P.: Prospective artificial intelligence approaches for active cyber defence (2021). https://arxiv.org/pdf/2104.09981.pdf
Vyas, S., Hannay, J., Bolton, A., Burnap, P.P.: Automated cyber defence: a review (2023). arXiv preprint arXiv:2303.04926
Bridges, R.A., et al.: Testing SOAR tools in use. Comput. Secur. 129, 103201 (2023)
Article Google Scholar
Jhawar, R., Mauw, S., Zakiuddin, I.: Automating cyber defence responses using attack-defence trees and game theory. In: European Conference on Cyber Warfare and Security, p. 163. Academic Conferences International Limited (2016)
Google Scholar
Kordy, B., Mauw, S., Melissen, M., Schweitzer, P.: Attack–defense trees and two-player binary zero-sum extensive form games are equivalent. In: Alpcan, T., Buttyán, L., Baras, J.S. (eds) Decision and Game Theory for Security. GameSec 2010. Lecture Notes in Computer Science, vol. 6442. Springer, Berlin, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17197-0_17
Eom, T., Hong, J.B., An, S., Park, J.S., Kim, D.S.: A framework for real-time intrusion response in software defined networking using precomputed graphical security models. Secur. Commun. Networks 2020, 1–15 (2020)
Article Google Scholar
Nguyen, T.T., Reddi, V.J.: Deep reinforcement learning for cyber security. IEEE Transactions on Neural Networks and Learning Systems 34, 1–17 (2021)
Google Scholar
Object Management Group: About the DDS security specification version 1.1 (2018). https://www.omg.org/spec/DDS-SECURITY/
Chowdhary, A., Huang, D., Sabur, A., Vadnere, N., Kang, M., Montrose, B.: SDN-based moving target defense using multi-agent reinforcement learning. In: Proceedings of the first International Conference on Autonomous Intelligent Cyber defense Agents, p. 15. Paris, France (2021)
Google Scholar
Yao, Q., Wang, Y., Xiong, X., Wang, P., Li, Y.: Adversarial decision-making for moving target defense: a multi-agent Markov game and reinforcement learning approach. Entropy 25(4), 605 (2023)
Article Google Scholar
Kordy, B., Piètre-Cambacédès, L., Schweitzer, P.: DAG-based attack and defense modeling: don’t miss the forest for the attack trees. Comput. Sci. Rev. 13, 1–38 (2014)
Article Google Scholar
Soviany, P., Ionescu, R.T., Rota, P., Sebe, N.: Curriculum learning: a survey. Int. J. Comput. Vision 130(6), 1526–1565 (2022)
Article Google Scholar
Jeon, J., Kim, W., Jung, W., Sung, Y.: Maser: Multi-agent reinforcement learning with subgoals generated from experience replay buffer. In International Conference on Machine Learning, pp. 10041–10052. PMLR (2022)
Google Scholar
Brockman, G., et al.: Openai gym. arXiv Preprint arXiv:1606.01540 (2016)
Terry, J., et al.: Pettingzoo: gym for multi-agent reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 34, pp. 15032–15043 (2021)
Google Scholar

Download references

Acknowledgements

This research is funded by Frazer-Nash Consultancy Ltd. on behalf of the Defence Science and Technology Laboratory (Dstl), an executive agency of the UK Ministry of Defence. The research forms part of the Autonomous Resilient Cyber Defence (ARCD) project within the Dstl Cyber Defence Enhancement programme.

Author information

Authors and Affiliations

Cambridge Consultants, 29 Cambridge Science Park, Milton Road, Cambridge, CB4 0DW, UK
Madeline Cheah, Jack Stone, Peter Haubrick, Samuel Bailey, David Rimmer, Demian Till, Matt Lacey, Jo Kruczynska & Mark Dorn

Authors

Madeline Cheah
View author publications
You can also search for this author in PubMed Google Scholar
Jack Stone
View author publications
You can also search for this author in PubMed Google Scholar
Peter Haubrick
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Bailey
View author publications
You can also search for this author in PubMed Google Scholar
David Rimmer
View author publications
You can also search for this author in PubMed Google Scholar
Demian Till
View author publications
You can also search for this author in PubMed Google Scholar
Matt Lacey
View author publications
You can also search for this author in PubMed Google Scholar
Jo Kruczynska
View author publications
You can also search for this author in PubMed Google Scholar
Mark Dorn
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Madeline Cheah .

Editor information

Editors and Affiliations

Norwegian University of Science and Technology, Gjøvik, Norway
Sokratis Katsikas
Norwegian Computing Center, Oslo, Norway
Habtamu Abie
University of Trento, Trento, Italy
Silvio Ranise
University of Genoa, Genoa, Italy
Luca Verderame
Consiglio Nazionale delle Ricerche (CNR), Genoa, Italy
Enrico Cambiaso
SINTEF A.S., Oslo, Norway
Rita Ugarelli
Instituto Superior de Engenharia do Porto, Porto, Portugal
Isabel Praça
Hong Kong Polytechnic University, Hong Kong, China
Wenjuan Li
Technical University of Denmark, Kongens Lyngby, Denmark
Weizhi Meng
University of Nottingham, Nottingham, UK
Steven Furnell
Norwegian University of Science and Technology, Gjøvik, Norway
Basel Katt
Norwegian Computing Center, Oslo, Norway
Sandeep Pirbhulal
Institute for Energy Technology (IFE), Halden, Norway
Ankur Shukla
University of Calabria, Rende, Italy
Michele Ianni
University of Verona, Verona, Italy
Mila Dalla Preda
The University of Texas at San Antonio, San Antonio, TX, USA
Kim-Kwang Raymond Choo
University of Lisbon, Lisbon, Portugal
Miguel Pupo Correia
University of Twente, Enschede, The Netherlands
Abhishta Abhishta
University of Amsterdam, Amsterdam, The Netherlands
Giovanni Sileno
Open University in the Netherlands, Heerlen, The Netherlands
Mina Alishahi
Robert Gordon University, Aberdeen, UK
Harsha Kalutarage
Osaka University, Osaka, Japan
Naoto Yanai

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cheah, M. et al. (2024). CO-DECYBER: Co-operative Decision Making for Cybersecurity Using Deep Multi-agent Reinforcement Learning. In: Katsikas, S., et al. Computer Security. ESORICS 2023 International Workshops. ESORICS 2023. Lecture Notes in Computer Science, vol 14399. Springer, Cham. https://doi.org/10.1007/978-3-031-54129-2_37

Download citation

DOI: https://doi.org/10.1007/978-3-031-54129-2_37
Published: 12 March 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-54128-5
Online ISBN: 978-3-031-54129-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics