skip to main content
10.1145/3490354.3494386acmconferencesArticle/Chapter ViewAbstractPublication PagesicaifConference Proceedingsconference-collections
research-article

Learning to classify and imitate trading agents in continuous double auction markets

Published: 04 May 2022 Publication History

Abstract

Continuous double auctions such as the limit order book employed by exchanges are widely used in practice to match buyers and sellers of a variety of financial instruments. In this work, we develop an agent-based model for trading in a limit order book and show (1) how opponent modelling techniques can be applied to classify trading agent archetypes and (2) how behavioural cloning can be used to imitate these agents in a simulated setting. We experimentally compare a number of techniques for both tasks and evaluate their applicability and use in real-world scenarios.

References

[1]
Stefano Vittorino Albrecht, Jacob William Crandall, and Subramanian Ramamoorthy. 2015. E-HBA: Using action policies for expert advice and agent typification. In Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence.
[2]
Stefano V Albrecht and Peter Stone. 2018. Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence 258 (2018), 66--95.
[3]
David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, and Thore Graepel. 2018. The mechanics of n-player differentiable games. arXiv preprint arXiv:1802.05642 (2018).
[4]
Samuel Barrett, Peter Stone, Sarit Kraus, and Avi Rosenfeld. 2013. Teamwork with limited knowledge of teammates. In Twenty-Seventh AAAI Conference on Artificial Intelligence.
[5]
James Bergstra, Daniel Yamins, and David Cox. 2013. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In International conference on machine learning. 115--123.
[6]
Jean-Philippe Bouchaud, Julius Bonart, Jonathan Donier, and Martin Gould. 2018. Trades, quotes and prices: financial markets under the microscope. Cambridge University Press.
[7]
George W Brown. 1951. Iterative solution of games by fictitious play. Activity analysis of production and allocation 13, 1 (1951), 374--376.
[8]
David Byrd, Maria Hybinette, and Tucker Hybinette Balch. 2019. ABIDES: Towards High-Fidelity Market Simulation for AI Research. arXiv preprint arXiv:1904.12066 (2019).
[9]
Sandra Carberry. 2001. Techniques for plan recognition. User Modeling and User-Adapted Interaction 11, 1--2 (2001), 31--48.
[10]
David Carmel and Shaul Markovitch. 1995. Opponent modeling in multi-agent systems. In International Joint Conference on Artificial Intelligence. Springer, 40--52.
[11]
David Carmel and Shaul Markovitch. 1996. Incorporating opponent models into adversary search. In AAAI/IAAI, Vol. 1. 120--125.
[12]
Tanmoy Chakraborty and Michael Kearns. 2011. Market making and mean reversion. In Proceedings of the 12th ACM conference on Electronic commerce. ACM, 307--314.
[13]
Michael M. Dacorogna, Ulrich Alfons Müller, Robert J. Nagler, Richard B. Olsen, and Olivier V. Pictet. 1993. A geographical model for the daily and weekly seasonal volatility in the foreign exchange market.
[14]
Jakob Foerster, Richard Y Chen, Maruan Al-Shedivat, Shimon Whiteson, Pieter Abbeel, and Igor Mordatch. 2018. Learning with opponent-learning awareness. In Proceedings of the 17th International Conference on Autonomous Agents and Multi-Agent Systems. International Foundation for Autonomous Agents and Multiagent Systems, 122--130.
[15]
Dhananjay K Gode and Shyam Sunder. 1993. Allocative efficiency of markets with zero-intelligence traders: Market as a partial substitute for individual rationality. Journal of political economy 101, 1 (1993), 119--137.
[16]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in neural information processing systems. 2672--2680.
[17]
Markus Gsell. 2008. Assessing the impact of Algorithmic Trading on markets: A simulation approach. Technical Report. CFS Working Paper.
[18]
Trevor Hastie, Saharon Rosset, Ji Zhu, and Hui Zou. 2009. Multi-class adaboost. Statistics and its Interface 2, 3 (2009), 349--360.
[19]
Nikolaus Hautsch and Ruihong Huang. 2012. On the dark side of the market: identifying and analyzing hidden order placements. Available at SSRN 2004231 (2012).
[20]
Pablo Hernandez-Leal, Bilal Kartal, and Matthew E Taylor. 2019. Agent modeling as auxiliary task for deep reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Vol. 15. 31--37.
[21]
Ronald A Howard and James E Matheson. 2005. Influence diagrams. Decision Analysis 2, 3 (2005), 127--143.
[22]
Max Jaderberg, Volodymyr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z Leibo, David Silver, and Koray Kavukcuoglu. 2016. Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv:1611.05397 (2016).
[23]
Andrej Karpathy and Li Fei-Fei. 2015. Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 3128--3137.
[24]
Albert S Kyle. 1985. Continuous auctions and insider trading. Econometrica: Journal of the Econometric Society (1985), 1315--1335.
[25]
Mahmoud Mahfouz, Angelos Filos, Cyrine Chtourou, Joshua Lockhart, Samuel Assefa, Manuela Veloso, Danilo Mandic, and Tucker Balch. 2019. On the Importance of Opponent Modeling in Auction Markets. arXiv preprint arXiv:1911.12816 (2019).
[26]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529.
[27]
Takayuki Osa, Joni Pajarinen, Gerhard Neumann, J Andrew Bagnell, Pieter Abbeel, and Jan Peters. 2018. An algorithmic perspective on imitation learning. arXiv preprint arXiv:1811.06711 (2018).
[28]
Mark Paddrik, Roy Hayes, William Scherer, and Peter Beling. 2017. Effects of limit order book information level on market stability metrics. Journal of Economic Interaction and Coordination 12, 2 (2017), 221--247.
[29]
Dean A Pomerleau. 1989. Alvinn: An autonomous land vehicle in a neural network. In Advances in neural information processing systems. 305--313.
[30]
Tobias Preis. 2011. Price-time priority and pro rata matching in an order book model of financial markets. In Econophysics of Order-driven Markets. Springer, 65--72.
[31]
Yoav Shoham and Kevin Leyton-Brown. 2008. Multiagent systems: Algorithmic, game-theoretic, and logical foundations. Cambridge University Press.
[32]
David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. nature 529, 7587 (2016), 484--489.
[33]
Peter Stone and Manuela Veloso. 1999. Task decomposition, dynamic role assignment, and low-bandwidth communication for real-time strategic teamwork. Artificial Intelligence 110, 2 (1999), 241--273.
[34]
Manuela M Veloso. 1994. Planning and learning by analogical reasoning. Vol. 886. Springer Science & Business Media.
[35]
Elaine Wah, Mason Wright, and Michael P Wellman. 2017. Welfare effects of market making in continuous double auctions. Journal of Artificial Intelligence Research 59 (2017), 613--650.
[36]
Xintong Wang and Michael P Wellman. 2017. Spoofing the limit order book: An agent-based model. In Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems. International Foundation for Autonomous Agents and Multiagent Systems, 651--659.
[37]
Ben G Weber and Michael Mateas. 2009. A data mining approach to strategy prediction. In 2009 IEEE Symposium on Computational Intelligence and Games. IEEE, 140--147.
[38]
Michael P Wellman. 2011. Trading agents. Synthesis Lectures on Artificial Intelligence and Machine Learning 5, 3 (2011), 1--107.
[39]
Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically conditioned lstm-based natural language generation for spoken dialogue systems. arXiv preprint arXiv:1508.01745 (2015).
[40]
Michael Wooldridge. 2009. An introduction to multiagent systems. John Wiley & Sons.
[41]
Steve Yang, Mark Paddrik, Roy Hayes, Andrew Todd, Andrei Kirilenko, Peter Beling, and William Scherer. 2012. Behavior based learning in identifying high frequency trading strategies. In 2012 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr). IEEE, 1--8.
[42]
Steve Y Yang, Qifeng Qiao, Peter A Beling, William T Scherer, and Andrei A Kirilenko. 2015. Gaussian process-based algorithmic trading strategy identification. Quantitative Finance 15, 10 (2015), 1683--1703.

Cited By

View all
  • (2022)Deep Reinforcement Learning-Based Pricing Strategy in Double-Auction Market for Edge Computing Resource Allocation2022 International Conference on Networks, Communications and Information Technology (CNCIT)10.1109/CNCIT56797.2022.00017(52-61)Online publication date: Jun-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICAIF '21: Proceedings of the Second ACM International Conference on AI in Finance
November 2021
450 pages
ISBN:9781450391481
DOI:10.1145/3490354
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 May 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. agent-based modelling
  2. auction markets
  3. behavioural cloning
  4. deep learning
  5. opponent modelling

Qualifiers

  • Research-article

Conference

ICAIF'21
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022)Deep Reinforcement Learning-Based Pricing Strategy in Double-Auction Market for Edge Computing Resource Allocation2022 International Conference on Networks, Communications and Information Technology (CNCIT)10.1109/CNCIT56797.2022.00017(52-61)Online publication date: Jun-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media