Skip to main content

GPU Accelerated Data Preparation for Limit Order Book Modeling

  • Conference paper
  • First Online:
Machine Learning, Optimization, and Data Science (LOD 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12565))

Abstract

Financial processes are frequently explained by econometric models, however, data-driven approaches may outperform the analytical models with adequate amount and quality data and algorithms. In the case of today’s state-of-the-art deep learning methods the more data leads to better models. However, even if the model is trained on massively parallel hardware, the preprocessing of a large amount of data is usually still done in a traditional way (e.g. few hundreds of threads on Central Processing Unit, CPU).

In this paper, we propose a GPU accelerated pipeline, which assesses the burden of time taken with data preparation for machine learning in financial applications. With the reduced time, it enables its user to experiment with multiple parameter setups in much less time. The pipeline processes and models a specific type of financial data – limit order books – on massively parallel hardware. The pipeline handles data collection, order book preprocessing, data normalisation, and batching into training samples, which can be used for training deep neural networks and inference. Time comparisons of baseline and optimized approaches are part of this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Such an order book does not exist for the so called dark pool, because the orders are typically not published in dark pools [1].

References

  1. Ganchev, K., Kearns, M., Nevmyvaka, Y., Vaughan, J.W.: Censored exploration and the dark pool problem (2012)

    Google Scholar 

  2. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)

    Article  Google Scholar 

  3. Black, F., Scholes, M.: The pricing of options and corporate liabilities. J. Polit. Econ. 81(3), 637–654 (1973)

    Article  MathSciNet  Google Scholar 

  4. Ranaldo, A.: Order aggressiveness in limit order book markets. J. Financ. Mark. 7, 53–74 (2004)

    Article  Google Scholar 

  5. Cont, R., Stoikov, S., Talreja, R.: A stochastic model for order book dynamics. Oper. Res. 58, 549–563 (2010)

    Article  MathSciNet  Google Scholar 

  6. Beck, J., et al.: Sensing social media signals for cryptocurrency news. In: Companion Proceedings of the 2019 World Wide Web Conference on - WWW 2019 (2019)

    Google Scholar 

  7. Mirtaheri, M., Abu-El-Haija, S., Morstatter, F., Steeg, GV., Galstyan, A.: Identifying and analyzing cryptocurrency manipulations in social media (2019)

    Google Scholar 

  8. Tsantekidis, A., Passalis, N., Tefas, A., Kanniainen, J., Gabbouj, M., Iosifidis, A.: Using deep learning for price prediction by exploiting stationary limit order book features (2018)

    Google Scholar 

  9. Ntakaris, A., Magris, M., Kanniainen, J., Gabbouj, M., Iosifidis, A.: Benchmark dataset for mid-price forecasting of limit order book data with machine learning methods. J. Forecast. 37(8), 852–866 (2018)

    Article  MathSciNet  Google Scholar 

  10. Bibinger, M., Neely, C., Winkelmann, L.: Estimation of the discontinuous leverage effect: evidence from the NASDAQ order book. In: Federal Reserve Bank of St. Louis, Working Papers, April 2017

    Google Scholar 

  11. Wei, H., Wang, Y., Mangu, L., Decker, K.: Model-based reinforcement learning for predictions and control for limit order books (2019)

    Google Scholar 

  12. Di Persio, L., Honchar, O.: Artificial neural networks architectures for stock price prediction: comparisons and applications. Int. J. Circ. Syst. Signal Process. 10, 403–413 (2016)

    Google Scholar 

  13. Sirignano, J., Cont, R.: Universal features of price formation in financial markets: perspectives from deep learning (2018)

    Google Scholar 

  14. Zhang, Z., Zohren, S., Roberts, S.: DeepLOB: deep convolutional neural networks for limit order books. IEEE Trans. Signal Process. 67(11), 3001–3012 (2019)

    Article  Google Scholar 

Download references

Acknowledgment

The research presented in this paper, carried out by BME was supported by the Ministry of Innovation and the National Research, Development and Innovation Office within the framework of the Artificial Intelligence National Laboratory Programme, by the NRDI Fund based on the charter of bolster issued by the NRDI Office under the auspices of the Ministry for Innovation and Technology, by the European Union, co-financed by the European Social Fund (EFOP-3.6.2-16-2017-00013, Thematic Fundamental Research Collaborations Grounding Innovation in Informatics and Infocommunications), by János Bolyai Research Scholarship of the Hungarian Academy of Sciences and by Doctoral Research Scholarship of Ministry of Human Resources (ÚNKP-20-5-BME-210) in the scope of New National Excellence Program. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan V GPU used for this research.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Viktor Burján or Bálint Gyires-Tóth .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Burján, V., Gyires-Tóth, B. (2020). GPU Accelerated Data Preparation for Limit Order Book Modeling. In: Nicosia, G., et al. Machine Learning, Optimization, and Data Science. LOD 2020. Lecture Notes in Computer Science(), vol 12565. Springer, Cham. https://doi.org/10.1007/978-3-030-64583-0_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-64583-0_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-64582-3

  • Online ISBN: 978-3-030-64583-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics