Skip to main content

SoccerMix: Representing Soccer Actions with Mixture Models

  • Conference paper
  • First Online:
Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track (ECML PKDD 2020)

Abstract

Analyzing playing style is a recurring task within soccer analytics that plays a crucial role in club activities such as player scouting and match preparation. It involves identifying and summarizing prototypical behaviors of teams and players that reoccur both within and across matches. Current techniques for analyzing playing style are often hindered by the sparsity of event stream data (i.e., the same player rarely performs the same action in the same location more than once). This paper proposes SoccerMix, a soft clustering technique based on mixture models that enables a novel probabilistic representation for soccer actions. SoccerMix overcomes the sparsity of event stream data by probabilistically grouping together similar actions in a data-driven manner. We show empirically how SoccerMix can capture the playing style of both teams and players and present an alternative view of a team’s style that focuses not on the team’s own actions, but rather on how the team forces its opponents to deviate from their usual playing style.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/ML-KULeuven/soccermix.

  2. 2.

    https://github.com/ML-KULeuven/socceraction.

  3. 3.

    More details on our approach to select the number of components used in each mixture model can be found in the public implementation.

  4. 4.

    https://sport.optus.com.au/articles/os6422/trent-alexander-arnold-is-changing-the-full-back-position.

References

  1. Bailey, T.L., Elkan, C., et al.: Fitting a mixture model by expectation maximization to discover motifs in bipolymers (1994)

    Google Scholar 

  2. Bekkers, J., Dabadghao, S.: Flow motifs in soccer: what can passing behaviortell us? J. Sports Anal. (Preprint), 1–13 (2017)

    Google Scholar 

  3. Best, D., Fisher, N.I.: Efficient simulation of the von mises distribution. J. Royal Stat. Soc. Ser. C (Applied Statistics) 28(2), 152–157 (1979)

    MATH  Google Scholar 

  4. Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of data, pp. 93–104 (2000)

    Google Scholar 

  5. Cintia, P., Rinzivillo, S., Pappalardo, L.: A network-based approach to evaluate the performance of football teams. In: Machine Learning and Data Mining for Sports Analytics Workshop, Porto, Portugal (2015)

    Google Scholar 

  6. Decroos, T., Bransen, L., Van Haaren, J., Davis, J.: Actions speak louder than goals: Valuing player actions in soccer. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2019, pp. 1851–1861. ACM, New York (2019). https://doi.org/10.1145/3292500.3330758

  7. Decroos, T., Davis, J.: Player vectors: characterizing soccer players’ playing style from match event streams. In: Brefeld, U., Fromont, E., Hotho, A., Knobbe, A., Maathuis, M., Robardet, C. (eds.) ECML PKDD 2019. LNCS (LNAI), vol. 11908, pp. 569–584. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-46133-1_34

    Chapter  Google Scholar 

  8. Decroos, T., Van Haaren, J., Davis, J.: Automatic discovery of tactics in spatio-temporal soccer match data. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 223–232 (2018)

    Google Scholar 

  9. Gyarmati, L., Hefeeda, M.: Analyzing in-game movements of soccer players at scale. arXiv preprint arXiv:1603.05583 (2016)

  10. Gyarmati, L., Kwak, H., Rodriguez, P.: Searching for a unique style in soccer. arXiv preprint arXiv:1409.0308 (2014)

  11. Mardia, K.V., Jupp, P.E.: Directional Statistics, vol. 494. Wiley, Chichester (2009)

    MATH  Google Scholar 

  12. McLachlan, G.J., Basford, K.E.: Mixture Models: Inference and Applications to Clustering, vol. 38. M. Dekker, New York (1988)

    MATH  Google Scholar 

  13. Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  14. Pena, J.L.: A Markovian model for association football possession and its outcomes. arXiv preprint arXiv:1403.7993 (2014)

  15. Reynolds, D.A.: Gaussian mixture models. Encycl. Biometrics 741, 659–663 (2009)

    Article  Google Scholar 

  16. Van Haaren, J., Dzyuba, V., Hannosset, S., Davis, J.: Automatically discovering offensive patterns in soccer match data. In: Fromont, E., De Bie, T., van Leeuwen, M. (eds.) IDA 2015. LNCS, vol. 9385, pp. 286–297. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24465-5_25

    Chapter  Google Scholar 

  17. Van Haaren, J., Hannosset, S., Davis, J.: Strategy discovery in professional soccer match data. In: Proceedings of the KDD-16 Workshop on Large-Scale Sports Analytics, pp. 1–4 (2016)

    Google Scholar 

  18. Wang, Q., Zhu, H., Hu, W., Shen, Z., Yao, Y.: Discerning tactical patterns for professional soccer teams: an enhanced topic model with applications. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2197–2206 (2015)

    Google Scholar 

Download references

Acknowledgements

Tom Decroos is supported by the Research Foundation-Flanders (FWO-Vlaanderen). Maaike Van Roy is supported by the Research Foundation-Flanders under EOS No. 30992574. Jesse Davis is partially supported by KU Leuven Research Fund (C14/17/07), Research Foundation - Flanders (EOS No. 30992574, G0D8819N). Thanks to StatsBomb for providing the data used in this paper.

figure a

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tom Decroos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Decroos, T., Van Roy, M., Davis, J. (2021). SoccerMix: Representing Soccer Actions with Mixture Models. In: Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (eds) Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020. Lecture Notes in Computer Science(), vol 12461. Springer, Cham. https://doi.org/10.1007/978-3-030-67670-4_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-67670-4_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-67669-8

  • Online ISBN: 978-3-030-67670-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics