Abstract
Analyzing playing style is a recurring task within soccer analytics that plays a crucial role in club activities such as player scouting and match preparation. It involves identifying and summarizing prototypical behaviors of teams and players that reoccur both within and across matches. Current techniques for analyzing playing style are often hindered by the sparsity of event stream data (i.e., the same player rarely performs the same action in the same location more than once). This paper proposes SoccerMix, a soft clustering technique based on mixture models that enables a novel probabilistic representation for soccer actions. SoccerMix overcomes the sparsity of event stream data by probabilistically grouping together similar actions in a data-driven manner. We show empirically how SoccerMix can capture the playing style of both teams and players and present an alternative view of a team’s style that focuses not on the team’s own actions, but rather on how the team forces its opponents to deviate from their usual playing style.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
More details on our approach to select the number of components used in each mixture model can be found in the public implementation.
- 4.
References
Bailey, T.L., Elkan, C., et al.: Fitting a mixture model by expectation maximization to discover motifs in bipolymers (1994)
Bekkers, J., Dabadghao, S.: Flow motifs in soccer: what can passing behaviortell us? J. Sports Anal. (Preprint), 1–13 (2017)
Best, D., Fisher, N.I.: Efficient simulation of the von mises distribution. J. Royal Stat. Soc. Ser. C (Applied Statistics) 28(2), 152–157 (1979)
Breunig, M.M., Kriegel, H.P., Ng, R.T., Sander, J.: Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of data, pp. 93–104 (2000)
Cintia, P., Rinzivillo, S., Pappalardo, L.: A network-based approach to evaluate the performance of football teams. In: Machine Learning and Data Mining for Sports Analytics Workshop, Porto, Portugal (2015)
Decroos, T., Bransen, L., Van Haaren, J., Davis, J.: Actions speak louder than goals: Valuing player actions in soccer. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2019, pp. 1851–1861. ACM, New York (2019). https://doi.org/10.1145/3292500.3330758
Decroos, T., Davis, J.: Player vectors: characterizing soccer players’ playing style from match event streams. In: Brefeld, U., Fromont, E., Hotho, A., Knobbe, A., Maathuis, M., Robardet, C. (eds.) ECML PKDD 2019. LNCS (LNAI), vol. 11908, pp. 569–584. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-46133-1_34
Decroos, T., Van Haaren, J., Davis, J.: Automatic discovery of tactics in spatio-temporal soccer match data. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 223–232 (2018)
Gyarmati, L., Hefeeda, M.: Analyzing in-game movements of soccer players at scale. arXiv preprint arXiv:1603.05583 (2016)
Gyarmati, L., Kwak, H., Rodriguez, P.: Searching for a unique style in soccer. arXiv preprint arXiv:1409.0308 (2014)
Mardia, K.V., Jupp, P.E.: Directional Statistics, vol. 494. Wiley, Chichester (2009)
McLachlan, G.J., Basford, K.E.: Mixture Models: Inference and Applications to Clustering, vol. 38. M. Dekker, New York (1988)
Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Pena, J.L.: A Markovian model for association football possession and its outcomes. arXiv preprint arXiv:1403.7993 (2014)
Reynolds, D.A.: Gaussian mixture models. Encycl. Biometrics 741, 659–663 (2009)
Van Haaren, J., Dzyuba, V., Hannosset, S., Davis, J.: Automatically discovering offensive patterns in soccer match data. In: Fromont, E., De Bie, T., van Leeuwen, M. (eds.) IDA 2015. LNCS, vol. 9385, pp. 286–297. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24465-5_25
Van Haaren, J., Hannosset, S., Davis, J.: Strategy discovery in professional soccer match data. In: Proceedings of the KDD-16 Workshop on Large-Scale Sports Analytics, pp. 1–4 (2016)
Wang, Q., Zhu, H., Hu, W., Shen, Z., Yao, Y.: Discerning tactical patterns for professional soccer teams: an enhanced topic model with applications. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 2197–2206 (2015)
Acknowledgements
Tom Decroos is supported by the Research Foundation-Flanders (FWO-Vlaanderen). Maaike Van Roy is supported by the Research Foundation-Flanders under EOS No. 30992574. Jesse Davis is partially supported by KU Leuven Research Fund (C14/17/07), Research Foundation - Flanders (EOS No. 30992574, G0D8819N). Thanks to StatsBomb for providing the data used in this paper.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Decroos, T., Van Roy, M., Davis, J. (2021). SoccerMix: Representing Soccer Actions with Mixture Models. In: Dong, Y., Ifrim, G., Mladenić, D., Saunders, C., Van Hoecke, S. (eds) Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track. ECML PKDD 2020. Lecture Notes in Computer Science(), vol 12461. Springer, Cham. https://doi.org/10.1007/978-3-030-67670-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-030-67670-4_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67669-8
Online ISBN: 978-3-030-67670-4
eBook Packages: Computer ScienceComputer Science (R0)