Skip to main content
Log in

The higher-order PLS-SEM confirmatory approach for composite indicators of football performance quality

  • Original paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

Supporting the strategic decisions of a football team’s management is becoming crucial. We create some new composite indicators to measure the performance quality, applying both Confirmatory Tetrad Analysis (CTA) and Confirmatory Composite Analysis (CCA) to a Third-Order Partial Least Squares Structural Equation Model (PLS-SEM). To do this, data provided by Electronic Arts (EA) Sports experts and available on the Kaggle data science platform has been used; in particular, the dataset was composed of 29 Key Performance Indices defined by EA Sports experts, concerning the top 5 European leagues. A PLS-SEM for each player’s role was developed, relying on the most recent season, 2021/2022. In order to improve each model, a CTA to evaluate the nature of the constructs (formative or reflective) and a CCA were applied. The results underline how some sub-areas of performance have different significance weights depending on the player’s role; as concurrent and predictive analysis, our third-order Player Indicator overall was compared with the existing EA overall and with some performance quality proxies, such as the player’s market value and wage, showing interesting and consistent relations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. www.sofifa.com.

  2. www.kaggle.com/stefanoleone992/fifa-22-complete-player-dataset.

References

  • Becker JM, Klein K, Wetzels M (2012) Hierarchical latent variable models in pls-sem: guidelines for using reflective-formative type models. Long Range Plann 45(5–6):359–394

    Article  Google Scholar 

  • Biecek P, Burzykowski T (2021) Explanatory model analysis: Explore, explain and examine predictive models. Chapman and Hall/CRC

  • Bollen KA, Kf Ting (2000) A tetrad test for causal indicators. Psychol Methods 5(1):3

    Article  Google Scholar 

  • Borsboom D, Mellenbergh GJ, Van Heerden J (2004) The concept of validity. Psychol Rev 111(4):1061

    Article  Google Scholar 

  • Carpita M, Golia S (2021) Discovering associations between players’ performance indicators and matches’ results in the European soccer leagues. J Appl Stat 48(9):1696–1711

    Article  MathSciNet  Google Scholar 

  • Carpita M, Sandri M, Simonetto A, Zuccolotto P (2015) Discovering the drivers of football match outcomes with data mining. Qual Technol Quant Manag 12(4):561–577

    Article  Google Scholar 

  • Carpita M, Ciavolino E, Pasca P (2019) Exploring and modelling team performances of the Kaggle European soccer database. Stat Modell 19(1):74–101

    Article  Google Scholar 

  • Carpita M, Ciavolino E, Pasca P (2021) Players’ role-based performance composite indicators of soccer teams: a statistical perspective. Soc Indic Res 156:815–830

    Article  Google Scholar 

  • Cefis M (2022a) Football analytics: a bibliometric study about the last decade contributions. Electron J Appl Stat Anal 15(1):232–248

    Google Scholar 

  • Cefis M (2022) Observed heterogeneity in players’ football performance analysis using pls-pm. J Appl Stat. https://doi.org/10.1080/02664763.2022.2101044

    Article  MathSciNet  Google Scholar 

  • Cefis M, Carpita M (2021) Football analytics: a higher-order pls-sem approach to evaluate players’ performance. Book Short Papers SIS 2021:508–513

    Google Scholar 

  • Cefis M, Carpita M (2022) Pls-sem with cca for football goalkeeper’s performance indicators. In: Book of Short Papers IES 2022 Innovation & Society 50: Statistical and Economic Methodologies for Quality Assessment, pp 288–293

  • Cheah JH, Ting H, Ramayah T, Memon MA, Cham TH, Ciavolino E (2019) A comparison of five reflective-formative estimation approaches: reconsideration and recommendations for tourism research. Qual Quant 53(3):1421–1458

    Article  Google Scholar 

  • Cheah JH, Roldán JL, Ciavolino E, Ting H, Ramayah T (2021) Sampling weight adjustments in partial least squares structural equation modeling: guidelines and illustrations. Total Qual Manag Bus Excell 32(13–14):1594–1613

    Article  Google Scholar 

  • Ciavolino, E., Nitti, M. (2013a). Simulation study for pls path modelling with high-order construct: A job satisfaction model evidence. In: Advanced Dynamic Modeling of Eeconomic and Social Systems, Springer, pp 185–207

  • Ciavolino E, Nitti M (2013b) Using the hybrid two-step estimation approach for the identification of second-order latent variable models. J Appl Stat 40(3):508–526

    Article  MathSciNet  Google Scholar 

  • Ciavolino E, Aria M, Cheah JH, Roldán JL (2022a) A tale of pls structural equation modelling: Episode i-a bibliometrix citation analysis. Soc Ind Res pp 1–26

  • Ciavolino E, Ferrante L, Sternativo GA, Cheah JH, Rollo S, Marinaci T, Venuleo C (2022b) A confirmatory composite analysis for the italian validation of the interactions anxiousness scale: a higher-order version. Behaviormetrika 49(1):23–46

    Article  Google Scholar 

  • Commission JRCE, et al. (2008) Handbook on constructing composite indicators: methodology and user guide. OECD publishing

  • Crocetta C, Antonucci L, Cataldo R, Galasso R, Grassia MG, Lauro CN, Marino M (2021) Higher-order pls-pm approach for different types of constructs. Soc Ind Res 154(2):725–754

    Article  Google Scholar 

  • Filetti C, Ruscello B, D’Ottavio S, Fanelli V (2017) A study of relationships among technical, tactical, physical parameters and final outcomes in elite soccer matches as analyzed by a semiautomatic video tracking system. Percept Motor Skills 124(3):601–620

    Article  Google Scholar 

  • Gudergan SP, Ringle CM, Wende S, Will A (2008) Confirmatory tetrad analysis in pls path modeling. J Bus Res 61(12):1238–1249

    Article  Google Scholar 

  • Hair Jr JF, Sarstedt M, Ringle CM, Gudergan SP (2017) Advanced issues in partial least squares structural equation modeling. saGe publications

  • Hair JF Jr, Howard MC, Nitzl C (2020) Assessing measurement model quality in pls-sem using confirmatory composite analysis. J Bus Res 109:101–110

    Article  Google Scholar 

  • Hughes MD, Caudrelier T, James N, Redwood-Brown A, Donnelly I, Kirkbride A, Duschesne C (2012) Moneyball and soccer- an analysis of the key performance indicators of elite male soccer players by position. J Hum Sport Exerc 7:402–412

    Article  Google Scholar 

  • Jöreskog KG (1978) Structural analysis of covariance and correlation matrices. Psychometrika 43(4):443–477

    Article  MathSciNet  Google Scholar 

  • Kirschstein T, Liebscher S (2019) Assessing the market values of soccer players- a robust analysis of data from german 1. and 2. bundesliga. J Appl Stat 46(7):1336–1349

    Article  MathSciNet  Google Scholar 

  • Leguina A (2015) A primer on partial least squares structural equation modeling (pls-sem)

  • Lohmöller JB (1989) Predictive vs. structural modeling: Pls vs. ml. In: Latent variable path modeling with partial least squares, Springer, pp 199–226

  • Matano F, Richardson LF, Pospisil T, Eubanks C, Qin J (2018) Augmenting adjusted plus-minus in soccer with fifa ratings. arXiv preprint arXiv:1810.08032

  • McHale IG, Scarf PA, Folker DE (2012) On the development of a soccer player performance rating system for the English premier league. Interfaces 42(4):339–351

    Article  Google Scholar 

  • Mehmetoglu M, Venturini S (2021) Structural equation modelling with partial least squares using Stata and R. CRC Press

  • Monecke A, Leisch F (2012) sempls: structural equation modeling using partial least squares. J Stat Softw 48(1):1–32

    Google Scholar 

  • Pappalardo L, Cintia P, Ferragina P, Massucco E, Pedreschi D, Giannotti F (2019) Playerank: data-driven performance evaluation and player ranking in soccer via a machine learning approach. ACM Trans Intell Syst Technol 10(5):1–27

    Article  Google Scholar 

  • Ringle C, Da Silva D, Bido D (2015) Structural equation modeling with the smartpls. Braz J Market 13(2)

  • Sanchez G, Trinchera L (2013) Package ‘plspm’. State College, PA, USA http://www.scribd.com/plspm

  • Schultze SR, Wellbrock CM (2018) A weighted plus/minus metric for individual soccer player performance. J Sports Anal 4(2):121–131

    Article  Google Scholar 

  • Shmueli G, Ray S, Estrada JMV, Chatla SB (2016) The elephant in the room: predictive performance of pls models. J Bus Res 69(10):4552–4564

    Article  Google Scholar 

  • Wilson B (2010) Using pls to investigate interaction effects between higher order branding constructs. In: Handbook of partial least squares, Springer, pp 621–652

  • Wold H (1985) Encyclopedia of statistical sciences. Partial least squares Wiley, New York pp 581–591

Download references

Acknowledgements

This research was supported by the project Big Data Analytics in Sports of the University of Brescia https://bodai.unibs.it/bdsports/.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mattia Cefis.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: The PLS-SEM third-order for each model-role

Appendix: The PLS-SEM third-order for each model-role

See Figs. 5, 6 and 7.

Fig. 5
figure 5

Path diagram by defensive roles and estimates significant (\(95\%\) BCa -two tailed- bootstrap CIs with 5000 replications). Legend central backs (CB), full backs (FB)

Fig. 6
figure 6

Path diagram by midfielder roles and estimates significant (\(95\%\) BCa -two tailed- bootstrap CIs with 5000 replications). Legend midfelders (MF), offensive midfelders (OM)

Fig. 7
figure 7

Path diagram by offensive roles and estimates significant (\(95\%\) BCa -two tailed- bootstrap CIs with 5000 replications). Legend wings (WG), forwards (FW)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cefis, M., Carpita, M. The higher-order PLS-SEM confirmatory approach for composite indicators of football performance quality. Comput Stat 39, 93–116 (2024). https://doi.org/10.1007/s00180-022-01295-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-022-01295-4

Keywords

Navigation