Abstract
Sport betting has become one of the most profitable business around the world. This business generates millions of dollars every year. One of the most influenced games is Baseball. Baseball has suffered an important change after the introduction of statistical methods to tune up the team strategy. This effect, called Moneyball, started in 2002 when the team Oaklans Atletics began to choose players according to their statistics. After this successful approach, several teams decided to continue with this strategy, generating strong statistical teams. The statistical information about players and matches have acquired highly importance, creating different datasets, such as Retrosheet which collects detailed information about players, teams and matches since 1956 until today. This work pretends to generate a forecasting model for Baseball focused on the result prediction of new matches using statistical previous information. We combine time-series and clustering algorithms to generate a model which learns about the teams and matches evolution and tries to predict the final results. Even whether this model is not complete accurated, it becomes a good starting point for future models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bittner, E., NuBbaumer, A., Janke, W., Weigel, M.: Self-affirmation model for football goal distributions. EPL (Europhysics Letters) 78(5), 58002 (2007), http://stacks.iop.org/0295-5075/78/i=5/a=58002
Cox, A., Stasko, J.: Sportsvis: Discovering meaning in sports statistics through information visualization. In: Compendium of Symposium on Information Visualization, pp. 114–115. Citeseer (2006)
Everitt, B.: Cluster analysis. Reviews of current research. Heinemann Educational [for] the Social Science Research Council (1974), http://books.google.es/books?id=KjQNAQAAIAAJ
Hakes, J.K., Sauer, R.D.: An economic evaluation of the moneyball hypothesis. The Journal of Economic Perspectives 20(3), 173–185 (2006)
Jiménez-Díaz, G., Menéndez, H.D., Camacho, D., González-Calero, P.A.: Predicting performance in team games. In: INSTICC - Institude for systems and Technologies of Information, Control and Communication (ed.) Proceedings of the 3rd International Conference on Agents and Artificial Intelligence, ICAART 2011, vol. 1, pp. 401–406 (2011), http://aida.ii.uam.es/wp-content/uploads/2011/06/icaart_2011.pdf
Marchi, M., Albert, J.: Analyzing Baseball Data with R. CRC Press, Taylor and Francis Group (2013)
Vaz de Melo, P.O., Almeida, V.A., Loureiro, A.A.: Can complex network metrics predict the behavior of nba teams? In: Proceeding of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 695–703. ACM, New York (2008), doi: http://doi.acm.org/10.1145/1401890.1401974
Menendez, H., Bello-Orgaz, G., Camacho, D.: Extracting behavioural models from 2010 fifa world cup. Journal of Systems Science and Complexity 26(1), 43–61 (2013), http://link.springer.com/article/10.1007%2Fs11424-013-2289-9
Onody, R.N., de Castro, P.A.: Complex network study of brazilian soccer players. Phys. Rev. E 70, 037103 (2004), http://link.aps.org/doi/10.1103/PhysRevE.70.037103 , doi:10.1103/PhysRevE.70.037103
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Menéndez, H.D., Vázquez, M., Camacho, D. (2015). Mixed Clustering Methods to Forecast Baseball Trends. In: Camacho, D., Braubach, L., Venticinque, S., Badica, C. (eds) Intelligent Distributed Computing VIII. Studies in Computational Intelligence, vol 570. Springer, Cham. https://doi.org/10.1007/978-3-319-10422-5_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-10422-5_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10421-8
Online ISBN: 978-3-319-10422-5
eBook Packages: EngineeringEngineering (R0)