Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection

Yu, Jinyang; Hamdan, Sami; Sasse, Leonard; Morrison, Abigail; Patil, Kaustubh R.

doi:10.1007/978-3-031-58553-1_5

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14642))

Included in the following conference series:

International Symposium on Intelligent Data Analysis

461 Accesses

Abstract

Mutation validation (MV) is a recently proposed approach for model selection, garnering significant interest due to its unique characteristics and potential benefits compared to the widely used cross-validation (CV) method. In this study, we empirically compared MV and k-fold CV using benchmark and real-world datasets. By employing Bayesian tests, we compared generalization estimates yielding three posterior probabilities: practical equivalence, CV superiority, and MV superiority. We also evaluated the differences in the capacity of the selected models and computational efficiency. We found that both MV and CV select models with practically equivalent generalization performance across various machine learning algorithms and the majority of benchmark datasets. MV exhibited advantages in terms of selecting simpler models and lower computational costs. However, in some cases MV selected overly simplistic models leading to underfitting and showed instability in hyperparameter selection. These limitations of MV became more evident in the evaluation of a real-world neuroscientific task of predicting sex at birth using brain functional connectivity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Choosing Between Two Classification Learning Algorithms Based on Calibrated Balanced $5\times 2$ Cross-Validated F-Test

Article 25 November 2016

What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis?

Article 13 June 2020

Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection

Article Open access 27 September 2018

Notes

References

xcpengine-container 1.0.1. https://pypi.org/project/xcpengine-container/
Barbiero, P., Squillero, G., Tonda, A.P.: Modeling generalization in machine learning: a methodological and computational study. CoRR abs/2006.15680 (2020)
Google Scholar
Corani, G., Benavoli, A.: A Bayesian approach for comparing cross-validated algorithms on multiple data sets. Mach. Learn. 100, 285–304 (2015)
Article MathSciNet Google Scholar
Corani, G., Benavoli, A., Demšar, J., Mangili, F., Zaffalon, M.: Statistical comparison of classifiers through Bayesian hierarchical modelling. Mach. Learn. 106(11), 1817–1837 (2017)
Article MathSciNet Google Scholar
Dua, D., Graff, C.: UCI machine learning repository (2017). http://archive.ics.uci.edu/ml
Esteban, O., et al.: fMRIPrep: a robust preprocessing pipeline for functional MRI (2022)
Google Scholar
Feldman, V., Frostig, R., Hardt, M.: The advantages of multiple classes for reducing overfitting from test set reuse. CoRR abs/1905.10360 (2019)
Google Scholar
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer Series in Statistics, Springer, New York (2009). https://doi.org/10.1007/978-0-387-21606-5
Book Google Scholar
Kohavi, R.: A Study of Cross-validation and Bootstrap for Accuracy Estimation and Model Selection, pp. 1137–1143. IJCAI 1995, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (1995)
Google Scholar
Kruschke, J.: Bayesian estimation supersedes the t test. J. Exp. Psychol. General 142, 573–603 (2012)
Article Google Scholar
Mitchell, T.M.: Machine Learning. McGraw-hill, New York (1997)
Google Scholar
Raschka, S.: Model evaluation, model selection, and algorithm selection in machine learning. CoRR abs/1811.12808 (2018)
Google Scholar
Schaefer, A., et al.: Local-Global Parcellation of the Human Cerebral Cortex from Intrinsic Functional Connectivity MRI. Cerebral Cortex (2018)
Google Scholar
Snoek, L., et al.: The Amsterdam open MRI collection, a set of multimodal MRI datasets for individual difference analyses. Sci. Data 8(1), 85 (2021)
Article Google Scholar
Vanschoren, J., van Rijn, J.N., Bischl, B., Torgo, L.: OpenML: networked science in machine learning. SIGKDD Explor. 15(2), 49–60 (2013)
Article Google Scholar
Weis, S., Patil, K.R., Hoffstaedter, F., Nostro, A., Yeo, B.T.T., Eickhoff, S.B.: Sex classification by resting state brain connectivity. Cereb. Cortex 30(2), 824–835 (2019)
Article Google Scholar
Zhang, J.M., Harman, M., Guedj, B., Barr, E.T., Shawe-Taylor, J.: Model validation using mutated training labels: an exploratory study. Neurocomput. 539(C), 126116 (2023). https://doi.org/10.1016/j.neucom.2023.02.042

Download references

Acknowledgements

This work was partly supported by the Helmholtz Portfolio Theme “Supercomputing and Modelling for the Human Brain” and by the Max Planck School of Cognition supported by the Federal Ministry of Education and Research (BMBF) and the Max Planck Society (MPG).

Author information

Authors and Affiliations

Institute of Neuroscience and Medicine, Brain and Behaviour (INM-7), Research Center Jülich, Jülich, Germany
Jinyang Yu, Sami Hamdan, Leonard Sasse & Kaustubh R. Patil
Institute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
Sami Hamdan, Leonard Sasse & Kaustubh R. Patil
Institute for Neurosciennce and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6), Research Center Jülich, Jülich, Germany
Abigail Morrison
Department of Computer Science 3 - Software Engineering, RWTH Aachen University, Aachen, Germany
Abigail Morrison
Max Planck School of Cognition, Stephanstrasse 1a, Leipzig, Germany
Leonard Sasse

Authors

Jinyang Yu
View author publications
You can also search for this author in PubMed Google Scholar
Sami Hamdan
View author publications
You can also search for this author in PubMed Google Scholar
Leonard Sasse
View author publications
You can also search for this author in PubMed Google Scholar
Abigail Morrison
View author publications
You can also search for this author in PubMed Google Scholar
Kaustubh R. Patil
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kaustubh R. Patil .

Editor information

Editors and Affiliations

Stockholm University, Kista, Sweden
Ioanna Miliou
Fraunhofer IAIS, Sankt Augustin, Germany
Nico Piatkowski
Stockholm University, Kista, Sweden
Panagiotis Papapetrou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yu, J., Hamdan, S., Sasse, L., Morrison, A., Patil, K.R. (2024). Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection. In: Miliou, I., Piatkowski, N., Papapetrou, P. (eds) Advances in Intelligent Data Analysis XXII. IDA 2024. Lecture Notes in Computer Science, vol 14642. Springer, Cham. https://doi.org/10.1007/978-3-031-58553-1_5

Download citation

DOI: https://doi.org/10.1007/978-3-031-58553-1_5
Published: 16 April 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-58555-5
Online ISBN: 978-3-031-58553-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Choosing Between Two Classification Learning Algorithms Based on Calibrated Balanced \(5\times 2\) Cross-Validated F-Test

What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis?

Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Empirical Comparison Between Cross-Validation and Mutation-Validation in Model Selection

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Choosing Between Two Classification Learning Algorithms Based on Calibrated Balanced \(5\times 2\) Cross-Validated F-Test

What is an optimal value of k in k-fold cross-validation in discrete Bayesian network analysis?

Limitations of Bayesian Leave-One-Out Cross-Validation for Model Selection

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us