Abstract
The traditional cross-validation usually selects an over-smoothing bandwidth for kernel regression. The penalty function based cross-validation (e.g., generalized cross-validation (\(\mathrm{{CV}}_{\mathrm{{GCV}}}\)), the Shibata’s model selector (\(\mathrm{{CV}}_{\mathrm{{S}}}\)), the Akaike’s information criterion (\(\mathrm{{CV}}_{\mathrm{{AIC}}}\)) and the Akaike’s finite prediction error (\(\mathrm{{CV}}_{\mathrm{{FPE}}}\))) are introduced to relieve the problem of selecting over-smoothing bandwidth parameter by the traditional cross-validation for kernel regression problems. In this paper, we investigate the influence of these four different penalty functions on the cross-validation based bandwidth selection in the framework of a typical kernel regression method, i.e., the Nadaraya-Watson kernel estimator (NWKE). Firstly, we discuss the mathematical properties of these four penalty functions. Then, experiments are given to compare the performance of aforementioned cross-validation methods. Finally, we give guidelines for the selection of different penalty functions in practical applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Harta, J.D., Wehrlya, T.E.: Kernel Regression Estimation Using Repeated Measurements Data. Journal of the American Statistical Association 81(396), 1080–1088 (1986)
Hart, J.D.: Kernel Regression Estimation With Time Series Errors. Journal of the Royal Statistical Society, Series B: Methodological 53(1), 173–187 (1991)
Herrmanna, E.: Local Bandwidth Choice in Kernel Regression Estimation. Journal of Computational and Graphical Statistics 6(1), 35–54 (1997)
Dabo-Nianga, S., Rhomarib, N.: Kernel Regression Estimation in a Banach Space. Journal of Statistical Planning and Inference 139(4), 1421–1434 (2009)
Girarda, S., Guilloub, A., Stupflerc, G.: Frontier Estimation with Kernel Regression on High Order Moments. Journal of Multivariate Analysis 116, 172–189 (2013)
Watson, G.S.: Smooth Regression Analysis. Sankhy: The Indian Journal of Statistics, Series A (1961–2002) 26(4), 359–372 (1964)
Priestley, M.B., Chao, M.T.: Non-Parametric Function Fitting. Journal of the Royal Statistical Society. Series B (Methodological) 34(3), 385–392 (1972)
Gasser, T., Müller, H.G.: Kernel estimation of regression functions. In: Gasser, T., Rosenblatt, M. (eds.) Smoothing Techniques for Curve Estimation. Lecture Notes in Mathematics, vol. 757, pp. 23–68. Springer, Heidelberg (1979)
Parzen, E.: On Estimation of a Probability Density Function and Mode. Annals of Mathematical Statistics 33(3), 1065–1076 (1962)
Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley and Sons, Inc. (1992)
Wand, M.P., Jones, M.C.: Kernel Smoothing. Chapman and Hall (1995)
Wang, X.Z., He, Y.L., Wang, D.D.: Non-Naive Bayesian Classifiers for Classification Problems with Continuous Attributes. IEEE Transactions on Cybernetics (2013), doi:10.1109/TCYB.2013.2245891
Leung, D.H.Y.: Cross-Validation in Nonparametric Regression With Outliers. The Annals of Statistics 33(5), 2291–2310 (2005)
Härdle, W.: Applied Nonparametric Regression. Cambridge University Press (1994)
Hardle, W., Marron, J.S.: Bootstrap Simultaneous Error Bars for Nonparametric Regression. The Annals of Statistics 19(2), 778–796 (1991)
Golub, G.H., Heath, M., Wahba, G.: Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter 21(2), 215–223 (1979)
Wechsler, H., Duric, Z., Li, F.Y., et al.: Motion estimation using statistical learning theory. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(4), 466–478 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y. (2014). Bandwidth Selection for Nadaraya-Watson Kernel Estimator Using Cross-Validation Based on Different Penalty Functions. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-662-45652-1_10
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45651-4
Online ISBN: 978-3-662-45652-1
eBook Packages: Computer ScienceComputer Science (R0)