Skip to main content

Bandwidth Selection for Nadaraya-Watson Kernel Estimator Using Cross-Validation Based on Different Penalty Functions

  • Conference paper
  • First Online:
Machine Learning and Cybernetics (ICMLC 2014)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 481))

Included in the following conference series:

Abstract

The traditional cross-validation usually selects an over-smoothing bandwidth for kernel regression. The penalty function based cross-validation (e.g., generalized cross-validation (\(\mathrm{{CV}}_{\mathrm{{GCV}}}\)), the Shibata’s model selector (\(\mathrm{{CV}}_{\mathrm{{S}}}\)), the Akaike’s information criterion (\(\mathrm{{CV}}_{\mathrm{{AIC}}}\)) and the Akaike’s finite prediction error (\(\mathrm{{CV}}_{\mathrm{{FPE}}}\))) are introduced to relieve the problem of selecting over-smoothing bandwidth parameter by the traditional cross-validation for kernel regression problems. In this paper, we investigate the influence of these four different penalty functions on the cross-validation based bandwidth selection in the framework of a typical kernel regression method, i.e., the Nadaraya-Watson kernel estimator (NWKE). Firstly, we discuss the mathematical properties of these four penalty functions. Then, experiments are given to compare the performance of aforementioned cross-validation methods. Finally, we give guidelines for the selection of different penalty functions in practical applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Harta, J.D., Wehrlya, T.E.: Kernel Regression Estimation Using Repeated Measurements Data. Journal of the American Statistical Association 81(396), 1080–1088 (1986)

    Article  MathSciNet  Google Scholar 

  2. Hart, J.D.: Kernel Regression Estimation With Time Series Errors. Journal of the Royal Statistical Society, Series B: Methodological 53(1), 173–187 (1991)

    MATH  MathSciNet  Google Scholar 

  3. Herrmanna, E.: Local Bandwidth Choice in Kernel Regression Estimation. Journal of Computational and Graphical Statistics 6(1), 35–54 (1997)

    MathSciNet  Google Scholar 

  4. Dabo-Nianga, S., Rhomarib, N.: Kernel Regression Estimation in a Banach Space. Journal of Statistical Planning and Inference 139(4), 1421–1434 (2009)

    Article  MathSciNet  Google Scholar 

  5. Girarda, S., Guilloub, A., Stupflerc, G.: Frontier Estimation with Kernel Regression on High Order Moments. Journal of Multivariate Analysis 116, 172–189 (2013)

    Article  MathSciNet  Google Scholar 

  6. Watson, G.S.: Smooth Regression Analysis. Sankhy: The Indian Journal of Statistics, Series A (1961–2002) 26(4), 359–372 (1964)

    MATH  Google Scholar 

  7. Priestley, M.B., Chao, M.T.: Non-Parametric Function Fitting. Journal of the Royal Statistical Society. Series B (Methodological) 34(3), 385–392 (1972)

    MATH  MathSciNet  Google Scholar 

  8. Gasser, T., Müller, H.G.: Kernel estimation of regression functions. In: Gasser, T., Rosenblatt, M. (eds.) Smoothing Techniques for Curve Estimation. Lecture Notes in Mathematics, vol. 757, pp. 23–68. Springer, Heidelberg (1979)

    Chapter  Google Scholar 

  9. Parzen, E.: On Estimation of a Probability Density Function and Mode. Annals of Mathematical Statistics 33(3), 1065–1076 (1962)

    Article  MATH  MathSciNet  Google Scholar 

  10. Scott, D.W.: Multivariate Density Estimation: Theory, Practice, and Visualization. John Wiley and Sons, Inc. (1992)

    Google Scholar 

  11. Wand, M.P., Jones, M.C.: Kernel Smoothing. Chapman and Hall (1995)

    Google Scholar 

  12. Wang, X.Z., He, Y.L., Wang, D.D.: Non-Naive Bayesian Classifiers for Classification Problems with Continuous Attributes. IEEE Transactions on Cybernetics (2013), doi:10.1109/TCYB.2013.2245891

    Google Scholar 

  13. Leung, D.H.Y.: Cross-Validation in Nonparametric Regression With Outliers. The Annals of Statistics 33(5), 2291–2310 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  14. Härdle, W.: Applied Nonparametric Regression. Cambridge University Press (1994)

    Google Scholar 

  15. Hardle, W., Marron, J.S.: Bootstrap Simultaneous Error Bars for Nonparametric Regression. The Annals of Statistics 19(2), 778–796 (1991)

    Article  MathSciNet  Google Scholar 

  16. Golub, G.H., Heath, M., Wahba, G.: Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter 21(2), 215–223 (1979)

    Google Scholar 

  17. Wechsler, H., Duric, Z., Li, F.Y., et al.: Motion estimation using statistical learning theory. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(4), 466–478 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yumin Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, Y. (2014). Bandwidth Selection for Nadaraya-Watson Kernel Estimator Using Cross-Validation Based on Different Penalty Functions. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45652-1_10

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45651-4

  • Online ISBN: 978-3-662-45652-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics