SiZer for smoothing splines

Marron, J. S.; Zhang, Jin Ting

doi:10.1007/BF02741310

SiZer for smoothing splines

Published: 01 September 2005

Volume 20, pages 481–502, (2005)
Cite this article

Computational Statistics Aims and scope Submit manuscript

J. S. Marron¹ &
Jin Ting Zhang²

187 Accesses
23 Citations
Explore all metrics

Abstract

Smoothing splines are an attractive method for scatterplot smoothing. The SiZer approach to statistical inference is adapted to this smoothing method, named SiZerSS. This allows quick and sure inference as to “which features in the smooth are really there” as opposed to “which are due to sampling artifacts”, when using smoothing splines for data analysis. Applications of SiZerSS to mode, linearity, quadraticity and monotonicity tests are illustrated using a real data example. Some small scale simulations are presented to demonstrate that the SiZerSS and the SiZerLL (the original local linear version of SiZer) often give similar performance in exploring data structure but they can not replace each other completely.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A robust variant of cubic smoothing spline approximation

Article Open access 09 January 2025

A review of spline function procedures in R

Article Open access 06 March 2019

REML for Two-Dimensional P-Splines

References

Chaudhuri, P. and Marron, J. S. (1999), SiZer for exploration of structure in curves, Journal of the American Statistical Association, 94, 807–823.
Article MathSciNet Google Scholar
Eubank, R. L. (2000), Nonparametric Regression and Spline Smoothing, Marcel Dekker, New York.
MATH Google Scholar
Fan, J. and Gijbels, I. (1996), Local Polynomial Modelling and Its Applications, Chapman and Hall, London.
MATH Google Scholar
Fan, J. and Marron, J. S. (1994), Fast implementations of nonparametric curve estimators, Journal of Computational and Graphical Statistics, 3, 35–56.
Google Scholar
Green, P. J. and Silverman, B. W. (1994), Nonparametric Regression and Generalized Linear Models, Chapman and Hall, London.
Book Google Scholar
Härdle, W. (1990), Applied Nonparametric Regression, Cambridge University Press, Boston.
Book Google Scholar
Hastie, T.J. and Tibshirani, R. J. (1990), Generalized Additive Models, Chapman and Hall, London.
MATH Google Scholar
Loader, C. (1999), Local Regression and Likelihood, Springer Verlag, Berlin.
MATH Google Scholar
Marron, J. S. (1996), A personal view of smoothing and statistics, in Statistical Theory and Computational Aspects of Smoothing, eds. W. Härdie and M. Schimek, 1–9 (with discussion, and rejoinder 103–112).
Google Scholar
Silverman, B. W. (1984), Spline smoothing: the equivalent kernel method. Ann. Statist., 12, 898–916.
Article MathSciNet Google Scholar
Wahba, G. (1991), Spline Models for Observational Data, SIAM, Philadelphia.
MATH Google Scholar
Wand, M. P. and Jones, M. C. (1995), Kernel Smoothing, Chapman and Hall, London.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, University of North Carolina, 27599-3260, Chapel Hill, NC
J. S. Marron
Department of Statistics and Applied Probability, National University of Singapore, Singapore, 119260, Singapore
Jin Ting Zhang

Authors

J. S. Marron
View author publications
You can also search for this author inPubMed Google Scholar
Jin Ting Zhang
View author publications
You can also search for this author inPubMed Google Scholar

Additional information

Marron’s research was supported by the Dept. of Stat. and Appl. Prob., National Univ. of Singapore, and by the National Science Foundation Grant DMS-9971649. Zhang’s research was supported by the National Univ. of Singapore Academic Research grant R-155-000-023-112. The Editor, the Associate Editor, and the referees are appreciated for their invaluable comments and suggestions that help improve the article significantly.

Appendix: Derivations of (5),(6), and (7)

First of all, assume X₁, X₂, ⋯, X_n have been sorted so that X₁ < X₂ < ⋯ < X_n. Write f_i = f(X_i) and γi = f″(X_i) to be the values of f(x) and f″(x) at X_i for i = 1, 2, ⋯, n. Define f = (f₁, ⋯, f_n)^T and γ = (γ₂, ⋯, γ_{n −1})^T. Let h_i = X_{i+ 1} − X_i, i = 1, 2, ⋯, n − 1. Let Q : n × (n − 2), R: (n − 2) × (n − 2) and K: n × n be the matrices as defined in Green and Silverman (1994, pages 12–13). According to Theorem 2.1 of Green and Silverman (1994, page 13), f is a natural cubic spline with knots at X_i, i = 1, 2, ⋯, n if and only if

$$K=Q R^{-1} Q^{T}, \quad \gamma=R^{-1} Q^{T} \mathbf{f}, \quad \int f^{\prime \prime}(x)^{2} d x=\mathbf{f}^{T} K \mathbf{f}.$$

((11))

Simple calculation then leads to the following desired formula:

$$\hat{\mathbf{f}}=(W+\lambda K)^{-1} W \mathbf{Y} \equiv A_{\lambda} \mathbf{Y},$$

((12))

with the weight matrix W = diag(w₁, w₂, ⋯, w_n), the hat matrix A_λ = (W + λK)⁻¹ W, and the response vector Y = (Y₁, Y₂, ⋯, Y_n)^T.

Using (11) and (12), we are now ready to give the matrix formulas for computing $\hat{f}_{\lambda}(x), \hat{f}_{\lambda}^{\prime}(x)$, and $\widehat{sd}\left\{\hat{f}_{\lambda}^{\prime}(x)\right\}$ at a given grid of locations x = [x₁, x₂, ⋯, x_N]^T. By Green and Silverman (1994, pages 22–23), for any x, we can write $\hat{f}(x)$ and $\hat{f}{}^{\prime}(x)$ as linear combinations of $\hat{\mathbf{f}}$ and $\hat{\gamma}$. Let h_i(x) = x − X_i, i = 1, 2, ⋯, n. When x < X₁,

$$\hat{f}(x)=\hat{f}_{1}+h_{1}(x)\left\{\frac{\hat{f}_{2}-\hat{f}_{1}}{h_{1}}-\frac{h_{1}}{6} \hat{\gamma}_{2}\right\}, \quad \hat{f}^{\prime}(x)=\frac{\hat{f}_{2}-\hat{f}_{1}}{h_{1}}-\frac{h_{1}}{6} \hat{\gamma}_{2}.$$

When X_i ≤ x ≤ X_{i+ 1}, let $\delta_{i}(x)=[1+\frac{h_{i}(x)}{h_{i}}] \hat{\gamma}_{i+1}+[1-\frac{h_{i+1}(x)}{h_{i}}] \hat{\gamma}_{i}$ for some i = 1, 2, ⋯, n,

$$\begin{aligned} \hat{f}(x) &=\frac{h_{i}(x) \hat{f}_{i+1}-h_{i+1}(x) \hat{f}_{i}}{h_{i}}+\frac{h_{i}(x) h_{i+1}(x) \delta_{i}(x)}{6}, \\ \hat{f}^{\prime}(x) &=\frac{\hat{f}_{i+1}-\hat{f}_{i}}{h_{i}}+\frac{h_{i}(x) h_{i+1}(x)(\hat{\gamma}_{i+1}-\hat{\gamma}_{i})}{6 h_{i}}+\frac{\left[h_{i}(x)+h_{i+1}(x)\right] \delta_{i}(x)}{6} \end{aligned}$$

When x > X_n,

$$\hat{f}(x)=\hat{f}_{n}+\frac{h_{n}(x)}{6}\left\{\frac{\hat{f}_{n}-\hat{f}_{n-1}}{h_{n-1}}+h_{n-1} \hat{\gamma}_{n-1}\right\},$$

$$\hat{f}^{\prime}(x)=\frac{1}{6}\left\{\frac{\hat{f}_{n}-\hat{f}_{n-1}}{h_{n-1}}+h_{n-1} \hat{\gamma}_{n-1}\right\}.$$

It follows that $\hat{f}(x)$ and $\hat{f}^{\prime}(x)$ can be written respectively as $c^{T} \hat{\mathbf{f}}-d^{T} \hat{\gamma}$ and $\tilde{c}^{T} \hat{\mathbf{f}}-\tilde{d}^{T} \hat{\gamma}$ where $c, \tilde{c}, d$and $\tilde{d}$ are coefficient vectors, depending on x and X₁, X₂, ⋯, X_n only. Let $\hat{f}\left(x_{i}\right)=c_{i}^{T} \hat{\mathbf{f}}-d_{i}^{T} \hat{\gamma}, \hat{f}^{\prime}\left(x_{i}\right)=\tilde{c}_{i}^{T} \hat{\mathbf{f}}-\tilde{d}_{i}^{T} \hat{\gamma}, i=1,2, \cdots, n$. Define C = (c₁, c₂, ⋯, c_n)^T, D = (d₁, d₂, ⋯, d_n)^T, and define $\tilde{C}$ and $\tilde{D}$ similarly. Set $\hat{\mathbf{f}}_{\mathbf{x}}=[\hat{f}(x_{1}), \cdots, \hat{f}(x_{N})]^{T}$ and $\hat{\mathbf{f}}_{\mathbf{x}}^{\prime}=[\hat{f}^{\prime}(x_{1}), \cdots, \hat{f}^{\prime}(x_{N})]^{T}$. Then, using (11) and (12), we have

$$\begin{array}{l}{\hat{\mathbf{f}}_{\mathbf{x}}\quad =\quad C \hat{\mathbf{f}}-D \hat{\gamma}=[C-D R^{-1} Q^{T}] \hat{\mathbf{f}}=M \hat{\mathbf{f}}=M A_{\lambda} \mathbf{Y}}, \\ {\hat{\mathbf{f}}{}^{\prime}_{\mathbf{x}}\quad =\quad \tilde{C} \hat{\mathbf{f}}-\tilde{D} \hat{\gamma}=[\tilde{C}-\tilde{D} R^{-1} Q^{T}] \hat{\mathbf{f}}=\tilde{M} \hat{\mathbf{f}}=\tilde{M} A_{\lambda} \mathbf{Y}},\end{array}$$

where M = C − DR^{− 1}Q^T and $\tilde{M}$ is similarly defined.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marron, J.S., Zhang, J.T. SiZer for smoothing splines. Computational Statistics 20, 481–502 (2005). https://doi.org/10.1007/BF02741310

Download citation

Published: 01 September 2005
Issue Date: September 2005
DOI: https://doi.org/10.1007/BF02741310

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SiZer for smoothing splines

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A robust variant of cubic smoothing spline approximation

A review of spline function procedures in R

REML for Two-Dimensional P-Splines

References

Author information

Authors and Affiliations

Additional information

Appendix: Derivations of (5),(6), and (7)

Appendix: Derivations of (5),(6), and (7)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now