Penalized spline smoothing in multivariable survival models with varying coefficients

doi:10.1016/j.csda.2004.05.006

Computational Statistics & Data Analysis

Volume 49, Issue 1, 15 April 2005, Pages 169-186

https://doi.org/10.1016/j.csda.2004.05.006 Get rights and content

Abstract

Penalized spline (P-spline) smoothing is discussed for hazard regression of multivariable survival data. Non-proportional hazard functions are fitted in a numerically handy manner by employing Poisson regression which results from numerical integration of the cumulative hazard function. Multivariate smoothing parameters are selected by utilizing the connection between P-spline smoothing and generalized linear mixed models. A hybrid routine is suggested which combines the mixed model idea with a classical Akaike information criteria. The model is evaluated with simulations and applied to data on the success and failure of newly founded companies.

Introduction

Modeling of survival data is largely dominated by the proportional hazard (PH) model introduced by Cox (1972). Even though the PH model appeals by simple numerical fitting based on the partial likelihood, the PH assumption often restricts the model in applications since it means that covariate effects remain constant over survival time. This assumption has been under major investigation and numerous papers suggest extensions and testing procedures, see for instance O’Sullivan (1988), O’Quigley and Pessione (1989) , Hastie and Tibshirani (1990), Gray (1994), Hess (1994), Abrahamowicz et al. (1996). For a general overview of estimation and tests in proportional hazard models, we also refer to Lin and Wei (1991), Sasieni (1999) or Grambsch and Therneau (2000). Allowing covariate effects to be dynamic in time leads to a varying coefficient model as generally introduced by Hastie and Tibshirani (1993). Here, constant covariate effects are replaced by smooth but unknown functions. Smooth estimation can then be carried out using e.g. spline fitting, as in Hastie and Tibshirani (1993), see also Kooperberg et al. (1995) or by applying local techniques, see e.g. Fan et al. (1997) or Cai and Sun (2003).

Smooth estimation in survival models is usually based on the partial likelihood function. There are, however, two points of criticism which should be raised against the use of the partial likelihood in the context of smoothing. First, in the simple case that covariate effects are in fact constant over time, that is if the PH assumption holds, the cumulative (integrated) hazard function in the likelihood function factorizes to the cumulative baseline hazard multiplied by the covariate effects. If the baseline hazard is then estimated by the empirical survivor function, the resulting profile likelihood for the parameters is equivalent to the partial likelihood suggested by Cox. This justification of the partial likelihood is due to Breslow (1972) (see also Cox, 1975 or Wong, 1986). However, if covariate effects do vary with time, that is if the PH assumption is violated, such factorization of the cumulative hazard does not exist and, consequently, the partial likelihood does not have any justification as profile likelihood function. Secondly, in partial likelihood estimation the baseline hazard is treated as nuisance component and not explicitly estimated. In applications, however, knowledge about the baseline hazard can be of interest, in particular if smooth, non-parametric regression is pursued. For this reason, it seems worthwhile to work directly with the likelihood function. This approach is pursued in this paper in order to fit a smooth, non-proportional hazard model. The integrated hazard function in the likelihood is thereby approximated using numerical integration based on a trapezoid approximation. This in turn leads to a simple likelihood function which resembles a Poisson model.

As smoothing technique we employ penalized spline fitting (P-spline). The approach was originally introduced by O’Sullivan (1986), but the procedure finally achieved general recognition with the paper by Eilers and Marx (1996). A comprehensive overview about the current state of the art is found in Ruppert et al. (2003). P-spline smoothing in survival models has been studied in Cai et al. (2002) for baseline hazard smoothing. The underlying idea of P-spline smoothing is to fit a smooth curve by using a high-dimensional basis. But instead of simple parametric fitting a penalized version is pursued to provide a smooth fit. The approach resembles standard spline smoothing as discussed, e.g. in Wahba (1978), or in its generalized form in Green and Silverman (1994). The major difference is that for spline smoothing the dimension of the corresponding spline basis grows with the sample size. In contrast, for P-spline smoothing a finite-dimensional basis is used, where the dimension is chosen in a rich and generous manner. The approach is numerically very handy. It also has strong links to linear mixed models (see Wand, 2003) and to penalized quasi-likelihood (PQL) estimation in generalized linear mixed models (GLMM), as discussed in Breslow and Clayton (1993). The connection becomes obvious if the penalty is rewritten as a priori distribution on the coefficients of the basis. In fact, the smoothing parameter steering the amount of penalization is then playing the role of the a priori variance in the resulting GLMM. We utilize this link for smoothing parameter estimation. It will be demonstrated that the PQL approach is numerically simple but fails to estimate reasonable smoothing parameters in low-intensity hazard models. Alternatively, an EM-based procedure as suggested in Booth and Hobert (1999) could be used for the price of increased numerical effort. We suggest a hybrid approach based on the numerically attractive PQL estimates combined with an Akaike criterion.

Our data example concerns the success (survival) of newly founded companies using the database of the “Munich Founder Study”. During 1985 and 1986 there were $28 646$ business registrations at the local chambers of commerce in Munich and surrounding administrative districts. About 5 years later in 1990, a sample of the businesses was drawn with subsequent interviews of about $1800$ founders. Details on the study can be found in Brüderl et al. (1992). The recorded variables include the duration of business as response variable ( $=$ actual duration time if business went bankrupt, censored observation if business is still in the market). Moreover, risk factors like for example whether the company was started with the intention of providing the main income for the founder (yes/no) or whether the business was planned in advance for more than 6 months (yes/no) were collected. More details are given later in the paper. The focus of interest is how these initial risk factors influence the chances of success ( $=$ survival) of a company and more importantly how this effect vary with the time being in the market. In particular, we focus on young entrepreneurs aged 30 years and younger, leading to a subsample of 369 firms. Corresponding Kaplan–Meier estimates are shown in Fig. 1. The left curve shows the empirical survivor curve, dependent on the effect of the purpose of the business (main income or not) while the right plot shows the effect of planning on the probability of success. There is clear non-proportionality visible which will be taken into account in our modeling exercise.

The paper is organized as follows. In Section 2, we first motivate the use of P-splines for fitting non-proportional hazard models. We demonstrate how integrals of the hazard function can be approximated by trapezoid integration, yielding a Poisson-type model. We provide some asymptotic consideration and discuss practical adjustments of the fitting algorithm. In Section 3, we derive the link to GLMMs and discuss the estimation of the smoothing parameter. The data example and simulations are found in Section 4. A discussion finalizes the paper. Technical details are provided in the appendix.

Section snippets

P-spline fitting

Let $T_{i}$ denote the survival time of the ith individual or observational units and let $C_{i}$ be the corresponding right censored time, $i = 1, \dots, N$ . We observe $Y_{i} = min (T_{i}, C_{i})$ and define the censoring indicator $δ_{i} = 1$ if $T_{i} < C_{i}$ and $δ_{i} = 0$ otherwise. With $x_{i}$ we denote the p-dimensional covariate vector for the ith individual, which for simplicity of presentation is assumed to be time constant. The hazard function is then modeled as $h (t, x_{i}) = h_{0} (t) \exp {x_{i}^{T} β_{x} (t)}$ with $h_{0} (t)$ as baseline hazard and $β_{x} (t)$ as vector of

PQL estimation

Penalized spline smoothing has strong affinities to PQL estimation in GLMM as discussed in Breslow and Clayton (1993) (see also McCulloch and Searle, 2001). For normal response models, this link is illuminated in depth in Wand (2003) (see also Ruppert et al., 2003). For non-normal response models we achieve the link in the following way. We consider coefficients $α_{l}$ , $l = 0, \dots, p$ , as independent normally distributed variables with $α_{l} \sim N (0, λ_{l}^{- 1} D_{l}^{-}),$ where $D_{l}^{-}$ is the (generalized) inverse of $D_{l}$ . The

Simulation

We simulate survival data for $N = 400$ individuals on a discrete time grid $t = 1, 2, 3, \dots$ using a constant drop out probability of 3% for each time interval t to $t + 1$ . The two binary covariates $x_{1}$ and $x_{2}$ are randomly chosen with $P (x_{1} = 1) = 0.5$ and $P (x_{2} = 1) = 0.3$ . As dynamic effects we include $β_{0} (t) = - 5$ as constant baseline hazard and $β_{1} (t) = - 1 + t / 30$ and $β_{2} (t) = 1.5 \sin (π t / 60)$ . In Fig. 3, we show for one simulation the principle of the hybrid smoothing parameter selection. Smoothing parameter estimation is started

Discussion

We demonstrated the use of P-splines for fitting non-proportional hazard models. Numerical integration was pursued which led to Poisson data. Multivariate smoothing parameter selection was carried out by a hybrid procedure, utilizing the link between P-spline smoothing and GLMM. In particular, complicated grid searching was avoided and the routine is numerically simple. A data example demonstrated the new insight which could be gained by allowing hazard functions to be dynamic in time.

References (37)

M. Abrahamowicz et al.
Time-dependent hazard ratiomodeling and hypothesis testing with application in lupus nephritis
J. Amer. Statist. Assoc.
(1996)
O.E. Barndorff-Nielsen et al.
Asymptotic Techniques for use in Statistics
(1989)
C. de Boor
A Practical Guide to Splines
(1978)
J. Booth et al.
Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm
J. Roy. Statist. Soc. Ser. B
(1999)
N.E. Breslow
Comment on “regression and life tables” by D. R. Cox
J. Roy. Statist. Soc. Ser. B
(1972)
N.E. Breslow et al.
Approximate inference in generalized linear mixed model
J. Amer. Statist. Assoc.
(1993)
N.E. Breslow et al.
Bias correction in generalised linear mixed models with a single component of dispersion
Biometrika
(1995)
J. Brüderl et al.
Survival chances of newly founded business organizations
Amer. Sociol. Rev.
(1992)
T. Cai et al.
Mixed model-based hazard estimation
J. Comput. Graphical Statist.
(2002)
Z. Cai et al.
Local linear estimation for time-dependent coefficients in Cox's regression models
Scand. J. Statist.
(2003)

H. Cardot

Local roughness penalities for regression splines

Comput. Statist.

(2002)

D. Cox

Partial likelihood

Biometrika

(1975)

D.R. Cox

Regression models and life tables (with discussion)

J. Roy. Statist. Soc. Ser. B

(1972)

D.R. Cox et al.

Analysis of Survival Data

(1984)

P.H.C. Eilers et al.

Flexible smoothing with B-splines and penalties

Statist. Sci.

(1996)

J. Fan et al.

Local likelihood and local partial likelihood in hazard regression

Ann. Statist.

(1997)

P.M. Grambsch et al.

Proportional hazards tests and diagnostics based on weighted residuals (corr: 95v82 p668)

Biometrika

(1994)

P.M. Grambsch et al.

Modeling Survival Data

(2000)

Cited by (53)

Association of Bioprosthetic Aortic Valve Leaflet Calcification on Hemodynamic and Clinical Outcomes
2020, Journal of the American College of Cardiology
Citation Excerpt :
We built a series of nested bivariate logistic models, and multiple testing was adjusted by Bonferroni correction (11). In survival analyses, the pattern of the association between AVCd and the clinical endpoint was initially examined with AVCd modeled as a penalized spline (16). We then modeled AVCd as both a continuous and a categorical variable according to the optimal threshold determined by maximally selected rank statistics (17) as well as quartile of the AVCd distribution.
The prognostic value of aortic valve calcification (AVC) measured by using multidetector computed tomography imaging has been well validated in native aortic stenosis, and sex-specific thresholds have been proposed. However, few data are available regarding the impact of leaflet calcification on outcomes after biological aortic valve replacement (AVR).
The goal of this study was to analyze the association of quantitative bioprosthetic leaflet AVC with hemodynamic and clinical outcomes, as well as its possible interaction with sex.
From 2008 to 2010, a total of 204 patients were prospectively enrolled with a median of 7.0 years (interquartile range: 5.1 to 9.2 years) after biological surgical AVR. AVC measured by using the Agatston method was indexed to the cross-sectional area of aortic annulus measured by echocardiography to calculate the AVC density (AVCd). Presence of hemodynamic valve deterioration (HVD; increase in mean gradient [MG] ≥10 mm Hg and/or increase in transprosthetic regurgitation ≥1) was assessed by echocardiography in 137 patients at the 3-year follow-up. The primary clinical endpoint was mortality or aortic valve re-intervention.
There was no significant sex-related difference in the relationship between bioprosthetic AVCd and the progression of MG. Baseline AVCd showed an independent association with HVD at 3 years. During follow-up, there were 134 (65.7%) deaths (n = 100) or valve re-interventions (n = 47). AVCd ≥58 AU/cm² was independently associated with an increased risk of mortality or aortic valve re-intervention (adjusted hazard ratio: 2.23; 95% confidence interval: 1.44 to 3.35; p < 0.001). The AVCd threshold combined with an MG progression threshold of 10 mm Hg amplified the stratification of patients at risk (log-rank, p < 0.001). The addition of AVCd threshold into the prediction model including traditional risk factors improved outcome prediction (net classification improvement: 0.25, p = 0.04; likelihood ratio test, p < 0.001).
Aortic bioprosthetic leaflet calcification is strongly and independently associated with HVD and the risk of death or aortic valve re-intervention. As opposed to native aortic stenosis, there is no sex-related differences in the relationship between AVCd and hemodynamic or clinical outcomes.
Stable exponential random graph models with non-parametric components for large dense networks
2017, Social Networks
Exponential random graph models (ERGM) behave peculiar in large networks with thousand(s) of actors (nodes). Standard models containing 2-star or triangle counts as statistics are often unstable leading to completely full or empty networks. Moreover, numerical methods break down which makes it complicated to apply ERGMs to large networks. In this paper we propose two strategies to circumvent these obstacles. First, we use a subsampling scheme to obtain (conditionally) independent observations for model fitting and secondly, we show how linear statistics (like 2-stars etc.) can be replaced by smooth functional components. These two steps in combination allow to fit stable models to large network data, which is illustrated by a data example including a residual analysis.
On the maximum penalized likelihood approach for proportional hazard models with right censored survival data
2014, Computational Statistics and Data Analysis
This paper considers simultaneous estimation of the regression coefficients and baseline hazard in proportional hazard models using the maximum penalized likelihood (MPL) method where a penalty function is used to smooth the baseline hazard estimate. Although MPL methods exist to fit proportional hazard models, they suffer from the following deficiencies: (i) the positivity constraint on the baseline hazard estimate is either avoided or poorly treated leading to efficiency loss, (ii) the asymptotic properties of the MPL estimator are lacking, and (iii) simulation studies comparing the performance of MPL to that of the partial likelihood have not been conducted. In this paper we propose a new approach and aim to address these issues. We first model baseline hazard using basis functions, then estimate this approximate baseline hazard and the regression coefficients simultaneously. The penalty function included in the likelihood is quite general but typically assumes prior knowledge about the smoothness of the baseline hazard. A new iterative optimization algorithm, which combines Newton’s method and a multiplicative iterative algorithm, is developed and its convergence properties studied. We show that if the smoothing parameter tends to zero sufficiently fast, the new estimator is consistent, asymptotically normal and retains full efficiency under independent censoring. A simulation study reveals that this method can be more efficient than the partial likelihood method, particularly for small to moderate samples. In addition, our simulation shows that the new estimator is substantially less biased under informative censoring.
A unifying switching regime regression framework with applications in health economics
2024, Econometric Reviews
AN EXTENSION OF ESTIMATING EQUATIONS TO MODEL LONGITUDINAL MEDICAL COST TRAJECTORY WITH MEDICARE CLAIMS DATA LINKED TO SEER CANCER REGISTRY
2023, Annals of Applied Statistics
Assessing nonlinearities and heterogeneity in debt sustainability analysis: a panel spline approach
2023, Empirical Economics

View all citing articles on Scopus

View full text

Penalized spline smoothing in multivariable survival models with varying coefficients

Abstract

Introduction

Section snippets

P-spline fitting

PQL estimation

Simulation

Discussion

Time-dependent hazard ratiomodeling and hypothesis testing with application in lupus nephritis

J. Amer. Statist. Assoc.

Asymptotic Techniques for use in Statistics

A Practical Guide to Splines

Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm

J. Roy. Statist. Soc. Ser. B

Comment on “regression and life tables” by D. R. Cox

J. Roy. Statist. Soc. Ser. B

Approximate inference in generalized linear mixed model

J. Amer. Statist. Assoc.

Bias correction in generalised linear mixed models with a single component of dispersion

Biometrika

Survival chances of newly founded business organizations

Amer. Sociol. Rev.

Mixed model-based hazard estimation

J. Comput. Graphical Statist.

Local linear estimation for time-dependent coefficients in Cox's regression models

Scand. J. Statist.

Local roughness penalities for regression splines

Comput. Statist.

Partial likelihood

Biometrika

Regression models and life tables (with discussion)

J. Roy. Statist. Soc. Ser. B

Analysis of Survival Data

Flexible smoothing with B-splines and penalties

Statist. Sci.

Local likelihood and local partial likelihood in hazard regression

Ann. Statist.

Proportional hazards tests and diagnostics based on weighted residuals (corr: 95v82 p668)

Biometrika

Modeling Survival Data