Elsevier

Handbook of Statistics

Volume 23, 2003, Pages 519-535
Handbook of Statistics

State Space Models for Survival Analysis

https://doi.org/10.1016/S0169-7161(03)23029-7Get rights and content

Publisher Summary

To ease the estimation problem and to estimate the survival probabilities, this chapter proposes a state-space modeling approach by combining stochastic models with statistical models. The Gibbs sampling method and the Markov Chain and Monte Carlo approach (MCMC) can be readily applied to estimate the unknown parameters and the state variables. By using these estimates, the model can be validated and the survival probabilities can be estimated. The chapter illustrates the model and the method by using a birth–death–immigration–illness–cure process that involves stochastic birth–death processes with immigration and the illness and cure processes for a disease such as tuberculosis. It extends this modeling approach to other human diseases such as tuberculosis. This type of modeling approach is definitely useful for other diseases, such as heart disease, and the risk assessment of environmental agents.

Introduction

Many diseases such as AIDS, cancer and infectious diseases are often very complicated biologically. Most of these diseases are complex stochastic processes where it is often very difficult to estimate the unknown parameters, especially in cases where not many data are available. In these cases, it would be very difficult to estimate the survival probabilities. To ease the estimation problem and to estimate the survival probabilities, in this article we propose a state space modeling approach by combining stochastic models with statistical models. Then one can readily apply the Gibbs sampling method and the Markov Chain and Monte Carlo approach (MCMC) to estimate the unknown parameters and the state variables. By using these estimates, one can validate the model and estimate the survival probabilities. We will illustrate the model and the method by using a birth–death–immigration–illness–cure process which involves stochastic birth–death processes with immigration and the illness and cure processes for a disease such as tuberculosis.

Section snippets

The state space models and the generalized Bayesian approach

To illustrate, consider a disease such as tuberculosis which is curable by drugs. Let X(t) be the vector of stochastic processes for key responses of the disease. Then, X(t) is the stochastic model for this disease and in many cases, one can derive stochastic equations for the state variables of the system by using basic biological mechanism of the disease; for some illustrations in cancer and AIDS, see Tan, 2000, Tan, 2002, Tan and Chen, 1998, Tan et al., 2001. If some observed data are

Stochastic modeling of the birth–death–immigration–illness–cure processes

Consider a population of individuals who are at risk for a disease and suppose that some drugs are available to treat the disease. One example is the tuberculosis. In this population, then there are two types of people: Normal healthy people (denote by N1) who do not have the disease and sick people (denote by N2) who have contracted the disease. When the population is at risk for the disease, normal people may contract the disease to become sick via contacts with sick people or disease agents.

A state space model for the birth–death–immigration–illness–cure processes

The state space model of a system is a stochastic model consisting of two sub-models: The stochastic system model which is the stochastic model of the system and the observation model which is the statistical model relating available data from the system to the model. For the birth–death–immigration–illness–cure process, the state variables are {Ni(t),i=1,2} and the stochastic system model is represented by the stochastic differential equations given by (5), (6). Assuming that the number of

The multi-level Gibbs sampling procedures for the birth–death–immigration–illness–cure processes

To implement the multi-level Gibbs sampling method, denote by U(t)={Fi(t),Ri(t),Bi(t),i=1,2} and put U={U(t),t=0,1,…,tM−1}. Then, given X(0) and Θ, PX,UX(0)=i=1tMPX(i)∣X(i−1),U(i−1)PU(i−1)∣X(i−1).

By the above distribution results with i≠j;i,j=1,2: PU(t)∣X(t)=C1(t)i=12giRi(t);tαi(t)Fi(t)bi(t)Bi(t)1−αi(t)−bi(t)Ni(t)−Fi(t)−Bi(t), where C1(t)=∏i=12Ni(t)Fi(t)Ni(t)−Fi(t)Bi(t),PX(t+1)∣X(t),U(t)=∏i=12Ni(t)−Fi(t)−Bi(t)ηi(t)di(t)1−αi(t)−bi(t)ηi(t)1−di(t)1−αi(t)−bi(t)ζi(t), where ηi(t)=Ni

The survival probabilities of normal and sick people

Let Si(t) denote the survival probability that an Ni person at time 0 will survive at least t months when the population is at risk for the disease. Then, under the conditions given in Section 3, one has, to order of o(Δt), Si(t+Δt)=αi(t)ΔtSj(t)+1−di(t)Δt−αi(t)ΔtSi(t),for alli≠j,i,j=1,2.

It follows that the Si(t)'s satisfy the following system of equations: ddtS1(t)=α1(t)S2(t)−α1(t)+d1(t)S1(t),ddtS2(t)=α2(t)S1(t)−α2(t)+d2(t)S2(t).

The initial condition is {Si(0)=1,i=1,2}.

Let S(t)={S1(t),S2(t)}′.

Some illustrative examples

To illustrate the above methods, consider the disease – tuberculosis (TB) which is curable by drugs. Given in Table 1 are the numbers of TB cases in US from 1980 to 1992 reported by CDC together with the total US population sizes over these years (CDC Report, 1993). In this data set, it is clear that the curve of TB cases in US is declining to the lowest level in 1985 and then increases due presumably to the effects of HIV (CDC Report, 1993). To fit this data, we thus assume α1(t)=α1(1) before

Conclusions

In this article, we have developed a state space model for the birth–death–immigration–illness–cure process. We have developed a generalized Bayesian method to estimate the unknown parameters and the state variables, and hence the survival probabilities. The numerical examples indicate that the methods are useful and promising. Of course, more studies are needed to further confirm the usefulness of the method and to check the efficiency of the method.

In the past 5 years, we have developed some

References (28)

  • B. Efron

    The Jackknife, the Bootstrap and Other Resampling Plans

    (1982)
  • A. Gelb

    Applied Optimal Estimation

    (1974)
  • A.C. Harvey

    Forecasting, Structural Time Series Models and the Kalman Filter

    (1994)
  • R.E. Kalman

    A new approach to linear filter and prediction problems

    J. Basic Engrg.

    (1960)
  • Cited by (1)

    View full text