Skip to main content
Log in

Two advanced methods for adjusting the main coefficient in logistic regression

  • Original Paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

A binary disease outcome is commonly modeled with continuous covariates (e.g., biochemical concentration) in medical research, and the corresponding exploration may employ a normal discrimination approach. The covariate relationship affects the estimated association between binary outcome and the interesting covariate. The method of value deviated from a fitted value (fractional polynomial), which is abbreviated as VDFV, may reduce the estimation bias especially when the relationship between the covariates is nonlinear. However, when the extraneous variable relates to the outcome, the pooled data (cases and controls) are replaced by the control data only for the purpose of fitting values. Based on two association patterns, the extraneous variable unrelated to the outcome (I) and that related to the outcome (II), the simulation study reveals that VDFV-p (using pooled data) is reliable, with less bias and a smaller mean square error (MSE) in pattern (I) and that VDFV-c (using control data) shows less bias in pattern (II). The conventional covariate adjustment performs worse in (I) but fairly well in (II). Note that a huge MSE is never observed in VDFV-p or VDFV-c, while this is a common issue related to small sample size or sparse data in logistic regression. Two fetal studies are illustrated—one for pattern (I) and one for pattern (II).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Breslow NE, Day NE (1980) Statistical methods in cancer research volume I: analysis of case-control studies. IARC Scientific Publication, Lyon

    Google Scholar 

  • Casella G, Berger RL (2002) Statistical inference. 2nd ed. Duxbury, Pacific Grove

    Google Scholar 

  • Chen H, Cohen P, Chen S (2007) Biased odds ratios from dichotomization of age. Stat Med 26: 3487–3497

    Article  MathSciNet  Google Scholar 

  • Cohen J (1983) The cost of dichotomization. Appl Psychol Meas 7: 249–253

    Article  Google Scholar 

  • Cohen J, Cohen P (2003) Applied multiple regression/correlation analysis for the behavioral science. 3rd ed. Lawrence Erlbaum Associates, Hillsdale

    Google Scholar 

  • Firth D (1993) Bias reduction of maximum likelihood estimates. Biometrika 80: 27–38

    Article  MathSciNet  MATH  Google Scholar 

  • Fu CY, Yang YW (2009) A comparison of binary models dealing with an extraneous effect relating to the main risk factor, but not relating to the outcome variable. Stat Med 28: 1473–1486

    Article  MathSciNet  Google Scholar 

  • Heinze G (2006) A comparative investigation of methods for logistic regression with separated or nearly separated data. Stat Med 25: 4216–4226

    Article  MathSciNet  Google Scholar 

  • Hosmer DW, Lemeshow S (2000) Applied logistic regression. 2nd ed. Wiley, New York

    Book  MATH  Google Scholar 

  • Hung JH, Fu CY, Chen CY, Chao KC, Hung J (2008) Fetal nasal bone length and Down syndrome during the second trimester in a Chinese population. J Obstet Gynaecol Res 34: 518–523

    Article  Google Scholar 

  • Hung JH, Fu CY, Yuan CC, Chen CL, Yang ML, Shu LP, Wu CC (2003) Nuchal translucency incorporated in a one-stage multifactorial screening model for the prediction of Down syndrome at second trimester pregnancy. Ultrasound Med Biol 29: 1667–1674

    Article  Google Scholar 

  • Kleinbaum DG, Kupper LL, Muller KE, Nizam A (2008) Applied regression analysis and other multivariable methods. Duxbury Press, Belmont

    Google Scholar 

  • Royston P, Altman DG (1994) Regression using fractional polynomials of continuous covariates: parsimonious parametric modeling. Appl Stat 43: 429–467

    Article  Google Scholar 

  • Royston P, Altman DG, Sauerbrei W (2006) Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med 25: 127–141

    Article  MathSciNet  Google Scholar 

  • Selvin S (1995) Practical biostatistical methods. Duxbury Press, Belmont

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chong Yau Fu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, YW., Fu, C.Y. Two advanced methods for adjusting the main coefficient in logistic regression. Comput Stat 28, 199–218 (2013). https://doi.org/10.1007/s00180-011-0294-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-011-0294-9

Keywords

Navigation