Abstract
A binary disease outcome is commonly modeled with continuous covariates (e.g., biochemical concentration) in medical research, and the corresponding exploration may employ a normal discrimination approach. The covariate relationship affects the estimated association between binary outcome and the interesting covariate. The method of value deviated from a fitted value (fractional polynomial), which is abbreviated as VDFV, may reduce the estimation bias especially when the relationship between the covariates is nonlinear. However, when the extraneous variable relates to the outcome, the pooled data (cases and controls) are replaced by the control data only for the purpose of fitting values. Based on two association patterns, the extraneous variable unrelated to the outcome (I) and that related to the outcome (II), the simulation study reveals that VDFV-p (using pooled data) is reliable, with less bias and a smaller mean square error (MSE) in pattern (I) and that VDFV-c (using control data) shows less bias in pattern (II). The conventional covariate adjustment performs worse in (I) but fairly well in (II). Note that a huge MSE is never observed in VDFV-p or VDFV-c, while this is a common issue related to small sample size or sparse data in logistic regression. Two fetal studies are illustrated—one for pattern (I) and one for pattern (II).
Similar content being viewed by others
References
Breslow NE, Day NE (1980) Statistical methods in cancer research volume I: analysis of case-control studies. IARC Scientific Publication, Lyon
Casella G, Berger RL (2002) Statistical inference. 2nd ed. Duxbury, Pacific Grove
Chen H, Cohen P, Chen S (2007) Biased odds ratios from dichotomization of age. Stat Med 26: 3487–3497
Cohen J (1983) The cost of dichotomization. Appl Psychol Meas 7: 249–253
Cohen J, Cohen P (2003) Applied multiple regression/correlation analysis for the behavioral science. 3rd ed. Lawrence Erlbaum Associates, Hillsdale
Firth D (1993) Bias reduction of maximum likelihood estimates. Biometrika 80: 27–38
Fu CY, Yang YW (2009) A comparison of binary models dealing with an extraneous effect relating to the main risk factor, but not relating to the outcome variable. Stat Med 28: 1473–1486
Heinze G (2006) A comparative investigation of methods for logistic regression with separated or nearly separated data. Stat Med 25: 4216–4226
Hosmer DW, Lemeshow S (2000) Applied logistic regression. 2nd ed. Wiley, New York
Hung JH, Fu CY, Chen CY, Chao KC, Hung J (2008) Fetal nasal bone length and Down syndrome during the second trimester in a Chinese population. J Obstet Gynaecol Res 34: 518–523
Hung JH, Fu CY, Yuan CC, Chen CL, Yang ML, Shu LP, Wu CC (2003) Nuchal translucency incorporated in a one-stage multifactorial screening model for the prediction of Down syndrome at second trimester pregnancy. Ultrasound Med Biol 29: 1667–1674
Kleinbaum DG, Kupper LL, Muller KE, Nizam A (2008) Applied regression analysis and other multivariable methods. Duxbury Press, Belmont
Royston P, Altman DG (1994) Regression using fractional polynomials of continuous covariates: parsimonious parametric modeling. Appl Stat 43: 429–467
Royston P, Altman DG, Sauerbrei W (2006) Dichotomizing continuous predictors in multiple regression: a bad idea. Stat Med 25: 127–141
Selvin S (1995) Practical biostatistical methods. Duxbury Press, Belmont
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yang, YW., Fu, C.Y. Two advanced methods for adjusting the main coefficient in logistic regression. Comput Stat 28, 199–218 (2013). https://doi.org/10.1007/s00180-011-0294-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-011-0294-9