A general approach to heteroscedastic linear regression

Leslie, David S.; Kohn, Robert; Nott, David J.

doi:10.1007/s11222-006-9013-8

A general approach to heteroscedastic linear regression

Published: 30 January 2007

Volume 17, pages 131–146, (2007)
Cite this article

Statistics and Computing Aims and scope Submit manuscript

David S. Leslie¹,
Robert Kohn² &
David J. Nott³

632 Accesses
36 Citations
Explore all metrics

Abstract

Our article presents a general treatment of the linear regression model, in which the error distribution is modelled nonparametrically and the error variances may be heteroscedastic, thus eliminating the need to transform the dependent variable in many data sets. The mean and variance components of the model may be either parametric or nonparametric, with parsimony achieved through variable selection and model averaging. A Bayesian approach is used for inference with priors that are data-based so that estimation can be carried out automatically with minimal input by the user. A Dirichlet process mixture prior is used to model the error distribution nonparametrically; when there are no regressors in the model, the method reduces to Bayesian density estimation, and we show that in this case the estimator compares favourably with a well-regarded plug-in density estimator. We also consider a method for checking the fit of the full model. The methodology is applied to a number of simulated and real examples and is shown to work well.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Antoniak C.E. 1974. Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. The Annals of Statistics 2: 1152–1174.
MATH MathSciNet Google Scholar
Bartels R., Fiebig D.G., and Plumb M.H. 1996. Gas or electricity, which is cheaper?: An econometric approach with application to Australian expenditure data. The Energy Journal 17: 33–58.
Google Scholar
Brooks S.P. and Gelman A. 1998. General methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics 7: 434–455.
Article MathSciNet Google Scholar
Carroll R.J. and Ruppert D. 1988. Transformation and Weighting in Regression. Monographs on Statistics and Applied Probability, Chapman and Hall, London.
Google Scholar
Chan D., Kohn R., Nott D.J., and Kirby C. 2005. Locally adaptive semiparametric estimation of the mean and variance functions in regression models. Forthcoming in Journal of Computational and Graphical Statistics,15: 915–936.
Google Scholar
Cripps E., Kohn R., and Nott D. 2006. Bayesian subset selection and model averaging using a centred and dispersed prior for the error variance. Australian and New Zealand Journal of Statistics 48: 237–252.
Article MATH MathSciNet Google Scholar
Dahl D.B. 2003. An improved merge-split sampler for conjugate Dirichlet process mixture models. Technical Report 1086, Department of Statistics, University of Wisconsin-Madison.
Escobar M.D. and West M. 1995. Bayesian density estimation and inference using mixtures. Journal of the American Statistical Association 90: 577–588.
Article MATH MathSciNet Google Scholar
Ferguson T.S. 1973. A Bayesian analysis of some nonparametric problems. The Annals of Statistics 1: 209–230.
MATH MathSciNet Google Scholar
Gamerman D. 1997. Sampling from the posterior distribution in generalized linear mixed models. Statistics and Computing 7: 57–68.
Article Google Scholar
Green P.J. and Richardson S. 2001. Modelling heterogeneity with and without the Dirichlet process. Scandinavian Journal of Statistics 28: 355–375.
Article MATH MathSciNet Google Scholar
Hanson T. and Johnson W.O. 2002. Modeling regression error with a mixture of Polya trees. Journal of the American Statistical Association 97: 1020–1033.
Article MATH MathSciNet Google Scholar
Hurn M., Justel A., and Robert C.P. 2003. Estimating mixtures of regressions. Journal of Computational and Graphical Statistics 12: 55–79.
Article MathSciNet Google Scholar
Kohn R., Smith M., and Chan D. 2001. Nonparametric regression using linear combinations of basis functions. Statistics and Computing 11: 313–322.
Article MathSciNet Google Scholar
Kottas A. and Gelfand A.E. 2001. Bayesian semiparametric median regression modeling. Journal of the American Statistical Association 96: 1458–1468.
Article MATH MathSciNet Google Scholar
Kottas A. and Krnjajic M. 2005. Bayesian nonparametric modeling in quantile regression. Technical Report 2005-06, UCSC Department of Applied Math and Statistics.
Bayesian semiparametric inference for the accelerated failure time model. Canadian Journal of Statistics 25: 457–472.
Lo A.Y. 1984. On a class of Bayesian nonparametric estimates: I. Denisty estimates. The Annals of Statistics 12: 351–357.
MATH MathSciNet Google Scholar
MacEachern S.N. 1994. Estimating normal means with a conjugate style Dirichlet process prior. Communications in Statistics: Simulation and Computation 7: 727–741.
MathSciNet Google Scholar
Marron J.S. and Tsybakov A.B. 1995. Visual error criteria for qualitative smoothing. Journal of the American Statistical Association 90: 499–507.
Article MATH MathSciNet Google Scholar
Marron J.S. and Wand M.P. 1992. Exact mean integrated squared error. Annals of Statistics 20: 712–736.
MATH MathSciNet Google Scholar
Marshall E.C. and Spiegelhalter D.J. 2003. Approximate cross-validatory predictive checks in disease mapping models. Statistics in Medicine 22: 1649–1660.
Article Google Scholar
Mukhopadhyay S. and Gelfand A.E. 1997. Dirichlet process mixed generalised linear models. Journal of the American Statistical Association 92: 633–639.
Article MATH MathSciNet Google Scholar
Nott D.J. and Leonte D. 2004. Sampling schemes for Bayesian variable selection in generalized linear models. Journal of Computational and Graphical Statistics 13: 362–382.
Article MathSciNet Google Scholar
Richardson S. and Green P.J. 1997. On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society, B 59: 731–792.
Article MATH MathSciNet Google Scholar
Ruppert D., Wand M.P., and Carroll R.J. 2003. Semiparametric Regression. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press.
Sheather S.J. and Jones M.C. 1991) A reliable data-based bandwidth selection method for kernel density estimation. Journal of the Royal Statistical Society, B 53: 683–690.
MATH MathSciNet Google Scholar
Walker S.G. and Mallick B.K. 1999. Semiparametric accelerated life time model. Biometrics 55: 477–483.
Article MATH MathSciNet Google Scholar
West M. 1992. Hyperparameter estimation in Dirichlet process mixture models. ISDS Discussion paper 92-A03, Duke University.
West M., Müller P., and Escobar M.D. 1994. Hierarchical priors and mixture models, with application in regression and density estimation. In: Smith A. and Freeman P. (Eds.), Aspects of Uncertainty: A tribute to D.V. Lindley, Wiley, New York, pp. 363–386.

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Bristol, University Walk, Bristol, BS8 1TW, United Kingdom
David S. Leslie
Faculty of Business, University of New South Wales, UNSW, Sydney, 2052, Australia
Robert Kohn
School of Mathematics, University of New South Wales, UNSW, Sydney, 2052, Australia
David J. Nott

Authors

David S. Leslie
View author publications
Search author on:PubMed Google Scholar
Robert Kohn
View author publications
Search author on:PubMed Google Scholar
David J. Nott
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Robert Kohn.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Leslie, D.S., Kohn, R. & Nott, D.J. A general approach to heteroscedastic linear regression. Stat Comput 17, 131–146 (2007). https://doi.org/10.1007/s11222-006-9013-8

Download citation

Published: 30 January 2007
Issue Date: June 2007
DOI: https://doi.org/10.1007/s11222-006-9013-8

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A general approach to heteroscedastic linear regression

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Bayesian variable selection in linear regression models with non-normal errors

Nonparametric Bayesian inferences on the skewed data using a Dirichlet process mixture model

High-dimensional properties for empirical priors in linear regression with unknown error variance

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

A general approach to heteroscedastic linear regression

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Bayesian variable selection in linear regression models with non-normal errors

Nonparametric Bayesian inferences on the skewed data using a Dirichlet process mixture model

High-dimensional properties for empirical priors in linear regression with unknown error variance

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now