Skip to main content
Log in

Latent variable techniques for categorical data

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Two useful statistical methods for generating a latent variable are described and extended to incorporate polytomous data and additional covariates. Item response analysis is not well-known outside its area of application, mainly because the procedures to fit the models are computer intensive and not routinely available within general statistical software packages. The linear score technique is less computer intensive, straightforward to implement and has been proposed as a good approximation to item response analysis. Both methods have been implemented in the standard statistical software package GLIM 4.0, and are compared to determine their effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Aitkin M. 1996. A general maximum likelihood analysis of overdispersion in generalized linear models. Statistics and Computing 6: 251-262.

    Google Scholar 

  • Aitkin M. 1998. Personal communication.

  • Aitkin M., Anderson D., Francis B., and Hinde J. 1989. Statistical Modelling in GLIM. Oxford University Press, Oxford.

    Google Scholar 

  • Aitkin M. and Francis B. 1995. Fitting overdispersed generalized linear models by non-parametric maximum likelihood. GLIM Newsletter 25: 37-45.

    Google Scholar 

  • Albanese M.T. 1990. Latent variable models for binary response. PhD Thesis. University of London.

  • Albanese M.T. and Knott M. 1990.Twomiss: A Computer Program for Fitting a One-or Two-Factor Logit-Probit Latent Variable Model to Binary Data when Observations may be Missing. London School of Economics and Political Science, London.

    Google Scholar 

  • Bartholomew D.J. 1983. Latent variable models for ordered categorical data. Journal of Econometrics 22: 229-243.

    Google Scholar 

  • Bartholomew D.J. 1984. Scaling binary data using a factor model. Journal of the Royal Statistical Society B 46: 120-123.

    Google Scholar 

  • Bartholomew D.J. and Knott M. 1999. Latent variable models and factor analysis, 2nd edn. Kendall's Library of Statistics 7. Arnold, London.

    Google Scholar 

  • Birnbaum A. 1968. Some latent trait models and their use in inferring an examinee's ability. In: Lord F.M. and Novick M.R. Statistical Theories of Mental Test Scores with Contributions by A. Birnbaum. Addison-Wesley, Reading, MA.

    Google Scholar 

  • Bock R.D. and Aitkin M. 1981. Marginal maximum likelihood estimation of item parameters: Application of an EM Algorithm. Psychometrika 46: 443-459.

    Google Scholar 

  • Carstairs V. and Morris R. 1991. Deprivation and Health in Scotland. Aberdeen University Press.

  • Dempster A.P., Laird N.M., and Rubin D.B. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society B 39: 1-38.

    Google Scholar 

  • Dietz E. and Böhning D. 1995. Statistical inference based on a general model of unobserved heterogeneity. In: Seeber G., Francis B., Hatzinger R., and Steckel-Berger G. (Eds.), Lecture Notes in Statistics, Vol. 104. Springer-Verlag, New-York, pp. 75-82.

    Google Scholar 

  • Goodman L.A. 1979. Simple models for the analysis of association in cross-classifications having ordered categories. JASA 74: 537-552.

    Google Scholar 

  • Green M. 1989. Generalisations of the Goodman association model for the analysis of multi-dimensional contingency tables. Lecture Notes in Statistics 57: 165-171.

    Google Scholar 

  • Green M. 1994. Social indices and factor analysis for categorical variables. In: 9th International Workshop on Statistical Modelling. Exeter University, 11–15 July. Centre for Applied Statistics, Lancaster University.

  • Jöreskog K.G. and Sörbom D. 1996. LISREL 8: User's Reference Guide. Scientific Software International, Hove and London.

    Google Scholar 

  • Laird N.M. 1978. Nonparametric maximum likelihood estimation of a mixing distribution. Journal of the American Statistical Association 73: 805-811.

    Google Scholar 

  • Lancaster G. 1998. The use of socioeconomic indices in assessing risk factors for ill-health. PhD Thesis. Lancaster University.

  • Lauritzen S.L. and Wermuth N. 1989.Graphical models for associations between variables, some of which are qualitative and some quantitative.The Annals of Statistics 17:31-57.

    Google Scholar 

  • Lawley D.N. 1943. On problems connected with item selection and test construction. Proceedings of the Royal Society of Edinburgh 61: 273-287.

    Google Scholar 

  • Lindsay B.G. 1983. The geometry of mixture likelihoods: Part I. A general theory. Annals of Statistics 11: 86-94.

    Google Scholar 

  • Lord F.M. and Novick M.R. 1968.Statistical Theories of Mental Test Scores with Contributions by A. Birnbaum.Addison-Wesley, Reading, MA.

    Google Scholar 

  • McCullagh P. and Nelder J.A. 1983. Generalized Linear Models. Chapman and Hall, London.

    Google Scholar 

  • Meng X. and Rubin D. 1991. Using EM to obtain asymptotic variance-covariance matrices: The SEMalgorithm. Journal of the American Statistical Association 86: 899-909.

    Google Scholar 

  • Muthén B. 1987. LISCOMP: Analysis of Linear Structural Equations with a Comprehensive Measurement Model. Scientific Software, Chicago.

  • Schwarz H.R. 1989. Numerical Analysis: A Comprehensive Introduction. John Wiley and Sons, New York.

    Google Scholar 

  • Shea B.L. 1985. POLYFAC User Guide. London School of Economics, London.

    Google Scholar 

  • Shi J.Q. and Lee S.-Y. 1997. Estimation of factor scores with polytomous data by the EM algorithm. British Journal of Mathematical and Statistical Psychology 50: 215-226.

    Google Scholar 

  • Whittaker J. 1989. Discussion of paper by P.G.M. van der Heijden, A. de Falguerolles, and J. de Leeuw. 1989. A combined approach to contingency table analysis using correspondence analysis and log-linear analysis. Applied Statistics 38: 249-292.

    Google Scholar 

  • Wood A. and Hinde J. 1987. Binomial variance component models with a non-parametric assumption concerning random effects. In: Crouchley R. (Ed.), Longitudinal Data Analysis. Avebury, Aldershot, pp. 110-128.

  • Zhou S. and Maller R.A. 1995. The likelihood ratio test for the presence of immunes in a censored sample. Statistics 27: 181-201.

    Google Scholar 

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lancaster, G., Green, M. Latent variable techniques for categorical data. Statistics and Computing 12, 153–161 (2002). https://doi.org/10.1023/A:1014886619553

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1014886619553

Navigation