Abstract
In the social, behavioral, and health sciences it is often of interest to identify latent or unobserved groups in the population with the group membership of the individuals depending on a set of observed variables. In particular, we focus on the field of nursing home assessment in which the response variables typically come from the administration of questionnaires made of categorical items. These types of data may suffer from missing values and the use of lengthy questionnaires may be problematic as a large number of items could have a negative impact on the responses. In such a context, we introduce an extended version of the Latent Class (LC) model aimed at dealing with missing values, by assuming a form of latent ignorability. Moreover, we propose an item selection algorithm, based on the LC model, for finding the smallest subset of items providing an amount of information close to that of the initial set. The proposed approach is illustrated through an application to a dataset collected within an Italian project on the quality-of-life of nursing home patients.
Similar content being viewed by others
References
AKAIKE, H. (1973), “Information Theory and an Extension of the Maximum Likelihood Principle”, in Second International Symposium on Information Theory, eds. B.N. Petrov and F. Csaki, Budapest: Akademiai Kiado, pp. 267–281.
BARTOLUCCI, F., LUPPARELLI, M., and MONTANARI, G.E. (2009), “Latent Markov Model for Longitudinal Binary Data: An Application to the Performance Evaluation of Nursing Homes”, The Annals of Applied Statistics, 3, 611–636.
BARTOLUCCI, F., MONTANARI, G.E., and PANDOLFI, S. (2016), “Item Selection by Latent Class-Based Methods: An Application to Nursing Home Evaluation”, Advances in Data Analysis and Classification, 10, 245–262.
BAUM, L.E., PETRIE, T., SOULES, G., and WEISS, N. (1970), “A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains”, The Annals of Mathematical Statistics, 41, 164–171.
BIERNACKI, C., CELEUX, G., and GOVAERT, G. (1999), “An Improvement of the NEC Criterion for Assessing the Number of Clusters in a Mixture Model”, Pattern Recognition Letters, 20, 267–272.
BIERNACKI, C., CELEUX, G., and GOVAERT, G. (2003), “Choosing Starting Values for the EM Algorithm for Getting the Highest Likelihood in Multivariate Gaussian Mixture Models”, Computational Statistics and Data Analysis, 41, 561–575.
CELEUX, G., and SOROMENHO, G. (1996), “An Entropy Criterion for Assessing the Number of Clusters in a Mixture Model”, Journal of Classification, 13, 195–212.
COLLINS, L., and LANZA, S. (2010), Latent Class and Latent Transition Analysis: With Applications in the Social, Behavioral, and Health Sciences, Wiley Series in Probability and Statistics, Hoboken NJ: Wiley.
DEAN, N., and RAFTERY, A.E. (2010), “Latent Class Analysis Variable Selection”, Annals of the Institute of Statistical Mathematics, 62, 11–35.
DEMPSTER, A.P., LAIRD, N.M., and RUBIN, D.B. (1977), “Maximum Likelihood from Incomplete Data via the EM Algorithm”, Journal of the Royal Statistical Society, Series B, 39, 1–38.
DIAS, J.G. (2006), “Model Selection for the Binary Latent Class Model: A Monte Carlo Simulation”, in Data Science and Classification, Berlin, Heidelberg: Springer, pp. 91–99.
FORMANN, A.K. (2007), “Mixture Analysis of Multivariate Categorical Data with Covariates and Missing Entries”, Computational Statistics and Data Analysis, 51, 5236–5246.
FRALEY, C., and RAFTERY, A.E. (2002), “Model-Based Clustering, Discriminant Analysis, and Density Estimation”, Journal of the American Statistical Association, 97, 611–631.
GOODMAN, L.A. (1974), “Exploratory Latent Structure Analysis Using Both Identifiable and Unidentifiable Models”, Biometrika, 61, 215–231.
HAGENAARS, J.A., and MCCUTCHEON, A.L. (2002), Applied Latent Class Analysis, Cambridge UK: Cambridge University Press.
HAREL, O., and SCHAFER, J.L. (2009), “Partial and Latent Ignorability in Missing-Data Problems”, Biometrika, 96, 37–50.
KARLIS, D., and XEKALAKI, E. (2003), “Choosing Initial Values for the EM Algorithm for Finite Mixtures”, Computational Statistics and Data Analysis, 41, 577–590.
KASS, R.E., and RAFTERY, A.E. (1995), “Bayes Factors”, Journal of the American Statistical Association, 90, 773–795.
LATTANZIO, F., MUSSI, C., SCAFATO, E., RUGGIERO, C., DELL’AQUILA, G., PEDONE, C., MAMMARELLA, F., GALLUZZO, L., SALVIOLI, G., SENIN, U., CARBONIN, P.U., BERNABEI, R., and CHERUBINI, A. (2010), “Health Care for Older People in Italy: The U.L.I.S.S.E. Project (un link informatico sui servizi sanitari esistenti per l’anziano – a computerized network on health care services for older people), Journal of Nutrition, Health and Aging, 14, 238–242.
LAZARSFELD, P.F. (1950), “The Logical and Mathematical Foundation of Latent Structure Analysis”, in Measurement and Prediction, eds. S.A. Stouffer, L. Guttman, E.A. Suchman, New York: Princeton University Press.
LAZARSFELD, P.F., and HENRY, N.W. (1968), Latent Structure Analysis, Boston: Houghton Mifflin.
LITTLE, R.J.A. (1995), “Modeling the Drop-Out Mechanism in Repeated-Measures Studies”, Journal of the American Statistical Association, 90, 1112–1121.
LITTLE, R.J.A., and Rubin, D.B. (2002), Statistical Analysis with Missing Data (2nd ed.), Wiley Series in Probability and Statistics, Hoboken NJ: Wiley.
LU, G., and COPAS, J.B. (2004), “Missing at Random, Likelihood Ignorability and Model Completeness”, The Annals of Statistics, 32, 754–765.
NYLUND, K.L., ASPAROUHOV, T., and MUTHÉN, B.O. (2007), “Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study”, Structural Equation Modeling, 14, 535–569.
RUBIN, D.B. (1976), “Inference and Missing Data”, Biometrika, 63, 581–592.
SCHWARZ, G. (1978), “Estimating the Dimension of a Model”, The Annals of Statistics, 6, 461–464.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bartolucci, F., Montanari, G.E. & Pandolfi, S. Latent Ignorability and Item Selection for Nursing Home Case-Mix Evaluation. J Classif 35, 172–193 (2018). https://doi.org/10.1007/s00357-017-9227-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-017-9227-9