Skip to main content

Imprecise Sampling Models for Modelling Unobserved Heterogeneity? Basic Ideas of a Credal Likelihood Concept

  • Conference paper
  • First Online:
  • 403 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11142))

Abstract

In this research note, we sketch the idea to use (aspects of) imprecise probability models to handle unobserved heterogeneity in statistical (regression) models. Unobserved heterogeneity (frailty) is a frequent issue in many applications, arising whenever the underlying probability distributions depend on unobservable individual characteristics (like personal attitudes or hidden genetic dispositions). We consider imprecise sampling models where the likelihood contributions depend on individual parameters, varying in an interval (cuboid). Based on this, and a hyperparameter controlling the amount of ambiguity, we directly fit a credal set to the data. We introduce the basic concepts of this credal maximum likelihood approach, sketch first aspects of practical calculation of the resulting estimators by constrained optimization, derive some first general properties and finally discuss some ideas of a data-dependent choice of the hyperparameter.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    We consider here directly the set of parametric models; often in the theory of imprecise the whole convex hull of the generating models is used.

  2. 2.

    In the case of multi-dimensional parameters typically only few components of the parameter vector are taken to vary individually. We do not explicitly distinguish the varying and the constant components of the parameter vector notationally.

  3. 3.

    Typically the models are equivalently formulated in terms of a central parameter \(\nu _{\text{ overall }}\) and individual variations \(\nu _i\) around it with zero mean such that \(\vartheta _i=\nu _{\text{ overall }}+\nu _i\).

  4. 4.

    Some ideas how to choose \(\delta \) in a data-dependent way are sketched in Sect. 4.

  5. 5.

    For the sake of conciseness, the formulation is given in terms of \(f_\vartheta (\cdot )\) only; it is immediately adopted to the regression case relying on \(f_\vartheta (\cdot |x_i)\).

  6. 6.

    Throughout this paper vectorial inequalities are understood as inequalities for all components.

  7. 7.

    This corresponds to the determination of the variance of the distribution in the traditional random effects approaches to unobserved heterogeneity, while, from this perspective, in our approach the range of the random effects has to be specified. This analogy should however not hide the major difference: the traditional approach needs a fixed distributional class for the – unobservable (!) – random effect (typically normal distribution), while our approach refrains from any modelling of the hidden process.

    Therefore, if one looks at an interpretation of our approach in the context of traditional theory, our approach could be termed ‘nonparametric’.

References

  1. Augustin, T., Walter, G., Coolen, F.: Statistical inference. In: Augustin, T., Coolen, F., de Cooman, G., Troffaes, M. (eds.) Introduction to Imprecise Probabilities, pp. 135–189. Wiley, Chichester (2014)

    Google Scholar 

  2. Benavoli, A., Zaffalon, M.: Prior near ignorance for inferences in the k-parameter exponential family. Statistics 49, 1104–1140 (2014)

    Article  MathSciNet  Google Scholar 

  3. Cameron, A., Trivedi, P.: Microeconometrics: Methods and Applications. Cambridge University Press, Cambridge (1991)

    MATH  Google Scholar 

  4. Cattaneo, M.: Empirical interpretation of imprecise probabilities. In: Antonucci, A., Corani, G., Couso, I., Destercke, S. (eds.) Proceedings of the Tenth International Symposium on Imprecise Probability: Theories and Applications, ISIPTA 2017, Lugano. Proceedings of Machine Learning Research, vol. 62, pp. 61–72. PMLR (2017). http://proceedings.mlr.press/v62/cattaneo17a.html

  5. Duchateau, L., Janssen, P.: The Frailty Model. Springer, New York (2007). https://doi.org/10.1007/978-0-387-72835-3

    Book  MATH  Google Scholar 

  6. Fahrmeir, L., Kneib, T., Lang, S., Marx, B.: Regression: Models, Methods and Applications. Springer, Berlin (2013). https://doi.org/10.1007/978-3-642-34333-9

    Book  MATH  Google Scholar 

  7. Huber, P., Strassen, V.: Minimax tests and the Neyman-Pearson lemma for capacities. Ann. Stat. 1, 251–263 (1973)

    Article  MathSciNet  Google Scholar 

  8. Manrique-Vallier, D.: Bayesian population size estimation using Dirichlet process mixtures. Biometrics 72, 1246–1254 (2016)

    Article  MathSciNet  Google Scholar 

  9. Manski, C.: Partial Identification of Probability Distributions. Springer, New York (2003). https://doi.org/10.1007/b97478

    Book  MATH  Google Scholar 

  10. Pawitan, Y.: In All Likelihood: Statistical Modelling and Inference Using Likelihood. Oxford University Press, Oxford (2001)

    MATH  Google Scholar 

  11. Shyamalkumar, N.: Likelihood robustness. In: Insua, D.R., Ruggeri, F. (eds.) Robust Bayesian Analysis, pp. 127–143. Springer, New York (2000). https://doi.org/10.1007/978-1-4612-1306-2_7

    Chapter  Google Scholar 

  12. Stanghellini, E., Vantaggi, B.: Identification of discrete concentration graph models with one hidden binary variable. Bernoulli 19, 1920–1937 (2013)

    Article  MathSciNet  Google Scholar 

  13. Vicard, P.: On the identification of a single-factor model with correlated residuals. Biometrika 87, 199–205 (2000)

    Article  MathSciNet  Google Scholar 

  14. Walley, P.: Statistical Reasoning with Imprecise Probabilities. Chapman & Hall, London (1991)

    Book  Google Scholar 

  15. Walter, G., Augustin, T.: Imprecision and prior-data conflict in generalized Bayesian inference. J. Stat. Theor. Pract. 3, 255–271 (2009)

    Article  MathSciNet  Google Scholar 

  16. Wasserman, L.: All of Statistics: A Concise Course in Statistical Inference. Springer, New York (2004). https://doi.org/10.1007/978-0-387-21736-9

    Book  MATH  Google Scholar 

  17. Wermuth, N., Marchetti, G.: Star graphs induce tetrad correlations: for Gaussian as well as for binary variables. Electr. J. Stat. 8, 253–273 (2014)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgement

I am very grateful to the two anonymous referees for very helpful comments and quite stimulating remarks.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thomas Augustin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Augustin, T. (2018). Imprecise Sampling Models for Modelling Unobserved Heterogeneity? Basic Ideas of a Credal Likelihood Concept. In: Ciucci, D., Pasi, G., Vantaggi, B. (eds) Scalable Uncertainty Management. SUM 2018. Lecture Notes in Computer Science(), vol 11142. Springer, Cham. https://doi.org/10.1007/978-3-030-00461-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00461-3_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00460-6

  • Online ISBN: 978-3-030-00461-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics