Variable selection in uncertain regression analysis with imprecise observations

Liu, Zhe; Yang, Xiangfeng

doi:10.1007/s00500-021-06129-x

Variable selection in uncertain regression analysis with imprecise observations

Mathematical methods in data science
Published: 19 August 2021

Volume 25, pages 13377–13387, (2021)
Cite this article

Soft Computing Aims and scope Submit manuscript

Zhe Liu¹ &
Xiangfeng Yang²

243 Accesses
7 Citations
Explore all metrics

Abstract

Variable selection is crucial in order to better investigate relationships between variables in regression analysis. However, sometimes data are collected in an imprecise way and can not be described by random variables. As a result, classical variable selection methods are invalid. Characterizing these imprecise observations as uncertain variables, this paper presents the uncertain lasso estimate and the de-biased uncertain lasso estimate to select variables and estimate unknown parameters, respectively. Moreover, a way to choose the tuning parameter using cross-validation is suggested. Finally, numerical examples are documented to show our methods in detail.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of predictive uncertainty estimation with machine learning

Article Open access 18 March 2024

Hristos Tyralis & Georgia Papacharalampous

Learning from imbalanced data: open challenges and future directions

Article Open access 22 April 2016

Bartosz Krawczyk

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

Article 30 August 2016

Aki Vehtari, Andrew Gelman & Jonah Gabry

References

Akaike H (1973) Information theory and an extension of the maximum likelihood principle. In: Petrov, Csaki (eds) Proceedings of 2nd international symposium on information theory. Akademia Kiado, Budapest, pp 267–281
Bisserier A, Boukezzoula R, Galichet S (2009) An Interval approach for fuzzy linear regression with imprecise data. In: Proceedings of the joint 2009 international fuzzy systems association world congress and 2009 European society of fuzzy logic and technology conference, Lisbon, Portugal, July 20–24
Cattaneo M, Wiencierz A (2012) Likelihood-based imprecise regression. Int J Approx Reason 53:1137–1154. https://doi.org/10.1016/j.ijar.2012.06.010
Fang L, Liu S, Huang Z (2020) Uncertain Johnson-Schumacher growth model with imprecise observations and $k$-fold cross-validation test. Soft Comput 24:2715–2720. https://doi.org/10.1007/s00500-019-04090-4
Article MATH Google Scholar
Ferraro M, Coppi R, Gonzalez G, Colubi A (2010) A linear regression model for imprecise response. Int J Approx Reason 51:759–770. https://doi.org/10.1016/j.ijar.2010.04.003
Article MathSciNet MATH Google Scholar
Geisser S (1974) A predictive approach to the random effect model. Biometrika 61:101–107. https://doi.org/10.2307/2334290
Article MathSciNet MATH Google Scholar
Geisser S (1975) The predictive sample reuse method with applications. J Am Stat Assoc 70:320–328. https://doi.org/10.1080/01621459.1975.10479865
Article MATH Google Scholar
Lio W, Liu B (2018) Residual and confidence interval for uncertain regression model with imprecise observations. J Intell Fuzzy Syst 35:2573–2583. https://doi.org/10.3233/JIFS-18353
Article Google Scholar
Lio W, Liu B (2020) Uncertain maximum likelihood estimation with application to uncertain regression analysis. Soft Comput 24:9351–9360. https://doi.org/10.1007/s00500-020-04951-3
Article Google Scholar
Liu B (2007) Uncertainty theory, 2nd edn. Springer, Berlin
Liu B (2015) Uncertainty theory, 4th edn. Springer, Berlin
Liu B (2009) Some research problems in uncertainty theory. J Uncertain Syst 3:3–10
Google Scholar
Liu B (2010) Uncertainty theory: a branch of mathematics for modeling human uncertainty. Springer, Berlin
Book Google Scholar
Liu B (2012) Why is there a need for uncertainty theory? J Uncertain Syst 6:3–10
Google Scholar
Liu Z, Jia L (2020) Cross-validation for the uncertain Chapman–Richards growth model with imprecise observations. Int J Uncertain Fuzziness Knowl Based Syst 28:769–783. https://doi.org/10.1142/S0218488520500336
Article MathSciNet Google Scholar
Liu Z, Yang Y (2020) Least absolute deviations estimation for uncertain regression with imprecise observations. Fuzzy Optim Decis Mak 19:33–52. https://doi.org/10.1007/s10700-019-09312-w
Article MathSciNet MATH Google Scholar
Prade H, Serrurier M (2010) Why imprecise regression: a discussion. In: Borgelt C et al (eds) Combining soft computing and statistical methods in data analysis. Advances in intelligent and soft computing, vol 77. Springer, Berlin
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6:461–464. https://doi.org/10.1214/aos/1176344136
Article MathSciNet MATH Google Scholar
Stone M (1974) Cross validatory choice and assessment of statistical predictions. J Roy Stat Soc 36:111–147. https://doi.org/10.1111/j.2517-6161.1974.tb00994.x
Article MathSciNet MATH Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B Stat Methodol 58:267–288. https://doi.org/10.1111/j.1467-9868.2011.00771.x
Article MathSciNet MATH Google Scholar
Wang X, Gao Z, Guo H (2012) Delphi method for estimating uncertainty distributions. Inform An Int Interdiscipl J 15:449-460. https://doi.org/10.1680/gein.2012.19.1.85.
Wang X, Peng Z (2014) Method of moments for estimating uncertainty distributions. J Uncertain Anal Appl. https://doi.org/10.1186/2195-5468-2-5
Yao K, Liu B (2018) Uncertain regression analysis: an approach for imprecise observations. Soft Comput 22:5579–5582. https://doi.org/10.1007/s00500-017-2521-y
Article MATH Google Scholar
Ye T, Liu Y (2020) Multivariate uncertain regression model with imprecise observations. J Ambient Intell Human Comput 11:4941–4950. https://doi.org/10.1007/s12652-020-01763-z
Article Google Scholar
Zhang C, Liu Z, Liu J (2020) Least absolute deviations for uncertain multivariate regression model. Int J Gen Syst 49:449–465. https://doi.org/10.1080/03081079.2020.1748615
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (No. 62073009) and the Program for Young Excellent Talents in UIBE (No. 18YQ06).

Author information

Authors and Affiliations

School of Reliability and Systems Engineering, Beihang University, Beijing, 100191, China
Zhe Liu
School of Information Technology and Management, University of International Business and Economics, Beijing, 100029, China
Xiangfeng Yang

Authors

Zhe Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangfeng Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiangfeng Yang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This paper does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Proof of Corollary 1

According to Definitions 1 and 2, the uncertain lasso estimate and de-biased uncertain lasso estimate for linear regression model (11) solve minimization problems (12) and (13), respectively. Since

$$\begin{aligned} \beta _{0}+\sum _{j=1}^{p} \beta _{j}\tilde{x}_{ji} \end{aligned}$$

is increasing with respect to $\tilde{x}_{ji}$ when $\beta _{j} > 0$ and decreasing with respect to $\tilde{x}_{ji}$ when $\beta _{j} \le 0$ for each i $(i=1,2,\ldots , n)$, the corollary follows from Theorems 1 and 2 immediately. $\square $

Proof of Corollary 2

According to Definitions 1 and 2, the uncertain lasso estimate and de-biased uncertain lasso estimate for the regression model (14) solve minimization problems (15) and (16), respectively. It follows from the operational law of uncertain variable (p. 55 of Liu 2015) that inverse uncertainty distributions of uncertain variables $\ln \tilde{y}_{i}$ are $\ln F_{i}^{-1}(\alpha )$, $i=1,2,\ldots ,n$, respectively. Since

$$\begin{aligned} \beta _{0}+ \sum _{j=1}^{p} \beta _{j} \tilde{x}_{ji} \end{aligned}$$

is increasing with respect to $\tilde{x}_{ji}$ when $\beta _{j} > 0$ and decreasing with respect to $\tilde{x}_{ji}$ when $\beta _{j} \le 0$ for each i $(i=1,2,\ldots , n)$, the corollary follows from Theorems 1 and 2 immediately.

$\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, Z., Yang, X. Variable selection in uncertain regression analysis with imprecise observations. Soft Comput 25, 13377–13387 (2021). https://doi.org/10.1007/s00500-021-06129-x

Download citation

Accepted: 06 August 2021
Published: 19 August 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s00500-021-06129-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Variable selection in uncertain regression analysis with imprecise observations

Abstract

Access this article

Similar content being viewed by others

A review of predictive uncertainty estimation with machine learning

Learning from imbalanced data: open challenges and future directions

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Variable selection in uncertain regression analysis with imprecise observations

Abstract

Access this article

Similar content being viewed by others

A review of predictive uncertainty estimation with machine learning

Learning from imbalanced data: open challenges and future directions

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation