Abstract
Predictive models learn relationships between dependent and independent features of a dataset to forecast future outcomes. The point forecasting models aim to predict a single numeric value for future input. Most of the time, point prediction models lead to inaccurate predictions due to the scattering variance of data from the fitted curve. However, the decision-makers often choose a range of plausible estimates leading to range forecasting to overcome human cognitive bias and catastrophic forecasting errors. In this paper, we present a novel range prediction model that predicts a range of most probable outcomes for future input. We first fit a robust nonlinear regression model to the data. Then, we compute the slope of the tangent line at each data point on the nonlinear fitting curve. We introduce an angular confidence interval and use it to generate conical (angular) sectors at each data point on fitted curve. The conical sector produces an arbitrarily shaped range residual interval. Finally, we apply gradient descent optimization to the range residual sum of squares to get the optimum range prediction results. Experiments are performed on four publicly available data sets, and the results show the viability of our approach.
Similar content being viewed by others
Data availability
The datasets analyzed during the current study are available from the corresponding author on request.
References
Almeida AMd, Castel-Branco MM, Falcao A (2002) Linear regression for calibration lines revisited: weighting schemes for bioanalytical methods. J Chromatogr B 774(2):215–222
Bottou L, Curtis FE, Nocedal J (2018) Optimization methods for large-scale machine learning. SIAM Rev 60(2):223–311
Brown AM (2001) A step-by-step guide to non-linear regression analysis of experimental data using a microsoft excel spreadsheet. Comput Methods Programs Biomed 65(3):191–200
Chen SX (1994) Empirical likelihood confidence intervals for linear regression coefficients. J Multivar Anal 49(1):24–40
Das M, Ghosh SK, Chowdary V et al (2016) A probabilistic nonlinear model for forecasting daily water level in reservoir. Water Resour Manage 30:3107–3122
De Vieaux RD, Schumi J, Schweinsberg J et al (1998) Prediction intervals for neural networks via nonlinear regression. Technometrics 40(4):273–282
Del Rio FJ, Riu J, Rius FX (2001) Prediction intervals in linear regression taking into account errors on both axes. J Chemom 15(10):773–788
Donda J (2018) Headbrain-simple linear regression. https://www.kaggle.com/code/codefordata/headbrain-simple-linear-regression/data. Accessed 10 Sept 2020
Fan J, Wu L, Zheng J et al (2021) Medium-range forecasting of daily reference evapotranspiration across china using numerical weather prediction outputs downscaled by extreme gradient boosting. J Hydrol 601:126664
Gneiting T, Balabdaoui F, Raftery AE (2007) Probabilistic forecasts, calibration and sharpness. J R Stat Soc 69(2):243–268
Halperin M (1963) Confidence interval estimation in non-linear regression. J Roy Stat Soc 25(2):330–333
Henley SS, Golden RM, Kashner TM (2020) Statistical modeling methods: challenges and strategies. Biostat Epidemiol 4(1):105–139
Leatherbarrow RJ (1990) Using linear and non-linear regression to fit biochemical data. Trends Biochem Sci 15(12):455–458
Liu K (1988) Measurement error and its impact on partial correlation and multiple linear regression analyses. Am J Epidemiol 127(4):864–874
Magnus JR (1982) Multivariate error components analysis of linear and nonlinear regression models by maximum likelihood. J Econ 19(2–3):239–285
Makridakis S (1994) Time series prediction: forecasting the future and understanding the past. Int J Forecast 10(3):463–466
Mangasarian OL, Street WN, Wolberg WH (1995) Breast cancer diagnosis and prognosis via linear programming. https://EconPapers.repec.org/RePEc:inm:oropre:v:43:y:1995:i:4:p:570-577. Accessed 12 Sept 2020
Martin RF (2000) General deming regression for estimating systematic bias and its confidence interval in method-comparison studies. Clin Chem 46(1):100–104
Mishra VK, Tiwari N, Ajaymon S (2019) Dengue disease spread prediction using twofold linear regression. In: IEEE 9th International Conference on Advanced Computing (IACC), pp 182–187
Nathans LL, Oswald FL, Nimon K (2012) Interpreting multiple linear regression: a guidebook of variable importance. Pract assess Res Eval 17(9):1–19
Park J, Bhuiyan MZA, Kang M et al (2018) Nearest neighbor search with locally weighted linear regression for heartbeat classification. Soft Comput 22:1225–1236
Pham TM (2010) Similar triangles and orientation in plane elementary geometry for coq-based proofs. In: Proceedings of the 2010 ACM Symposium on Applied Computing, pp 1268–1269
Posamentier AS (2002) Advanced Euclidian Geometry: Excursions for Students and Teachers. Springer Science & Business Media, Key College Emeryville, CA
Skouras K, Goutis C, Bramson M (1994) Estimation in linear models using gradient descent with early stopping. Stat Comput 4(4):271–278
T.A. B, G. M, R.J. A (2009) Global, regional, and national fossil-fuel CO2 emissions. Carbon Dioxide Information Analysis Center (CDIAC) Datasets https://doi.org/10.3334/cdiac/00001, https://cir.nii.ac.jp/crid/1360011142940653056
Venema GA (2013) Exploring advanced Euclidean geometry with GeoGebra. Am Math Soc 44:1–6
Williams CK (1998) Prediction with gaussian processes: From linear regression to linear prediction and beyond. Learning in graphical models pp 599–621
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors confirm that they do not have known competing interests of any kind that could influence the work reported in this paper.
Ethical approval
This article does not contain any studies on human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kumar, V., Krishna, P.R. A novel range prediction model using gradient descent optimization and regression techniques. J Ambient Intell Human Comput 14, 14277–14289 (2023). https://doi.org/10.1007/s12652-023-04665-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-023-04665-y