Skip to main content
Log in

Granite porosity prediction under varied thermal conditions using machine learning models

  • Research
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

    We’re sorry, something doesn't seem to be working properly.

    Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Porosity estimation at high granite temperatures is essential for numerous purposes, including natural and enhanced geothermal energy production. However, these methods of determining porosity have some disadvantages, such as being labor-intensive, requiring expensive instrumentation, and taking a significant amount of time, particularly during elevated temperature treatments. This study is vital for advancing geothermal energy applications by addressing the limitations of traditional porosity estimation methods. The study introduces innovative predictive machine learning (ML) models and offers insights into practical applications for improving geothermal reservoir management and sustainability. the datasets divided into six subsets, each with a specific grain size (fine, medium, or coarse) and cooling method furnace cooling (FC) and water cooling (WC). The cooling rate has a significant impact on rock thermal stress. In this study, six ML models—Random Forest (RF), K-Nearest Neighbors (KNN), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), Categorical Boosting (CatBoost), and Light Gradient Boosting Machine (LightGBM)—were employed to predict models for estimating porosity at high temperatures. The multiple evaluation metrics were used to assess the suitability of these algorithms. R-squared values were calculated to assess the models’ goodness-of-fit. In parallel, several error metrics such as root mean squared error (RMSE), mean absolute error (MAE), mean absolute percentage error (MAPE), root mean square relative error (RMSRE), mean absolute relative error (MARE), and percent bias (PBIAS) were evaluated to quantify the accuracy of the predictions. The performance of the ML methods exhibited considerable variability among the different datasets. Among the evaluated models, CatBoost and KNN consistently achieved higher R-squared values and lower error metrics, demonstrating their effectiveness in accurately discerning underlying patterns and reliably predicting porosity. Conversely, RF, XGBoost, and SVM yielded reasonably accurate predictions, albeit with slightly increased variability in their performance metrics across different grain size and cooling condition subsets. LightGBM demonstrates comparatively diminished prediction accuracy, since it fails to capture the complex fluctuations in rock properties over thermal heating and cooling cycles. These findings highlight the superior predictive efficacy of CatBoost and KNN, confirming their reliability as robust tools for modeling porosity in complex datasets with varied geological and environmental factors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21

Similar content being viewed by others

Data availability

No datasets were generated or analysed during the current study.

Abbreviations

FC:

Furnace Cooling 

WC:

Water Cooling

RF:

Random Forest

KNN:

K-Nearest Neighbor

XGBoost:

Extreme Gradient Boosting

SVM:

Support Vector Machine

CatBoost:

Categorical Boosting

LightGBM:

Light Gradient Boosting Machine

RMSE:

Root Mean Squared Error

MAE:

Mean Absolute Error

MAPE:

Mean Absolute Percentage Error

RMSRE:

Root Mean Square Relative Error

MARE:

Mean Absolute Relative Error

PBIAS:

Percent Bias

ML:

Machine Learning

EGS:

Enhanced Geothermal System

LN2:

Liquid Nitrogen

PSO-SVM:

Particle Swarm Optimization Support Vector Machine

AdaBoost:

Adaptive Boosting

CART:

Classification and Regression Trees

RBFNN:

Radial Basis Neural Network

MLR:

Multiple Linear Regression

SVR:

Support Vector Regression

FG:

Fine-Grained

MG:

Medium-Grained

CG:

Coarse-Grained 

EDA:

Exploratory Data Analysis

IQR:

Interquartile range

SHAP:

SHapley Additive exPlanations

PDP:

Partial Dependence Plot

ICE:

Individual Conditional Expectation

References

Download references

Funding

This study was financial supported by a fellowship from the Council of Scientific and Industrial Research (CSIR). The fellowship was awarded to the first author.

Author information

Authors and Affiliations

Authors

Contributions

RD: Research Conceptualization, Experimental investigation, Data Collection, Methodology Development, Data Analysis, Manuscript Writing, Editing and ReviewBP: Data Collection, Methodology Development, Data Analysis, reviewPKG: Research Conceptualization, Experimental investigation, Data Collection, Methodology Development, Data Analysis, Manuscript Writing, Editing and ReviewPG: Data Collection, Methodology Development, Review, Data AnalysisSD: Editing and Review, SupervisionKHS: Editing and Review, SupervisionTNS: Editing and Review, Supervision.

Corresponding author

Correspondence to PK Gautam.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The authors have declared that there is no conflict of interest in this submitted manuscript.

Additional information

Communicated by: H. Babaie

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dwivedi, R., Prasad, B., Gautam, P. et al. Granite porosity prediction under varied thermal conditions using machine learning models. Earth Sci Inform 18, 211 (2025). https://doi.org/10.1007/s12145-025-01726-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s12145-025-01726-y

Keywords