Abstract
Plentiful and diverse Organic wastes such as green, food, pruning and landscaping waste necessitate upon effective and efficient recycling strategies such as composting. In this article, for the first time, machine learning (ML) models have been applied to model compost generation rate as a function of climatic parameters and organic waste content. The data size contained approximately 864 sample points records of the meteorological parameters (Modern-Era Retrospective Analysis), organic waste (Central Pollution Control Board) and compost yields (Open Government Data) data for 2010–2021. The modelling efforts involved the consideration of MLP and traditional ML algorithms namely k-nearest neighbour (kNN), gradient boosting (GB) and random forest (RF) for prediction and autoregressive integrated moving average (ARIMA) model supplemented ML models for long-term forecasting of the compost generation rate. Model validation resulted in an RMSE of 0.757 and R2 of 0.99 for GB model, and a correlation index of 0.68 between observed and predicted values to thereby outperform all other models. However, forecasted data for ten years after the predicted outcomes resulted in the best performance of ARIMA–MLP model with a standard error of 21.1152 and a CP yield of 74,958 kg. Thereby, the findings affirm upon the evidence for the limitations of the broader application of the empirical approaches and the feasibility of ML algorithms as a potential reconstruction technique for developing robust and accurate region-specific compost prediction and forecasting models to assist integrated circular agricultural system development for a sustainable global future.
Similar content being viewed by others
Data availability
The datasets used and analysed in this study are mentioned as below, and the rest of the predicted datasets will be made available on reasonable request.
[dataset] https://www.soda-pro.com/web-services/meteo-data/merra.
[dataset] https://cpcb.nic.in/uploads/MSW/Waste_generation_Composition.pdf.
[dataset] https://data.gov.in/search?title=compost.
References
Census of India Population report (2011) https://censusindia.gov.in/census.website/data/census-tables
Gerdes P, Gunsilius E (2010) The waste experts: enabling conditions for informal sector integration in solid waste management. In: Management. www.gtz.de/recyclingpartnerships
Central Pollution Control Board. MSW Annual Report (2018–19) Accessed from https://cpcb.nic.in/uploads/MSW/MSW_AnnualReport_2018-19.pdf
Ahluwalia IJ, Patel U (2018) Working paper no. 356. Solid waste management in India an assessment of resource recovery and environmental impact Isher judge Ahluwalia. Indian Council for Research on International Economic Relations, pp 1–48
Joshi R, Ahmed S (2016) Status and challenges of municipal solid waste management in India: a review. Cogent Environ Sci 2(1):1139434
Kumar S, Smith SR, Fowler G, Velis C, Kumar SJ, Arya S, Cheeseman C et al (2017) Challenges and opportunities associated with waste management in India. R Soc Open Sci 4(3):160764
United Nations Environment Programme. Food Index Report (2021) Accessed from https://www.unep.org/resources/report/unep-food-waste-index-report-2021
Yang F, Li Y, Han Y, Qian W, Li G, Luo W (2019) Performance of mature compost to control gaseous emissions in kitchen waste composting. Sci Total Environ 657:262–269
Zhang L, Zeng G, Dong H, Chen Y, Zhang J, Yan M, Huang Z et al (2017) The impact of silver nanoparticles on the co-composting of sewage sludge and agricultural waste: evolutions of organic matter and nitrogen. Biores Tech 230(2017):132–139
Laura F, Tamara A, Müller A, Hiroshan H, Christina D, Serena C (2020) Selecting sustainable sewage sludge reuse options through a systematic assessment framework: methodology and case study in Latin America. J Cleaner Prod 242:118389
El Haggar SM (2005) Rural and developing country solutions. Environmental solutions. Academic Press, Cambridge, pp 313–400
Kujawa S, Nowakowski K, Tomczak RJ, Dach J, Boniecki P, Weres J, Rodríguez Carmona PC et al (2014) Neural image analysis for maturity classification of sewage sludge composted with maize straw. Comput Electron Agric 109:302–310
Hao X, Chang C, Larney FJ, Travis GR (2002) Greenhouse gas emissions during cattle feedlot manure composting. J Environ Qual 31(2):700–700
Tchobanoglous G, Kreith F (1994) Integrated solid waste management: engineering principles and management issues. In: Bulletin of science, technology and society
Nelson VL, Crowe TG, Shah MA, Watson LG (2006) Temperature and turning energy of composting feedlot manure at different moisture contents in southern Alberta. Can Biosyst Eng Genie Biosyst Can 48:31–37
Farmer J, Schaeffer S, Zhang B, An T, Pei J, Zhuang J, Wang J (2018) How soil bacterial communities with seasonal variation respond differently to long-term fertilization and plastic film mulching. Pol J Environ Stud 27(4):1483–1496
Ge G, Li Z, Fan F, Chu G, Hou Z, Liang Y (2010) Soil biological activity and their seasonal variations in response to long-term application of organic and inorganic fertilizers. Plant Soil 326(1):31–44
Kishimoto M, Preechaphan C, Yoshida T, Taguchi H (1987) Simulation of an aerobic composting of activated sludge using a statistical procedure. MIRCEN J Appl Microbiol Biotechnol 3(2):113–123
Kaiser J (1996) Modelling composting as a microbial ecosystem: a simulation approach. Ecol Model 91(1–3):25–37
Nakasaki K, Kato J, Akiyama T, Kubota H (1987) A new composting model and assessment of optimum operation for effective drying of composting material. J Ferment Technol 65(4):441–447
Hall SG (1998) Temperature feedback and control via aeration rate regulation in biological composting systems. Ph.D. Dissertation, Cornell University, Ithaca, USA
Lin YP, Huang GH, Lu HW, He L (2008) Modeling of substrate degradation and oxygen consumption in waste composting processes. Waste Manag 28(8):1375–1385
Stombaugh DP, Nokes SE (1996) Development of a biologically based aerobic composting simulation model. Trans Am Soc Agric Eng 39(1):239–250
Hamelers HVM (2004) Modeling composting kinetics: a review of approaches. Rev Environ Sci Biotechnol 3(4):331–342
Das K, Keener HM (1997) Moisture effect on compaction and permeability in composts. J Environ Eng 123(3):275–281
Haug RT (1993) Compost engineering. In: The library of compost engineering software
Mason IG, Milke MW (2005) Physical modelling of the composting environment: a review. Part 1: reactor systems. Waste Manag 25(5):481–500
Sole-Mauri F, Illa J, Magrí A, Prenafeta-Boldú FX, Flotats X (2007) An integrated biochemical and physical model for the composting process. Biores Technol 98(17):3278–3293
Thomas GV (2004) vermicomposting of coconut leaves by Eudrilus sp. in coastal tract of Kerala. J Plant Crops 32:486–490
Díaz MJ, Eugenio ME, López F, García JC, Yañez R (2012) Neural models for optimizing lignocellulosic residues composting process. Waste Biomass Valorization 3(3):319–331
Gao MJ, Tian JW, Jiang W et al (2007) Research of sludge compost maturity degree modeling method based on wavelet neural network for sewage treatment. In: Li K, Fei M, Irwin GW et al (eds) Bio-inspired computational intelligence and applications, vol 4688. LSMS 2007. Lecture Notes in Computer Science. Springer, Berlin, pp 608–618
Xue W, Hu X, Wei Z, Mei X, Chen X, Xu Y (2019) A fast and easy method for predicting agricultural waste compost maturity by image-based deep learning. Bioresour Technol 290:121761
Randles D, Benjamin R, Martens JP, Heine SJ (2018) Searching for answers in an uncertain world: meaning threats lead to increased working memory capacity. PLoS ONE 13(10):1–22
Singhal A, Gupta AK, Dubey B, Ghangrekar MM (2022) Seasonal characterization of municipal solid waste for selecting feasible waste treatment technology for Guwahati city, India. J Air Waste Manag Assoc 72(2):147–160
Iguyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182
Mendenhall W, Sincich T, Danna M (2011) A second course in statistics, 8th edn
Sammut and Webb (2017) Encyclopedia of machine learning and data mining. In: Encyclopedia of machine learning and data mining
Rawlings JO, Pantula SG, Dickey DA (1999) Applied regression analysis: a research tool, second edition. In: The American statistician, vol 53
Kroese D, Botev Z, Taimre T (2011) Data science and machine learning at scale. Lecture notes in computer science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6911 LNAI(November). https://doi.org/10.1007/978-3-642-23780-5_9
Zaki MJ, Meira MJ (2013) Data mining and analysis: fundamental concepts and algorithms. Accessed from https://books.google.com.tr/books?id=Gh9GAwAAQBAJ&lpg=PR9&dq=Data Mining and analysis: foundations and algorithms
Hastie T, Tibshirani R, Friedman J (2011) The elements of statistical learning: data mining, inference, and prediction, 2nd edn (Springer Series in Statistics) (9780387848570): Trevor Hastie, Robert Tibshirani, Jerome Friedman: Books. In: The elements of statistical learning: dta mining, inference, and prediction
Kuhn M, Johnson K (2013) Applied predictive modeling. Applied predictive modeling. Springer, New York
Yang XS (2019) Introduction to algorithms for data mining and machine learning. Academic press, Cambridge
Kubat M (2017) An introduction to machine learning. An introduction to machine learning. Springer, Cham
Shalev-Shwartz S, Ben-David S (2013) Understanding machine learning: from theory to algorithms. In: Understanding machine learning: from theory to algorithms, vol 9781107057135
Flach P (2020) Data, machine learning: the art and science of algorithms that make sense of. In: ACS catalysis, vol 10
Leao S, Bishop I, Evans D (2001) Assessing the demand of solid waste disposal in urban region by urban dynamics modelling in a GIS environment. Resour Conserv Recycl 33(4):289–313
Chen T, Guestrin C (2016) XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining - KDD ’16 785–794. https://doi.org/10.1145/2939672.2939785
Breiman L, Friedman JH, Olshen RA, Stone CJ (2017) Classification and regression trees. In: Classification and regression trees
Abraham B, Ledolter J (1986) Forecast functions implied by autoregressive integrated moving average models and other related forecast procedures. Int Stat Rev Revue Int Stat 54(1):51
Granger CWJ, Ramanathan R (1984) Improved methods of combining forecasts. J Forecast 3:197–204
Hyndman RJ, Koehler AB, Snyder RD, Grose S (2002) A state space framework for automatic forecasting using. Int J Forecasting 18(3):439–454
Sun W, Huang GH, Zeng G, Qin X, Yu H (2011) Quantitative effects of composting state variables on C/N ratio through GA-aided multivariate analysis. Sci Total Environ 409(7):1243–1254
Khalil AI, Hassouna MS, El-Ashqar HMA, Fawzi M (2011) Changes in physical, chemical and microbial parameters during the composting of municipal sewage sludge. World J Microbiol Biotechnol 27(10):2359–2369
Kashyap A, Borthakur R (2019) Identification of types and source-specific characterization and quantification study of solid waste in Guwahati City, Assam, India. Waste management and resource efficiency. Springer, Singapore, pp 373–384
Venkata Rao R (2016) Jaya: a simple and new optimization algorithm for solving constrained and unconstrained optimization problems. Int J Ind Eng Comput 7(1):19–34
Bianchi L, Dorigo M, Gambardella LM, Gutjahr WJ (2009) A survey on metaheuristics for stochastic combinatorial optimization. Nat Comput 8(2):239–287
Blum C, Roli A (2003) Metaheuristics in combinatorial optimization: overview and conceptual comparison. ACM Comput Surv 35(3):268–308
Box GEP, Jenkins GM (2015) Time series analysis: forecasting and control. Wiley, Hoboken, New Jersey
Singh T, Uppaluri RVS (2022) Machine learning tool-based prediction and forecasting of municipal solid waste generation rate: a case study in Guwahati, Assam, India. Int J Environ Sci Technol. https://doi.org/10.1007/s13762-022-04644-4
Rumelhart DE, Mcclelland JL (1986) Parallel distributed processing: explorations in the microstructure of cognition: volume 1: foundations. MIT Press, Cambridge, Mass
Boniecki P, Dach J, Mueller W, Koszela K, Przybyl J, Pilarski K, Olszewski T (2013) Neural prediction of heat loss in the pig manure composting process. Appl Therm Eng 58(1–2):650–655
Acknowledgements
The authors thankfully acknowledge the Centre for the Environment, Indian Institute of Technology Guwahati, India, Guwahati Municipal Corporation (GMC), Assam, India, and Central Pollution Control Board (CPCB).
Funding
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests.
Author information
Authors and Affiliations
Contributions
TS contributed to data extraction, data pre-processing, investigation, modelling, writing—original draft preparation, writing—review and editing, and validation. RVSU contributed to conceptualization, supervision, validation, visualization, and writing—review and editing.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. The authors declare the following financial interests/personal relationships which may be considered as potential competing interests.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Glossary
- ANN
-
An approach based on artificial neural networks (ANN) is used to detect patterns in data or to simulate dynamic interactions between inputs and outputs
- ARIMA
-
A technique called ARIMA based on statistical concept of correlation, where past data points influence subsequent data points, is used to anticipate or predict future events using a previous time series
- ARIMA–MLP
-
In order to build series linear/nonlinear combination models, the hybrid ARIMA–MLP models are frequently employed linear and nonlinear models
- CNN
-
A deep learning neural network designed for processing organised arrays of data is known as a convolutional neural network
- GB
-
A tree-based machine learning method called gradient boosting is employed for classification and regression applications
- kNN
-
One of the most basic machine learning algorithms, kNN, is mostly employed for classification and regression
- ML
-
Machine learning algorithms construct a model from sample data, referred to as training data, in order to make predictions or decisions without being explicitly programmed to do so
- MLP
-
MLP is a term that can be used ambiguously to refer to any feedforward ANN or specifically to networks made up of many layers of perceptrons
- RF
-
Bagging is an ensemble approach that combines the predictions from all models after fitting them to various subsets of a training dataset
- CPCB
-
Municipal solid waste data collected from Central Pollution Control Board of India (a statutory organization under the Ministry of Environment, Forest and Climate Change)
- MERRA-2
-
Modern-Era Retrospective analysis for Research and Applications provided the relevant meteorological data
- OGD
-
Open Government Data Platform India or data.gov.in is a platform for supporting Open data initiative of Government of India
- CP
-
The dependent variable to predict the Compost Production
- T
-
Meteorological parameter: Temperature
- RH
-
Meteorological parameter: Relative Humidity
- P
-
Meteorological parameter: Precipitation
- WS
-
Meteorological parameter: Wind speed
- OW
-
Organic waste found in municipal solid waste include food, paper, wood, sewage sludge, and yard waste
- CL
-
Confidence Levels is the mean of an estimate +/− the variation in the estimate
- MAE
-
Mean absolute error represents the average of the absolute difference between the actual and predicted values in the dataset
- MSE
-
A measurement of how close a fitted line is to plotted data points
- RMSE
-
The square root of the mean square, or the quadratic mean
- R 2
-
The coefficient of determination or R-squared represents the proportion of the variance in the dependent variable which is explained by the linear regression model
- SSE
-
Sum of squared error is the difference between the observed value and the predicted value
- IOA
-
Index of agreement for the models have been calculated as the ratio between the MSE and the potential error
- PO
-
Parameter optimization in ML is the values when the learning algorithm can change independently as it learns and these values are affected by the choice of hyperparameters we provide
- C/N
-
The ratio of the weight of organic carbon to the weight of total nitrogen in a soil or in organic material
- MC
-
Moisture content (or water content) refers to the weight of the water contained in a compost pile
- MSW
-
Municipal solid waste are made up of waste, organics, and recyclable materials, with the municipality overseeing its disposal.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, T., Uppaluri, R.V.S. Application of ANN and traditional ML algorithms in modelling compost production under different climatic conditions. Neural Comput & Applic 35, 13465–13484 (2023). https://doi.org/10.1007/s00521-023-08404-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-08404-4