Abstract
Forest cover type prediction is used for the forest management organizations. It also get the insight on area of the forest cover up to date and development lack in present time. Classification of the forest area and type of trees could eventually help in maintaining the eco system and to get inference on deforestation. In present scenario this problem gains more attention hence to retain the climate change impact forest cover type and area prediction would help a lot. This paper proposes a novel ensemble machine learning based random de-correlated extra decision tree model for the forest cover type prediction. The tree based classifiers perform well in prediction of the forest cover data. Many researchers use tree based classifiers for the problem. Even though the enhancement of the accuracy seems to be lower in the multi-class classification problem. So, this research proposes the Extra random de-correlated decision tree method for the prediction of the forest cover. The results the multiple de-correlated decision trees are aggregated for the final classification. This proposed method is the ensemble based method. In ensemble machine learning method combines several base optimal results in order to produce one final optimal result. A decision tree follows a simple predictive outcomes based on the series of the cause and effect values. While adopting the decision tree models the user has to follow the factors including the variable on which the decision to be taken and threshold for deciding the class. Instead of depending on one tree for decision making, multiple tree split criteria can be considered. Also these ensemble based machine learning allow to fine tune the predictor variable based on the feature to use and split criteria. The random forest based methods follows the bagging strategy. It has a major role in the split aspect and decision-making aspect in significant manner. This machine learning model decides where to split based on random selection of features. Random forest tree methods have a uniqueness where each split can be done through scrutiny of different features. This paper proposes the ensemble machine learning based random de-correlated extra decision tree model for the forest cover type prediction. This algorithm especially suits the problem for the multiclass classification nature. Forest cover type prediction helps in identifying the wilderness type and total area of the forest predicted and available. The dataset considered for the paper is from the UCI Machine Learning repository. It contains various features including elevation, slop, aspect, vertical and horizontal distance to hydrology, fire points and roadways, hill shade, wilderness area, soil type and cover type. Initially the preprocessing is done in the data set by identifying the missing values, outlier detection and formatting data. Later the exploratory analysis is carried out using the Pearson correlation coefficients aspect. Then three machine learning techniques: Multiclass SVM, Boosting and proposed EMLARDE were deployed. The accuracy of the proposed EMLARDE method outperforms the other two algorithms. The proposed algorithm performs well for this multiclass classification.






Similar content being viewed by others
Data availability statement
No datasets were generated or analysed during the current study.
Abbreviations
- RF:
-
Random forest
- DT:
-
Decision tree
- ANN:
-
Artificial neural network
- DNN:
-
Deep neural network
- LR:
-
Logistics regression
- DFNN:
-
Deep feed-forward neural network
- SFS:
-
Sequential forward selection
- FRA:
-
Forest resources assessment
- SAR:
-
Synthetic aperture radar
- FAO:
-
Food and agriculture organization
References
Akbas, A., Buyrukoglu, S.: Stacking ensemble learning-based wireless sensor network deployment parameter estimation. Arab. J. Sci. Eng. 48(8), 9739–9748 (2023). https://doi.org/10.1007/s13369-022-07365-5
Babu, P.A., Rai, A.K., Ramesh, J.V.N., Nithyasri, A., Sangeetha, S., Kshirsagar, P.R., Rajendran, A., Rajaram, A., Dilipkumar, S.: An explainable deep learning approach for oral cancer detection. J. Electr. Eng. Technol. 19(3), 1837–1848 (2024). https://doi.org/10.1007/s42835-023-01654-1
Branco, P., Torgo, I.S., Ribeiro, R.P.: A survey of predictive modeling on imbalanced domains. ACM Comput. Surv. (CSUR) 49(2), 1–50 (2016). https://doi.org/10.1145/2907070
Buyrukoğlu, S., Akbaş, A.: Machine learning based early prediction of type 2 diabetes: a new hybrid feature selection approach using correlation matrix with heatmap and SFS. Balkan J. Electr. Comput. Eng. 10(2), 110–117 (2022). https://doi.org/10.17694/bajece.973129
Buyrukoğlu, S., Yılmaz, Y., Topalcengiz, Z.: Correlation value determined to increase Salmonella prediction success of deep neural network for agricultural waters. Environ. Monit. Assess. 194(5), 373 (2022). https://doi.org/10.1007/s10661-022-10050-7
Buyrukoğlu, S., Savaş, S.: Stacked-based ensemble machine learning model for positioning footballer. Arab. J. Sci. Eng. 48(2), 1371–1383 (2023). https://doi.org/10.1007/s13369-022-06857-8
Buyrukoğlu, S.: New hybrid data mining model for prediction of Salmonella presence in agricultural waters based on ensemble feature selection and machine learning algorithms. J. Food Saf. 41(4), e12903 (2021). https://doi.org/10.1111/jfs.12903
Buyrukoğlu, G., Buyrukoğlu, S., Topalcengiz, Z.: Comparing regression models with count data to artificial neural network and ensemble models for prediction of generic Escherichia coli population in agricultural ponds based on weather station measurements. Microb. Risk Anal. 19, 100171 (2021). https://doi.org/10.1016/j.mran.2021.100171
Buyrukoğlu, S.: Promising cryptocurrency analysis using deep learning. In: 2021 5th International symposium on multidisciplinary studies and innovative technologies (ISMSIT), pp. 372–376. IEEE (2021). https://doi.org/10.1109/ISMSIT52890.2021.9604721
Chiranjeevi, P., Rajaram, A.: A lightweight deep learning model based recommender system by sentiment analysis. J. Intell. Fuzzy Syst. 44(6), 10537–10550 (2023). https://doi.org/10.3233/JIFS-223871
Doğru, A., Buyrukoğlu, S., Arı, M.: A hybrid super ensemble learning model for the early-stage prediction of diabetes risk. Med. Biol. Eng. Compu. 61(3), 785–797 (2023). https://doi.org/10.1007/s11517-022-02749-z
Gong, J., Kim, H.: Rhsboost: Improving classifification performance in imbalance data. Comput. Stat. Data Anal. 111, 1–13 (2017). https://doi.org/10.1016/j.csda.2017.01.005
Ferreira, L.E.B., Barddal, J.P., Gomes, H.M., Enembreck, F.: Improving credit risk prediction in online peer-to-peer (p2p) lending using imbalanced learning techniques. In: Tools with Artifificial Intelligence (ICTAI), 2017 IEEE 29th International Conference on. IEEE, pp. 175–181 (2017). https://doi.org/10.1109/ICTAI.2017.00037
Ferreira, L.E.B., Gomes, H.M., Bifet, A.: Adaptive andom forests with resampling for imbalanced data streams. International Joint Conference on Neural Networks, pp. 14–19 (2019). https://doi.org/10.1109/IJCNN.2019.8852027
Friedl, M.A., Sulla-Menashe, D., Tan, B., Schneider, A., Ramankutty, N., Sibley, A., Huang, X.: M.: MODIS collection 5 global land cover: algorithm refinements and characterization of new datasets. Remote Sens. Environ. 114, 168–182 (2009). https://doi.org/10.1016/j.rse.2009.08.016
He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009). https://doi.org/10.1109/TKDE.2008.239
Kalpana, R., Subburaj, V., Lokanadham, R., Amudha, K., Beena Bethel, G.N., Shukla, A.K., Kshirsagar, P.R., Rajaram, A.: Internet of things (IOT) based machine learning techniques for wind energy harvesting. Electr. Power Compon. Syst. 14, 1–17 (2023). https://doi.org/10.1080/15325008.2023.2293952
Meenakshi, K., Revathi, M., Harsha, S.S., Tamilarasi, K., Shanthi, T.S., Sugumar, D., Suriyakrishnaan, K., Uma Maheswari, B., Rajaram, A.: Hybrid machine learning approach for trust evaluation to secure MANET from routing attacks. J. Intell. Fuzzy Syst. (2024). https://doi.org/10.3233/JIFS-231918
Oza, N.C.: Online bagging and boosting. Syst., Man Cybern. IEEE Int. Conf. 3, 2340–2345 (2005)
Poloju, N., Rajaram, A.: Transformation with Yolo Tiny Network architecture for multimodal fusion in lung disease classification. Cybern. Syst. 17, 1–22 (2024). https://doi.org/10.1080/01969722.2024.2343992
Qin, Y., Xiao, X., Tang, H., Dubayah, R., Doughty, R., Liu, D., Liu, F., Shimabukuro, Y., Arai, E., Wang, X., Moore, B.: Annual forest and evergreen forest cover maps in the Brazilian Amazon in terms of FAO’s forest definition. Earth Syst. Sci. Data (2023). https://doi.org/10.5194/essd-16-321-2024
Sun, Y., Wong, A.K.C., Kamel, M.S.: Classifification of imbalanced data: a review. Int. J. Pattern Recognit. Artifificial Intell. 23(4), 687–719 (2009). https://doi.org/10.1142/S0218001409007326
Acknowledgements
There is no acknowledgement involved in this work.
Funding
No funding is involved in this work.
Author information
Authors and Affiliations
Contributions
All authors are contributed equally to this work.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Ethics approval and consent to participate
No participation of humans takes place in this implementation process.
Human and animal rights
No violation of Human and Animal Rights is involved.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Guhan, T., Revathy, N. EMLARDE tree: ensemble machine learning based random de-correlated extra decision tree for the forest cover type prediction. SIViP 18, 8525–8536 (2024). https://doi.org/10.1007/s11760-024-03470-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-024-03470-0