Skip to main content

Advertisement

Log in

EMLARDE tree: ensemble machine learning based random de-correlated extra decision tree for the forest cover type prediction

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Forest cover type prediction is used for the forest management organizations. It also get the insight on area of the forest cover up to date and development lack in present time. Classification of the forest area and type of trees could eventually help in maintaining the eco system and to get inference on deforestation. In present scenario this problem gains more attention hence to retain the climate change impact forest cover type and area prediction would help a lot. This paper proposes a novel ensemble machine learning based random de-correlated extra decision tree model for the forest cover type prediction. The tree based classifiers perform well in prediction of the forest cover data. Many researchers use tree based classifiers for the problem. Even though the enhancement of the accuracy seems to be lower in the multi-class classification problem. So, this research proposes the Extra random de-correlated decision tree method for the prediction of the forest cover. The results the multiple de-correlated decision trees are aggregated for the final classification. This proposed method is the ensemble based method. In ensemble machine learning method combines several base optimal results in order to produce one final optimal result. A decision tree follows a simple predictive outcomes based on the series of the cause and effect values. While adopting the decision tree models the user has to follow the factors including the variable on which the decision to be taken and threshold for deciding the class. Instead of depending on one tree for decision making, multiple tree split criteria can be considered. Also these ensemble based machine learning allow to fine tune the predictor variable based on the feature to use and split criteria. The random forest based methods follows the bagging strategy. It has a major role in the split aspect and decision-making aspect in significant manner. This machine learning model decides where to split based on random selection of features. Random forest tree methods have a uniqueness where each split can be done through scrutiny of different features. This paper proposes the ensemble machine learning based random de-correlated extra decision tree model for the forest cover type prediction. This algorithm especially suits the problem for the multiclass classification nature. Forest cover type prediction helps in identifying the wilderness type and total area of the forest predicted and available. The dataset considered for the paper is from the UCI Machine Learning repository. It contains various features including elevation, slop, aspect, vertical and horizontal distance to hydrology, fire points and roadways, hill shade, wilderness area, soil type and cover type. Initially the preprocessing is done in the data set by identifying the missing values, outlier detection and formatting data. Later the exploratory analysis is carried out using the Pearson correlation coefficients aspect. Then three machine learning techniques: Multiclass SVM, Boosting and proposed EMLARDE were deployed. The accuracy of the proposed EMLARDE method outperforms the other two algorithms. The proposed algorithm performs well for this multiclass classification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Data availability statement

No datasets were generated or analysed during the current study.

Abbreviations

RF:

Random forest

DT:

Decision tree

ANN:

Artificial neural network

DNN:

Deep neural network

LR:

Logistics regression

DFNN:

Deep feed-forward neural network

SFS:

Sequential forward selection

FRA:

Forest resources assessment

SAR:

Synthetic aperture radar

FAO:

Food and agriculture organization

References

  1. Akbas, A., Buyrukoglu, S.: Stacking ensemble learning-based wireless sensor network deployment parameter estimation. Arab. J. Sci. Eng. 48(8), 9739–9748 (2023). https://doi.org/10.1007/s13369-022-07365-5

    Article  Google Scholar 

  2. Babu, P.A., Rai, A.K., Ramesh, J.V.N., Nithyasri, A., Sangeetha, S., Kshirsagar, P.R., Rajendran, A., Rajaram, A., Dilipkumar, S.: An explainable deep learning approach for oral cancer detection. J. Electr. Eng. Technol. 19(3), 1837–1848 (2024). https://doi.org/10.1007/s42835-023-01654-1

    Article  Google Scholar 

  3. Branco, P., Torgo, I.S., Ribeiro, R.P.: A survey of predictive modeling on imbalanced domains. ACM Comput. Surv. (CSUR) 49(2), 1–50 (2016). https://doi.org/10.1145/2907070

    Article  Google Scholar 

  4. Buyrukoğlu, S., Akbaş, A.: Machine learning based early prediction of type 2 diabetes: a new hybrid feature selection approach using correlation matrix with heatmap and SFS. Balkan J. Electr. Comput. Eng. 10(2), 110–117 (2022). https://doi.org/10.17694/bajece.973129

    Article  Google Scholar 

  5. Buyrukoğlu, S., Yılmaz, Y., Topalcengiz, Z.: Correlation value determined to increase Salmonella prediction success of deep neural network for agricultural waters. Environ. Monit. Assess. 194(5), 373 (2022). https://doi.org/10.1007/s10661-022-10050-7

    Article  Google Scholar 

  6. Buyrukoğlu, S., Savaş, S.: Stacked-based ensemble machine learning model for positioning footballer. Arab. J. Sci. Eng. 48(2), 1371–1383 (2023). https://doi.org/10.1007/s13369-022-06857-8

    Article  Google Scholar 

  7. Buyrukoğlu, S.: New hybrid data mining model for prediction of Salmonella presence in agricultural waters based on ensemble feature selection and machine learning algorithms. J. Food Saf. 41(4), e12903 (2021). https://doi.org/10.1111/jfs.12903

    Article  Google Scholar 

  8. Buyrukoğlu, G., Buyrukoğlu, S., Topalcengiz, Z.: Comparing regression models with count data to artificial neural network and ensemble models for prediction of generic Escherichia coli population in agricultural ponds based on weather station measurements. Microb. Risk Anal. 19, 100171 (2021). https://doi.org/10.1016/j.mran.2021.100171

    Google Scholar 

  9. Buyrukoğlu, S.: Promising cryptocurrency analysis using deep learning. In: 2021 5th International symposium on multidisciplinary studies and innovative technologies (ISMSIT), pp. 372–376. IEEE (2021). https://doi.org/10.1109/ISMSIT52890.2021.9604721

  10. Chiranjeevi, P., Rajaram, A.: A lightweight deep learning model based recommender system by sentiment analysis. J. Intell. Fuzzy Syst. 44(6), 10537–10550 (2023). https://doi.org/10.3233/JIFS-223871

    Article  Google Scholar 

  11. Doğru, A., Buyrukoğlu, S., Arı, M.: A hybrid super ensemble learning model for the early-stage prediction of diabetes risk. Med. Biol. Eng. Compu. 61(3), 785–797 (2023). https://doi.org/10.1007/s11517-022-02749-z

    Article  Google Scholar 

  12. Gong, J., Kim, H.: Rhsboost: Improving classifification performance in imbalance data. Comput. Stat. Data Anal. 111, 1–13 (2017). https://doi.org/10.1016/j.csda.2017.01.005

    Article  Google Scholar 

  13. Ferreira, L.E.B., Barddal, J.P., Gomes, H.M., Enembreck, F.: Improving credit risk prediction in online peer-to-peer (p2p) lending using imbalanced learning techniques. In: Tools with Artifificial Intelligence (ICTAI), 2017 IEEE 29th International Conference on. IEEE, pp. 175–181 (2017). https://doi.org/10.1109/ICTAI.2017.00037

  14. Ferreira, L.E.B., Gomes, H.M., Bifet, A.: Adaptive andom forests with resampling for imbalanced data streams. International Joint Conference on Neural Networks, pp. 14–19 (2019). https://doi.org/10.1109/IJCNN.2019.8852027

  15. Friedl, M.A., Sulla-Menashe, D., Tan, B., Schneider, A., Ramankutty, N., Sibley, A., Huang, X.: M.: MODIS collection 5 global land cover: algorithm refinements and characterization of new datasets. Remote Sens. Environ. 114, 168–182 (2009). https://doi.org/10.1016/j.rse.2009.08.016

    Article  Google Scholar 

  16. He, H., Garcia, E.A.: Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 21(9), 1263–1284 (2009). https://doi.org/10.1109/TKDE.2008.239

    Article  Google Scholar 

  17. Kalpana, R., Subburaj, V., Lokanadham, R., Amudha, K., Beena Bethel, G.N., Shukla, A.K., Kshirsagar, P.R., Rajaram, A.: Internet of things (IOT) based machine learning techniques for wind energy harvesting. Electr. Power Compon. Syst. 14, 1–17 (2023). https://doi.org/10.1080/15325008.2023.2293952

    Article  Google Scholar 

  18. Meenakshi, K., Revathi, M., Harsha, S.S., Tamilarasi, K., Shanthi, T.S., Sugumar, D., Suriyakrishnaan, K., Uma Maheswari, B., Rajaram, A.: Hybrid machine learning approach for trust evaluation to secure MANET from routing attacks. J. Intell. Fuzzy Syst. (2024). https://doi.org/10.3233/JIFS-231918

    Article  Google Scholar 

  19. Oza, N.C.: Online bagging and boosting. Syst., Man Cybern. IEEE Int. Conf. 3, 2340–2345 (2005)

    Google Scholar 

  20. Poloju, N., Rajaram, A.: Transformation with Yolo Tiny Network architecture for multimodal fusion in lung disease classification. Cybern. Syst. 17, 1–22 (2024). https://doi.org/10.1080/01969722.2024.2343992

    Article  Google Scholar 

  21. Qin, Y., Xiao, X., Tang, H., Dubayah, R., Doughty, R., Liu, D., Liu, F., Shimabukuro, Y., Arai, E., Wang, X., Moore, B.: Annual forest and evergreen forest cover maps in the Brazilian Amazon in terms of FAO’s forest definition. Earth Syst. Sci. Data (2023). https://doi.org/10.5194/essd-16-321-2024

    Article  Google Scholar 

  22. Sun, Y., Wong, A.K.C., Kamel, M.S.: Classifification of imbalanced data: a review. Int. J. Pattern Recognit. Artifificial Intell. 23(4), 687–719 (2009). https://doi.org/10.1142/S0218001409007326

    Article  Google Scholar 

Download references

Acknowledgements

There is no acknowledgement involved in this work.

Funding

No funding is involved in this work.

Author information

Authors and Affiliations

Authors

Contributions

All authors are contributed equally to this work.

Corresponding author

Correspondence to T. Guhan.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Ethics approval and consent to participate

No participation of humans takes place in this implementation process.

Human and animal rights

No violation of Human and Animal Rights is involved.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guhan, T., Revathy, N. EMLARDE tree: ensemble machine learning based random de-correlated extra decision tree for the forest cover type prediction. SIViP 18, 8525–8536 (2024). https://doi.org/10.1007/s11760-024-03470-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-024-03470-0

Keywords