Skip to main content

Advertisement

Log in

Forest data visualization and land mapping using support vector machines and decision trees

  • Research Article
  • Published:
Earth Science Informatics Aims and scope Submit manuscript

Abstract

Forests play a vital role in the regulation of climate, absorption of carbon dioxide, hydrological cycle, conservation of water, soil and biodiversity and help mitigate natural disasters. With the help of various remote sensors, high-resolution satellite images are being collected nowadays, which helps in tackling the global challenges of forest mapping in remote areas. Each landscape will grow different types of trees and in turn substantiate a part of the country’s economy. This paper uses visualization and machine learning (ML) processes to classify the forest land on the terrain dataset composed of the advanced spaceborne thermal emission and reflection radiometer (ASTER) imaging instrument to get the insight of the cumulated data by using Box Plot and Heat Map. The accuracy obtained by utilizing different machine learning techniques like Support Vector Machine (SVM) gives 95.4%, Logistic Regression (LR) gives 94.5%, K-Nearest Neighbor (K-NN) gives 93.7%, Decision Tree (DT) with 89.5%, Stochastic Gradient Descendent (SGD) with 92.4% and CN2 Rule Induction (RI) gives 85.3% are allied which gives appreciable results in forest mapping substantiated the same with confusion matrix and ROC. We also obtained the DT and rules for the considered dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  • Advanced Spaceborne Thermal Emission and Reflection Radiometer. Wikipedia, Wikimedia Foundation (2020), en.wikipedia.org/wiki/Advanced_Spaceborne_Thermal_Emission_and_Reflection_Radiometer. Accessed 27 Apr 2020

  • Arabameri A, Chen W, Loche M, Zhao X, Li Y, Lombardo L, Cerda A, Pradhan B, Bui DT (2019) Comparison of machine learning models for gully erosion susceptibility mapping. Geosci Front. https://doi.org/10.1016/j.gsf.2019.11.009

  • Bertsimas D, King A (2017) Logistic regression: from art to science. Stat Sci 32(3):367–384

    Article  Google Scholar 

  • Bonilla-Bedoya S, Mora A, Vaca A, Estrella A, Herrera MÁ (2020) Modelling the relationship between urban expansion processes and urban forest characteristics: An application to the Metropolitan District of Quito. Comput Environ Urban Syst 79:101420

    Article  Google Scholar 

  • Chamaecyparis obtusa. Wikipedia, Wikimedia Foundation, 27 April 2020, en.wikipedia.org/wiki/Chamaecyparis_obtusa. Accessed 27 Apr 2020

  • Chang CJ, Li DC, Huang YH, Chen CC (2015) A novel gray forecasting model based on the box plot for small manufacturing data sets. Appl Math Comput 265:400–408

  • Chaves PP, Zuquim G, Ruokolainen K, Doninck JV, Kalliola R, Rivero EG, Tuomisto H (2020) Mapping floristic patterns of trees in peruvian amazonia using remote sensing and machine learning. Remote Sens 12:1523

    Article  Google Scholar 

  • Cox DR (1972) Regression models and life-ss. J R Stat Soc Ser B Methodol 34(2):187–202

    Google Scholar 

  • Deilami K, Khajeh S, Bolhassani N, Jazireeyan I (2012) Application of aster satellite in digital elevation model generation: a review. Res J Environ Earth Sci 4(12):1033–1037

    Google Scholar 

  • Deng Z, Zhu X, Cheng D, Zong M, Zhang S (2016) Efficient kNN classification algorithm for big data. Neurocomputing. 195:143–148

    Article  Google Scholar 

  • Eckert S, Kellenberger T, Itten K (2005) Accuracy assessment of automatically derived digital elevation models from ASTER data in mountainous terrain. Int J Remote Sens 26(9):1943–1957

    Article  Google Scholar 

  • Elovici Y, Braha D (2003) A decision-theoretic approach to data mining. IEEE Trans Syst Man Cybern Syst Hum 33(1):42–51

    Article  Google Scholar 

  • Fukuda M, Iehara T, Matsumoto M (2003) Carbon stock estimates for sugi and hinoki forests in Japan. For Ecol Manag 184(1–3):1–6

    Article  Google Scholar 

  • Ge G, Shi Z, Zhu Y, Yang X, Hao Y (2020) Land use/cover classification in an arid desert-oasis mosaic landscape of China using remote sensed imagery: performance assessment of four machine learning algorithms. Global Ecol Conserv 22:e00971

    Article  Google Scholar 

  • Gu Y, Cheng L (2017) Classification of class overlapping datasets by kernel-MTS method. Int J Innov Comput Inf Control 13(5):1759–1768

    Google Scholar 

  • Gumus E, Kirci P (2018) Selection of spectral features for land cover type classification. Expert Syst Appl 102:27–35

    Article  Google Scholar 

  • Hays J (2009) “Trees in Japan: Allergy-Causing Cedars, Hinoki and Dogwood.” Facts and Details. factsanddetails.com/japan/cat26/sub164/item2912.html. Accessed 27 Apr 2020

  • Hsu YC (2018) Using decision tree algorithm for the detection and decision rule construction of defect on dispensing process (Master’s Thesis, NSYS University)

  • Ketkar N (2017) Introduction to pytorch. InDeep learning with python. Apress, Berkeley, pp 195–208

    Google Scholar 

  • Kraxner F, Schepaschenko D, Fuss S, Lunnan A, Kindermann G, Aoki K, Dürauer M, Shvidenko A, See L (2017) Mapping certified forests for sustainable management-a global tool for information improvement through participatory and collaborative mapping. Forest Policy Econ 83:10–18

    Article  Google Scholar 

  • Landgrebe TC, Duin RP (2008) Efficient multiclass ROC approximation by decomposition via confusion matrix perturbation analysis. IEEE Trans Pattern Anal Mach Intell 30(5):810–822

    Article  Google Scholar 

  • Lem S, Onghena P, Verschaffel L, Van Dooren W (2013) The heuristic interpretation of box plots. Learn Instr 26:22–35

    Article  Google Scholar 

  • Li DC, Huang WT, Chen CC, Chang CJ (2014) Employing box plots to build high-dimensional manufacturing models for new products in TFT-LCD plants. Neurocomputing. 142:73–85

    Article  Google Scholar 

  • Marom ND, Rokach L, Shmilovici A (2010) Using the confusion matrix for improving ensemble classifiers. In 2010 IEEE 26-th convention of electrical and electronics engineers in Israel (pp. 000555-000559). IEEE

  • Masek JG, Hayes DJ, Hughes MJ, Healey SP, Turner DP (2015) The role of remote sensing in process-scaling studies of managed forest ecosystems. For Ecol Manag 355:109–123

    Article  Google Scholar 

  • Mason C, Twomey J, Wright D, Whitman L (2018) Predicting engineering student attrition risk using a probabilistic neural network and comparing results with a backpropagation neural network and logistic regression. Res High Educ 59(3):382–400

    Article  Google Scholar 

  • McKinley DC, Ryan MG, Birdsey RA, Giardina CP, Harmon ME, Heath LS, Houghton RA, Jackson RB, Morrison JF, Murray BC, Pataki DE (2011) A synthesis of current knowledge on forests and carbon storage in the United States. Ecol Appl 21(6):1902–1924

    Article  Google Scholar 

  • Nacson MS, Srebro N, Soudry D (2018) Stochastic gradient descent on separable data: Exact convergence with a fixed learning rate. arXiv preprint arXiv:1806.01796

  • Nguyen VV, Pham BT, Vu BT, Prakash I, Jha S, Shahabi H, Shirzadi A, Ba DN, Kumar R, Chatterjee JM, Tien BD (2019) Hybrid machine learning approaches for landslide susceptibility modeling. Forests 10(2):157

    Article  Google Scholar 

  • Nuzzo RL (2016) The box plots alternative for visualizing quantitative data. PM&R. 8(3):268–272

    Article  Google Scholar 

  • Ohannessian R, Bénet T, Argaud L, Guérin C, Guichon C, Piriou V, Rimmelé T, Girard R, Gerbier-Colomban S, Vanhems P (2017) Heat map for data visualization in infection control epidemiology: an application describing the relationship between hospital-acquired infections, simplified acute physiological score II, and length of stay in adult intensive care units. Am J Infect Control 45(7):746–749

    Article  Google Scholar 

  • Oliveira AC, Botega LC, Saran JF, Silva JN, Melo JO, Tavares MF, Neris VP (2019) Crowdsourcing, data and information fusion and situation awareness for emergency management of forest fires: the project DF100Fogo (FDWithoutFire). Comput Environ Urban Syst 77:101172

    Article  Google Scholar 

  • Othman O (2018) Pruning methods for rule induction (Doctoral dissertation, University of Salford)

  • Prasetio RT, Ripandi E (2019 Apr 22) Optimasi Klasifikasi jenis hutan menggunakan deep learning berbasis optimize selection. J Inf Secur 6(1):100–106

    Google Scholar 

  • Rabah M, El-Hattab A, Abdallah M (2017) Assessment of the most recent satellite based digital elevation models of Egypt. NRIAG J Astron Geophys 6(2):326–335

    Article  Google Scholar 

  • Saa AA (2016) Educational data mining & students’ performance prediction. Int J Adv Comput Sci Appl 7(5):212–220

    Google Scholar 

  • Sabanci K, Ünlerşen MF, Polat K (2016) Classification of different forest types with machine learning algorithms. Res Rural Dev 1

  • Shabani S, Pourghasemi HR, Blaschke T (2020) Forest stand susceptibility mapping during harvesting using logistic regression and boosted regression tree machine learning models. Global Ecol Conserv 22:e00974

    Article  Google Scholar 

  • Sujatha R, Ezhilmaran D (2013) Evaluation of classifiers to enhance model selection. Int J Comput Sci Eng Technol 4(01)

  • Susmaga R (2004) Confusion matrix visualization. In intelligent information processing and web mining. Springer, Berlin, Heidelberg, pp 107–116

    Book  Google Scholar 

  • Swe SM, Sett KM (2019) Approaching rules induction: CN2 algorithm in categorizing of biodiversity. Int J Trend Sci Res Dev 3(4):1581–1584

    Google Scholar 

  • Toutin T, Cheng P (2002) Comparison of automated digital elevation model extraction results using along-track ASTER and across-track SPOT stereo images. Opt Eng Bellingham Int Soc Opt Eng 41(9):2102–2106

    Google Scholar 

  • UCI (2015) Machine Learning Repository: Forest Type Mapping Data Set, archive.ics.uci.edu/ml/datasets/Forest+type+mapping. Accessed 27 Apr 2020

  • Walker S, Khan W, Katic K, Maassen W, Zeiler W (2020) Accuracy of different machine learning algorithms and added-value of predicting aggregated-level energy performance of commercial buildings. Energy Build 209:109705

    Article  Google Scholar 

  • Wang Y, Ziv G, Adami M, Mitchard E, Batterman SA, Buermann W, Marimon BS, Junior BH, Reis SM, Rodrigues D, Galbraith D (2019) Mapping tropical disturbed forests using multi-decadal 30 m optical satellite imagery. Remote Sens Environ 221:474–488

    Article  Google Scholar 

  • Wang H, Zhang L, Yin K, Luo H, Li J (2020) Landslide identification using machine learning. Geosci Front. https://doi.org/10.1016/j.gsf.2020.02.012

  • Xing W, Bei Y (2019) Medical health big data classification based on KNN classification algorithm. IEEE Access 8:28808–28819

    Article  Google Scholar 

  • Yale University (2018) Japanese Red-Cedar (Sugi) | Yale Nature Walk, naturewalk.yale.edu/trees/cupressaceae/cryptomeria-japonica/japanese-red-cedar-sugi-83. Accessed 27 Apr 2020

  • Yamaguchi Y, Kahle AB, Tsu H, Kawakami T, Pniel M (1998) Overview of advanced spaceborne thermal emission and reflection radiometer (ASTER). IEEE Trans Geosci Remote Sens 36(4):1062–1071

    Article  Google Scholar 

  • Yang P, Zhong Z, Du L, Li F, Li H, Hu K, Chen Z, Zhu Y, Zhang W, Huang F, Ye X, Wang C, Ye Z, Qi J, Dong H, Li X, Nguyen QD., Han Y, Kijlstra A (2018) Visual Outcome and Development of a Decision Tree Algorithm for Predicting Systemic Diseases Associated with Uveitis. Available at SSRN: https://ssrn.com/abstract=3294108

  • Yuksel ME, Basturk NS, Badem H, Caliskan A, Basturk A (2018) Classification of high resolution hyperspectral remote sensing data using deep neural networks. J Intell Fuzzy Syst 34(4):2273–2285

    Article  Google Scholar 

  • Zhang D (2019) Support vector machine. In fundamentals of image data mining. Springer, Cham, pp 179–205

    Book  Google Scholar 

  • Zhang P, Yin ZY, Jin YF, Chan TH, Gao FP (2020) Intelligent modelling of clay compressibility using hybrid meta-heuristic and machine learning algorithms. Geosci Front. https://doi.org/10.1016/j.gsf.2020.02.014

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to D. Jude Hemanth.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Radhakrishnan, S., Lakshminarayanan, A.S., Chatterjee, J.M. et al. Forest data visualization and land mapping using support vector machines and decision trees. Earth Sci Inform 13, 1119–1137 (2020). https://doi.org/10.1007/s12145-020-00492-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12145-020-00492-3

Keywords

Navigation