Abstract
Geographical thematic mapping based on spatial information can effectively support scientific decision-making in Geosciences. To obtain finer spatial decision information, this paper proposes a geo-parcel-based thematic mapping methodology for evaluating cash crop planting suitability using C5.0 decision tree (DT). In this study, geo-parcels are utilized as basic mapping units. Multi-source data are firstly employed to increase geo-parcel units’ attributes and a decision table then is constructed under a multi-attribute index system. Next, rules are mined using a C5.0 DT algorithm according to local geo-parcels in this decision table. Finally, rules are referred as thematic-distinguishing knowledge for inferential mapping in global geo-parcels. A case study of sugarcane planting suitability evaluation is conduct based on the proposed methodology. The experimental results showed that the cross-validation accuracy of the rules is 81.34% and the sum of the very suitable area and suitable area in the generated evaluation map is close to that of historical selected high-yield and high-sugar-content sugarcane bases, which indicated that the mapping result is in good agreement with the actual selection situation. These also demonstrate the effectiveness of our method and thus may be extended to other domains requiring fine geographical thematic mapping of cash crop planting suitability.
Similar content being viewed by others
References
Atijosan A, Muibi K, Ogunyemi S, Adewoyin J, Badru R, Alaga A, Shaba A (2015) Agricultural land suitability assessment using fuzzy logic and geographic information system techniques. Int J Scientific Res Sci Technol 1(5):113–118
Bagherzadeh A, Gholizadeh A (2017) Parametric-based neural networks and TOPSIS modeling in land suitability evaluation for alfalfa production using GIS. Model Earth Syst Environ 3(1):2
Banai R (1993) Fuzziness in geographical information systems: contributions from the analytic hierarchy process. Int J Geogr Inf Sci 7:315–329
Bargiela A, Pedrycz W (2002) Granular computing: an introduction. Kluwer Academic Publishers, Dordrecht
Baroudy AAE (2016) Mapping and evaluating land suitability using a GIS-based model. Catena 140:96–104
Barros RC, Basgalupp MP, de Carvalho ACPLF, Freitas AA (2012) A survey of evolutionary algorithms for decision-tree induction. IEEE Trans Syst Man Cybern C 42:291–312
Blaschke T (2010) Object based image analysis for remote sensing. ISPRS J Photogramm Remote Sens 65:2–16
Blaschke T, Lang S, Lorup E, Strobl J, Zeil P (2008) Object-oriented image processing in an integrated GIS/remote sensing environment and perspectives for environmental applications. In: Cremers A, Greve K (eds) Environmental information for planning, politics and the public. Metropolis Verlag, Marburg, pp 555–570
Blaschke T, Strobl J (2001) What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS. GIS-Zeitschrift für Geoinformations Systeme 14(6):12–17
Blaschke T, Hay GJ, Kelly M, Lang S, Hofmann P, Addink E, Feitosa RQ, van der Meer F, van der Werff H, van Coillie F, Tiede D (2014) Geographic object-based image analysis: towards a new paradigm. ISPRS J Photogramm Remote Sens 87:180–191
Bozdağ A, Yavuz F, Günay AS (2016) AHP and GIS based land suitability analysis for Cihanbeyli (Turkey) county. Environ Earth Sci 75(9):1–15
Breiman L (2001) Random forests. Mach Learn 45:5–32
Burrough PA, McDonnell RA (1998) Principles of geographical information systems. Oxford University Press, Oxford
Carver SJ (1991) Integrating multi-criteria evaluation with geographical information systems. Int J Geogr Inf Sci 5:321–339
Chakhar S, Mousseau V (2008) Spatial multicriteria decision making. Int J Geogr Inf Sci 22:175–191
Chen TQ, Guestrin C (2016) XGBoost: A scalable tree boosting system. ArXiv e-prints. https://arxiv.org/abs/1603.02754
Chen Y, Yu J, Khan S (2010) Spatial sensitivity analysis of multi-criteria weights in GIS-based land suitability evaluation. Environ Model Softw 25:1582–1591
Eastman JR, Kyem PAK, Toledano J (1993) GIS and decision making. The United Nations Institute for Training and Research
Eastman JR (1997) Idrisi for Windows (Version 2.0): Tutorial Exercises. Graduate School of Geography-Clark University, Worcester County, Massachusetts
Elaalem M, Comber A, Fisher P (2011) A comparison of fuzzy AHP and ideal point methods for evaluating land suitability. Trans GIS 15(3):329–346
Friedl MA, Brodley CE (1997) Decision tree classification of land cover from remotely sensed data. Remote Sens Environ 61:399–409
Hailu AH, Kibret K, Gebrekidan H (2013) Land suitability evaluation for rainfed production of barley and wheat at Kabe subwatershed, northeastern Ethiopia. Am J Res Commun 1:296–318
Hastie T, Tibshirani R, Friedman J (2009) Elements of statistical learning: data mining, inference and prediction, 2nd edn. Springer, Berlin
He YB, Luca O, Wang YM (2009) Land suitability evaluation of food crops in vulnerably ecological area based up-scaling method. Agric Sci Technol 10(6):168–174
Hengl T, Mendes de Jesus J, Heuvelink GBM, Ruiperez-Gonzalez M, Kilibarda M, Blagotić A, Wei SG, Wright MN, Geng X, Bauer-Marschallinger B, Guevara MA, Vargas R, RMacMillan RA, Batjes NH, Leenaars JG, Ribeiro E, Wheeler I, Mantel S, Kempen B (2017) SoilGrids250m: global gridded soil information based on machine learning. PLoS One 12(2):e0169748
Heuvelink GBM, Brus D, Hengl T, Kempen B, Leenaars RGM (2016) Uncertainty quantification of interpolated maps derived from observations with different accuracy levels. In Proceedings of Spatial Accuracy 2016, Montpellier, 44–51
Heywood I, Oliver J, Tomlinson S (1995) Building an exploratory multi-criteria modeling environment for spatial decision support. In Proceedings of EGIS/MARI’ 94. Paris, France, 632–639
James G, Witten D, Hastie T, Tibshirani R (2013) An introduction to statistical learning with applications in R. Springer, New York, Heidelberg, Dordrecht and Lon-don
Jankowski P (1995) Integrating geographical information systems and multiple criteria decision making methods. Int J Geogr Inf Sci 9:251–273
Jansson J (2016) Decision tree classification of products using C5.0 and prediction of workload using time series analysis (Ph.D Thesis). KTH Royal Institute of Technology, Stockholm, Sweden
Jiang H, Eastman JR (2000) Application of fuzzy measures in multi-criteria evaluation in GIS. Int J Geogr Inf Sci 14:173–184
John H, Retchie JT (1991) Modeling plant and soil systems. Madison, Wisconsin, Society of Agronomy, Crop Science Society of American and Soil Science Society of America
Karger DN, Conrad O, Böhner J, Kawohl T, Kreft H, Soria-Auza RW, Zimmermann N, Linder HP, Kessler M (2016) Climatologies at high resolution for the earth’s land surface areas. World Data Center for Climate, http://www.wdc-climate.de
Karpatne A, Jiang Z, Vatsavai RR, Shekhar S (2016) Monitoring land-cover changes: a machine-learning perspective. IEEE Geosc Rem Sen M 4:8–21
Khaleghi B, Khamis A, Karray FO (2013) Multisensor data fusion: a review of the state-of-the-art. Inform Fusion 14:28–44
Kheir RB, Greve MH, Bøcher PK, Greve MB, Larsen R, McCloy K (2010) Predictive mapping of soil organic carbon in wet cultivated lands using classification-tree based models: the case study of Denmark. J Environ Manag 91:1150–1160
Li YR (2010) Modern sugarcane science. China Agriculture Press, Beijing
Lodwick WA, Monson W, Svoboda L (1990) Attribute error and sensitivity analysis of map operations in geographical information systems: suitability analysis. Int J Geogr Inform Syst 4:413–428
Malczewski J (1999) GIS and muti-criteria decision analysis. Wiley, New York
Malczewski J (2006) GIS-based multicriteria decision analysis: a survey of the literature. Int J Geogr Inf Sci 20:703–726
Mennis J, Guo D (2009) Spatial data mining and geographic knowledge discovery: an introduction. Comput Environ Urban 33:403–408
Mosadeghi R, Warnken J, Tomlinson R, Mirfenderesk H (2015) Comparison of fuzzy-AHP and AHP in a spatial multi-criteria decision making model for urban land-use planning. Comput Environ Urban Syst 49:54–65
Openshaw S, Abrahart RJ (2000) Geo-computation. Taylor & Francis, Oxford
Pal M, Mather PM (2003) An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens Environ 86:554–565
Pereira JMC, Duckstein L (2007) A multiple criteria decision-making approach to GIS-based land suitability evaluation. Int J Geogr Inf Sci 7:407–424
Qi F, Zhu AX (2003) Knowledge discovery from soil maps using inductive learning. Int J Geogr Inf Sci 17:771–795
Qi F, Zhu AX (2011) Comparing three methods for modeling the uncertainty in knowledge discovery from area-class soil maps. Comput Geosci 37:1425–1436
Qin ZL, Kong LZ, Li XH, Mo XX (2015) Analysis on the competitiveness of Guangxi sugarcane and cane sugar industry. J South Agr 46(4):722–728
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers, San Francisco
Quinlan JR (2001) See5: An informal tutorial. http://www.rulequest.com (verified November 29, 2012)
Reshmidevi TV, Eldho TI, Jana R (2009) A GIS-integrated fuzzy rule-based inference system for land suitability evaluation in agricultural watersheds. Agric Syst 101(1):101–109
Romano G, Sasso PD, Liuzzi GT, Gentile F (2015) Multi-criteria decision analysis for land suitability mapping in a rural area of southern Italy. Land Use Policy 48:131–143
Romero A, Gatta C, Camps-Valls G (2016) Unsupervised deep feature extraction for remote sensing image classification. IEEE Trans Geosci Remote Sens 54(3):1349–1362
Saaty TL (1980) The analytic hierarchy process. McGraw-Hill, New York
Shen KY, Tzeng GH (2015) A decision rule-based soft computing model for supporting financial performance improvement of the banking industry. Soft Comput 19(4):859–874
Sultan D (2013) Assessment of irrigation land suitability and development of map for the fogera catchment using GIS, South Gondar. Asian J Agri Rural Dev 3:7–17
Tang Y, Jin B, Zhang YQ (2005) Granular support vector machines with association rules mining for protein homology prediction. Artif Intell Med 35:121–134
Taghizadeh-Mehrjardi R, Sarmadian F, Minasny B, Triantafilis J, Omid M (2014) Digital mapping of soil classes using decision tree and auxiliary data in the Ardakan region, Iran. Arid Land Res Manag 28:147–168
Whiteside TG, Boggs GS, Maier SW (2011) Comparing object-based and pixel-based classifications for mapping savannas. Int J Appl Earth Obs Geoinf 13:884–893
Wu WZ, Leung Y (2011) Theory and applications of granular labelled partitions in multi-scale decision tables. Inform Sciences 181:3878–3897
Xie GX, Zeng ZK, Li YX, Qin ZL, Lan ZB, Zhang XL, Xie FQ, Su QQ, Zhang JM (2017a) Sugarcane planting suitability evaluation based on block unit. J South Agr 48(2):361–367
Xie J, Yang M, Li J, Zheng Z (2017b) Rule acquisition and optimal scale selection in multi-scale formal decision contexts and their applications to smart city. Future Gener Comp Sy 73(1):1–30
Yang YP, Huang QT, Wu W, Luo JC, Gao LJ, Dong W, Wu TJ, Hu XD (2017) Geo-parcel based crop identification by integrating high spatial-temporal resolution imagery from multi-source satellite data. Remote Sens 9(12):1298 1-20
Yao JT, Yao YY (2002) A granular computing approach to machine learning. In Proceedings of the 1st International Conference on Fuzzy Systems and Knowledge Discovery (FSKD’02 FSKD’02), Yishun, Singapore, 732–736
Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Trans Big Data 1:16–34
Zhou J, Civco DL (1996) Using genetic learning neural networks for spatial decision making in GIS. Photogramm Eng Rem S 62:1287–1295
Zhu AX (1999) A personal construct-based knowledge acquisition process for natural resource mapping. Int J Geogr Inf Sci 13(2):119–141
Zhu AX (2008) Rule-based mapping. The Handbook of Geographic Information Science (Wilson JP and Fotheringham AS eds.), Blackwell Publishing, Oxford, 273–291
Zhu AX, Zhang GM, Wang W, Xiao W, Huang ZP, Dunzhu GS, Ren GP, Qin CZ, Yang L, Pei T, Yang ST (2015) A citizen data-based approach to predictive mapping of spatial variation of natural phenomena. Int J Geogr Inf Sci 29:1864–1886
Zhu AX, Lawrence EB, Barry D, Thomas JN (1996) Automated soil inference under fuzzy logic. Ecol Model 90(2):123–145
Acknowledgements
This work was partially funded by the National Natural Science Foundation of China (Grant No: 41631179, 41601437); National Key Research and Development Program (Grant No: 2017YFB0503600); Natural Science Basic Research Plan in Shaanxi Province of China (Grant No: 2017JQ4002); Open Projects of Key Laboratory of Spatial Data Mining & Information Sharing of Ministry of Education, Fuzhou University (Grant No: 2018LSDMIS03), and State Key Laboratory of Geo-information Engineering (No. SKLGIE2017-Z-4-3); Special Fund for Basic Scientific Research of Central Colleges in Chang’an University (Grant No: 310812163504).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by: Hassan Babaie
Rights and permissions
About this article
Cite this article
Wu, T., Dong, W., Luo, J. et al. Geo-parcel-based geographical thematic mapping using C5.0 decision tree: a case study of evaluating sugarcane planting suitability. Earth Sci Inform 12, 57–70 (2019). https://doi.org/10.1007/s12145-018-0360-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-018-0360-8