Skip to main content

Spatial-SMOTE: An Approach for Handling Class Imbalance in Spatial Time Series Data

  • Conference paper
  • First Online:
Pattern Recognition and Machine Intelligence (PReMI 2021)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13102))

  • 223 Accesses

Abstract

Data imbalance handling is important to have unbiased learning during model training of classification tasks. Synthetic minority over-sampling technique (SMOTE) is primarily used for data imbalance handling. Conventional SMOTE algorithm and its variants mainly deal with upsampling the class data that is based solely on the amplitude values of features and their neighbors. However, the spatio-temporal data corresponding to satellite remote sensing images, comes with the additional location information, i.e. longitude and latitude. This has to be incorporated in the data upsampling case in order to have semantically and physically useful data. Hence, we propose a new pipeline named, ‘Spatial-SMOTE’ to upsample the data by retaining the significance of spatial distribution aspect in the overall upsampling process. The effectiveness of this approach is shown on land-use land-cover classification task using the time series data for a particular study area. We identified the relation between different classes based on their semantic distances and formulated two cases- one with high semantic distance and other with low. It is to be observed that the proposed method of Spatial-SMOTEing the minority class works well for both the cases. We also tested the effectiveness of the proposed approach over synthetically induced class imbalances for both low and high semantically differing classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Talukdar, S., et al.: Land-use land-cover classification by machine learning classifiers for satellite observation - a review. Remote Sens. 12(7), 1135 (2020)

    Article  Google Scholar 

  2. Thabtah, F., et al.: Data imbalance in classification: experimental evaluation. Inf. Sci. 513, 429–441 (2020)

    Article  MathSciNet  Google Scholar 

  3. Chawla, N.V., et al.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)

    Article  Google Scholar 

  4. Galar, M., et al.: A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches. IEEE Trans. SMC, Part C 42(4), 463–484 (2011)

    Google Scholar 

  5. He, H.: Garcia: learning from imbalanced data. TKDE 21(9), 1263–1284 (2009)

    Google Scholar 

  6. Han, H., et al: Borderline-SMOTE: a new over-sampling method in imbalanced data sets learning. In: International Conference on Intelligent Computing, pp. 878–887 (2005)

    Google Scholar 

  7. Tang, B., He, H.: KernelADASYN: kernel based adaptive synthetic data generation for imbalanced learning. In: CEC, pp. 664–671 (2015)

    Google Scholar 

  8. Barua, S., et al.: MWMOTE-majority weighted minority oversampling technique for imbalanced data set learning. TKDE 26(2), 405–425 (2012)

    Google Scholar 

  9. Vega, C., et al.: A new small area estimation algorithm to balance between statistical precision and scale. IJAEOG 97, 102303 (2021)

    Google Scholar 

  10. Creswell, A., et al.: Generative adversarial networks: an overview. IEEE Signal Process. Maga. 35(1), 53–65 (2018)

    Article  Google Scholar 

  11. Bhattacharjee, S., Ghosh, S.K., Chen, J.: Spatial interpolation. In: Semantic Kriging for Spatio-temporal Prediction. SCI, vol. 839, pp. 19–41. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-8664-0_2

    Chapter  Google Scholar 

  12. Musin, O.R.: Properties of the delaunay triangulation. In: 13th Annual Symposium on Computational Geometry, pp. 424–426 (1997)

    Google Scholar 

  13. Shukla, K., et al.: Mapping spatial distribution of particulate matter using kriging and inverse distance weighting at supersites of megacity Delhi. Sustain. Cities Soc. 54, 101997 (2020)

    Article  Google Scholar 

  14. TiSeLaC dataset. https://www.timeseriesclassification.com/description.php?Dataset=Tiselac. Accessed 16 June 2020

  15. Das, G.: SemBnet: a semantic bayesian network for multivariate prediction of meteorological time series data. Pattern Recogn. Lett. 93, 192–201 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rahul Dasharath Gavas .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gavas, R.D., Ghosh, S.K., Pal, A. (2024). Spatial-SMOTE: An Approach for Handling Class Imbalance in Spatial Time Series Data. In: Ghosh, A., King, I., Bhattacharyya, M., Sankar Ray, S., K. Pal, S. (eds) Pattern Recognition and Machine Intelligence. PReMI 2021. Lecture Notes in Computer Science, vol 13102. Springer, Cham. https://doi.org/10.1007/978-3-031-12700-7_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-12700-7_50

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-12699-4

  • Online ISBN: 978-3-031-12700-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics