Skip to main content

Ensemble of Fully Convolutional Neural Networks with End-to-End Learning for Small Object Semantic Segmentation

  • Conference paper
  • First Online:
Robot Intelligence Technology and Applications 7 (RiTA 2022)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 642))

  • 768 Accesses

Abstract

Small object segmentation is important due to its vast application potential for biomedical and surveillance. This paper describes several ensemble methods that combine different fully convolutional neural networks (FCNNs) for accurate semantic segmentation of aerial images. One problem with a single model for accurate small object segmentation is that the small object classes are suppressed by large object classes, which can increase overfitting problems and hinder generalization performance. Furthermore, using the single model for segmentation has particular strengths and weaknesses, and the segmentation accuracy changes with the different features used for training. Thus, this paper describes several ensemble methods that combine multiple fully convolutional networks for accurate semantic segmentation of unmanned aerial vehicles (UAVs) images. The proposed method starts with performing object localization for each single FCNN model. After that, each model’s channels of feature map outputs are combined to guide the training of the ensemble segmentation network. We have performed experiments on UAVid benchmark dataset. The results show that the proposed ensemble approach outperforms the single model that directly uses the channels of the single model’s feature map outputs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abdullah, A., Ting, W.E.: Orientation and scale based weights initialization scheme for deep convolutional neural networks. Asia-Pac. J. Inf. Technol. Multimed. 9, 103–112 (2020)

    Google Scholar 

  2. Albashish, D.: Ensemble of adapted convolutional neural networks (CNN) methods for classifying colon histopathological images. PeerJ Comput. Sci. 8, e1031 (2022)

    Article  Google Scholar 

  3. Albashish, D., Al-Sayyed, R.M.H., Abdullah, A., Ryalat, M.H., Almansour, N.A.: Deep CNN model based on VGG16 for breast cancer classification. In: 2021 International Conference on Information Technology (ICIT), pp. 805–810 (2021)

    Google Scholar 

  4. Brown, G., Wyatt, J.L., Harris, R., Yao, X.: Diversity creation methods: a survey and categorisation. Inf. Fusion 6, 5–20 (2005)

    Article  Google Scholar 

  5. Corinna Cortes and Vladimir Naumovich Vapnik: Support-vector networks. Mach. Learn. 20, 273–297 (2004)

    MATH  Google Scholar 

  6. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893 (2005)

    Google Scholar 

  7. Dong, R., Pan, X., Li, F.: DenseU-net-based semantic segmentation of small objects in urban remote sensing images. IEEE Access 7, 65347–65356 (2019)

    Article  Google Scholar 

  8. Dong, Y., Du, B., Zhang, L.P.: Target detection based on random forest metric learning. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 8, 1830–1838 (2015)

    Article  Google Scholar 

  9. Ibrahim, Z., Diah, N.M., Azmi, M.E., Abdullah, A., Zin, N.A.M.: Real-time mobile application for handwritten digit recognition using MobileNet. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds.) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. LNEE, vol. 829, pp. 1003–1008. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-8129-5_153

    Chapter  Google Scholar 

  10. Inglada, J.: Automatic recognition of man-made objects in high resolution optical remote sensing images by SVM classification of geometric image features. ISPRS J. Photogramm. Remote. Sens. 62, 236–248 (2007)

    Article  Google Scholar 

  11. Kisantal, M., Wojna, Z., Murawski, J., Naruniec, J., Cho, K.: Augmentation for small object detection. ArXiv, abs/1902.07296 (2019)

    Google Scholar 

  12. Li, M., Zang, S., Zhang, B., Li, S., Changshan, W.: A review of remote sensing image classification techniques: the role of spatio-contextual information. Eur. J. Remote Sens. 47, 389–411 (2014)

    Article  Google Scholar 

  13. Lin, T.-Y., Dollár, P., Girshick, R.B., He, K., Hariharan, B., Belongie, S.J.: Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936–944 (2017)

    Google Scholar 

  14. Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1520–1528 (2015)

    Google Scholar 

  15. Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 971–987 (2002)

    Article  MATH  Google Scholar 

  16. Ozer, S., et al.: Supervised and unsupervised methods for prostate cancer segmentation with multispectral MRI. Med. phys. 37(4), 1873–83 (2010)

    Article  Google Scholar 

  17. Pires, C., Damas, B.D., Bernardino, A.: An efficient cascaded model for ship segmentation in aerial images. IEEE Access 10, 31942–31954 (2022)

    Article  Google Scholar 

  18. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. ArXiv, abs/1505.04597 (2015)

    Google Scholar 

  19. Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3431–3440 (2015)

    Google Scholar 

  20. Siswantoro, J., Prabuwono, A.S., Abdullah, A., bin Idrus, B.: Automatic image segmentation using sobel operator and k-means clustering: a case study in volume measurement system for food products. In: 2015 International Conference on Science in Information Technology (ICSITech), pp. 13–18 (2015)

    Google Scholar 

  21. Tan, M., Le, Q.V.: EfficientNet: rethinking model scaling for convolutional neural networks. ArXiv, abs/1905.11946 (2019)

    Google Scholar 

  22. Trémeau, A., Borel, N.: A region growing and merging algorithm to color segmentation. Pattern Recognit. 30, 1191–1203 (1997)

    Article  Google Scholar 

  23. Viola, P.A., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, p. I (2001)

    Google Scholar 

  24. Wu, G., et al.: A stacked fully convolutional networks with feature alignment framework for multi-label land-cover segmentation. Remote. Sens. 11, 1051 (2019)

    Article  Google Scholar 

  25. Wu, G., et al.: Automatic building segmentation of aerial imagery using multi-constraint fully convolutional networks. Remote. Sens. 10, 407 (2018)

    Article  Google Scholar 

  26. Mushrif, M.M., Dubey, Y.K.: FCM clustering algorithms for segmentation of brain MR images. Adv. Fuzzy Syst. 2016, 1–14 (2016)

    MathSciNet  Google Scholar 

  27. Zhang, Z.V., Tang, M., Cobzas, D., Zonoobi, D., Jägersand, M., Jaremko, J.L.: End-to-end detection-segmentation network with ROI convolution. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pp. 1509–1512 (2018)

    Google Scholar 

Download references

Acknowledgment

This work has been supported by the University Kebangsaan Malaysia Research Grant GUP, GUP-2021-063 and Malaysia’s Ministry of Higher Education Fundamental Research Grant FRGS/1/2019/ICT02/UKM/02/8.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Azizi Abdullah .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lam, K.L., Abdullah, A., Albashish, D. (2023). Ensemble of Fully Convolutional Neural Networks with End-to-End Learning for Small Object Semantic Segmentation. In: Jo, J., et al. Robot Intelligence Technology and Applications 7. RiTA 2022. Lecture Notes in Networks and Systems, vol 642. Springer, Cham. https://doi.org/10.1007/978-3-031-26889-2_12

Download citation

Publish with us

Policies and ethics