Ensemble of Fully Convolutional Neural Networks with End-to-End Learning for Small Object Semantic Segmentation

Lam, Ken Lun; Abdullah, Azizi; Albashish, Dheeb

doi:10.1007/978-3-031-26889-2_12

Ken Lun Lam¹⁶,
Azizi Abdullah¹⁶ &
Dheeb Albashish¹⁷

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 642))

Included in the following conference series:

International Conference on Robot Intelligence Technology and Applications

1170 Accesses
3 Citations

Abstract

Small object segmentation is important due to its vast application potential for biomedical and surveillance. This paper describes several ensemble methods that combine different fully convolutional neural networks (FCNNs) for accurate semantic segmentation of aerial images. One problem with a single model for accurate small object segmentation is that the small object classes are suppressed by large object classes, which can increase overfitting problems and hinder generalization performance. Furthermore, using the single model for segmentation has particular strengths and weaknesses, and the segmentation accuracy changes with the different features used for training. Thus, this paper describes several ensemble methods that combine multiple fully convolutional networks for accurate semantic segmentation of unmanned aerial vehicles (UAVs) images. The proposed method starts with performing object localization for each single FCNN model. After that, each model’s channels of feature map outputs are combined to guide the training of the ensemble segmentation network. We have performed experiments on UAVid benchmark dataset. The results show that the proposed ensemble approach outperforms the single model that directly uses the channels of the single model’s feature map outputs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

I-rod: an ensemble of CNNs for object detection in unconstrained road scenarios

Article 28 November 2024

Deep convolutional encoder–decoder networks based on ensemble learning for semantic segmentation of high-resolution aerial imagery

Article 14 March 2024

FusFormer: global and detail feature fusion transformer for semantic segmentation of small objects

Article 26 March 2024

References

Abdullah, A., Ting, W.E.: Orientation and scale based weights initialization scheme for deep convolutional neural networks. Asia-Pac. J. Inf. Technol. Multimed. 9, 103–112 (2020)
Google Scholar
Albashish, D.: Ensemble of adapted convolutional neural networks (CNN) methods for classifying colon histopathological images. PeerJ Comput. Sci. 8, e1031 (2022)
Article Google Scholar
Albashish, D., Al-Sayyed, R.M.H., Abdullah, A., Ryalat, M.H., Almansour, N.A.: Deep CNN model based on VGG16 for breast cancer classification. In: 2021 International Conference on Information Technology (ICIT), pp. 805–810 (2021)
Google Scholar
Brown, G., Wyatt, J.L., Harris, R., Yao, X.: Diversity creation methods: a survey and categorisation. Inf. Fusion 6, 5–20 (2005)
Article Google Scholar
Corinna Cortes and Vladimir Naumovich Vapnik: Support-vector networks. Mach. Learn. 20, 273–297 (2004)
MATH Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), vol. 1, pp. 886–893 (2005)
Google Scholar
Dong, R., Pan, X., Li, F.: DenseU-net-based semantic segmentation of small objects in urban remote sensing images. IEEE Access 7, 65347–65356 (2019)
Article Google Scholar
Dong, Y., Du, B., Zhang, L.P.: Target detection based on random forest metric learning. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 8, 1830–1838 (2015)
Article Google Scholar
Ibrahim, Z., Diah, N.M., Azmi, M.E., Abdullah, A., Zin, N.A.M.: Real-time mobile application for handwritten digit recognition using MobileNet. In: Mahyuddin, N.M., Mat Noor, N.R., Mat Sakim, H.A. (eds.) Proceedings of the 11th International Conference on Robotics, Vision, Signal Processing and Power Applications. LNEE, vol. 829, pp. 1003–1008. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-8129-5_153
Chapter Google Scholar
Inglada, J.: Automatic recognition of man-made objects in high resolution optical remote sensing images by SVM classification of geometric image features. ISPRS J. Photogramm. Remote. Sens. 62, 236–248 (2007)
Article Google Scholar
Kisantal, M., Wojna, Z., Murawski, J., Naruniec, J., Cho, K.: Augmentation for small object detection. ArXiv, abs/1902.07296 (2019)
Google Scholar
Li, M., Zang, S., Zhang, B., Li, S., Changshan, W.: A review of remote sensing image classification techniques: the role of spatio-contextual information. Eur. J. Remote Sens. 47, 389–411 (2014)
Article Google Scholar
Lin, T.-Y., Dollár, P., Girshick, R.B., He, K., Hariharan, B., Belongie, S.J.: Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936–944 (2017)
Google Scholar
Noh, H., Hong, S., Han, B.: Learning deconvolution network for semantic segmentation. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1520–1528 (2015)
Google Scholar
Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24, 971–987 (2002)
Article MATH Google Scholar
Ozer, S., et al.: Supervised and unsupervised methods for prostate cancer segmentation with multispectral MRI. Med. phys. 37(4), 1873–83 (2010)
Article Google Scholar
Pires, C., Damas, B.D., Bernardino, A.: An efficient cascaded model for ship segmentation in aerial images. IEEE Access 10, 31942–31954 (2022)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. ArXiv, abs/1505.04597 (2015)
Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3431–3440 (2015)
Google Scholar
Siswantoro, J., Prabuwono, A.S., Abdullah, A., bin Idrus, B.: Automatic image segmentation using sobel operator and k-means clustering: a case study in volume measurement system for food products. In: 2015 International Conference on Science in Information Technology (ICSITech), pp. 13–18 (2015)
Google Scholar
Tan, M., Le, Q.V.: EfficientNet: rethinking model scaling for convolutional neural networks. ArXiv, abs/1905.11946 (2019)
Google Scholar
Trémeau, A., Borel, N.: A region growing and merging algorithm to color segmentation. Pattern Recognit. 30, 1191–1203 (1997)
Article Google Scholar
Viola, P.A., Jones, M.J.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, vol. 1, p. I (2001)
Google Scholar
Wu, G., et al.: A stacked fully convolutional networks with feature alignment framework for multi-label land-cover segmentation. Remote. Sens. 11, 1051 (2019)
Article Google Scholar
Wu, G., et al.: Automatic building segmentation of aerial imagery using multi-constraint fully convolutional networks. Remote. Sens. 10, 407 (2018)
Article Google Scholar
Mushrif, M.M., Dubey, Y.K.: FCM clustering algorithms for segmentation of brain MR images. Adv. Fuzzy Syst. 2016, 1–14 (2016)
MathSciNet Google Scholar
Zhang, Z.V., Tang, M., Cobzas, D., Zonoobi, D., Jägersand, M., Jaremko, J.L.: End-to-end detection-segmentation network with ROI convolution. In: 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), pp. 1509–1512 (2018)
Google Scholar

Download references

Acknowledgment

This work has been supported by the University Kebangsaan Malaysia Research Grant GUP, GUP-2021-063 and Malaysia’s Ministry of Higher Education Fundamental Research Grant FRGS/1/2019/ICT02/UKM/02/8.

Author information

Authors and Affiliations

Center for Artificial Intelligence Technology, Universiti Kebangsaan Malaysia, 43600, Bandar Baru Bangi, Malaysia
Ken Lun Lam & Azizi Abdullah
Computer Science Department, Prince Abdullah bin Ghazi, Faculty of Information and Communication Technology, Al-Balqa Applied University, Salt, Jordan
Dheeb Albashish

Authors

Ken Lun Lam
View author publications
You can also search for this author in PubMed Google Scholar
Azizi Abdullah
View author publications
You can also search for this author in PubMed Google Scholar
Dheeb Albashish
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Azizi Abdullah .

Editor information

Editors and Affiliations

School of Information and Communication Technology, Griffith University, Southport, Australia
Jun Jo
Department of Aerospace Engineering, KAIST, Daejeon, Korea (Republic of)
Han-Lim Choi
School of Information and Communication Technology, Griffith University, Southport, Australia
Marde Helbig
Department of Mechanical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, Korea (Republic of)
Hyondong Oh
Department of Mechanical Engineering, KAIST, Daejeon, Korea (Republic of)
Jemin Hwangbo
Department of Mechanical Engineering, KAIST, Daejeon, Korea (Republic of)
Chang-Hun Lee
School of Information and Communication Technology, Griffith University, Southport, Australia
Bela Stantic

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lam, K.L., Abdullah, A., Albashish, D. (2023). Ensemble of Fully Convolutional Neural Networks with End-to-End Learning for Small Object Semantic Segmentation. In: Jo, J., et al. Robot Intelligence Technology and Applications 7. RiTA 2022. Lecture Notes in Networks and Systems, vol 642. Springer, Cham. https://doi.org/10.1007/978-3-031-26889-2_12

Download citation

DOI: https://doi.org/10.1007/978-3-031-26889-2_12
Published: 01 March 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26888-5
Online ISBN: 978-3-031-26889-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Ensemble of Fully Convolutional Neural Networks with End-to-End Learning for Small Object Semantic Segmentation