An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

Chen, Bolin; Zhao, Tiesong; Zhou, Liping; Yang, Jing; Liu, Jiahui; Lin, Liqun

doi:10.1007/s00034-021-01784-9

An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

Published: 14 July 2021

Volume 41, pages 372–394, (2022)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Bolin Chen¹,
Tiesong Zhao^1,2,
Liping Zhou¹,
Jing Yang¹,
Jiahui Liu¹ &
…
Liqun Lin¹

346 Accesses
1 Citation
Explore all metrics

Abstract

These past six years have witnessed that segmentation algorithms make a breakthrough due to the development of deep-based semantic segmentation networks, which can realize classification from image-level to pixel-level. However, when segmenting a small-batch, complex-features and obvious-gap database, these semantic segmentation networks will lead to segmentation ambiguities in some boundaries between foreground (i.e. the main segmentation body) and its background with limited image detail expression. To solve such problem, this paper proposes an optimized segmentation scheme for ambiguous pixels based on Atrous-ResFCN and MFR-DenseNet, namely ARMD. Firstly, masking algorithm and adaptive box mechanism are applied in pre-processed KITTI segmentation database to construct the classification database, which will be used to train ambiguities judgment model. In addition, Atrous-ResFCN-8s and Atrous-ResFCN-16s are proposed and further combined to determine segmentation ambiguities with different-level segmentation abilities. Finally, MFR-DenseNet is further migrated to optimize these ambiguous pixels with effective threshold selection. Experimental results demonstrate that our proposed ARMD algorithm is beneficial to improve segmentation accuracy, where the highest MIoU and PA are able to reach 88.18% and 95.88%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 5

Fig. 6

SFSM: sensitive feature selection module for image semantic segmentation

Article 26 September 2022

Image Semantic Segmentation Based on Fully Convolutional Neural Network and CRF

Object Boundary Guided Semantic Segmentation

Data Availability Statement

Some or all data, models or code generated or used during this research are available from the corresponding author by request.

References

R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, S. Susstrunk, Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2274–2282 (2012)
Article Google Scholar
R. Adams, L. Bischof, Seeded region growing. IEEE Trans. Pattern Anal. Mach. Intell. 16(6), 641–647 (1994)
Article Google Scholar
V. Badrinarayanan, A. Kendall, R. Cipolla, Segnet: a deep convolutional encoder–decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Article Google Scholar
L. Bottou, Large-scale machine learning with stochastic gradient descent, in Proceedings of COMPSTAT’2010, pp. 177–186 (2010)
Y. Boykov, O. Veksler, R. Zabih, Fast approximate energy minimization via graph cuts, in Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 1, pp. 377–384 (1999)
J. Canny, A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Article Google Scholar
B. Chen, T. Zhao, J. Liu, L. Lin, Multipath feature recalibration densenet for image classification. Int. J. Mach. Learn. Cybern. 12, 651–660 (2021)
Article Google Scholar
H. Chen, Y. Li, D. Su, Multi-modal fusion network with multi-scale multi-path and cross-modal interactions for rgb-d salient object detection. Pattern Recognit. 86, 376–385 (2019)
Article Google Scholar
L. Chen, G. Papandreou, I. Kokkinos, K. Murphy, A.L. Yuille, Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2018)
Article Google Scholar
L. Chen, G. Papandreou, F. Schroff, H. Adam, Rethinking atrous convolution for semantic image segmentation. CoRR (2017). arXiv:1706.05587
L. Chen, Y. Zhu, G. Papandreou, F. Schroff, H. Adam, Encoder–decoder with atrous separable convolution for semantic image segmentation, in Proceedings of European Conference on Computer Vision(ECCV), pp. 833–851 (2018)
D. Comaniciu, P. Meer, Mean shift analysis and applications, in Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 2, pp. 1197–1203 (1999)
A. Geiger, P. Lenz, R. Urtasun, Are we ready for autonomous driving? The kitti vision benchmark suite, in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 3354–3361 (2012)
R.M. Haralick, L.G. Shapiro, Image segmentation techniques. Comput. Vis. Graph. Image Process. 29(1), 100–132 (1985)
Article Google Scholar
R.M. Haralick, L.G. Shapiro, Survey: Image segmentation techniques. Comput. Vis. Graph. Image Process. 29(1), 100–132 (1985)
Article Google Scholar
J.A. Hartigan, M.A. Wong, A k-means clustering algorithm. Appl. Stat. 28(1), 1979 (1979)
Article Google Scholar
K. He, X. Zhang, S. Ren, J. Sun, Delving deep into rectifiers: surpassing human-level performance on imagenet classification, in Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 1026–1034 (2015)
K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
J. Hu, L. Shen, G. Sun, Squeeze-and-excitation networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2018)
G. Huang, Z. Liu, L.V. Der Maaten, K.Q. Weinberger, Densely connected convolutional networks, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2261–2269 (2017)
A. Krizhevsky, I. Sutskever, G.E. Hinton, Imagenet classification with deep convolutional neural networks, in Neural Information Processing Systems, pp. 1097–1105 (2012)
J. Long, E. Shelhamer, T. Darrell, Fully convolutional networks for semantic segmentation, in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition(CVPR), pp. 3431–3440 (2015)
A. Mikolajczyk, M. Grochowski, Data augmentation for improving deep learning in image classification problem, in International Interdisciplinary Phd Workshop, pp. 117–122 (2018)
W.K. Newey, Adaptive estimation of regression models via moment restrictions. J. Econom. 38(3), 301–339 (1988)
Article MathSciNet Google Scholar
N.R. Pal, S.K. Pal, A review on image segmentation techniques. Pattern Recognit. 26(9), 1277–1294 (1993)
Article Google Scholar
O. Ronneberger, P. Fischer, T. Brox, T.: U-net: Convolutional networks for biomedical image segmentation, in Medical Image Computing and Computer Assisted Intervention, pp. 234–241 (2015)
C. Rother, V. Kolmogorov, A. Blake, Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. 23(3), 309–314 (2004)
Article Google Scholar
J. Shi, J. Malik, Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 888–905 (2000)
Article Google Scholar
K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition, in 3rd International Conference on Learning Representations ICLR (2015)
M. Tang, L. Gorelick, O. Veksler, Y. Boykov, Grabcut in one cut, in 2013 IEEE International Conference on Computer Vision, pp. 1769–1776 (2013)
A.S. Ventura, J.A. Borrego, S. Solorza, Adaptive nonlinear correlation with a binary mask invariant to rotation and scale. Opt. Commun. 339, 185–193 (2015)
Article Google Scholar
S. Vicente, V. Kolmogorov, C. Rother, Graph cut based image segmentation with connectivity priors, in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2008)
L. Xu, W. Li, D. Schuurmans, Fast normalized cut with linear constraints, in 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2866–2873 (2009)
H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, Pyramid scene parsing network, in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR), pp. 6230–6239 (2017)
S. Zhu, X. Xia, Q. Zhang, K. Belloulata, An image segmentation algorithm in image processing based on threshold segmentation, in 2007 Third International IEEE Conference on Signal-Image Technologies and Internet-Based System, pp. 673–678 (2007)

Download references

Author information

Authors and Affiliations

Fujian Key Lab for Intelligent Processing and Wireless Transmission of Media Information, College of Physics and Information Engineering, Fuzhou University, Fuzhou, Fujian, China
Bolin Chen, Tiesong Zhao, Liping Zhou, Jing Yang, Jiahui Liu & Liqun Lin
Peng Cheng Laboratory, Shenzhen, Guangdong, China
Tiesong Zhao

Authors

Bolin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Tiesong Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Liping Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Jing Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jiahui Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liqun Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Liqun Lin.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, B., Zhao, T., Zhou, L. et al. An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet. Circuits Syst Signal Process 41, 372–394 (2022). https://doi.org/10.1007/s00034-021-01784-9

Download citation

Received: 10 September 2020
Revised: 25 June 2021
Accepted: 25 June 2021
Published: 14 July 2021
Issue Date: January 2022
DOI: https://doi.org/10.1007/s00034-021-01784-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

Abstract

Access this article

Similar content being viewed by others

SFSM: sensitive feature selection module for image semantic segmentation

Image Semantic Segmentation Based on Fully Convolutional Neural Network and CRF

Object Boundary Guided Semantic Segmentation

Data Availability Statement

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Optimized Segmentation Scheme for Ambiguous Pixels Based on Improved FCN and DenseNet

Abstract

Access this article

Similar content being viewed by others

SFSM: sensitive feature selection module for image semantic segmentation

Image Semantic Segmentation Based on Fully Convolutional Neural Network and CRF

Object Boundary Guided Semantic Segmentation

Data Availability Statement

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation