research-article

A Deeplab-Based Segmentation Network for Screw Images

Authors:

Ping LouAuthors Info & Claims

ICMLSC '21: Proceedings of the 2021 5th International Conference on Machine Learning and Soft Computing

Pages 84 - 89

https://doi.org/10.1145/3453800.3453816

Published: 18 June 2021 Publication History

Abstract

Aiming at the problem that the screwdriver cannot be precisely embedded in the screw groove area during automatic screw removal, we propose an image semantic segmentation model fused with a lightweight convolutional neural network. Based on the classic DeeplabV3 model, a lightweight MobileNetV2 structure is used to replace original feature extractor, and its unique spatial pyramid structure is used for multi-scale fusion of the convolution feature of screw head image, and adding a dual attention module to extract the high-dimensional feature to reduce the loss of detail in segmentation. Finally, deconvolution is used to restore the resolution through the improved decoding network. By comparing our method with the state-of-the-art semantic segmentation network, It turns out that our method has better segmentation performance, with the mIoU up to 94.6%, and the testing time of a picture is 0.12ms, which can meet the demand of real-time task.

References

[1]

Borjigin, S., Sahoo, P. K. 2019. Color image segmentation based on multi-level Tsallis–Havrda–Charvát entropy and 2D histogram using PSO algorithms. Pattern Recognition, 92, 107-118.

[2]

Roy, P., Goswami, S., Chakraborty, S., Azar, A. T., Dey, N. 2014. Image segmentation using rough set theory: a review. International Journal of Rough Sets and Data Analysis (IJRSDA), 1(2), 62-74.

[3]

He, K., Zhang, X., Ren, S., Sun, J. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, 770-778.

[4]

Long, J., Shelhamer, E., Darrell, T. 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3431-3440).

[5]

Lin, T. Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2117-2125).

[6]

Chen, L. C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A. L. (2017). Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE transactions on pattern analysis and machine intelligence, 40(4), 834-848.

[7]

Chen, L. C., Papandreou, G., Schroff, F., Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587.

[8]

Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2881-2890).

[9]

Yu, F., Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.

[10]

Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X., Cottrell, G. (2018, March). Understanding convolution for semantic segmentation. In 2018 IEEE winter conference on applications of computer vision (WACV) (pp. 1451-1460). IEEE.

[11]

Badrinarayanan, V., Kendall, A., Cipolla, R. (2017). Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence, 39(12), 2481-2495.

[12]

Li, H., Xiong, P., An, J., Wang, L. (2018). Pyramid attention network for semantic segmentation. arXiv preprint arXiv:1805.10180.

[13]

Zhang, Z., Zhang, X., Peng, C., Xue, X., Sun, J. (2018). Exfuse: Enhancing feature fusion for semantic segmentation. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 269-284).

Digital Library

[14]

Zhao, H., Zhang, Y., Liu, S., Shi, J., Change Loy, C., Lin, D., Jia, J. (2018). Psanet: Point-wise spatial attention network for scene parsing. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 267-283).

Digital Library

[15]

Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., Liu, W. (2019). Ccnet: Criss-cross attention for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision (pp. 603-612).

[16]

Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., Sang, N. (2018). Bisenet: Bilateral segmentation network for real-time semantic segmentation. In Proceedings of the European conference on computer vision (ECCV) (pp. 325-341).

Digital Library

[17]

Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., Lu, H. (2019). Dual attention network for scene segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3146-3154).

[18]

Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1251-1258).

[19]

He, K., Zhang, X., Ren, S., Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).

[20]

Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L. C. (2018). Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4510-4520).

[21]

Lin, T. Y., Goyal, P., Girshick, R., He, K., Dollár, P. (2017). Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision (pp. 2980-2988).

Cited By

Deshpande SVenugopal VKumar MAnand S(2024)Deep learning-based image segmentation for defect detection in additive manufacturing: an overviewThe International Journal of Advanced Manufacturing Technology10.1007/s00170-024-14191-6134:5-6(2081-2105)Online publication date: 17-Aug-2024
https://doi.org/10.1007/s00170-024-14191-6

Recommendations

Semi-supervised Semantic Segmentation of Cataract Surgical Images based on DeepLab v3+
ICCDA '21: Proceedings of the 2021 5th International Conference on Compute and Data Analysis

Microscopic surgical image analysis is very important in surgical skill analysis, workflow recognition, and autonomous robotic surgery. Semantic segmentation of microscopic image is a prerequisite. Currently, supervised deep convolutional neural network ...
DASGC-Unet: An Attention Network for Accurate Segmentation of Liver CT Images
Abstract
The precise segmentation of lesions can assist doctors to complete efficient disease diagnosis. Unet is widely used in the field of medical image segmentation due to its excellent feature fusion ability. However, the deep network based on Unet has ...
IFCM Based Segmentation Method for Liver Ultrasound Images

In this paper we have proposed an iterative Fuzzy C-Mean (IFCM) method which divides the pixels present in the image into a set of clusters. This set of clusters is then used to segment a focal liver lesion from a liver ultrasound image. Advantage of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICMLSC '21: Proceedings of the 2021 5th International Conference on Machine Learning and Soft Computing

January 2021

178 pages

ISBN:9781450387613

DOI:10.1145/3453800

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 June 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

National Natural Science Foundation of China

Conference

ICMLSC '21

ICMLSC '21: 2021 The 5th International Conference on Machine Learning and Soft Computing

January 29 - 31, 2021

Da Nang, Viet Nam

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
52
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 12 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Deshpande SVenugopal VKumar MAnand S(2024)Deep learning-based image segmentation for defect detection in additive manufacturing: an overviewThe International Journal of Advanced Manufacturing Technology10.1007/s00170-024-14191-6134:5-6(2081-2105)Online publication date: 17-Aug-2024
https://doi.org/10.1007/s00170-024-14191-6

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten