VNet: a versatile network to train real-time semantic segmentation models on a single GPU

Li, Wenxing; Lin, Ning; Zhang, Mingzhe; Lu, Hang; Chen, Xiaoming; Li, Xiaowei

doi:10.1007/s11432-020-2971-8

VNet: a versatile network to train real-time semantic segmentation models on a single GPU

Letter
Published: 05 August 2021

Volume 65, article number 139105, (2022)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Wenxing Li^1,2,
Ning Lin^2,3,
Mingzhe Zhang²,
Hang Lu^2,3,
Xiaoming Chen^2,3 &
…
Xiaowei Li^2,3

117 Accesses
2 Citations
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Paszke A, Chaurasia A, Kim S, et al. Enet: a deep neural network architecture for real-time semantic segmentation. 2016. ArXiv:1606.02147
Zhao H S, Qi X J, Shen X Y, et al. ICNet for real-time semantic segmentation on high-resolution images. In: Proceedings of European Conference on Computer Vision, 2018. 405–420
Li H, Kadav A, Durdanovic I, et al. Pruning filters for efficient ConvNets. 2016. ArXiv:1608.08710
Cordts M, Omran M, Ramos S, et al. The cityscapes dataset for semantic urban scene understanding. In: Proceedings of Computer Vision and Pattern Recognition, 2016. 3213–3223
Gomez A N, Ren M, Urtasun R, et al. The reversible residual network: backpropagation without storing activations. In: Proceedings of Advances in Neural Information Processing Systems, 2017. 2214–2224
Chen L C, Papandreou G, Schroff F, et al. Rethinking atrous convolution for semantic image segmentation. 2017. ArXiv:1706.05587
Zhao H S, Shi J P, Qi X J, et al. Pyramid scene parsing network. In: Proceedings of Computer Vision and Pattern Recognition, 2017. 2881–2890
Romera E, Alvarez J M, Bergasa L M, et al. ERFNet: efficient residual factorized ConvNet for real-time semantic segmentation. IEEE Trans Intell Transp Syst, 2018, 19: 263–272
Article Google Scholar

Download references

Acknowledgements

This work was supported by National Key R&D Program of China (Grant No. 2018YFA0701500), Strategic Priority Research Program of CAS (Grant No. XDB44000000), Beijing Academy of Artificial Intelligence (BAAI), National Natural Science Foundation of China (Grant No. 61532017), and CARCH Innovation Project (Grant No. CARCH4506).

Author information

Authors and Affiliations

College of Computer Science and Technology, Guizhou University, Guiyang, 550025, China
Wenxing Li
State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China
Wenxing Li, Ning Lin, Mingzhe Zhang, Hang Lu, Xiaoming Chen & Xiaowei Li
University of Chinese Academy of Sciences, Beijing, 100049, China
Ning Lin, Hang Lu, Xiaoming Chen & Xiaowei Li

Authors

Wenxing Li
View author publications
You can also search for this author in PubMed Google Scholar
Ning Lin
View author publications
You can also search for this author in PubMed Google Scholar
Mingzhe Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Hang Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoming Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaowei Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hang Lu, Xiaoming Chen or Xiaowei Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, W., Lin, N., Zhang, M. et al. VNet: a versatile network to train real-time semantic segmentation models on a single GPU. Sci. China Inf. Sci. 65, 139105 (2022). https://doi.org/10.1007/s11432-020-2971-8

Download citation

Received: 23 March 2020
Revised: 11 June 2020
Accepted: 28 June 2020
Published: 05 August 2021
DOI: https://doi.org/10.1007/s11432-020-2971-8

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

VNet: a versatile network to train real-time semantic segmentation models on a single GPU

Access this article

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation