References
Paszke A, Chaurasia A, Kim S, et al. Enet: a deep neural network architecture for real-time semantic segmentation. 2016. ArXiv:1606.02147
Zhao H S, Qi X J, Shen X Y, et al. ICNet for real-time semantic segmentation on high-resolution images. In: Proceedings of European Conference on Computer Vision, 2018. 405–420
Li H, Kadav A, Durdanovic I, et al. Pruning filters for efficient ConvNets. 2016. ArXiv:1608.08710
Cordts M, Omran M, Ramos S, et al. The cityscapes dataset for semantic urban scene understanding. In: Proceedings of Computer Vision and Pattern Recognition, 2016. 3213–3223
Gomez A N, Ren M, Urtasun R, et al. The reversible residual network: backpropagation without storing activations. In: Proceedings of Advances in Neural Information Processing Systems, 2017. 2214–2224
Chen L C, Papandreou G, Schroff F, et al. Rethinking atrous convolution for semantic image segmentation. 2017. ArXiv:1706.05587
Zhao H S, Shi J P, Qi X J, et al. Pyramid scene parsing network. In: Proceedings of Computer Vision and Pattern Recognition, 2017. 2881–2890
Romera E, Alvarez J M, Bergasa L M, et al. ERFNet: efficient residual factorized ConvNet for real-time semantic segmentation. IEEE Trans Intell Transp Syst, 2018, 19: 263–272
Acknowledgements
This work was supported by National Key R&D Program of China (Grant No. 2018YFA0701500), Strategic Priority Research Program of CAS (Grant No. XDB44000000), Beijing Academy of Artificial Intelligence (BAAI), National Natural Science Foundation of China (Grant No. 61532017), and CARCH Innovation Project (Grant No. CARCH4506).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Li, W., Lin, N., Zhang, M. et al. VNet: a versatile network to train real-time semantic segmentation models on a single GPU. Sci. China Inf. Sci. 65, 139105 (2022). https://doi.org/10.1007/s11432-020-2971-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-020-2971-8