ODCN: Optimized Dilated Convolution Network for 3D Shape Segmentation

Qian, Likuan; Lian, Yuanfeng; Wei, Qian; Wu, Shuangyuan; Zhang, Jianbin

doi:10.1007/978-3-030-31726-3_32

ODCN: Optimized Dilated Convolution Network for 3D Shape Segmentation

Likuan Qian¹⁶,
Yuanfeng Lian¹⁶,
Qian Wei¹⁷,
Shuangyuan Wu¹⁶ &
…
Jianbin Zhang¹⁶

Conference paper
First Online: 31 October 2019

1857 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11859))

Abstract

3D shape segmentation is a vital and fundamental issue in 3D shape analysis tasks, and the multi-view paradigm is one of practical approaches to solve it. The typical multi-view paradigm contains an image-based convolutional neural network (CNN) for effective view-based semantic segmentation. To improve the accuracy of multi-view paradigm, this paper presents a new dilated convolution network called Optimized Dilated Convolution Network (ODCN). We derive a novel network architecture by using the gradient descent with momentum algorithm to minimize some objective functions related to neural network propagation. In addition, the dilated convolution, which increases the resolution of output feature maps without reducing the receptive field of network, is adopted for semantic segmentation. Experimental results verify that the proposed method achieves better performance over other state-of-the-art methods.

Student as first author.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Guo, K., Zou, D., Chen, X.: 3d mesh labeling via deep convolutional neural networks. ACM Trans. Graph. 35(1), 3:1–3:12 (2015)
Article Google Scholar
Kalogerakis, E., Averkiou, M., Maji, S., Chaudhuri, S.: 3D shape segmentation with projective convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3779–3788 (2017)
Google Scholar
Wang, P., Gan, Y., Shui, P.: 3D shape segmentation via shape fully convolutional networks. Comput. Graph. 70, 128–139 (2018)
Article Google Scholar
Shu, Z., et al.: Scribble based 3D shape segmentation via weakly-supervised learning. IEEE Trans. Vis. Comput. Graph. 1, 1 (2019)
Article Google Scholar
Wu, Z., Wang, Y., Shou, R., Chen, B., Liu, X.: Unsupervised co-segmentation of 3D shapes via affinity aggregation spectral clustering. Comput. Graph. 37(6), 628–637 (2013)
Article Google Scholar
Sander, P.V., Snyder, J., Gortler, S.J., Hoppe, H.: Texture mapping progressive meshes. In: Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, pp. 409–416. ACM (2001)
Google Scholar
Shalom, S., Shapira, L., Shamir, A., Cohen-Or, D.: Part analogies in sets of objects. In: Proceedings of the 1st Eurographics Conference on 3D Object Retrieval, pp. 33–40 (2008)
Google Scholar
Zuckerberger, E., Tal, A., Shlafman, S.: Polyhedral surface decomposition with applications. Comput. Graph. 26(5), 733–743 (2002)
Article Google Scholar
He, C., Wang, C.: A survey on segmentation of 3D models. Wirel. Pers. Commun. 102(4), 3835–3842 (2018)
Article MathSciNet Google Scholar
Chen, X., Golovinskiy, A., Funkhouser, T.: A benchmark for 3D mesh segmentation. In: ACM Transactions on Graphics, pp. 73:1–73:12. ACM (2009)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Google Scholar
Yu, F., Koltun, V.: Multi-scale context aggregation by dilated convolutions. arXiv:1511.07122 (2015)
Su, H., Maji, S., Kalogerakis, E., Learned-Miller, E.: Multi-view convolutional neural networks for 3d shape recognition. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 945–953 (2015)
Google Scholar
Le, T., Bui, G., Duan, Y.: A multi-view recurrent neural network for 3D mesh segmentation. Comput. Graph. 66, 103–112 (2017)
Article Google Scholar
Yu, F., Koltun, V., Funkhouser, T.: Dilated residual networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 472–480 (2017)
Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Semantic image segmentation with deep convolutional nets and fully connected CRFs. arXiv:1412.7062 (2014)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556 (2014)
Bertsekas, D.P.: Nonlinear programming. J. Oper. Res. Soc. 48(3), 334 (1997)
Article Google Scholar
Qian, N.: On the momentum term in gradient descent learning algorithms. Neural Netw. 12(1), 145–151 (1999)
Article MathSciNet Google Scholar
Gal, R., Cohen-Or, D.: Salient geometric features for partial shape matching and similarity. ACM Trans. Graph. 25(1), 130–150 (2006)
Article Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 4, 509–522 (2002)
Article Google Scholar
Polyak, B.: Some methods of speeding up the convergence of iteration methods. USSR Comput. Math. Math. Phys. 4(5), 1–17 (1964)
Article Google Scholar
Kalogerakis, E., Hertzmann, A., Singh, K.: Learning 3D mesh segmentation and labeling. ACM Trans. Graph. 29(4), 102 (2010)
Article Google Scholar
Wang, Y., Gong, M., Wang, T., Cohen-Or, D., Zhang, H., Chen, B.: Projective analysis for 3D shape segmentation. ACM Trans. Graph. 32(6), 192:1–192:12 (2013)
MathSciNet Google Scholar
Phong, B.T.: Illumination for computer generated pictures. Commun. ACM 18(6), 311–317 (1975)
Article Google Scholar
Li, H., Yang, Y., Chen, D., Lin, Z.: Optimization algorithm inspired deep neural network structure design. arXiv:1810.01638 (2018)
Krahenbuhl, P., Koltun, V.: Effcient inference in fully connected crfs with gaussian edge potentials. In: Advances in Neural Information Processing Systems, pp. 109–117 (2011)
Google Scholar
Wang, Y., Asafi, S., Van Kaick, O., Zhang, H., Cohen-Or, D., Chen, B.: Active co-analysis of a set of shapes. ACM Trans. Graph. 31(6), 165 (2012)
Article Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27 (2011)
Article Google Scholar
Shui, P., et al.: 3D shape segmentation based on viewpoint entropy and projective fully convolutional networks fusing multi-view features. In: The 24th International Conference on Pattern Recognition, pp. 1056–1061 (2018)
Google Scholar
Nesterov, Y.: A method for unconstrained convex minimization problem with the rate of convergence 0(1/k²). Sov. Math. Doklady 27(2), 372–376 (1983)
Google Scholar

Download references

Acknowledgments

This work was supported by National Key R&D Program of China (2016YFC0303707).

Author information

Authors and Affiliations

China University of Petroleum-Beijing, Beijing, 102200, China
Likuan Qian, Yuanfeng Lian, Shuangyuan Wu & Jianbin Zhang
CNPC Beijing Richfit Information Technology Co., LTD., Beijing, 102200, China
Qian Wei

Authors

Likuan Qian
View author publications
You can also search for this author in PubMed Google Scholar
Yuanfeng Lian
View author publications
You can also search for this author in PubMed Google Scholar
Qian Wei
View author publications
You can also search for this author in PubMed Google Scholar
Shuangyuan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jianbin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuanfeng Lian .

Editor information

Editors and Affiliations

School of EECS, Peking University, Beijing, China
Zhouchen Lin
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Liang Wang
Nanjing University of Science and Technology, Nanjing, China
Jian Yang
Xidian University, Xi’an, China
Guangming Shi
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
Institute of Artificial Intelligence, Xi’an Jiaotong University, Xi’an, China
Nanning Zheng
Chinese Academy of Sciences, Beijing, China
Xilin Chen
Northwestern Polytechnical University, Xi’an, China
Yanning Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qian, L., Lian, Y., Wei, Q., Wu, S., Zhang, J. (2019). ODCN: Optimized Dilated Convolution Network for 3D Shape Segmentation. In: Lin, Z., et al. Pattern Recognition and Computer Vision. PRCV 2019. Lecture Notes in Computer Science(), vol 11859. Springer, Cham. https://doi.org/10.1007/978-3-030-31726-3_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-31726-3_32
Published: 31 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31725-6
Online ISBN: 978-3-030-31726-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics