SVP-Classifier: Single-View Point Cloud Data Classifier with Multi-view Hallucination

Mohammadi, Seyed Saber; Wang, Yiming; Taiana, Matteo; Morerio, Pietro; Del Bue, Alessio

doi:10.1007/978-3-031-06430-2_2

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13232))

Included in the following conference series:

International Conference on Image Analysis and Processing

2061 Accesses

Abstract

We address single-view 3D shape classification with partial Point Cloud Data (PCD) inputs. Conventional PCD classifiers achieve the best performance when trained and evaluated with complete 3D object scans. However, they all experience a performance drop when trained and evaluated on partial single-view PCD. We propose a Single-View PCD Classifier (SVP-Classifier), which first hallucinates the features of other viewpoints covering the unseen part of the object with a Conditional Variational Auto-Encoder (CVAE). It then aggregates the hallucinated multi-view features with a multi-level Graph Convolutional Network (GCN) to form a global shape representation that helps to improve the single-view PCD classification performance. With experiments on the single-view PCDs generated from ModelNet40 and ScanObjectNN, we prove that the proposed SVP-Classifier outperforms the best single-view PCD-based methods, after they have been retrained on single-view PCDs, thus reducing the gap between single-view methods and methods that employ complete PCDs. Code and datasets are available: https://github.com/IIT-PAVIS/SVP-Classifier.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

PVFNet: Point-View Fusion Network for 3D Shape Recognition

FuseNet: a multi-modal feature fusion network for 3D shape classification

Article 26 July 2024

Three-stage generative network for single-view point cloud completion

Article 08 October 2021

References

An, J., Cho, S.: Variational autoencoder based anomaly detection using reconstruction probability. Spec. Lect. IE 2(1), 1–18 (2015)
Google Scholar
Angelina Uy, M., Pham, Q.H., Hua, B.S., Thanh Nguyen, D., Yeung, S.K.: Revisiting point cloud classification: a new benchmark dataset and classification model on real-world data. arXiv, arXiv-1908 (2019)
Google Scholar
Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)
Article Google Scholar
Dong, G., Liao, G., Liu, H., Kuang, G.: A review of the autoencoder and its variants: a comparative perspective from target recognition in synthetic-aperture radar images. IEEE Geosci. Remote Sens. Mag. 6(3), 44–68 (2018)
Article Google Scholar
Ioannidou, A., Chatzilari, E., Nikolopoulos, S., Kompatsiaris, I.: Deep learning advances in computer vision with 3D data: a survey. ACM Comput. Surv. (CSUR) 50(2), 1–38 (2017)
Article Google Scholar
Kingma, D., Welling, M.: Auto-encoding variational Bayes. In: ICLR 2014 (2014)
Google Scholar
Li, Y., Bu, R., Sun, M., Wu, W., Di, X., Chen, B.: PointCNN: convolution on x-transformed points. Adv. Neural Inf. Process. Syst. 31, 820–830 (2018)
Google Scholar
Mohammadi, S.S., Wang, Y., Del Bue, A.: PointView-GCN: 3D shape classification with multi-view point clouds. In: 2021 IEEE International Conference on Image Processing (ICIP), pp. 3103–3107. IEEE (2021)
Google Scholar
Qi, C.R., Liu, W., Wu, C., Su, H., Guibas, L.J.: Frustum PointNets for 3D object detection from RGB-D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 918–927 (2018)
Google Scholar
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: PointNet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 652–660 (2017)
Google Scholar
Qi, C.R., Su, H., Nießner, M., Dai, A., Yan, M., Guibas, L.J.: Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5648–5656 (2016)
Google Scholar
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems, pp. 5099–5108 (2017)
Google Scholar
Sohn, K., Lee, H., Yan, X.: Learning structured output representation using deep conditional generative models. Adv. Neural Inf. Process. Syst. 28, 3483–3491 (2015)
Google Scholar
Vishwanath, K.V., Gupta, D., Vahdat, A., Yocum, K.: ModelNet: towards a datacenter emulation environment. In: Proceedings of the IEEE Ninth International Conference on Peer-to-Peer Computing, pp. 81–82. IEEE (2009)
Google Scholar
Wang, Y., Carletti, M., Setti, F., Cristani, M., Bue, A.D.: Active 3D classification of multiple objects in cluttered scenes. In: Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, pp. 2602–2610 (2019)
Google Scholar
Wang, Y., Sun, Y., Liu, Z., Sarma, S.E., Bronstein, M.M., Solomon, J.M.: Dynamic graph CNN for learning on point clouds. ACM Trans. Graph. (TOG) 38(5), 1–12 (2019)
Article Google Scholar
Xu, Y., Fan, T., Xu, M., Zeng, L., Qiao, Yu.: SpiderCNN: deep learning on point sets with parameterized convolutional filters. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11212, pp. 90–105. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01237-3_6
Chapter Google Scholar
Yuniarti, A., Suciati, N.: A review of deep learning techniques for 3D reconstruction of 2D images. In: 2019 12th International Conference on Information & Communication Technology and System (ICTS), pp. 327–331. IEEE (2019)
Google Scholar
Zhao, Y., Birdal, T., Deng, H., Tombari, F.: 3D point capsule networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1009–1018 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Marine, Electrical, Electronic and Telecommunications Engineering, University of Genoa, Genoa, Italy
Seyed Saber Mohammadi
Pattern Analysis and Computer Vision (PAVIS), Italian Institute of Technology, Genoa, Italy
Seyed Saber Mohammadi, Yiming Wang, Matteo Taiana, Pietro Morerio & Alessio Del Bue
Deep Visual Learning (DVL), Fondazione Bruno Kessler, Trento, Italy
Yiming Wang

Authors

Seyed Saber Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar
Yiming Wang
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Taiana
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Morerio
View author publications
You can also search for this author in PubMed Google Scholar
Alessio Del Bue
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seyed Saber Mohammadi .

Editor information

Editors and Affiliations

Boston University, Boston, MA, USA
Stan Sclaroff
National Research Council, Lecce, Italy
Cosimo Distante
National Research Council, Lecce, Italy
Marco Leo
University of Catania, Catania, Italy
Giovanni M. Farinella
Technische Universität München, Garching, Germany
Federico Tombari

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 12054 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mohammadi, S.S., Wang, Y., Taiana, M., Morerio, P., Del Bue, A. (2022). SVP-Classifier: Single-View Point Cloud Data Classifier with Multi-view Hallucination. In: Sclaroff, S., Distante, C., Leo, M., Farinella, G.M., Tombari, F. (eds) Image Analysis and Processing – ICIAP 2022. ICIAP 2022. Lecture Notes in Computer Science, vol 13232. Springer, Cham. https://doi.org/10.1007/978-3-031-06430-2_2

Download citation

DOI: https://doi.org/10.1007/978-3-031-06430-2_2
Published: 17 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06429-6
Online ISBN: 978-3-031-06430-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SVP-Classifier: Single-View Point Cloud Data Classifier with Multi-view Hallucination

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

PVFNet: Point-View Fusion Network for 3D Shape Recognition

FuseNet: a multi-modal feature fusion network for 3D shape classification

Three-stage generative network for single-view point cloud completion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 12054 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SVP-Classifier: Single-View Point Cloud Data Classifier with Multi-view Hallucination

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

PVFNet: Point-View Fusion Network for 3D Shape Recognition

FuseNet: a multi-modal feature fusion network for 3D shape classification

Three-stage generative network for single-view point cloud completion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 12054 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation