High-Resolution Self-attention with Fair Loss for Point Cloud Segmentation

Liu, Qiyuan; Lu, Jinzheng; Li, Qiang; Huang, Bingsen

doi:10.1007/978-981-99-8073-4_27

Qiyuan Liu¹²,
Jinzheng Lu¹³,
Qiang Li¹³ &
…
Bingsen Huang¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14451))

Included in the following conference series:

International Conference on Neural Information Processing

578 Accesses

The original version of this chapter was revised: In the PDF version of this chapter, both Fig. 4 and Fig. 5 erroneously showed the same figure. A correction to this chapter can be found at https://doi.org/10.1007/978-981-99-8073-4_46

Abstract

Applying deep learning techniques to analyze point cloud data has emerged as a prominent research direction. However, the insufficient spatial and feature information integration within point cloud and unbalanced classes in real-world datasets have hindered the advancement of research. Given the success of self-attention mechanisms in numerous domains, we apply the High-Resolution Self-Attention (HRSA) module as a plug-and-play solution for point cloud segmentation. The proposed HRSA module preserve high-resolution internal representations in both spatial and feature dimensions. Additionally, by affecting the gradient of dominant and weak classes, we introduce the Fair Loss to address the problem of unbalanced class distribution on a real-world dataset to improve the network’s inference capabilities. The introduced modules are seamlessly integrated into an MLP-based architecture tailored for large-scale point cloud processing, resulting in a new segmentation network called PointHR. PointHR achieves impressive performance with mIoU scores of 69.8% and 74.5% on S3DIS Area-5 and 6-fold cross-validation. With a significantly smaller number of parameters, these performances make PointHR highly competitive in point cloud semantic segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Change history

15 December 2023
A correction has been published.

References

Guo, Y., Wang, H., Hu, Q., Liu, H., Liu, L., Bennamoun, M.: Deep learning for 3D point clouds: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 43(12), 4338–4364 (2021)
Google Scholar
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: PointNet: deep learning on point sets for 3D classification and segmentation. In: Proceedings of the IEEE Conference Computer Vision Pattern Recognition (CVPR), vol. 1, p. 4 (2017)
Google Scholar
Qian, G., et al.: PointNeXt: revisiting PointNet++ with improved training and scaling strategies(2022). arXiv:2206.04670
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: "PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in neural information processing systems , pp. 5099–5108 (2017)
Google Scholar
Zhao, H., Jiang, L., Fu, C.W., Jia, J.: Pointweb: enhancing local neighborhood features for point cloud processing. In: Proeedings of the IEEE Conference Computer Vision Pattern Recognition (CVPR), pp. 5565–5573 (2019)
Google Scholar
Zhang, Y., Hu, Q., Xu, G., Ma, Y., Wan, J., Guo, Y.: Not all points are equal: learning highly efficient point-based detectors for 3D LiDAR point clouds. In: IEEE/CVF Conference Computer Vision Pattern Recognition (CVPR), New Orleans, LA, USA, 2022, pp. 18931–18940 (2022)
Google Scholar
Hu, Q., et al.: RandLA-net: efficient semantic segmentation of large-scale point clouds. In: Proceedings of IEEE/CVF Conference Computer Vision Pattern Recognition (CVPR), pp. 11108–11117 (2020)
Google Scholar
Guo, M.-H., et al.: Point cloud transformer. Comput. Vis. Media 7(2), 187–199 (2021)
Google Scholar
Zhao, H., Jiang, L., Jia, J., Torr, P.H., Koltun, V., et al.: Point transformer. In: Proceedings International Conference Computer Vision, pp. 16259–16268 (2021)
Google Scholar
Lai, X., et al.: Stratified transformer for 3D point cloud segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (CVPR), pp. 8500–8509 (2022)
Google Scholar
Armeni, I., et al.: 3D semantic parsing of large-scale indoor spaces. In: Proceedings IEEE Conerence Computer Vision Pattern Recognition, pp. 1534–1543 (2016)
Google Scholar
Boulch, A., Guerry, J., Le Saux, B., Audebert, N.: SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks. Comput. Graph. 71, 189–198 (2018)
Article Google Scholar
Wu, B., Wan, A., Yue, X., Keutzer, K.: SqueezeSeg: convolutional neural nets with recurrent CRF for real-time road-object segmentation from 3D LiDAR point cloud. In: Proceedings IEEE International Conference on Robotics Automation (ICRA), Brisbane, QLD, Australia, 2018, pp. 1887–1893 (2018)
Google Scholar
Wu, B., Zhou, X., Zhao, S., Yue, X., Keutzer, K.: SqueezeSegV2: improved model structure and unsupervised domain adaptation for road-object segmentation from a lidar point cloud. In: Proceedings IEEE International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 2019, pp. 4376–4382 (2019)
Google Scholar
Tchapmi, L., Choy, C., Armeni, I., Gwak, J., Savarese, S.: SEGCloud: semantic segmentation of 3d point clouds. In: Proceedings Inernational Conference 3D Vision (3DV), Qingdao, China, 2017, pp. 537–547 (2017)
Google Scholar
Liu, Z., Tang, H., Lin, Y., Han, S.: Point-voxel CNN for efficient 3D deep learning. In: Advances in Neural Information Processing Systems, Cambridge, MA, USA:MIT Press, 2019, pp. 965–975 (2019)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: Proceedings of the 31st International Conference Neural Information Processing System, pp. 5998–6008 (2017)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015)
Article Google Scholar
Paszke, A., et al.: PyTorch: an imperative style high-performance deep learning library. In: Proceedings Advance Neural Information Processing System, pp. 8026–8037 (2019)
Google Scholar
Lin, Y., et al.: PointCNN: Convolution on \(\chi \) -transformed points. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 826–836 (2018)
Google Scholar
Yang, J., et al.: Modeling point clouds with self-attention and gumbel subset sampling. In: Proceedings of the IEEE/CVF Conference Computer Vision Pattern Recognition, pp. 3323–3332 (2019)
Google Scholar
Thomas, H., Qi, C.R., Deschaud, J.E., Marcotegui, B., Goulette, F., Guibas, L.J.: KPConv: flexible and Deformable Convolution for Point Clouds. In: Proceedings of IEEE International Conference Computer Vision (ICCV), 2019, pp. 6411–6420 (2019)
Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE Conference Computer Vision Pattern Recognition, pp. 2818–2826. (2016)
Google Scholar

Download references

Acknowledgement

This work was supported in part by the Heilongjiang Provincial Science and Technology Program under Grant 2022ZX01A16, and in part by the Sichuan Science and Technology Program under Grant 2022YFG0148.

Author information

Authors and Affiliations

School of Computer Science and Technology, Southwest University of Science and Technology, Mianyang, China
Qiyuan Liu & Bingsen Huang
School of Information Engineering, Southwest University of Science and Technology, Mianyang, China
Jinzheng Lu & Qiang Li

Authors

Qiyuan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jinzheng Lu
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Li
View author publications
You can also search for this author in PubMed Google Scholar
Bingsen Huang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qiang Li .

Editor information

Editors and Affiliations

Central South University, Changsha, China
Biao Luo
Chinese Academy of Sciences, Beijing, China
Long Cheng
Zhejiang University, Hangzhou, China
Zheng-Guang Wu
Guangdong University of Technology, Guangzhou, China
Hongyi Li
UNSW Sydney, Sydney, NSW, Australia
Chaojie Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, Q., Lu, J., Li, Q., Huang, B. (2024). High-Resolution Self-attention with Fair Loss for Point Cloud Segmentation. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Lecture Notes in Computer Science, vol 14451. Springer, Singapore. https://doi.org/10.1007/978-981-99-8073-4_27

Download citation

DOI: https://doi.org/10.1007/978-981-99-8073-4_27
Published: 15 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8072-7
Online ISBN: 978-981-99-8073-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

High-Resolution Self-attention with Fair Loss for Point Cloud Segmentation