Skip to main content

DGAT-net: Dynamic Graph Attention for 3D Point Cloud Semantic Segmentation

  • Conference paper
  • First Online:
Advanced Intelligent Computing Technology and Applications (ICIC 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14872))

Included in the following conference series:

  • 604 Accesses

Abstract

When performing processing tasks such as semantic segmentation on point cloud data, one of the biggest challenges is how to effectively capture both the local details as well as the long-range dependencies in the data. Existing processing methods often struggle to deal with both types of information simultaneously, which leads to limited performance when processing complex scenarios. To address this problem, this paper proposes a dynamic graph attention network DGAT-net. First, a dynamic graph is constructed using a feature similarity and spatial relationship-based approach to capture the local structure and long-range semantic dependencies in point cloud data. Dynamic edge sampling is also added to update the dynamic graph to reduce computational complexity. A spatial self-attention mechanism and a graph attention mechanism are introduced to capture local detail information and global dependencies, respectively, to further enhance the model’s ability to understand complex spatial relationships. Moreover, the fusion of these two attention mechanisms combined with the gating mechanism are dynamically selected to adapt to different data characteristics and task requirements. Experimental results on public point cloud datasets show that our method achieves superior performance in semantic segmentation tasks, validating its effectiveness and practicality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Armeni, I., et al.: 3D semantic parsing of large-scale indoor spaces. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1534–1543 (2016)

    Google Scholar 

  2. Behley, J., et al.: Se-mantickitti: a dataset for semantic scene understanding of lidar sequences. In: Proceedingsof the IEEE/CVF International Conference on Computer Vision, pp. 9297–9307 (2019)

    Google Scholar 

  3. Chen, Y., Dai, X., Liu, M., Chen, D., Yuan, L., Liu, Z.: Dynamic convolution: attention overconvolution kernels. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11030–11039 (2020)

    Google Scholar 

  4. Du, Z., Ye, H., Cao, F.: A novel local-global graph convolutional method for point cloud semantic segmentation. IEEE Trans. Neural Netw. Learn. Syst. 35(4), 4798–4812 (2022)

    Article  Google Scholar 

  5. Hong, J., Kim, K., Lee, H.: Faster dynamic graph CNN: faster deep learning on 3D point cloud data. IEEE Access 8, 190529–190538 (2020)

    Article  Google Scholar 

  6. Hu, Q., et al.: Randla-net: efficient semantic segmentation of large-scale point clouds. In: Proceedings of theIEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11108–11117 (2020)

    Google Scholar 

  7. Jing, W., Zhang, W., Li, L., Di, D., Chen, G., Wang, J.: Agnet: an attention-based graph network for point cloud classification and segmentation. Remote Sens. 14(4), 1036 (2022)

    Article  Google Scholar 

  8. Landrieu, L., Simonovsky, M.: Large-scale point cloud semantic segmentation with super-point graphs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4558–4567 (2018)

    Google Scholar 

  9. Lu, D., Zhou, J., Gao, K.Y., Du, J., Xu, L., Li, J.: Dynamic clustering transformer network for point cloud segmentation. Int. J. Appl. Earth Obs. Geoinf. 128, 103791 (2024)

    Google Scholar 

  10. Milioto, A., Vizzo, I., Behley, J., Stachniss, C.: Rangenet++: fast and accurate lidar semanticsegmentation. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4213–4220. IEEE (2019)

    Google Scholar 

  11. Ngo, T.D., Hua, B.S., Nguyen, K.: Isbnet: a 3D point cloud instance segmentation network with instance-aware sampling and box-aware dynamic convolution. In: Proceedings ofthe IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13550–13559 (2023)

    Google Scholar 

  12. Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: deep hierarchical feature learning on pointsets in a metric space (2017)

    Google Scholar 

  13. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. arXiv preprintarXiv:1804.02767 (2018)

  14. Shuai, H., Xu, X., Liu, Q.: Backward attentive fusing network with local aggregation classifier for 3D point cloud semantic segmentation. IEEE Trans. Image Process. 30, 4973–4984 (2021)

    Article  Google Scholar 

  15. Tatarchenko, M., Park, J., Koltun, V., Zhou, Q.Y.: Tangent convolutions for dense predictionin 3D. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3887–3896 (2018)

    Google Scholar 

  16. Tian, Z., Shen, C., Chen, H.: Conditional convolutions for instance segmentation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 282–298. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_17

    Chapter  Google Scholar 

  17. Wang, Y., Sun, Y., Liu, Z., Sarma, S.E., Bronstein, M.M., Solomon, J.M.: Dynamic graph CNN for learning on point clouds. ACM Trans. Graph. (TOG) 38(5), 1–12 (2019)

    Article  Google Scholar 

  18. Wu, W., Qi, Z., Fuxin, L.: Pointconv: deep convolutional networks on 3D point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9621–9630 (2019)

    Google Scholar 

  19. Zhai, Z., Zhang, X., Yao, L.: Multi-scale dynamic graph convolution network for point clouds classification. IEEE Access 8, 65591–65598 (2020)

    Article  Google Scholar 

  20. Zhang, K., et al.: Linked dynamicgraph CNN: learning through point cloud by linking hierarchical features. In: 2021 27th International Conference on Mechatronics and Machine Vision in Practice (M2VIP), pp. 7–12. IEEE (2021)

    Google Scholar 

  21. Zhang, Y., et al.: Polarnet: an improvedgrid representation for online lidar point clouds semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9601–9610 (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yujie Miao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Miao, Y., Yi, X., Guan, N., Lu, H. (2024). DGAT-net: Dynamic Graph Attention for 3D Point Cloud Semantic Segmentation. In: Huang, DS., Pan, Y., Zhang, Q. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science, vol 14872. Springer, Singapore. https://doi.org/10.1007/978-981-97-5612-4_22

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-5612-4_22

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-5611-7

  • Online ISBN: 978-981-97-5612-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics