Abstract
Dynamic point cloud (DPC) represents a realistic 3D scene in motion and has a wide range of applications. Compressing point clouds has become crucial for storing and transmitting such data. Video-based point cloud compression (V-PCC) developed by the Moving Picture Expert Group can achieve remarkable performance in DPC compression. However, it also introduces issues of point reduction and coordinate distortion in the decoded DPC. In this paper, we present a 3D-based framework as a post-processing tool for the V-PCC decoder, which complements decoded DPC and performs coordinate adjustment. In particular, we propose a neighbor-based interpolation method to recover the missing points based on the coordinates in decoded DPC. Then, to minimize the coordinate distortion in interpolation, we design a sparse fully convolutional networks, 3D Minkowski Unet, to perform coordinate adjustment. Considering the variation of data size for DPC, we propose a cube-based patch generation method to enable the scalability of the proposed framework. The experiment results demonstrate that the proposed framework obtains significant performance in complementing reduced coordinates in both objective and subjective evaluation .







Similar content being viewed by others
Data Availability
No datasets were generated or analysed during the current study.
References
Jang, E.S., Preda, M., Mammou, K., Tourapis, A.M., Kim, J., Graziosi, D.B., Rhyu, S., Budagavi, M.: Video-based point-cloud-compression standard in mpeg: From evidence collection to committee draft [standards in a nutshell]. IEEE Signal Process. Mag. 36(3), 118–123 (2019)
Cao, C., Preda, M., Zaharia, T.: 3d point cloud compression: a survey. In: The 24th international conference on 3D web technology, pp. 1–9. (2019)
Liu, H., Yuan, H., Liu, Q., Hou, J., Liu, J.: A comprehensive study and comparison of core technologies for mpeg 3-d point cloud compression. IEEE Trans. Broadcast. 66(3), 701–717 (2019)
Gumhold, S., Kami, Z., Isenburg, M., Seidel, H.-P.: Predictive point-cloud compression. In: ACM SIGGRAPH 2005 Sketches, pp. 137. (2005)
Kammerl, J., Blodow, N., Rusu, R.B., Gedikli, S., Beetz, M., Steinbach, E.: Real-time compression of point cloud streams. In: 2012 IEEE international conference on robotics and automation, pp. 778–785 (2012). IEEE
Guo, Y., Wang, H., Hu, Q., Liu, H., Liu, L., Bennamoun, M.: Deep learning for 3d point clouds: a survey. arxiv 2019. arXiv preprint arXiv:1912.12033
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 652–660. (2017)
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Adv Neural Inf. Process. Syst. 30 (2017)
Graham, B., Engelcke, M., Van Der Maaten, L.: 3d semantic segmentation with submanifold sparse convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 9224–9232. (2018)
Choy, C., Gwak, J., Savarese, S.: 4d spatio-temporal convnets: Minkowski convolutional neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 3075–3084. (2019)
Xing, J., Yuan, H., Hamzaoui, R., Liu, H., Hou, J.: Gqe-net: a graph-based quality enhancement network for point cloud color attribute. IEEE Trans. Image Process. 32, 6303–6317 (2023). https://doi.org/10.1109/TIP.2023.3330086
Cao, K., Cosman, P.: Denoising and inpainting for point clouds compressed by v-pcc. IEEE Access 9, 107688–107700 (2021)
Akhtar, A., Gao, W., Li, L., Li, Z., Jia, W., Liu, S.: Video-based point cloud compression artifact removal. IEEE Trans. Multimed. 24, 2866–2876 (2021)
Liu, Y.-L., Chou, H.-S., Lee, M.-Z., Chan, M.-L., Lin, T.-L., Chen, C.-A., Chen, S.-L.: Point cloud inpainting based on delaunay triangulation. In: 2023 Asia Pacific signal and information processing association annual summit and conference (APSIPA ASC), pp. 1525–1529 (2023). IEEE
Li, J., Zhou, J., Xiong, Y., Chen, X., Chakrabarti, C.: An adjustable farthest point sampling method for approximately-sorted point cloud data. In: 2022 IEEE workshop on signal processing systems (SiPS), pp. 1–6 (2022). IEEE
Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.: Inception-v4, inception-resnet and the impact of residual connections on learning. Proc. AAAI Conf. Artif Intell. (2017). https://doi.org/10.1609/aaai.v31i1.11231
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Anal. Mach. Intell. 40(4), 834–848 (2017)
Eon, B., Harrison, T., Myers, P., A., Chou: 8i Voxelized Full Bodies - A Voxelized Point Cloud Dataset,. MPEG/JPEG (2017)
Schwarz, S., Martin-Cocher, G., Flynn, D., Budagavi, M.: Common test conditions for point cloud compression. Document ISO/IEC JTC1/SC29/WG11 w17766, Ljubljana, Slovenia (2018)
Li, Z., Bao, J., Liu, Y., Au Yeung, S.-K., Zhu, S., Hung, K., Khan, M.A.: Sparse fully convolutional network for video-based point cloud compression color enhancement. In: 2023 6th artificial intelligence and cloud computing conference (AICCC), pp. 66–73. (2023)
Author information
Authors and Affiliations
Contributions
All authors whose names appear on the submission made substantial contributions to the conception or design of the work; Z.L. wrote the main manuscript text and prepared all the materials for manuscript. All authors reviewed the manuscript and revised it critically for important intellectual content.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, Z., Bao, J., Liu, Y. et al. Complement decoded point cloud with coordinate adjustment for video-based point cloud compression. SIViP 19, 48 (2025). https://doi.org/10.1007/s11760-024-03602-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-024-03602-6