Abstract
High-quality 3D model abstraction is needed in many graphics or 3D vision tasks to improve the rendering efficiency, increase transmission speed or reduce space occupation. Traditional simplification algorithms for 3D models rely heavily on the mesh topology and ignore objects’ overall structure during optimization. Learning-based methods are then proposed to form an end-to-end regression system for abstraction. However, existing learning-based methods have difficulty representing shapes with hollow or concave structures. We propose a self-supervised learning-based abstraction method for 3D meshes to solve this problem. Our system predicts the positive and negative primitives, where positive primitives are to match the inside part of the shape, and negative primitives represent the hollow region of the shape. More specifically, the bool difference between positive primitives and the object is fed to a network using Iteration error feedback mechanism to predict the negative primitives, which crop the positive primitives to create hollow or concave structures. In addition, we design a new separation loss to prevent a negative primitive from overlapping the object too much. We evaluate the proposed method on the ShapeNetCore dataset by Chamfer Distance and Intersection over Union. The results show that our positive–negative abstraction schema outperforms the baselines.
Similar content being viewed by others
Data Availability
We use two public datasets, both of them are available online. The ShapeNetCore dataset [25] is available at https://shapenet.org/. The animal models collected by Tulsiani et al. [4] are published by the authors at https://github.com/shubhtuls/volumetricPrimitives/issues/7 and are available at https://www.dropbox.com/s/2p6mccei43x7bvk/quadrapeds.tar.gz?dl=0 &file_subpath=.
References
Hoppe, H., DeRose, T., Duchamp, T., McDonald, J., Stuetzle, W.: Mesh optimization. In: Proceedings of the 20th Annual Conference on Computer Graphics and Interactive Techniques. SIGGRAPH ’93, pp. 19–26. Association for Computing Machinery, New York, NY, USA (1993). https://doi.org/10.1145/166117.166119
Garland, M., Heckbert, P.S.: Surface simplification using quadric error metrics. In: Proceedings of the 24th Annual Conference on Computer Graphics and Interactive Techniques, pp. 209–216 (1997)
Yumer, M.E., Kara, L.B.: Co-abstraction of shape collections. ACM Trans. Graph. 31(6), 1–11 (2012). https://doi.org/10.1145/2366145.2366185
Tulsiani, S., Su, H., Guibas, L.J., Efros, A.A., Malik, J.: Learning shape abstractions by assembling volumetric primitives. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2635–2643 (2017)
Sun, C.-Y., Zou, Q.-F., Tong, X., Liu, Y.: Learning adaptive hierarchical cuboid abstractions of 3D shape collections. ACM Trans. Graph. 38(6), 1–13 (2019). https://doi.org/10.1145/3355089.3356529
Zhao, X., Wang, H., Zhang, B., Yang, Y.-J., Hu, R.: Learning cuboid abstraction of 3D shapes via iterative error feedback. Comput.-Aided Des. 141, 103092 (2021). https://doi.org/10.1016/j.cad.2021.103092
Maturana, D., Scherer, S.: Voxnet: a 3D convolutional neural network for real-time object recognition. In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 922–928 (2015). https://doi.org/10.1109/IROS.2015.7353481
Paschalidou, D., Ulusoy, A.O., Geiger, A.: Superquadrics revisited: Learning 3D shape parsing beyond cuboids. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10344–10353 (2019)
Yang, K., Chen, X.: Unsupervised learning for cuboid shape abstraction via joint segmentation from point clouds. ACM Trans. Graph. 40(4), 1–11 (2021). https://doi.org/10.1145/3450626.3459873
Li, L., Sung, M., Dubrovina, A., Yi, L., Guibas, L.J.: Supervised fitting of geometric primitives to 3d point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2652–2660 (2019)
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: deep hierarchical feature learning on point sets in a metric space. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R. (eds.) Advances in Neural Information Processing Systems, vol. 30, pp. 5099–5108. Curran Associates Inc., USA (2017)
Yan, S., Yang, Z., Ma, C., Huang, H., Vouga, E., Huang, Q.: Hpnet: deep primitive segmentation using hybrid representations. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2753–2762 (2021)
Sharma, G., Liu, D., Maji, S., Kalogerakis, E., Chaudhuri, S., Měch, R.: Parsenet: a parametric surface fitting network for 3D point clouds. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) Computer Vision—ECCV 2020, pp. 261–276. Springer, Cham (2020)
Lê, E.-T., Sung, M., Ceylan, D., Mech, R., Boubekeur, T., Mitra, N.J.: Cpfn: cascaded primitive fitting networks for high-resolution point clouds. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7457–7466 (2021)
Huang, J., Zhang, Y., Sun, M.: Primitivenet: primitive instance segmentation with local primitive embedding under adversarial metric. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 15343–15353 (2021)
Guo, J., Liu, Y., Song, X., Liu, H., Zhang, X., Cheng, Z.: Line-based 3d building abstraction and polygonal surface reconstruction from images. IEEE Transactions on Visualization and Computer Graphics, 1–15 (2022). https://doi.org/10.1109/TVCG.2022.3230369
Birdal, T., Busam, B., Navab, N., Ilic, S., Sturm, P.: A minimalist approach to type-agnostic detection of quadrics in point clouds. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3530–3540 (2018)
Wu, Q., Xu, K., Wang, J.: Constructing 3D CSG models from 3D raw point clouds. Comput. Graph. Forum 37(5), 221–232 (2018). https://doi.org/10.1111/cgf.13504
Li, Y., Wu, X., Chrysathou, Y., Sharf, A., Cohen-Or, D., Mitra, N.J.: Globfit: consistently fitting primitives by discovering global relations. In: ACM SIGGRAPH 2011 Papers. SIGGRAPH ’11, pp. 1–12. Association for Computing Machinery, New York, NY, USA (2011). https://doi.org/10.1145/1964921.1964947
Liu, H.-T.D., Jacobson, A.: Cubic stylization. ACM Trans. Graph. 38(6), 1–10 (2019). https://doi.org/10.1145/3355089.3356495
Huang, J., Jiang, T., Shi, Z., Tong, Y., Bao, H., Desbrun, M.: l1-based construction of polycube maps from complex shapes. ACM Trans. Graph. 33(3), 1–11 (2014). https://doi.org/10.1145/2602141
Liu, Z., Freeman, W.T., Tenenbaum, J.B., Wu, J.: Physical primitive decomposition. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Yin, K., Gao, J., Shugrina, M., Khamis, S., Fidler, S.: 3dstylenet: creating 3D shapes with geometric and texture style variations. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 12456–12465 (2021)
Chen, X., Tang, J., Li, C.: Progressive 3D shape abstraction via hierarchical CSG tree. In: Second International Workshop on Pattern Recognition, vol. 10443, pp. 205–210 (2017). SPIE
Chang, A.X., Funkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., Xiao, J., Yi, L., Yu, F.: ShapeNet: an Information-Rich 3D Model Repository. Technical Report arXiv:1512.03012 [cs.GR], Stanford University—Princeton University—Toyota Technological Institute at Chicago (2015)
Funding
This work was supported by National Natural Science Foundation of China (62072366) and Key R &D project of Shaanxi Province (2021QFY01-03HZ).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhao, X., Li, H. & Wang, H. Learning shape abstraction by cropping positive cuboid primitives with negative ones. Vis Comput 39, 3585–3595 (2023). https://doi.org/10.1007/s00371-023-02943-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-023-02943-6