Abstract
In the field of 3D object detection, the point-based method faces significant limitations due to the need to process large-scale collections of irregular point clouds, resulting in reduced inference speed. To address this issue, we propose an efficient single-scale single-stage 3D object detection algorithm called SS-3DSSD. Our method eliminates the time-consuming multi-scale feature extraction module used in PointNet++ and adopts an efficient single-scale feature extraction method based on neighborhood-attention, significantly improving the model’s inference speed. Additionally, we introduce a learning-based sampling method to overcome the limited receptive fields of single-scale methods and a multi-level context feature grouping module to meet varying feature requirements at different levels. On the KITTI test set, our method achieves an inference speed of 66.7 frames per second on the RTX 2080Ti, with an average precision of 81.35% for the moderate difficulty car category. This represents a better balance between inference speed and detection accuracy, offering promising implications for real-time 3D object detection applications.
This work was supported by the Key Research and Development Program in Shaanxi Province of China (No. 2022GY-080).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, Q., Wang, Y., Yang, T., Zhang, X., Cheng, J., Sun, J.: You only look one-level feature. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 13034–13043 (2021)
Chen, Y., Liu, S., Shen, X., Jia, J.: Fast point r-cnn. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9775–9784 (2019)
He, C., Zeng, H., Huang, J., Hua, X.S., Zhang, L.: Structure aware single-stage 3D object detection from point cloud. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2020)
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
He, Q., Wang, Z., Zeng, H., Zeng, Y., Liu, Y.: Svga-net: sparse voxel-graph attention network for 3D object detection from point clouds. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 36, pp. 870–878 (2022)
Jiang, T., Song, N., Liu, H., Yin, R., Gong, Y., Yao, J.: Vic-net: voxelization information compensation network for point cloud 3d object detection. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), pp. 13408–13414 (2021). https://doi.org/10.1109/ICRA48506.2021.9561597
Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., Beijbom, O.: Pointpillars: fast encoders for object detection from point clouds. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12697–12705 (2019)
Li, J., et al.: 3D iou-net: iou guided 3d object detector for point clouds. arXiv preprint arXiv:2004.04962 (2020)
Liu, Z., Zhao, X., Huang, T., Hu, R., Zhou, Y., Bai, X.: Tanet: robust 3D object detection from point clouds with triple attention. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 34, pp. 11677–11684 (2020)
Pang, S., Morris, D., Radha, H.: Fast-clocs: fast camera-lidar object candidates fusion for 3d object detection. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pp. 187–196 (January 2022)
Qi, C.R., Litany, O., He, K., Guibas, L.J.: Deep hough voting for 3d object detection in point clouds. In: proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9277–9286 (2019)
Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 652–660 (2017)
Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: deep hierarchical feature learning on point sets in a metric space. arXiv preprint arXiv:1706.02413 (2017)
Shi, S., et al.: Pv-rcnn: point-voxel feature set abstraction for 3D object detection. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10529–10538 (2020)
Shi, S., Wang, X., Li, H.: Pointrcnn: 3D object proposal generation and detection from point cloud. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 770–779 (2019)
Shi, S., Wang, Z., Shi, J., Wang, X., Li, H.: From points to parts: 3d object detection from point cloud with part-aware and part-aggregation network. IEEE Trans. Pattern Anal. Mach. Intell. 43(8), 2647–2664 (2020)
Shi, W., Rajkumar, R.R.: Point-gnn: graph neural network for 3d object detection in a point cloud. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (June 2020)
Yan, Y., Mao, Y., Li, B.: Second: sparsely embedded convolutional detection. Sensors 18(10), 3337 (2018)
Yang, Z., Sun, Y., Liu, S., Jia, J.: 3dssd: point-based 3d single stage object detector. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11040–11048 (2020)
Yang, Z., Sun, Y., Liu, S., Shen, X., Jia, J.: Std: sparse-to-dense 3D object detector for point cloud. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1951–1960 (2019)
Yin, T., Zhou, X., Krähenbühl, P.: Center-based 3D object detection and tracking. arXiv preprint arXiv:2006.11275 (2020)
Yoo, J.H., Kim, Y., Kim, J., Choi, J.W.: 3D-CVF: generating joint camera and lidar features using cross-view spatial feature fusion for 3D object detection. In: European Conference on Computer Vision, pp. 720–736. Springer (2020)
Zhang, Y., Chen, J., Huang, D.: Cat-det: contrastively augmented transformer for multimodal 3d object detection. In: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 898–907 (2022)
Zhang, Y., Huang, D., Wang, Y.: Pc-rgnn: point cloud completion and graph neural network for 3D object detection. ArXiv:2012.10412 (2020)
Zheng, W., Tang, W., Chen, S., Jiang, L., Fu, C.W.: Cia-ssd: confident iou-aware single-stage object detector from point cloud. In: Proceedings of the AAAI Conference on Artificial Intelligence.,vol. 35, pp. 3555–3562 (2021)
Zhou, Y., Tuzel, O.: Voxelnet: end-to-end learning for point cloud based 3D object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4490–4499 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tang, W., Li, Y., Li, Y., Dong, B. (2024). Efficient Point-Based Single Scale 3D Object Detection from Traffic Scenes. In: Liu, Q., et al. Pattern Recognition and Computer Vision. PRCV 2023. Lecture Notes in Computer Science, vol 14426. Springer, Singapore. https://doi.org/10.1007/978-981-99-8432-9_13
Download citation
DOI: https://doi.org/10.1007/978-981-99-8432-9_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8431-2
Online ISBN: 978-981-99-8432-9
eBook Packages: Computer ScienceComputer Science (R0)