Skip to main content
Log in

Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection

  • Neural Computing in Next Generation Virtual Reality Technology
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

3D point clouds are important for the reconstruction of environment. However, comparing to the artificial VR scene representation methods, 3D point clouds are more difficult to correspond to real scenes. In this paper, a method for detecting keypoints and describing scale invariant point feature of 3D point clouds is proposed. To detect, we first select keypoints as the saliency points with fast changing speed along with all principal directions of the searching area of the point cloud. The searching area is a searching keyscale which represents the unique scale size of the point cloud. Then, the descriptor is encoded based on the shape of a border or silhouette of an object to be detected or recognized. We also introduce a vote-casting-based 3D multi-scale object detection method. Experimental results based on synthetic data, real data and vote-casting scheme show that we can easily deal with the different tasks without additional information.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18

Similar content being viewed by others

References

  1. Furukawa Y, Ponce J (2007) Accurate, dense, and robust multi-view stereopsis. In: CVPR, pp 1362–1376

    Google Scholar 

  2. Snavely N, Seitz SM, Szeliski R (2008) Modeling the world from internet photo collections. In: IJCV, vol 80, pp 189–210

  3. Witkin P (1981) Andrew Recovering surface shape and orientation from texture. In: Artificial intelligence, vol 17, pp 17–45

  4. Wang F, Lu M (2014) Robust particle tracker via Markov Chain Monte Carlo posterior sampling. In: Multimedia Tools and Applications, vol 72, pp 573–589

  5. Wang F, Li P, Li X, Lu M (2016) Ordered over-relaxation based Langevin Monte Carlo sampling for visual tracking. In: Neurocomputing, in press

    Google Scholar 

  6. Izadi S, Kim D, Hilliges O, Molyneaux D, Newcombe R, Kohli P, Shotton J, Hodges S, Freeman D, Davison A, Fitzgibbon A (2011) Kinectfusion: real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the ACM symposium on User interface software and technology (UIST)

  7. Lowe DG (2004) Distinctive image features from scale invariant keypoints. In: IJCV, vol 60, pp 91–110

  8. Johnson AE, Hebert M (1998) Surface matching for object recognition in complex three-dimensional scenes. In: Image and Vision Computing, vol 16, pp 635–651

  9. Johnson AE, Hebert M (1999) Using spin images for efficient object recognition in cluttered 3d scene. In: PAMI, vol 21, pp 433–449

  10. Steder B, Rusu RB, Konolige K, Burgard W (2010) Narf: 3d range image features for object recognition. In: Workshop on Defining and Solving Realistic Perception Problems in Personal Robotics at IROS

  11. Buch AG, Yang Y, Kruger N, Petersen HG (2014) In search of inliers: 3D correspondence by local and global voting. In: CVPR

  12. Fang Y, Xie J, Dar G, Wang M, Zhu F, Xu T (2015) E wang, 3D deep shape descriptor. In: CVPR

  13. Yu F, Xiao JX, Funkhouser T (2015) Semantic alignment of liDAR data at city scale. In: CVPR

  14. Rusu RB, Marton ZC, Blodow N, Beetz M (2008) Persistent point feature histograms for 3D point clouds. In: Proc. of IAS10, vol 16, pp 635–651

  15. Rusu RB, Blodow N, Beetz M (2009) Fast point feature histograms (FPFH) for 3D registration. In: Proc. of ICRA2009

  16. Frome A, Huber D, Kolluri R, Blow T, Malik J (2004) Recognizing objects in range data using regional point descriptors. In: ECCV

  17. Lazebnik S, Schmid C, Ponce J (2005) A sparse texture representation using local affine regions. In: PAMI, vol 27, pp 1265–1278

  18. Guo Y, Sohel F, Bennamoun M, Lu M, Wan J (2013) Rotational projection statistics for 3D local surface description and object recognition. In: IJCV, vol 105, pp 63–86

  19. Rusu RB, Bradski G, Thibaux R, Hsu J (2010) Fast 3D recognition and pose using the viewpoint feature histogram. In: IROS

  20. Rusu RB, Holzbach A, Beetz M (2009) Detecting and segmenting objects for mobile manipulation. In: ICCV

  21. Wohlkinger W, Vincze M (2011) Ensemble of shape functions for 3D object classification. In: ROBIO

  22. Bogo F, Romero J, Loper M, Black MJ (2014) FAUST: Dataset And evaluation for 3D mesh registration. In: CVPR

  23. Huang X, Zhang B, Masuda T, Ikeuchi K (2014) Robust 3D features for matching between distorted range scans captured by moving systems. In: CVPR

  24. Scovanner P, Ali S, Shah M (2007) A 3-dimensional sift descriptor and its application to action recognition. In: ACM Multimedia, pp 357–360

  25. Knopp J, Prasad M, Willems G, Timofte R, Gool LV (2010) Hough Transform and 3D SURF for robust three dimensional classication. In: Proc of ECCV2010

  26. Cheung W, Hamarneh G (2009) n-sift: n-dimensional scale invariant feature transform. In: Trans. IP, vol 18, pp 2012–2021

  27. Darom T, Keller Y (2012) Scale invariant feature for 3D Mesh models. In: IEEE Trans. Image Process, vol 21, pp 2758– 2769

  28. Lin B, Tamaki T, Raytchev B, Kaneda K, Ichii K (2013) Scale ratio ICP for 3D point clouds with different scales. In: ICIP

  29. Lin B, Tamaki T, Zhao F, Raytchev B, Kaneda K, Ichii K (2014) Scale Alignment of 3D Point Clouds with Different Scales. In: MVAP, vol 25, pp 1989–2002

  30. Mellado N, Dellepiane M, Scopigno R (2016) Relative scale estimation and 3D registration of multi-modal geometry using Growing Least Squares. In: IEEE Transactions on Visualization and Computer Graphics. in press

  31. Lin B, Tamaki T, Slomp M, Raytchev B, Kaneda K, Ichii K (2013) 3D keypoints detection from a 3D point cloud for real-time camera tracking. In: IEEJ transactions on electronics, Information and Systems, vol 133, pp 84–90

  32. Aubry M, Maturana D, Efros AA, Russell BC, Sivic J (2014) Scening 3D charis: exemplar part-based 2D-3D alignment using a large dataset of CAD models. In: CVPR

  33. Crivellaro A, Lepetit V (2014) Robust 3D tracking with descriptor fields. In: CVPR

  34. Tombari F, Salti S, Stefano LD (2010) Unique signatures of histograms for local surface description. In: ECCV2010, pp 356–369

  35. Salti S, Tombari F, Stefano LD (2014) SHOT: Unique signatures of histograms for surface and texture description. In: CVIU, vol 125, pp 251–264

  36. Wu C, Clipp B, Li X, Frahm J-M, Pollefeys M (2008) 3d model matching with viewpoint invariant patches (vips). In: CVPR, pp 1–8

  37. Rodola E, Albarelli A, Bergamasco F, Torsello A (2013) A scale independent selection process for 3D object recognition in cluttered scenes. In: IJCV, vol 102, pp 129–145

  38. Unnikrishnan R, Hebert M (2008) Multi-scale interest regions from unorganized point clouds. In: Proc. of Workshop on Search in 3D (S3D), pp 1–8

  39. Zaharescu A, Boyer E, Varanasi K (2009) R Horaud, Surface feature detection and description with applications to mesh matching. In: Proc. of International Conference on Computer Vision and Pattern Recognition (CVPR), pp 373–380

  40. Castellani U, Cristani M (2008) S Fantoni, Sparse points matching by combining 3D mesh saliency with statistical descriptors. In: Proc. of Computer Graphics Forum, pp 643–652

  41. Zhong Y (2009) Intrinsic shape signatures: A shape descriptor for 3D object recognition. In: ICCV Workshops

  42. Malik S, Belongie J, Puzicha J (2002) Shape matching and object recognition using shape contexts. In: PAMI, vol 24, pp 509–522

  43. Dey TK, Li G, Sun J (2005) Normal estimation for point clouds: a comparison study for a Voronoi based method. In: Proc. of the Second Eurographics/IEEE VGTC conference on Point-Based Graphics (SPBG2005), pp 39–46

  44. Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: ECCV

  45. Fischler M, Bolles R (1981) Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. In: Commun ACM, vol 24, pp 381–395

  46. The Stanford 3D Scanning Repository. [Online]. Available at: http://www.graphics.stanford.edu/data/3Dscanrep/

  47. Li B, Lu Y, Godil A, Schreck T, Bustos B, Ferreira A, Furuya T, Fonseca MJ, Johan H, Matsuda T, Ohbuchi R, Pascoal PB, Saavedra JM (2014) A comparison of methods for sketch-based 3D shape retrieval. In: Computer Vision and Image Understanding, vol 119, pp 57–80

  48. Anand A, Koppula HS, Joachims T, Saxena A (2012) Contextually guided semantic labeling and search for 3D point clouds. In: IJRR

Download references

Acknowledgments

This work was supported by Liaoning Province Doctoral Research Foundation, 201601315; Dalian Youth Science and Technology Foundation, No. 2015R092; National Natural Science Foundation Granted, No.61300082 and Liaoning Natural Science Foundation, No. 2015020015.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Baowei Lin.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lin, B., Wang, F., Zhao, F. et al. Scale invariant point feature (SIPF) for 3D point clouds and 3D multi-scale object detection. Neural Comput & Applic 29, 1209–1224 (2018). https://doi.org/10.1007/s00521-017-2964-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-017-2964-1

Keywords

Navigation