Skip to main content
Log in

Stacked convolutional auto-encoders for surface recognition based on 3d point cloud data

  • Original Article
  • Published:
Artificial Life and Robotics Aims and scope Submit manuscript

Abstract

This paper addresses the problem of feature extraction for 3d point cloud data using a deep-structured auto-encoder. As one of the most focused research areas in human–robot interaction (HRI), the vision-based object recognition is very important. To recognize object using the most common geometry feature, surface condition that can be obtained from 3d point cloud data could decrease the error during the HRI. In this research, the surface normal vectors are used to convert 3D point cloud data to a surface-condition-feature map, and a sub-route stacked convolution auto-encoder (sCAE) is designed to classify the difference between the surfaces. The result of the trained filters and the classification of sCAE shows the surface-condition-feature and the specified sCAE are very effective in the variation of surface condition.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Guo Y, Bennamoun M, Sohel F, Lu M, Wan J (2014) 3D object recognition in cluttered Scenes with local surface features: a survey. IEEE Trans Pattern Anal Mach Intell 36(11):2270–2287

  2. Xie J, Fang Y, Zhu F, Wong E (2015) DeepShape: Deep learned shape descriptor for 3D shape matching and retrieval. In: Proc. of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, 2015, pp 1275–1283

  3. Fukuda H, Mori S, Kobayashi Y, Kuno Y, Kachi D (2014) Object recognition based on human description ontology for service robots. In: Proc. of IECON 2014-40th Annual Conference, Dallas, USA, pp 4051–4056

  4. Kimura A, Yonetani R, Hirayama T (2013) Computational models of human visual attention and their implementations: a survey. IEICE Trans Inf Syst E96-D(3):562–578

  5. Borji A, Itti L (2013) State-of-the-art in visual attention modeling. IEEE Trans Pattern Anal Mach Intell 35(1):185–207

  6. OuYang D, Feng H-Y (2005) On the normal vector estimation for point cloud data from smooth surfaces. CAD 37(10):1071–1079

    Google Scholar 

  7. Kalogerakis E, Nowrouzezahrai D, Simari P, Singh K (2009) Extracting lines of curvature from noisy point clouds. CAD 41(4):282–292

    Google Scholar 

  8. Hoppe H, DeRose T, Duchamp T, McDonald J, Stuetzle W (1992) Surface reconstruction from unorganized points. ACM SIGGRAPH Comput Gr 26(2):71–78

    Article  Google Scholar 

  9. Ebert S, Musgrave F, Peachey D, Perlin K, Worley S (2003) Texturing & modeling: a procedural approach. Morgan Kaufmann Publish Inc., San Fransico

    Google Scholar 

  10. Perlin K (1985) An image synthesizer. ACM Siggraph Comput Gr 19(3):287–296

    Article  Google Scholar 

  11. Merigot Q, Ovsjanikov M, Guibas LJ (2011) Voronoi-based curvature and feature estimation from point clouds. IEEE Trans Vis Comput Gr 17(6):743–756

  12. Vincent P, Larochelle H, Lajoie I, Bengio Y, Manzagol PA (2010) Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion. J Mach Learn Res 11(3):3371–3408

    MathSciNet  MATH  Google Scholar 

  13. Masci J, Meier U, Cirean D, Schmidhuber J (2011) Stacked convolutional auto-encoders for hierarchical feature extraction. In: Proc. of Artificial Neural Networks and Machine Learning–ICANN 2011, vol 6791 LNCS, no. PART 1, pp 52–59

  14. Bastien F, Lamblin P, Pascanu R, Bergstra J, Goodfellow I, Bergeron A, Bouchard N, Warde-Farley D, Bengio Y (2012) Theano: new features and speed improvements. In: Proc. of Deep Learning and Unsupervised Feature Learning NIPS, (2012) Workshop. Lake Tahoe, USA

  15. Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127

    Article  MathSciNet  MATH  Google Scholar 

  16. Baldi P (2012) Autoencoders, unsupervised learning, and deep architectures. Unsupervised Transf Learn Chall Mach Learn 7:37–50

    Google Scholar 

  17. Socher R, Huval B, Bath B, Manning D, Ng A (2012) Convolutional-recursive deep learning for 3d object classification. In: Proc. of Advances in Neural Information Processing Systems, pp 665–673

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maierdan Maimaitimin.

Additional information

This work was presented in part at the 21st International Symposium on Artificial Life and Robotics, Beppu, Oita, January 20–22, 2016.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Maimaitimin, M., Watanabe, K. & Maeyama, S. Stacked convolutional auto-encoders for surface recognition based on 3d point cloud data. Artif Life Robotics 22, 259–264 (2017). https://doi.org/10.1007/s10015-017-0350-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10015-017-0350-9

Keywords

Navigation