Skip to main content

Sequence Alignment for RGB-D and Motion Capture Skeletons

  • Conference paper
Image Analysis and Recognition (ICIAR 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7950))

Included in the following conference series:

Abstract

RGB-D skeletons are nowadays commonly used e.g. for gesture recognition, and so their accuracy and stability have significant influence on further processing. Skeletons obtained with motion capture are considerably more accurate and can be used to assess the quality of RGB-D skeleton extraction algorithms. In this paper, we record motion sequences with both a Kinect RGB-D sensor and a full motion capture system and align the generated skeletons by subsequence dynamic time warping with a varied step size. To evaluate the alignment, we propose two measures: the minimum overall distance between feature vectors and the distance of transformed skeletons. Experimental results show that our proposed method provides a better alignment between skeletons than the comparison methods. The proposed technique can also be used for content-based retrieval from large motion capture databases.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bautista, M., Hernández-Vela, A., Ponce, V., Perez-Sala, X., Baró, X., Pujol, O., Angulo, C., Escalera, S.: Probability-based dynamic time warping for gesture recognition on RGB-D data. In: International Workshop on Depth Image Analysis, Tsukuba Science City, Japan (2012)

    Google Scholar 

  2. Deng, L., Leung, H., Gu, N., Yang, Y.: Automated recognition of sequential patterns in captured motion streams. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 250–261. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  3. Horn, B.K.P.: Closed-form solution of absolute orientation using unit quaternions. Journal of the Optical Society of America A 4(4), 629–642 (1987)

    Article  MathSciNet  Google Scholar 

  4. Kar, A.: Skeletal tracking using Microsoft Kinect. Methodology 1, 1–11 (2010)

    Google Scholar 

  5. Krishnamurthy, S.N.: Human Detection and Extraction using Kinect Depth Images. Master’s thesis, Bournemouth University, the United Kingdom (2011)

    Google Scholar 

  6. Lai, K., Konrad, J., Ishwar, P.: A gesture-driven computer interface using Kinect. In: Proc. Image Analysis and Interpretation (SSIAI 2012), pp. 185–188 (2012)

    Google Scholar 

  7. Menache, A.: Understanding motion capture for computer animation and video games. Morgan Kaufmann Pub. (2000)

    Google Scholar 

  8. Müller, M.: Information Retrieval for Music and Motion. Springer (2007)

    Google Scholar 

  9. Shen, W., Xiao, S., Jiang, N., Liu, W.: Unsupervised human skeleton extraction from Kinect depth images. In: Proc. 4th International Conference on Internet Multimedia Computing and Service (ICIMCS 2012), New York, USA, pp. 66–69 (2012)

    Google Scholar 

  10. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: Proc. Computer Vision and Pattern Recognition (June 2011)

    Google Scholar 

  11. Vieira, A., Lewiner, T., Schwartz, W., Campos, M.: Distance matrices as invariant features for classifying MoCap data. In: 21st International Conference on Pattern Recognition (ICPR), Tsukuba, Japan (2012)

    Google Scholar 

  12. Xia, L., Chen, C.C., Aggarwal, J.K.: Human detection using depth information by Kinect. In: Workshop on Human Activity Understanding from 3D Data in conjunction with CVPR (HAU3D), Colorado Springs, USA (2011)

    Google Scholar 

  13. Xia, L., Chen, C.C., Aggarwal, J.K.: View invariant human action recognition using histograms of 3D joints. In: CVPR Workshops, pp. 20–27. IEEE (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, X., Koskela, M. (2013). Sequence Alignment for RGB-D and Motion Capture Skeletons. In: Kamel, M., Campilho, A. (eds) Image Analysis and Recognition. ICIAR 2013. Lecture Notes in Computer Science, vol 7950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39094-4_72

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-39094-4_72

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-39093-7

  • Online ISBN: 978-3-642-39094-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics