KSM Based Machine Learning for Markerless Motion Capture

KSM Based Machine Learning for Markerless Motion Capture

Therdsak Tangkuampien, David Suter
Copyright: © 2010 |Pages: 33
ISBN13: 9781605669007|ISBN10: 1605669008|ISBN13 Softcover: 9781616922160|EISBN13: 9781605669014
DOI: 10.4018/978-1-60566-900-7.ch005
Cite Chapter Cite Chapter

MLA

Tangkuampien, Therdsak, and David Suter. "KSM Based Machine Learning for Markerless Motion Capture." Machine Learning for Human Motion Analysis: Theory and Practice, edited by Liang Wang, et al., IGI Global, 2010, pp. 74-106. https://doi.org/10.4018/978-1-60566-900-7.ch005

APA

Tangkuampien, T. & Suter, D. (2010). KSM Based Machine Learning for Markerless Motion Capture. In L. Wang, L. Cheng, & G. Zhao (Eds.), Machine Learning for Human Motion Analysis: Theory and Practice (pp. 74-106). IGI Global. https://doi.org/10.4018/978-1-60566-900-7.ch005

Chicago

Tangkuampien, Therdsak, and David Suter. "KSM Based Machine Learning for Markerless Motion Capture." In Machine Learning for Human Motion Analysis: Theory and Practice, edited by Liang Wang, Li Cheng, and Guoying Zhao, 74-106. Hershey, PA: IGI Global, 2010. https://doi.org/10.4018/978-1-60566-900-7.ch005

Export Reference

Mendeley
Favorite

Abstract

A marker-less motion capture system, based on machine learning, is proposed and tested. Pose information is inferred from images captured from multiple (as few as two) synchronized cameras. The central concept of which, we call: Kernel Subspace Mapping (KSM). The images-to-pose learning could be done with large numbers of images of a large variety of people (and with the ground truth poses accurately known). Of course, obtaining the ground-truth poses could be problematic. Here we choose to use synthetic data (both for learning and for, at least some of, testing). The system needs to generalizes well to novel inputs:unseen poses (not in the training database) and unseen actors. For the learning we use a generic and relatively low fidelity computer graphic model and for testing we sometimes use a more accurate model (made to resemble the first author). What makes machine learning viable for human motion capture is that a high percentage of human motion is coordinated. Indeed, it is now relatively well known that there is large redundancy in the set of possible images of a human (these images form som sort of relatively smooth lower dimensional manifold in the huge dimensional space of all possible images) and in the set of pose angles (again, a low dimensional and smooth sub-manifold of the moderately high dimensional space of all possible joint angles). KSM, is based on the KPCA (Kernel PCA) algorithm, which is costly. We show that the Greedy Kernel PCA (GKPCA) algorithm can be used to speed up KSM, with relatively minor modifications. At the core, then, is two KPCA’s (or two GKPCA’s) - one for the learning of pose manifold and one for the learning image manifold. Then we use a modification of Local Linear Embedding (LLE) to bridge between pose and image manifolds.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.