Articulated motion reconstruction from feature points

doi:10.1016/j.patcog.2007.06.002

Pattern Recognition

Volume 41, Issue 1, January 2008, Pages 418-431

https://doi.org/10.1016/j.patcog.2007.06.002 Get rights and content

Abstract

A fundamental task of reconstructing non-rigid articulated motion from sequences of unstructured feature points is to solve the problem of feature correspondence and motion estimation. This problem is challenging in high-dimensional configuration spaces. In this paper, we propose a general model-based dynamic point matching algorithm to reconstruct freeform non-rigid articulated movements from data presented solely by sparse feature points. The algorithm integrates key-frame-based self-initialising hierarchial segmental matching with inter-frame tracking to achieve computation effectiveness and robustness in the presence of data noise. A dynamic scheme of motion verification, dynamic key-frame-shift identification and backward parent-segment correction, incorporating temporal coherency embedded in inter-frames, is employed to enhance the segment-based spatial matching. Such a spatial–temporal approach ultimately reduces the ambiguity of identification inherent in a single frame. Performance evaluation is provided by a series of empirical analyses using synthetic data. Testing on motion capture data for a common articulated motion, namely human motion, gave feature-point identification and matching without the need for manual intervention, in buffered real-time. These results demonstrate the proposed algorithm to be a candidate for feature-based real-time reconstruction tasks involving self-resuming tracking for articulated motion.

Introduction

Visual interpretation of non-rigid articulated motion has lately seen somewhat of a renaissance in computer vision and pattern recognition. The motivation for directing existing motion analysis of rigid objects towards non-rigid articulated objects [1], [2], especially human motion [3], [4], [5], is driven by potential applications such as human–computer interaction, surveillance systems, entertainment and medical studies. A large body of research, dedicated to the task of structure and motion analysis, utilises feature-based methods regardless of parametrisation by points, lines, curves or surfaces. Among these, concise feature-point representation, advantageously abstracting the underlying movement, is usually used as an essential or intermediate correspondence towards the end-product of motion and structure recovery [6], [7], [8].

In the context of vision cues via feature-point representation, the spatio–temporal information is notably reduced to a sequence of unidentified points moving over time. To determine the subject's structure and therefore its underlying skeletal-style movements for the purpose of high-level recognition, two fundamental problems of feature-point tracking and identification need to be solved. Tracking feature points in successive frames has been investigated extensively in the literatures [9], [10], [11], [12]. However, the identities of the subject feature points are not obtainable from inter-frame tracking alone.

Feature-point identification requires the determination of which point in an observed data frame corresponds to which point in its model, thus allowing recovery of structure. The task addresses the difficult problem of automatic model matching and identification, crucial at the start of tracking or on resumption from tracking loss. Currently, most tracking approaches simplify the problem to incremental pose estimation, relying on manual model fitting at the start of tracking, or on an assumption of initial pose similarity and alignment to the model, or on pre-knowledge of a specific motion from which to infer an initial pose [5], [13]. In this sense, the general recovery of non-rigid articulated motion solely from feature points still remains an open problem. There is a relative dearth of algorithmic self-initialisation for articulated motion reconstruction from only a collection of sparse feature points.

Motivated by these observations, we present a dynamic segment-based hierarchical point matching (DSHPM) algorithm to address self-initialising articulated motion reconstruction from sparse feature points. The articulated motion we are considering describes general segmental jointed freeform movement. The motion of each segment can be considered as rigid or nearly rigid, but the motion of the object as a whole is high-dimensionally non-rigid. In our work, the articulated model of an observed subject is a priori known, suggesting a model-based approach. As a general solution to the problem, the algorithm only assumes availability of feature-point motion data, such as obtained in our experiments via a marker-based motion capture system. We do not make the usual simplifying assumptions of model-pose similarity or restricted motion class for tracking initialisation, nor do we require absence of data noise. The algorithm aims to establish one-to-one matches between the model point-set and its freeform motion data to reconstruct the underlying articulated movements in buffered real-time.

Section snippets

Related work

The problem of automatically identifying feature points to retrieve underlying articulated movement can be inherently difficult for a number of reasons: (1) the possibility of globally high dimensionality to depict the articulated structure; (2) relaxation of segment rigidity to allow for limited distortion; (3) data corruption due to missing (occluded) and extra (via the process of feature extraction) data; (4) unrestricted and arbitrary poses in freeform movements; (5) requirements of

Framework of the model-based DSHPM algorithm

The generic task under consideration arose from the need to identify feature-point data to reconstruct underlying skeletal structure of freeform articulated motion. We assume the data capture rate is sufficiently high, as demanded in most real-world applications. This allows the obtaining of feature-point trajectories in successive frames. However, the identities of feature points (or trajectories) are not known.

The DSHPM algorithm

Identification is carried out hierarchically segment by segment in a chosen key-frame containing e.g. over 90% of the model points (Section 4.2), or in a key-frame range when necessary, taking advantage of a dynamic scheme (Section 4.3). In order to make temporal coherence of motion cues exploitable and reduce the search space, feature-point pre-tracking and pre-segmentation are carried out prior to segmental identification (Section 4.1).

Experiments

The algorithm has been implemented in Matlab. We tested it on articulated models, such as human and robot manipulators in various low densities and distributions of feature points. Human motion representing a typical articulated motion with segments of only near-rigidity makes the identification task more difficult than in the case of robot manipulator motion with rigid segments. To reflect this challenge, we report in this section experimental results on real-world human motion capture and its

Conclusion

The proposed dynamic segment-based hierarchical point matching (DSHPM) algorithm addresses a general and currently open problem in pattern recognition: non-rigid articulated motion reconstruction from low-density feature points. The algorithm has a crucial self-initialisation phase of pose estimation, benefiting from our previous work [35], [37]. In the context of a dynamic sequence, the DSHPM algorithm integrates a key-frame-based dynamic hierarchial matching with inter-frame tracking to

Acknowledgements

All model and motion data used in our experiments were obtained by a marker-based optical motion capture system—Vicon-512, installed at the Department of Computer Science, UWA. Some motion trials analysed in this paper were captured for the game project “Dance: UK” in collaboration with Broadsword Interactive Ltd. [39].

About the Author—BAIHUA LI received the B.S. and M.S. degrees in electronic engineering from Tianjin University, China and the Ph.D. degree in computer science from the University of Wales, Aberystwyth in 2003. She is a Lecturer in the Department of Computing and Mathematics, Manchester Metropolitan University, UK. Her current research interests include computer vision, pattern recognition, human motion tracking and recognition, 3D modelling and animation.

References (39)

J. Maintz et al.
A survey of medical image registration
IEEE Eng. Med. Biol. Mag.
(1998)
T. Moeslund et al.
A survey of computer vision-based human motion capture
Comput. Vision Image Understanding
(2001)
L. Wang et al.
Recent developments in human motion analysis
Pattern Recognition
(2003)
C. Taylor
Reconstruction of articulated objects from point correspondences in a single uncalibrated image
Comput. Vision Image Understanding
(2000)
R. Campbell et al.
A survey of free-form object representation and recognition techniques
Comput. Vision Image Understanding
(2001)
D.M. Mount et al.
Efficient algorithms for robust feature matching
Pattern Recognition
(1999)
E. Bardinet et al.
A parametric deformable model to fit unstructured 3D data
Comput. Vision Image Understanding
(1998)
H. Chui et al.
A new point matching algorithm for non-rigid registration
Comput. Vision Image Understanding
(2003)
J. Richards
The measurement of human motion: a comparison of commercially available systems
Human Movement Sci.
(1999)
B. Li et al.
Reconstruction of segmentally articulated structure in freeform movement with low density feature points
Image and Vision Comput.
(2004)

J.K. Aggarwal, Q. Cai, W. Liao, B. Sabata, Articulated and elastic non-rigid motion: a review, In Proceedings of the...

J. Deutscher, A. Blake, I. Reid, Articulated body motion capture by annealed particle filtering, in: Proceedings of the...

C. Cédras, M. Shah, A survey of motion analysis from moving light displays, in: Proceedings of the IEEE Computer Vision...

J. Zhang, R. Collins, Y. Liu, Representation and matching of articulated shapes, in: Proceedings of the IEEE...

I. Cox et al.

An efficient implementation of Reid's multiple hypothesis tracking algorithm and its evaluation for the purpose of visual tracking

IEEE Trans. Pattern Anal. Mach. Intell.

(1996)

S. Deb et al.

A generalized S-D assignment algorithm for multisensor–multitarget state estimation

IEEE Trans. Aerosp. Electron. Syst.

(1997)

C. Veenman et al.

Resolving motion correspondence for densely moving points

IEEE Trans. Pattern Anal. Mach. Intell.

(2001)

Y. Wang

Feature point correspondence between consecutive frames based on genetic algorithm

Int. J. Robot. Autom.

(2006)

M. Ringer, J. Lasenby, Modelling and tracking articulated motion from multiple camera views, in: Proceedings of the...

Cited by (21)

Efficient point pattern matching algorithm for planar point sets under transform of translation, rotation and scale
2014, Applied Mathematics and Computation
Citation Excerpt :
In another work, McAuley et al. [13] presented a novel graph with better performance than that of [12], and exploited it to find matched point sets. In [14], Li et al. proposed a dynamic segment-based hierarchical point matching algorithm for self-initialising articulated motion reconstruction from sparse feature points. Recently, Bhowmick et al. [15] designed a novel data structure called “angular tree” for point pattern matching.
Point pattern matching is an important topic in computer vision and pattern recognition, and finds many applications such as image registration, motion detection, object tracking and pose estimation. In this paper, we propose an efficient algorithm for determining correspondence between two planar point sets under transform of translation, rotation and scale. This algorithm randomly selects some points of a set and extracts their neighbor points. It views the selected points and their neighbor points as local point patterns, and finds the local matched patterns in the other set. Point pattern matching is finally achieved by counting the unique point number of those local matched point patterns with the same transform parameters. Many experiments are conducted to validate efficiency of the proposed algorithm. Running time comparisons with a well-known point pattern matching algorithm are also done and the results show that the proposed algorithm is faster than the compared algorithm.
Real time pose recognition of covered human for diagnosis of sleep apnoea
2010, Computerized Medical Imaging and Graphics
Citation Excerpt :
As spatial shape features generated from obscured appearance cues may include false positives and limited, fragmented and noisy dynamic motion cues cannot be relied on to define action performed by the subject, the idea of the proposed method is to combine both static and dynamic features to suppress false detection and improve pose estimation on the covered human. Spatio-temporal approaches have been shown to be useful in overcoming self-occlusion and image noise in recent research [19,21,31]. These methods exploit temporal coherency of feature points.
Existing video monitoring techniques for sleep apnoea require clinicians to analyze substantial amounts of video data. Analysis of the covered human body from video is a challenging task as traditional computer vision methods such as correlation, template matching, background subtraction, contour models and related techniques for object tracking become ineffective because of the large degree of occlusion for long periods. To the authors’ best knowledge, there is no previously published method to estimate pose from persistently covered human body. This paper presents an automated monocular video monitoring approach to recover the human pose in conditions with persistently heavy obscuration, allowing for further analysis of covered human activity. In evaluation, we demonstrate that the proposed technique is able to identify human configurations with various poses and occlusion levels in two different environments.
Motion trajectory reproduction from generalized signature description
2010, Pattern Recognition
Free form motion trajectories prove to be an informative and compact motion clue in sketching long-term, spatiotemporal motions. Hence, motion trajectories have been used for characterizing human behaviors/activities, robot actions and other objects’ movements. However, it is observed that most of the previous studies merely use motion trajectories straightforwardly in the raw data form, which is inflexible as they rely largely on the absolute positions. To solve this problem, we propose to achieve effective motion trajectory descriptions by developing a systematic trajectory description mechanism. To this end, a flexible motion trajectory signature descriptor has been proposed in our previous work, which can offer generalized descriptions to the raw trajectory data thanks to its rich description invariants. Moreover, for an effective descriptor, it is sometimes desired to have mutual description functions, i.e. describing and un-describing capability to support some applications like robot learning. Hence, opposite to describing a motion trajectory using the signature, this paper focuses on the un-describing problem, that is, reproducing a trajectory instance from a given signature description. The moving frame technique is used in formulating the trajectory reproduction method. A nonlinear signature matching-based metric is also developed to measure the quality of the reproductions. Experiments are conducted to verify the effectiveness of the trajectory reproduction. It is shown that the trajectory signature is flexible and easy to implement in both the description and reproduction of trajectory instances.
Fast detection of marker pixels in video-based motion capture systems
2009, Pattern Recognition Letters
Marker pixel detection is a low-level task used by video-based motion capture systems for motion analysis. This is a highly time-consuming task because there are usually a large number of frames to be processed in motion analysis applications. In this paper a novel method for determining marker pixels is proposed. The basic idea behind the proposed method is to reduce the repeated observations of pixels that have already been examined. The method is compared to two classic fill algorithms that are used for this purpose. The results indicate that the proposed method performs better for small circular markers used in this type of applications.
Passive Binocular Optical Motion Capture Technology Under Complex Illumination
2023, Journal of Shanghai Jiaotong University (Science)
Toward accurate real-time marker labeling for live optical motion capture
2017, Visual Computer

View all citing articles on Scopus

About the Author—QINGGANG MENG received the B.S. and M.S. degrees in electronic engineering from Tianjin University, China and the Ph.D. degree in computer science from the University of Wales, Aberystwyth in 2003. He is a Lecturer in the Department of Computer Science, Loughborough University, UK. His research interests include biologically/psychologically inspired robot learning and control, machine vision and service robotics.

About the Author—HORST HOLSTEIN received the degree of B.S. in Mathematics from the University of Southampton, UK, in 1963, and obtained a Ph.D. in the field of rheology from University of Wales, Aberystwyth, UK, in 1981. He is a Lecturer in the Department of Computer Science, University of Wales, Aberystwyth, UK. His research interests include motion tracking, computational bioengineering and geophysical gravi-magnetic modelling.

View full text

Articulated motion reconstruction from feature points

Abstract

Introduction

Section snippets

Related work

Framework of the model-based DSHPM algorithm

The DSHPM algorithm

Experiments

Conclusion

Acknowledgements

IEEE Eng. Med. Biol. Mag.

Comput. Vision Image Understanding

Pattern Recognition

Comput. Vision Image Understanding

Comput. Vision Image Understanding

Pattern Recognition

Comput. Vision Image Understanding

Comput. Vision Image Understanding

Human Movement Sci.

Image and Vision Comput.

An efficient implementation of Reid's multiple hypothesis tracking algorithm and its evaluation for the purpose of visual tracking

IEEE Trans. Pattern Anal. Mach. Intell.

A generalized S-D assignment algorithm for multisensor–multitarget state estimation

IEEE Trans. Aerosp. Electron. Syst.

Resolving motion correspondence for densely moving points

IEEE Trans. Pattern Anal. Mach. Intell.

Feature point correspondence between consecutive frames based on genetic algorithm

Int. J. Robot. Autom.