ABSTRACT
Human activity datasets collected under natural conditions are an important source of data. Since these contain multiple activities in unscripted sequence, temporal segmentation of multimodal datasets is an important precursor to recognition and analysis. Manual segmentation is prohibitively time consuming and unsupervised approaches for segmentation are unreliable since they fail to exploit the semantic context of the data. Gathering labels for supervised learning places a large workload on the human user since it is relatively easy to gather a mass of unlabeled data but expensive to annotate. This paper proposes an active learning approach for segmenting large motion capture datasets with both small training sets and working sets. Support Vector Machines (SVMs) are learned using an active learning paradigm; after the classifiers are initialized with a small set of labeled data, the users are iteratively queried for labels as needed. We propose a novel method for initializing the classifiers, based on unsupervised segmentation and clustering of the dataset. By identifying and training the SVM with points from pure clusters, we can improve upon a random sampling strategy for creating the query set. Our active learning approach improves upon the initial unsupervised segmentation used to initialize the classifier, while requiring substantially less data than a fully supervised method; the resulting segmentation is comparable to the latter while requiring significantly less effort from the user.
- O. Arikan, D. Forsyth and J. O'Brien, Motion synthesis from annotations. Proceedings of SIGGRAPH 22(3):402--408, 2003. Google ScholarDigital Library
- J. Barbic, A. Safonova, J. Pan, C. Faloutsos, J. Hodgins and N. Pollard, Segmenting motion capture data into distinct behaviors. Proceedings of Graphics Interface, 62:185--194, 2004. Google ScholarDigital Library
- O. Chapelle, V. Sindhwani and S. Keerthi, Optimization techniques for semi--supervised support vector machines. The Journal of Machine Learning Research 9:203--233, 2008. Google ScholarDigital Library
- S. Canu, Y. Grandvalet, V. Guigue and A. Rakotomamonjy, SVM and Kernel Methods Matlab Toolbox. Perception Systemes et Information, INSA de Rouen, Rouen, France, 2005.Google Scholar
- F. Frade, J. Hodgins, A. Bargteil, X. Artal, J. Macey, A. Castells and J. Beltran, Guide to the CMU Multimodal Activity Databse. Robotics Institute, Carnegie Mellon University, CMU-RI-TR-08-22, Pittsburgh PA, 2008.Google Scholar
- V. Sindhwani and P. Niyogi and M. Belkin, Beyond the point cloud: frame transductive to semi-supervised learning, Proceedings of the International Conference on Machine Learning (ICML), 2005. Google ScholarDigital Library
- S. Tong and D. Koller, Support vector machine active learning with applications to text classification, The Journal of Machine Learning Research 245--66, 2002. Google ScholarDigital Library
- F. Zhou and F. Frade and J. Hodgins, Aligned cluster analysis for temporal segmentation of human motion, Proceedings of the IEEE Conference on Automatic Face and Gesture Recognition, 2008.Google ScholarCross Ref
Index Terms
- An active learning approach for segmenting human activity datasets
Recommendations
Combining active learning and semi-supervised for improving learning performance
ISABEL '11: Proceedings of the 4th International Symposium on Applied Sciences in Biomedical and Communication TechnologiesIn many learning tasks, there are abundant unlabeled samples but the number of labeled training samples is limited, because labeling the samples requires the efforts of human annotators and expertise. There are three major techniques for labeling the ...
Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost
Computer Vision – ECCV 2020AbstractActive learning (AL) combines data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance. In pool-based active learning, accessible unlabeled data are not ...
Semi-supervised Dictionary Active Learning for Pattern Classification
Pattern Recognition and Computer VisionAbstractGathering labeled data is one of the most time-consuming and expensive tasks in supervised machine learning. In practical applications, there are usually quite limited labeled training samples but abundant unlabeled data that is easy to collect. ...
Comments