research-article

Learning reduced-order feedback policies for motion skills

Authors:
Kai Ding

University of British Columbia

University of British Columbia
View Profile

,
Libin Liu

Microsoft Research Asia

Microsoft Research Asia
View Profile

,
Michiel van de Panne

University of British Columbia

University of British Columbia
View Profile

,
KangKang Yin

National University of Singapore

National University of Singapore
View Profile

SCA '15: Proceedings of the 14th ACM SIGGRAPH / Eurographics Symposium on Computer AnimationAugust 2015Pages 83–92https://doi.org/10.1145/2786784.2786802

Published:07 August 2015Publication History

SCA '15: Proceedings of the 14th ACM SIGGRAPH / Eurographics Symposium on Computer Animation

Pages 83–92

ABSTRACT

We introduce a method for learning low-dimensional linear feedback strategies for the control of physics-based animated characters around a given reference trajectory. This allows for learned low-dimensional state abstractions and action abstractions, thereby reducing the need to rely on manually designed abstractions such as the center-of-mass state or foot-placement actions. Once learned, the compact feedback structure allow simulated characters to respond to changes in the environment and changes in goals. The approach is based on policy search in the space of reduced-order linear output feedback matrices. We show that these can be used to replace or further reduce manually-designed state and action abstractions. The approach is sufficiently general to allow for the development of unconventional feedback loops, such as feedback based on ground reaction forces. Results are demonstrated for a mix of 2D and 3D systems, including tilting-platform balancing, walking, running, rolling, targeted kicks, and several types of ball-hitting tasks.

Supplemental Material

Available for Download

zip

p83-ding.zip (74.6 MB)

References

Barbič, J., da Silva, M., and Popović, J. 2009. Deformable object animation using reduced optimal control. In ACM SIGGRAPH 2009 papers, ACM, SIGGRAPH '09, 53:1--53:9. Google ScholarDigital Library
Berniker, M., Jarc, A., Bizzi, E., and Tresch, M. C. 2009. Simplified and effective motor control based on muscle synergies to exploit musculoskeletal dynamics. Proceedings of the National Academy of Sciences 106, 18, 7601--7606.Google ScholarCross Ref
Box2D., http://box2d.org/.Google Scholar
Burke, J., Lewis, A., and Overton, M. 2003. A nonsmooth, nonconvex optimization approach to robust stabilization by static output feedback and low-order controllers. In Proc. ROCOND 2003.Google Scholar
Burns, J. A., and King, B. B. 1998. A reduced basis approach to the design of low-order feedback controllers for nonlinear continuous systems. Journal of Vibration and Control 4, 3, 297--323.Google ScholarCross Ref
Coros, S., Beaudoin, P., and van de Panne, M. 2010. Generalized biped walking control. ACM Transctions on Graphics 29, 4, Article 130. Google ScholarDigital Library
da Silva, M., Abe, Y., and Popović, J. 2008. Interactive simulation of stylized human locomotion. ACM Trans. Graph. 27 (August), 82:1--82:10. Google ScholarDigital Library
David, J., and De Moor, B. 1994. Designing reduced order output feedback controllers using a potential reduction method. American Control Conference, 1994 1, 845--849.Google ScholarCross Ref
de Lasa, M., Mordatch, I., and Hertzmann, A. 2010. Feature-Based Locomotion Controllers. ACM Transactions on Graphics 29, 3. Google ScholarDigital Library
de Oliveira, M., and Geromel, J. 1997. Numerical comparison of output feedback design methods. In American Control Conference, 1997. Proceedings of the 1997, vol. 1, 72--76 vol.1.Google Scholar
Hansen, N. 2006. The CMA evolution strategy: a comparing review. In Towards a new evolutionary computation. Advances on estimation of distribution algorithms, J. Lozano, P. Larranaga, I. Inza, and E. Bengoetxea, Eds. Springer, 75--102.Google Scholar
Jain, S., and Liu, C. K. 2011. Modal-space control for articulated characters. ACM Trans. Graph. 30 (October), 118:1--118:12. Google ScholarDigital Library
James, D. L., and Pai, D. K. 2002. Dyrt: dynamic response textures for real time deformation simulation with graphics hardware. ACM Trans. Graph. 21 (July), 582--585. Google ScholarDigital Library
Kry, P. G., Reveret, L., Faure, F., and Cani, M.-P. 2009. Modal locomotion: Animating virtual characters with natural vibrations. Computer Grahics Forum 28, 2.Google ScholarCross Ref
Kwon, T., and Hodgins, J. K. 2010. Control systems for human running using an inverted pendulum model and a reference motion capture sequence. The ACM SIGGRAPH / Eurographics Symposium on Computer Animation (SCA 2010). Google ScholarDigital Library
Lall, S., Marsden, J. E., and Glavaški, S. 2002. A subspace approach to balanced truncation for model reduction of nonlinear control systems. International journal of robust and nonlinear control 12, 6, 519--535.Google Scholar
Laszlo, J., van de Panne, M., and Eugene, F. 1996. Limit cycle control and its application to the animation of balancing and walking. In SIGGRAPH '96, ACM, 155--162. Google ScholarDigital Library
Lee, Y., Kim, S., and Lee, J. 2010. Data-driven biped control. ACM Trans. Graph. 29 (July), 129:1--129:8. Google ScholarDigital Library
Lewis, F., and Syrmos, V. 1995. Optimal control. A Wiley-Interscience publication. J. Wiley.Google Scholar
Liu, L., Yin, K., van de Panne, M., Shao, T., and Xu, W. 2010. Sampling-based contact-rich motion control. ACM Transctions on Graphics 29, 4, Article 128. Google ScholarDigital Library
Liu, L., Yin, K., van de Panne, M., and Guo, B. 2012. Terrain runner: control, parameterization, composition, and planning for highly dynamic motions. ACM Trans. Graph. 31, 6, 154. Google ScholarDigital Library
Liu, L., Yin, K., Wang, B., and Guo, B. 2013. Simulation and control of skeleton-driven soft body characters. ACM Transactions on Graphics (TOG) 32, 6, 215. Google ScholarDigital Library
Macchietto, A., Zordan, V., and Shelton, C. R. 2009. Momentum control for balance. ACM Trans. Graph. 28 (July), 80:1--80:8. Google ScholarDigital Library
Mordatch, I., de Lasa, M., and Hertzmann, A. 2010. Robust Physics-Based Locomotion Using Low-Dimensional Planning. ACM Transactions on Graphics 29, 3. Google ScholarDigital Library
Muico, U., Lee, Y., Popović, J., and Popović, Z. 2009. Contact-aware nonlinear control of dynamic characters. ACM Transactions on Graphics 28, 3. Google ScholarDigital Library
ODE. Open dynamics engine, http://www.ode.org/.Google Scholar
Raibert, M. H., and Hodgins, J. K. 1991. Animation of dynamic legged locomotion. SIGGRAPH Comput. Graph. 25 (July), 349--358. Google ScholarDigital Library
Safonova, A., Hodgins, J. K., and Pollard, N. S. 2004. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces. ACM Trans. Graph. 23 (Aug.), 514--521. Google ScholarDigital Library
Scherer, C., Gahinet, P., and Chilali, M. 1997. Multiobjective output-feedback control via lmi optimization. Automatic Control, IEEE Transactions on 42, 7, 896--911.Google ScholarCross Ref
Slotine, J.-J. E., Li, W., et al. 1991. Applied nonlinear control, vol. 60. Prentice-Hall Englewood Cliffs, NJ.Google Scholar
Sok, K. W., Kim, M., and Lee, J. 2007. Simulating biped behaviors from human motion data. ACM Trans. Graph. 26 (July). Google ScholarDigital Library
Tan, J., Gu, Y., Liu, C. K., and Turk, G. 2014. Learning bicycle stunts. ACM Transactions on Graphics (TOG) 33, 4, 50. Google ScholarDigital Library
Treuille, A., Lewis, A., and Popović, Z. 2006. Model reduction for real-time fluids. ACM Transactions on Graphics 25, 3 (July), 826--834. Google ScholarDigital Library
Tsai, Y.-Y., Lin, W.-C., Cheng, K. B., Lee, J., and Lee, T.-Y. 2010. Real-time physics-based 3d biped character animation using an inverted pendulum model. IEEE Transactions on Visualization and Computer Graphics 16 (March), 325--337. Google ScholarDigital Library
Wang, J. M., Fleet, D. J., and Hertzmann, A. 2009. Optimizing walking controllers. ACM Trans. Graph. 28 (December), 168:1--168:8. Google ScholarDigital Library
Wang, J. M., Fleet, D. J., and Hertzmann, A. 2010. Optimizing walking controllers for uncertain inputs and environments. ACM Trans. Graph. 29 (July), 73:1--73:8. Google ScholarDigital Library
Ye, Y., and Liu, C. K. 2010. Optimal feedback control for character animation using an abstract model. ACM Trans. Graph. 29 (July), 74:1--74:9. Google ScholarDigital Library
Yin, K., Loken, K., and van de Panne, M. 2007. Simbicon: Simple biped locomotion control. ACM Trans. Graph. 26, 3, Article 105. Google ScholarDigital Library

Index Terms

Learning reduced-order feedback policies for motion skills
1. Computing methodologies
  1. Computer graphics
    1. Animation

Recommendations

Reduced-order observer-based output feedback control of nonlinear time-delay systems with prescribed performance

This paper studies the problem of output feedback control for a class of nonlinear time-delay systems with prescribed performance. The system is in the form of triangular structure with unmodelled dynamics. First, we introduce a reduced-order observer ...
Read More
Partial-state feedback multivariable MRAC and reduced-order designs
Abstract
This paper develops a new model reference adaptive control (MRAC) framework using partial-state feedback for solving a multivariable adaptive output tracking problem. The developed MRAC scheme has full capability to deal with plant ...
Read More
Real-time motion retargeting to highly varied user-created morphologies

Character animation in video games---whether manually keyframed or motion captured---has traditionally relied on codifying skeletons early in a game's development, and creating animations rigidly tied to these fixed skeleton morphologies. This paper ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SCA '15: Proceedings of the 14th ACM SIGGRAPH / Eurographics Symposium on Computer Animation
August 2015
193 pages
ISBN:9781450334969
DOI:10.1145/2786784
Conference Chairs:
Jernej Barbič
University of Southern California
,
Zhigang Deng
University of Houston
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 August 2015
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Badges
- Best Paper
Author Tags
character animation
control
human simulation
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate183of487submissions,38%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 19
  Total Citations
  View Citations
- 266
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Learning reduced-order feedback policies for motion skills

SCA '15: Proceedings of the 14th ACM SIGGRAPH / Eurographics Symposium on Computer Animation

ABSTRACT

Supplemental Material

Available for Download

References

Cited By

Index Terms

Recommendations

Reduced-order observer-based output feedback control of nonlinear time-delay systems with prescribed performance

Partial-state feedback multivariable MRAC and reduced-order designs

Real-time motion retargeting to highly varied user-created morphologies