Abstract
It is widely recognised that compliant actuation is advantageous to robot control once dynamic tasks are considered. However, the benefit of intrinsic compliance comes with high control complexity. Specifically, coordinating the motion of a system through a compliant actuator and finding a task-specific impedance profile that leads to better performance is known to be non-trivial. Here, we propose an optimal control formulation to compute the motor position commands, and the associated time-varying torque and stiffness profiles. To demonstrate the utility of the approach, we consider an “explosive” ball-throwing task where exploitation of the intrinsic dynamics of the compliantly actuated system leads to improved task performance (i.e., distance thrown). In this example we show that: (i) the proposed control methodology is able to tailor impedance strategies to specific task objectives and system dynamics, (ii) the ability to vary stiffness can be exploited to achieve better performance, (iii) in systems with variable physical compliance, the present formulation enables exploitation of the energy storage capabilities of the actuators to improve task performance. We illustrate these in numerical simulations, and in hardware experiments on a two-link variable stiffness robot.
Similar content being viewed by others
Notes
This is because on compliant actuators the joint torques do not correspond to the motor torques directly.
Where .
While in the present paper, the actuator torque is assumed to be position dependent (as is the case in the majority of VS actuators), the formulation remains valid for cases where the torque is velocity dependent (e.g., due to viscoelastic forces).
This is indeed the case in many athletic disciplines (e.g., shot put, discus throw, hammer throw, javelin throw, high jump) executed through explosive actions.
If the control inputs are saturated (i.e., restricted with “hard constraints” (7)), the corresponding feedback gains may be set to zero, as suggested in Li and Todorov (2007). Alternatively, one could utilise penalty terms to embed the inequality constraints in the objective function (8), see Stengel (1994).
This control law is only optimal in the neighbourhood of the optimal open-loop motion generated by: u=u ∗(t).
P q ∈ℝm×n, D q ∈ℝm×n, P θ ∈ℝm×m and D θ ∈ℝm×m.
Note that the feedback correction on u ∗ is a PD-control performed with optimal position and velocity gains.
Mechanically Adjustable Compliance and Controllable Equilibrium Position Actuators.
In the present case the stiffness matrix has diagonal elements only. Off-diagonal elements in the stiffness matrix appear if the configuration change on one joint induces torque change on an another joint (e.g., see a human arm model of Mussa-Ivaldi et al. (1985) that incorporates bi-articular muscles).
As an example, human peak performance as characterised by the rotation speed of the shoulder during a baseball pitch of a professional pitcher, is between 6900–9800∘/s (Herman 2007). This kind of high-performance task execution is not in the scope of present robotic systems.
There is a transient at the start of the movement as the pre-tensioning motors move to the optimal (but fixed) commanded positions.
The potential energy stored by the linear springs in the present actuators is computed as: where F is the spring force while K s =diag(κ 1,κ 2) is the matrix of the spring stiffness constants.
During antagonistic co-activation in humans, muscles do no mechanical work but consume metabolic energy.
References
Alexander, R. M., & Bennet-Clark, H. C. (1977). Storage of elastic strain energy in muscle and other tissues. Nature, 265, 114–117.
Anderson, F. C., & Pandy, M. G. (2001). Dynamic optimization of human walking. Journal of Biomechanical Engineering, 123, 381–390.
Anderson, R., & Spong, M. (1988). Hybrid impedance control of robotic manipulators. IEEE Journal of Robotics and Automation, 4(5), 549–556.
Bellman, R. (1957). Dynamic programming. Princeton: Princeton University Press.
Betts, J. T. (1998). Survey of numerical methods for trajectory optimization. AIAA Journal of Guidance, Control and Dynamics, 21(2), 193–207.
Bicchi, A., & Tonietti, G. (2004). Fast and soft arm tactics: dealing with the safety-performance trade-off in robot arms design and control. IEEE Robotics and Automation Magazine, 11, 22–33.
Bingham, G. P. (1988). Task-specific devices and the perceptual bottleneck. Journal of Human Movement Science, 7, 255–264.
Bobrow, J. E., Dubowsky, S., & Gibson, J. S. (1985). Time-optimal control of robotic manipulators along specified paths. International Journal of Robotics Research, 4(3), 3–17.
Braun, D. J., Howard, M., & Vijayakumar, S. (2011). Exploiting variable stiffness in explosive movement tasks. In Proceedings of robotics: science and systems, Los Angeles, CA, USA.
Bryson, A. E., & Ho, Y. C. (1975). Applied optimal control. Washington: Hemisphere/Wiley.
Burdet, E., Osu, R., Franklin, D. W., Milner, T. E., & Kawato, M. (2001). The central nervous system stabilizes unstable dynamics by learning optimal impedance. Nature, 414, 446–449.
Cho, A. (2004). To throw farther, waste energy. Science, 306(5693), 42–43.
Chowdhary, A., & Challis, J. (1999). Timing accuracy in human throwing. Journal of Theoretical Biology, 201(4), 219–229.
Collins, J. J. (1995). The redundant nature of locomotor optimization laws. Journal of Biomechanics, 28(3), 251–267.
English, C. E. (1999a). Implementation of variable joint stiffness through antagonistic actuation using rolamite springs. Mechanism and Machine Theory, 341, 27–40.
English, C. E. (1999b). Mechanics and stiffness limitations of a variable stiffness actuator for use in prosthetic limbs. Mechanism and Machine Theory, 341, 7–25.
Flash, T., & Hogan, N. (1985). The coordination of arm movements: an experimentally confirmed mathematical model. Journal of Neuroscience, 5, 1688–1703.
Garabini, M., Passaglia, A., Belo, F. A. W., Salaris, P., & Bicchi, A. (2011). Optimality principles in variable stiffness control: the VSA hammer. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, San Francisco, USA.
Haddadin, S., Weis, M., Wolf, S., & Albu-Schäffer, A. (2011). Optimal control for maximizing link velocity of robotic variable stiffness joints. In Proceedings of the 18th IFAC world congress, Part 1 (Vol. 18).
Ham, R. V., Vanderborght, B., Damme, M. V., Verrelst, B., & Lefeber, D. (2007). MACCEPA, the mechanically adjustable compliance and controllable equilibrium position actuator: design and implementation in a biped robot. Robotics and Autonomous Systems, 55(10), 761–768.
Herman, I. P. (2007). Physics of the human body. Berlin: Springer.
Hill, A. V. (1938). The heat of shortening and the dynamic constants of muscle. Proceedings of the Royal Society B, 126, 136–195.
Hogan, N. (1984). Adaptive control of mechanical impedance by coactivation of antagonist muscles. IEEE Transactions on Automatic Control, AC-29(8), 681–690.
Hogan, N. (1985). Impedance control: an approach to manipulation. ASME Journal of Dynamic Systems, Measurement and Control, 107, 1–24.
Hurst, J. W., Chestnutt, J., & Rizzi, A. A. (2010). The actuator with mechanically adjustable series compliance. IEEE Transactions on Robotics, 26(4), 597–606.
Ikeura, R., Moriguchi, T., & Mizutani, K. (2002). Optimal variable impedance control for a robot and its application to lifting an object with a human. In Proceedings of the IEEE international workshop on robot and human interactive communication.
Jacobson, D. H., & Mayne, D. Q. (1970). Differential dynamic programming. New York: Elsevier.
Johansson, R., & Spong, M. (1994). Quadratic optimisation of impedance control. In Proceedings of the IEEE international conference on robotics and automation, San Diego, CA, USA (pp. 616–621).
Jöris, H. J. J., van Muyen, A. J. E., van Ingen Schenau, H. C. G., & Kemper, G. J. (1985) Force, velocity and energy flow during the overarm throw in female handball players. Journal of Biomechanics, 18(6), 409–414.
Kim, B. S., & Song, J. B. (2010). Hybrid dual actuator unit: A design of a variable stiffness actuator based on an adjustable moment arm mechanism. In Proceedings of the IEEE international conference on robotics and automation, Anchorage, Alaska, USA (pp. 1655–1660).
Kirk, D. E. (1970). Optimal control theory: an introduction. New York: Prentice-Hall.
Koganezawa, K., Watanabe, Y., & Shimizu, N. (1999). Antagonistic muscle-like actuator and its application to multi-dof forearm prosthesis. Advanced Robotics, 12(7–8), 771–789.
Komi, P. V. (1992). Stretch-shortening cycle. The encyclopaedia of sports medicine. In Strength and power in sport, Oxford: Blackwell Scientific.
Lagoudakis, M. G., & Parr, R. (2003). Least-squares policy iteration. Journal of Machine Learning Research, 4, 1107–1149.
Laurin-Kovitz, K. F., Colgate, J. E., & Carnes, S. D. R. (1991). Design of components for programmable passive impedance. In Proceedings of the IEEE international conference on robotics and automation (Vol. 2, pp. 1476–1481).
Li, W., & Todorov, E. (2004). Iterative linear-quadratic regulator design for nonlinear biological movement systems. In Proceedings of the 1st international conference on informatics in control, automation and robotics (Vol. 1, pp. 222–229).
Li, W., & Todorov, E. (2007). Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system. International Journal of Control, 80(9), 1439–1453.
Matinfar, M., & Hashtrudi-Zaad, K. (2005). Optimisation-based robot compliance control: Geometric and linear quadratic approaches. International Journal of Robotics Research, 24(8), 645–656.
Mettin, U., Shiriaev, A. S., Freidovich, B., & Sampei, M. (2010). Optimal ball pitching with an underactuated model of a human arm. In Proceedings of the IEEE international conference on robotics and automation, Anchorage, Alaska, USA (pp. 5009–5014).
Migliore, S. A., Brown, E. A., & DeWeerth, S. P. (2007). Novel nonlinear elastic actuators for passively controlling robotic joint compliance. Journal of Mechanical Design, 129(4), 406–412.
Mitrovic, D., Klanke, S., & Vijayakumar, S. (2010). From motor learning to interaction learning in robots: adaptive optimal feedback control with learned internal dynamics models. SCI (Vol. 264). Berlin: Springer.
Mitrovic, D., Klanke, S., & Vijayakumar, S. (2011). Learning impedance control of antagonistic systems based on stochastic optimization principles. International Journal of Robotics Research, 30(2), 1–18.
Morita, T., & Sugano, S. (1995). Design and development of a new robot joint using a mechanical impedance adjuster. In Proceedings of the IEEE international conference on robotics and automation, Nagoya, Japan (Vol. 3, pp. 2469–2475).
Mussa-Ivaldi, F. A., Hogan, N., & Bizzi, E. (1985). Neural, mechanical, and geometric factors subserving arm posture in humans. Journal of Neuroscience, 5, 2732–2743.
Nelson, W. L. (1983). Physical principles for economies of skilled movements. Biological Cybernetics, 46(2), 135–147.
Newton, R. U., Kraemer, W. J., Hakkinen, K., Humphries, B. J., & Murphy, A. J. (1996). Kinematics, kinetics and muscle activation during explosive upper body movements. Journal of Applied Biomechanics, 12, 31–43.
Paluska, D., & Herr, H. (2006). The effect of series elasticity on actuator power and work output: implications for robotic and prosthetic joint design. Robotics & Autonomous Systems, 54, 667–673.
Pandy, M., Zajac, F., Sim, E., & Levine, W. (1990). An optimal control model for maximum-height human jumping. Journal of Biomechanical Engineering, 23, 1185–1198.
Pandy, M., Garner, B., & Anderson, F. (1995). Optimal control of non-ballistic muscular movements: a constraint-based performance criterion for rising from a chair. Journal of Biomechanical Engineering, 117, 15–26.
Peters, J., & Schaal, S. (2006). Policy gradient methods for robotics. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Beijing, China (pp. 2219–2225).
Pontryagin, L. S., Boltyanskii, V. G., Gamkrelidze, R. V., & Mishchenko, E. F. (1962). The mathematical theory of optimal processes. New York: Wiley.
Putnam, C. (1993). Sequential motions of body segments in striking and throwing skills: descriptions and explanations. Journal of Biomechanics, 26(1), 125–135.
Schenau, G., Bobbert, M. F. & de Haan, A. (1997). Mechanics and energetics of the strech-shortening cycle: a stimulating discussion. Journal of Applied Biomechanics, 13, 484–496.
Shen, X., & Goldfarb, M. (2007). Simultaneous force and stiffness control of a pneumatic actuator. Journal of Dynamic Systems, Measurement, and Control, 129(4), 425–434.
Shoji, T., Nakaura, S., & Sampei, M. (2010). Throwing motion control of the springed pendubot via unstable zero dynamics. In Proceedings of the IEEE international conference on control applications: multi-conference on systems and control, Yokohama, Japan (pp. 1602–1607).
Siciliano, B., & Khatib, O. (2008). Handbook of robotics. Berlin: Springer.
van Soest, A. J., & Bobbert, M. F. (1993). The contribution of muscle properties in the control of explosive movements. Biological Cybernetics, 69, 195–204.
Stengel, R. F. (1994). Optimal control and estimation. New York: Dover.
Todorov, E. (2004). Optimality principles in sensorimotor control. Nature Neuroscience, 7(9), 907–915.
Tonietti, G., Schiavi, R., & Bicchi, A. (2005). Design and control of a variable stiffness actuator for safe and fast physical human/robot interaction. In Proceedings of the IEEE international conference on robotics and automation, Barcelona, Spain (pp. 526–531).
Uemura, M., & Kawamura, S. (2009). Resonance-based motion control method for multi-joint robot through combining stiffness adaptation and iterative learning control. In Proceedings of the IEEE international conference on robotics and automation, Kobe, Japan (pp. 1543–1548).
Uno, Y., Kawato, M., & Suzuki, R. (1989). Formation and control of optimal trajectories in human multijoint arm movements: minimum torque-change model. Biological Cybernetics, 61, 89–101.
Vanderborght, B., Verrelst, B. Ham, R. V., Damme, M. V., Lefeber, D., Duran, B. M. Y., & Beyl, P. (2006). Exploiting natural dynamics to reduce energy consumption by controlling the compliance of soft actuators. International Journal of Robotics Research, 25(4), 343–358.
Verrelst, B., Ham, V., Vanderborght, B., Vermeulen, J., Lefeber, D., & Daerden, F. (2005). Exploiting adaptable passive behaviour to influence natural dynamics applied to legged robots. Robotica, 23(2), 149–158.
Wilson, A. M., Watson, J. C., & Lichtwark, G. A. (2003). A catapult action for rapid limb protraction. Nature, 421, 35–36.
Winters, J. M., & Stark, L. (1985). Analysis of fundamental human movement patterns through the use of in-depth antagonistic muscle models. IEEE Transactions on Biomedical Engineering, 32, 826–839.
Wolf, S., & Hirzinger, G. (2008). A new variable stiffness design: matching requirements of the next robot generation. In Proceedings of the IEEE international conference on robotics and automation, Pasadena, CA, USA (pp. 1741–1746).
Zinn, M., Khatib, O., Roth, B., & Salisbury, J. (2004). Playing it safe. IEEE Robotics & Automation Magazine, 11(2), 12–21.
Acknowledgements
This work was funded by the EU Seventh Framework Programme (FP7) as part of the STIFF project. The authors gratefully acknowledge this support. We would like to thank Alexander Enoch for his work on the hardware design and Andrius Sutas for his contribution to the control interface. In addition, we thank Dr. Jun Nakanishi and Dr. Takeshi Mori for fruitful discussions regarding this work.
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
(MPG 39.9 MB)
Rights and permissions
About this article
Cite this article
Braun, D., Howard, M. & Vijayakumar, S. Optimal variable stiffness control: formulation and application to explosive movement tasks. Auton Robot 33, 237–253 (2012). https://doi.org/10.1007/s10514-012-9302-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10514-012-9302-3