Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots

Bhat, Ajaz Ahmad; Akkaladevi, Sharath Chandra; Mohan, Vishwanathan; Eitzinger, Christian; Morasso, Pietro

doi:10.1007/s10514-016-9563-3

Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots

Published: 04 April 2016

Volume 41, pages 945–966, (2017)
Cite this article

Autonomous Robots Aims and scope Submit manuscript

Ajaz Ahmad Bhat ORCID: orcid.org/0000-0002-6992-8224¹,
Sharath Chandra Akkaladevi²,
Vishwanathan Mohan¹,
Christian Eitzinger² &
…
Pietro Morasso¹

1006 Accesses
6 Citations
27 Altmetric
3 Mentions
Explore all metrics

Abstract

During any goal oriented behavior the dual processes of generation of dexterous actions and anticipation of the consequences of potential actions must seamlessly alternate. This article presents a unified neural framework for generation and forward simulation of goal directed actions and validates the architecture through diverse experiments on humanoid and industrial robots. The basic idea is that actions are consequences of an simulation process that animates the internal model of the body (namely the body schema), in the context of intended goals/constraints. Specific focus is on (a) Learning: how the internal model of the body can be acquired by any robotic embodiment and extended to coordinated tools; (b) Configurability: how diverse forward/inverse models of action can be ‘composed’ at runtime by coupling/decoupling different body (body $+$ tool) chains with task relevant goals and constraints represented as multi-referential force fields; and (c) Computational simplicity: how both the synthesis of motor commands to coordinate highly redundant systems and the ensuing forward simulations are realized through well-posed computations without kinematic inversions. The performance of the neural architecture is demonstrated through a range of motor tasks on a 53-DoFs robot iCub and two industrial robots performing real world assembly with emphasis on dexterity, accuracy, speed, obstacle avoidance, multiple task-specific constraints, task-based configurability. Putting into context other ideas in motor control like the Equilibrium Point Hypothesis, Optimal Control, Active Inference and emerging studies from neuroscience, the relevance of the proposed framework is also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A review of motion planning algorithms for intelligent robots

Article Open access 25 November 2021

Chengmin Zhou, Bingding Huang & Pasi Fränti

Embodied intelligence in manufacturing: leveraging large language models for autonomous industrial robotics

Article 09 January 2024

Haolin Fan, Xuan Liu, … Bingbing Li

A review of external sensors for human detection in a human robot collaborative environment

Article Open access 04 April 2024

Zainab Saleem, Fredrik Gustafsson, … Saif Huq

Notes

The difference between body image and body schema is disputed and is somehow fuzzy. For our purpose we assume that they are two sides of the same coin: the former one stresses the static component, mainly based on proprioceptive information whereas the latter is related to the dynamic synergy formation function.
Condition to have a bounded acceleration, $\partial ^{2}\xi /\partial t^{2}=-\beta \gamma ^{2}(\xi (1-\xi ))^{2\beta -1}(1-2\xi )$, at equilibrium point is, $0.5<\beta <1$. The Jerk of $\xi \hbox { (t)},\partial ^{3}\xi /\partial t^{3}=\beta \gamma ^{3}(\xi (1-\xi ))^{3\beta -2}\{(2\beta -1)(1-2\xi )^{2}-2\xi (1-\xi ))\}$ imposes an additional restriction of having $0.66< \beta <1$ for bounded jerk.
Non Lipschitzian systems have point attractors of infinite stability in the sense that the gradient of their Lyapnov function diverges at equilibrium point, a consequence is that they reach equilibrium in finite time (it is a terminal attractor). $\partial \dot{\xi }/\partial \xi =\beta \gamma (\xi (1-\xi ))^{\beta -1}(1-2\xi )$, as $\beta <1$, the expression tends to 8, at equilibrium points.

References

Arimoto, S., et al. (2005). Natural resolution of ill-posedness of inverse kinematics for redundant robots: A challenge to Bernstein’s degrees-of-freedom problem. Advanced Robotics, 19(4), 401–434.
Article Google Scholar
Asatryan, D. G., & Feldman, A. G. (1965). Functional tuning of the nervous system with control of movements or maintenance of a steady posture. Biophysics, 10, 925–935.
Google Scholar
Baillieul, J., & Martin, D. P. (1990). Resolution of kinematic redundancy. Proceedings of Symposia in Applied Mathematics, 41, 49–89.
Article MathSciNet MATH Google Scholar
Balestrino, A., De Maria, G., & Sciavicco, L. (1984). Robust control of robotic manipulators. In Proceedings of the 9th IFAC world congress (Vol. 5, pp. 2435–2440).
Bekey, G., & Goldberg, K. Y. (Eds.). (2012). Neural networks in robotics (Vol. 202). Berlin: Springer.
Bernstein, N. (1935). The problem of the interrelationships between coordination and localization. Retrieved November 13th, 2015 from http://www.cns.nyu.edu/~bijan/courses/sm10/Readings/Glimcher/Problem%20of%20the%20Interrelation%20of%20Coor%20and%20Local%20-%20PGArt.pdf.
Bernstein, N. (1967). The coordination and regulation of movements. Oxford: Pergamon Press.
Google Scholar
Bhat, A. A., & Mohan, V. (2015). How iCub learns to imitate use of a tool quickly by recycling the past knowledge learnt during drawing. In Biomimetic and biohybrid systems (pp. 339–347). Berlin: Springer.
Bizzi, E., & Polit, A. (1978). Processes controlling arm movements in monkeys. Science, 201, 1235–1237.
Article Google Scholar
Bryson, E. (1999). Dynamic optimization. Menlo Park, CA: Addison Wesley Longman.
Google Scholar
Buss, S. R., & Kim, J.-S. (2005). Selectively damped least squares for inverse kinematics. Journal of Graphics Tools, 10(3), 37–49.
Article Google Scholar
Cai, H., Werner, T., & Matas, J. (2013). Fast detection of multiple textureless 3-D objects. In Computer vision systems (pp. 103–112). Berlin: Springer.
DARWIN D9.4. (2014). Deliverable D9.4: Third year demonstrators and evaluation report. EC FP7 project DARWIN Grant No. 270138. Retrieved November 10th, 2015 from http://darwin-project.eu/wp-content/uploads/2010/07/D94_Y3_Demonstrators_Evaluation_v3.0.pdf.
DARWIN D9.5. (2015). Deliverable D9.5: Industrial assembly demonstrator and final evaluation. EC FP7 project DARWIN Grant No. 270138. Retrieved November 10th, 2015 from http://darwin-project.eu/wp-content/uploads/2010/07/D95_Y4_Demonstrators_Evaluation.pdf.
De Luca, A., & Oriolo, G. (1991). Issues in acceleration resolution of robot redundancy. In Third IFAC symposium on robot control (pp. 93–98).
De Luca, A., Oriolo, G., & Siciliano, B. (1992). Robot redundancy resolution at the acceleration level. Laboratory Robotics and Automation, 4, 97–106.
Google Scholar
Featherstone, R. (1987). Robot Dynamics Algorithms. Dordrecht: Kluwer.
Google Scholar
Featherstone, R., & Khatib, O. (1997). Load independence of the dynamically consistent inverse of the Jacobian matrix. International Journal of Robotics Research, 16(2), 168–170.
Article Google Scholar
Flash, T., & Hogan, N. (1985). The coordination of arm movements: an experimentally confirmed mathematical model. Journal of Neuroscience, 5, 1688–1703.
Google Scholar
Frey, S. H., & Gerry, V. E. (2006). Modulation of neural activity during observational learning of actions and their sequential orders. Journal of Neuroscience, 26, 13194–13201.
Article Google Scholar
Friston, K. (2010). The free-energy principle: A unified brain theory? Nature Reviews Neuroscience, 11, 127–138.
Article Google Scholar
Friston, K. (2011). What is optimal about motor control? Neuron, 72(3), 488–498.
Article Google Scholar
Gallese, V., & Lakoff, G. (2005). The brain’s concepts: The role of the sensory-motor system in reason and language. Cognitive Neuropsychology, 22(3), 455–479.
Article Google Scholar
Gallese, V., & Sinigaglia, C. (2011). What is so special about Embodied Simulation. Trends in Cognitive Sciences, 15(11), 512–519.
Article Google Scholar
Grafton, S. T. (2009). Embodied cognition and the simulation of action to understand others. Annals of the New York Academy of Sciences, 1156, 97–117.
Article Google Scholar
Graziano, M. S. A., & Botvinick, M. M. (2002). How the brain represents the body: Insights from neurophysiology and psychology. In W. Prinz & B. Hommel (Eds.), Common mechanisms in perception and action: Attention and performance (pp. 136–157). Oxford: Oxford University Press.
Google Scholar
Guigon, E. (2011). Models and architectures for motor control: Simple or complex? In F. Danion & M. L. Latash (Eds.), Motor control (pp. 478–502). Oxford: Oxford University Press.
Google Scholar
Haggard, P., & Wolpert, D. M. (2005). Disorders of body schema. In H. J. Freund, M. Jeannerod, M. Hallett, & R. Leiguarda (Eds.), Higher-order motor disorders: From neuroanatomy and neurobiology to clinical neurology (pp. 261–271). Oxford: Oxford University Press.
Google Scholar
Head, H., & Holmes, G. (1911). Sensory disturbances in cerebral lesions. Brain, 34, 102–254.
Article Google Scholar
Hollerbach, J. M., & Suh, K. C. (1987). Redundancy resolution of manipulators through torque optimization. IEEE Journal of Robotics and Automation, 3(4), 308–316.
Article Google Scholar
Hsu, P., Hauser, J., & Sastry, S. (1989). Dynamic control of redundant manipulators. Journal of Robotic Systems, 6(2), 133–148.
Article MATH Google Scholar
Iriki, A., Tanaka, M., & Iwamura, Y. (1996). Coding of modified body schema during tool use by macaque postcentral neurones. Neuroreport, 7, 2325–2330.
Article Google Scholar
Jordan, M. I. (1990). Motor learning and the degrees of freedom problem. In M. Jeannerod (Ed.), Attention and performance XIII. Hillsdale, NJ: Lawrence Erlbaum Associates Inc.
Google Scholar
Jordan, M. I., & Rumelhart, D. E. (1992). Forward models: Supervised learning with a distal teacher. Cognitive Science, 16(3), 307–354.
Article Google Scholar
Khatib, O. (1987). A unified approach for motion and force control of robot manipulators: The operational space formulation. IEEE Journal of Robotics and Automation, 3(1), 43–53.
Article Google Scholar
Khatib, O., et al. (2004). Human-centered robotics and interactive haptic simulation. International Journal of Robotics Research, 23(2), 167–478.
Article Google Scholar
Kranczioch, C., Mathews, S., Dean, J. A., & Sterr, A. (2009). On the equivalence of executed and imagined movements. Human Brain Mapping, 30, 3275–3286.
Article Google Scholar
Lashley, K. S. (1933). Integrative function of the cerebral cortex. Physiological Reviews, 13(1), 1–42.
Google Scholar
Lee, S., & Kil, R. M. (1990, June). Robot kinematic control based on bidirectional mapping neural network. In 1990 IJCNN international joint conference on neural networks, 1990 (pp. 327–335). New York: IEEE.
Lewis, F. W., Jagannathan, S., & Yesildirak, A. (1998). Neural network control of robot manipulators and non-linear systems. Boca Raton: CRC Press.
Google Scholar
Li, S., Chen, S., Liu, B., Li, Y., & Liang, Y. (2012). Decentralized kinematic control of a class of collaborative redundant manipulators via recurrent neural networks. Neurocomputing, 91, 1–10.
Article Google Scholar
Liégeois, A. (1977). Automatic supervisory control of the configuration and behavior of multibody mechanisms. IEEE Transactions on Systems, Man and Cybernetics, 7(12), 868–871.
Article MATH Google Scholar
Lourakis, M., & Zabulis, X. (2013). Model-based pose estimation for rigid objects. In Computer vision systems (pp. 83–92). Berlin: Springer.
Maravita, A., & Iriki, A. (2004). Tools for the body (schema). Trends in Cognitive Science, 8, 79–86.
Article Google Scholar
Mel, B. W. (1988). MURPHY: A robot that learns by doing. In Neural information processing systems (pp. 544–553).
Mohan, V., & Morasso, P. (2011). Passive motion paradigm: An alternative to optimal control. Frontiers in Neurorobotics, 5, 4.
Mohan, V., Morasso, P., Metta, G., & Sandini, G. (2009). A biomimetic, force-field based computational model for motion planning and bimanual coordination in humanoid robots. Autonomous Robots, 27, 291–301.
Article Google Scholar
Mohan, V., Morasso, P., Zenzeri, J., Metta, G., Chakravarthy, V. S., & Sandini, G. (2011). Teaching a humanoid robot to draw ‘Shapes’. Autonomous Robots, 31(1), 21–53.
Mussa-Ivaldi, F. A., Morasso, P., & Zaccaria, R. (1988). Kinematic networks. A distributed model for representing and regularizing motor redundancy. Biological Cybernetics, 60, 1–16.
Google Scholar
Nakamura, Y., & Hanafusa, H. (1986). Inverse kinematics solutions with singularity robustness for robot manipulator control. Journal of Dynamic Systems, Measurement, and Control, 108, 163–171.
Article MATH Google Scholar
Nakamura, Y., & Hanafusa, H. (1987). Optimal redundancy control of robot manipulators. International Journal of Robotics Research, 6(1), 32–42.
Article Google Scholar
Nakanishi, J., Cory, R., Mistry, M., Peters, J., & Schaal, S. (2008). Operational space control: A theoretical and empirical comparison. The International Journal of Robotics Research, 27(6), 737–757.
Article Google Scholar
Nguyen, L., Patel, R. V., & Khorasani, K. (1990, June). Neural network architectures for the forward kinematics problem in robotics. In 1990 IJCNN international joint conference on neural networks (pp. 393–399). New York: IEEE.
Peters, J., & Schaal, S. (2008). Learning to control in operational space. The International Journal of Robotics Research, 27(2), 197–212.
Article Google Scholar
Pickering, M. J., & Clark, A. (2014). Getting ahead: Forward models and their role in cognitive architecture. Trends in Cognitive Sciences, 18(9), 451–456.
Article Google Scholar
Salaün, C., Padois, V., & Sigaud, O. (2009, October). Control of redundant robots using learned models: An operational space control approach. In IROS 2009 IEEE/RSJ international conference on intelligent robots and systems, 2009 (pp. 878–885). New York: IEEE.
Scott, S. (2004). Optimal feedback control and the neural basis of volitional motor control. Nature Reviews Neuroscience, 5, 534–546.
Article Google Scholar
Senda, K. (1999). Quasioptimal control of space redundant manipulators. AIAA Guidance, Navigation, and Control Conference, 3, 1877–1885.
Google Scholar
Sentis, L., & Khatib, O. (2005). Synthesis of wholebody behaviors through hierarchical control of behavioral primitives. International Journal of Humanoid Robotics, 2(4), 505–518.
Article Google Scholar
Sevdalis, V., & Keller, P. E. (2011). Captured by motion: Dance, action understanding, and social cognition. Brain & Cognition, 77, 231–236.
Article Google Scholar
Todorov, E. (2006). Optimal control theory. In K. Doya, et al. (Eds.), Bayesian brain: Probabilistic approaches to neural coding (pp. 269–298). Cambridge, MA: MIT Press.
Google Scholar
Umiltà, M. A., Escola, L., Intskirveli, I., Grammont, F., Rochat, M., Caruana, F., et al. (2008). When pliers become fingers in the monkey motor system. Proceedings of the National Academy of Sciences of the United States of America, 105(6), 2209–13.
Article Google Scholar
Wampler, C. W. (1986). Manipulator inverse kinematic solutions based on vector formulations and damped least squares methods. IEEE Transaction on Systems, Man, and Cybernetics, 16, 93–101.
Article MATH Google Scholar
Whitney, D. E. (1969). Resolved motion rate control of manipulators and human prostheses. IEEE Transactions on Man Machine Systems, 10(2), 47–53.
Article Google Scholar
Wolovich, W. A., & Elliot, H. (1984). A computational technique for inverse kinematics. In Proceedings of the 23rd IEEE conference on decision and control (pp. 1359–1363).
Zak, M. (1991). Terminal chaos for information processing in neurodynamics. Biological Cybernetics, 64, 343–351.
Article MATH Google Scholar

Download references

Acknowledgments

This work presented in this article is supported by Robotics, Brain and Cognitive Sciences Department IIT, the EU FP7 Project DARWIN (www.darwin-project.eu, Grant No. FP7-270138) and US Dept. of Defense Grant (W911QY-12-C0078).

Author information

Authors and Affiliations

Robotics, Brain and Cognitive Science Department, Istituto Italiano di Tecnologia, Via Morego 30, 16163, Genoa, Italy
Ajaz Ahmad Bhat, Vishwanathan Mohan & Pietro Morasso
Robotics and Assistive Systems, PROFACTOR GmbH, Im Stadtgut A2, 4407, Steyr-Gleink, Austria
Sharath Chandra Akkaladevi & Christian Eitzinger

Authors

Ajaz Ahmad Bhat
View author publications
You can also search for this author in PubMed Google Scholar
Sharath Chandra Akkaladevi
View author publications
You can also search for this author in PubMed Google Scholar
Vishwanathan Mohan
View author publications
You can also search for this author in PubMed Google Scholar
Christian Eitzinger
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Morasso
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ajaz Ahmad Bhat.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (mp4 19871 KB)

Supplementary material 2 (mp4 26014 KB)

Supplementary material 3 (mp4 21926 KB)

Appendix: A neural implementation of time base generator

A time base generator (TBG) is a scalar dynamical system in the normalized variable $\xi $ given by:

$$\begin{aligned} \dot{\xi }= & {} \gamma (\xi (1-\xi ))^{\beta }\nonumber \\&\beta \in (0,1), \end{aligned}$$

(5)

where $\xi (\hbox {t})$ is a smooth sigmoid from $\xi (0) =0$ to $\xi (t_f ) =1$, with a bell shaped velocity profile and desired finite movement duration $t_f $. The system has two equilibrium points, an unstable one at $\xi =0$ and a stable one at $\xi =1$, consequently the system always approaches stably to $\xi =1$. The time history of the TBG can be regulated using $\beta $. The $\gamma $ parameter has a dual function: controlling the convergence time and to reset the TBG and make it excitable for subsequent activation cycles. As regards to the exponent $\beta $, it can be shown that the condition,^{Footnote 2} $\beta >2/3$ is essential in order for the third derivative of $\xi \hbox {(t)}$ (Jerk) to be defined at $t=0$ and $t= t_f$. Under these conditions, it can be seen that the dynamics of the system are Non Lipscitzian,^{Footnote 3} i.e. equilibrium configurations do not satisfy Lipschitz condition for ODE since $|\partial \dot{\xi }/\partial \xi |\rightarrow \infty $. This implies that equilibrium point is a terminal attractor, and systems with terminal attractor dynamics always converge in finite time (Zak 1991).

To derive the convergence time, let us consider a simpler dynamical system:

$$\begin{aligned} \dot{\xi }= & {} \gamma \xi ^{\beta }.\\ t_f= & {} \int _0^{tf} {dt=\int _0^1 {\partial \xi /\gamma \xi ^{\beta }=1/\gamma (1-\beta )} } \end{aligned}$$

Once again we can see that equilibrium point is a terminal attractor as convergence time is always finite and can be precisely specified through the constant $\gamma =1/t_f (1-\beta )$.

Remarkably, the above dynamical system can be approximated using a reciprocal inhibition network consisting of two neurons. A single neural element is an integrate-and fire neuron comprised of a multiplier, an integrator and a power function. In the integrate-and-fire model, input spikes are multiplied by their respective synaptic weights, summed and integrated over time. If the integral exceeds a threshold, the neuron fires and the integration restarts. The functionality in this case can be expressed as:

$$\begin{aligned} \dot{\xi }_i =\prod w_i \zeta _i \end{aligned}$$

where

$$\begin{aligned} {\zeta _i }=\xi _i ^{\beta } \end{aligned}$$

The reciprocal inhibition network of two neurons modeling the TBG is shown in Fig. 12.

Dynamic behavior of the neuron can be written as

$$\begin{aligned}&\dot{\xi }_{1}=-\gamma {\xi }_{1}^{\beta }{\xi }_2^{\beta }=-\dot{\xi }_{2}\\&{\xi }_{1}(t)+ {\xi }_2 (t) =1\\&\therefore \dot{\xi }_{2}=\gamma {\xi }_1 ^{\beta }{\xi }_2 ^{\beta }=\gamma {\xi }_2 ^{\beta }(1-\xi _2 )^{\beta }=\gamma (\xi _2 (1-\xi _2 ))^{\beta } \end{aligned}$$

This is same as Eq. 5.

To perform any reaching movement, several joints—shoulder, elbow, wrist, fingers move cooperatively forming a synergy in a flexible and dynamic fashion. While groups of fingers may operate synergistically while playing a guitar chord, individual fingers are controlled while playing a lead. One of the basic problems of motor control is to understand how neural control structures quickly and flexibly organize and engage different parts of the body schema to cooperate synergistically in a movement sequence. The above TBG can be used to dynamically couple and decouple synergies in different ways based on task specification. In sum, by selecting two parameters of the TBG ($t_f$ and $\beta $), a family of time-varying signals can be generated. From the point of view of real-time implementation, it is possible to use any scalar function of time satisfying the properties of described above or a look-up table etc.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bhat, A.A., Akkaladevi, S.C., Mohan, V. et al. Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots. Auton Robot 41, 945–966 (2017). https://doi.org/10.1007/s10514-016-9563-3

Download citation

Received: 10 March 2015
Accepted: 17 March 2016
Published: 04 April 2016
Issue Date: April 2017
DOI: https://doi.org/10.1007/s10514-016-9563-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Towards a learnt neural body schema for dexterous coordination of action in humanoid and industrial robots

Abstract

Access this article

Similar content being viewed by others

A review of motion planning algorithms for intelligent robots

Embodied intelligence in manufacturing: leveraging large language models for autonomous industrial robotics

A review of external sensors for human detection in a human robot collaborative environment

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Appendix: A neural implementation of time base generator

Appendix: A neural implementation of time base generator

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation