Elsevier

Neural Networks

Volume 35, November 2012, Pages 54-69
Neural Networks

A bio-inspired kinematic controller for obstacle avoidance during reaching tasks with real robots

https://doi.org/10.1016/j.neunet.2012.07.010Get rights and content

Abstract

This paper describes a redundant robot arm that is capable of learning to reach for targets in space in a self-organized fashion while avoiding obstacles. Self-generated movement commands that activate correlated visual, spatial and motor information are used to learn forward and inverse kinematic control models while moving in obstacle-free space using the Direction-to-Rotation Transform (DIRECT). Unlike prior DIRECT models, the learning process in this work was realized using an online Fuzzy ARTMAP learning algorithm. The DIRECT-based kinematic controller is fault tolerant and can handle a wide range of perturbations such as joint locking and the use of tools despite not having experienced them during learning. The DIRECT model was extended based on a novel reactive obstacle avoidance direction (DIRECT-ROAD) model to enable redundant robots to avoid obstacles in environments with simple obstacle configurations. However, certain configurations of obstacles in the environment prevented the robot from reaching the target with purely reactive obstacle avoidance. To address this complexity, a self-organized process of mental rehearsals of movements was modeled, inspired by human and animal experiments on reaching, to generate plans for movement execution using DIRECT-ROAD in complex environments. These mental rehearsals or plans are self-generated by using the Fuzzy ARTMAP algorithm to retrieve multiple solutions for reaching each target while accounting for all the obstacles in its environment. The key aspects of the proposed novel controller were illustrated first using simple examples. Experiments were then performed on real robot platforms to demonstrate successful obstacle avoidance during reaching tasks in real-world environments.

Introduction

The future of industrial robots is progressing towards systems that have a great deal of flexibility in handling various tasks in a self-organized and safe manner. One such task is in the area of intelligent robotic assembly. This task requires the robot to reach and grasp complex objects and manipulate them while satisfying many requirements: the robot must be able to reach for any point in its workspace from any other point with varying speed requirements; the robot must be able to avoid collisions with objects in its environment; it must be able to tolerate unexpected contingencies and perform reliably despite changes in sensor geometry and changes in joint mobility; it must be able to shape its hand around the object to enable stable grasps. In this study the problem of collision avoidance in highly complex environments is developed by drawing inspiration from biological systems.

Previous work has demonstrated obstacle avoidance in complex environments using a path planning approach (Schwarzer & Saha, 2005). In this framework, paths are generated by a path planner, and subsequently evaluated by a separate process to determine collisions with obstacles in the environment (Barraquand et al., 1997, Sanchez and Latombe, 2003). By exhaustively searching through the space of possible paths, a candidate non-collision path is selected for execution. In contrast, path selection and avoidance is an integrated process in human and animal reaching behaviors, and forms the basis of bio-inspired approaches to obstacle avoidance and movement planning.

Another approach to obstacle avoidance is based on modifying path plans created with attractor dynamics (Iossifidis & Schoner, 2007). The controller is developed on the basis of a readily available attractor dynamics model that provides a defined trajectory for the end-effector to reach a target dependent on robot geometry and absolute joint angles. This trajectory serves as the attractor. The influence of the obstacles is superimposed on the attractor dynamics to modify the trajectory. There are three key differences between this approach and the work described in this paper. First, the controller in Iossifidis and Schoner (2007) requires a specific trajectory to be defined in order to reach a target. The trajectory for the novel controller described in this paper is automatically generated by only using directions to targets and thus is less dependent on system parameters and also much easier to implement. Second, the controller in Iossifidis and Schoner (2007) is dependent on absolute joint angles to define the attractor dynamics while the controller described in this work learns a mapping between the changes in the joint angles and changes in direction to the target. Third, the influence of the obstacles modifies the changes in joint angles in this work, while in Iossifidis and Schoner (2007) obstacles modify the absolute joint angles. Thus, the bio-inspired controller described in this paper scales well to robots with more degrees of freedom while being robust to unexpected faults and contingencies.

A prior work that explicitly addresses the obstacle avoidance problem in complex environments is described in Mel (1990). This bio-inspired approach performs a random exploration of arm movements “mentally” using robot models. Each arm configuration that is generated is evaluated for its ability to reach the target while avoiding obstacles. If none of the random movements from the present configuration is successful, then the system “backtracks” to a previous configuration and the search process is continued. This approach creates jerky movements that are arguably unnatural. Furthermore, it is unclear if any subsequent movement smoothing could be applied to avoid collisions.

The approach described in this paper has similarities to Mel (1990) in that “mental rehearsals” (also called lookahead planning) are performed to decide on a plan to avoid obstacles. The mental rehearsal approach employs a perceptual process that is used to generate via-points or intermediate targets during reaching tasks. The via-points are tested for suitability to reach both the target and the initial arm configuration via a reactive obstacle avoidance controller. This approach obviates the need for “backtracking” (Mel, 1990) and is more efficient and practical. Another advantage is that arm movements are smooth due to seamless target switching from via-point to target.

The rest of the paper is organized as follows. Section 2 first describes the basic DIRECT model, followed by a description of the DIRECT-ROAD model for reactive avoidance. Next, the limitation of using the reactive controller in more complex obstacle and target configurations is motivated with an example of a local minima situation. Computer simulations are described to demonstrate obstacle avoidance in complex environments for a planar redundant robot arm. In Section 3, the overall system architecture is described. This is followed in Section 4 by a description of various experiments that were performed on these platforms to demonstrate examples of obstacle avoidance during reaching tasks in complex real-world environments. Section 5 describes related research while concluding remarks are provided in Section 6.

Section snippets

DIRECT controller

The DIRECT model (Bullock, Grossberg, & Guenther, 1993) uses a mapping of desired movement directions in task space into joint rotations (Fig. 1). The model automatically compensates for externally imposed constraints on effector motion. The use of a directional mapping for movement control is closely related to robotic controllers that utilize a generalized inverse of the Jacobian matrix (Baillieul et al., 1984, Hollerbach, 1982, Mussa-Ivaldi and Hogan, 1991). For notational convenience, all

Overall system architecture

The system for implementing the DIRECT-ROAD model on real robotic platforms consists of three main modules: the visual perception module, the DIRECT-ROAD controller and the robot movement execution module, as shown in Fig. 9. The visual perception module takes in stereo images from the stereo camera and processes them using an object recognition model (Section 4.1). The object recognition model is built using templates from an object database. In addition to the object recognition model, there

Experiments

Experiments in learning and control using DIRECT were first performed on the ST Robotics R17 platform. Fuzzy ARTMAPs were used to learn the forward and inverse models for four DOFs on the robot (waist rotate, shoulder swing, elbow swing, and hand swing) in 3D task space. Additionally, experiments in learning and control using DIRECT and DIRECT-ROAD were also performed on a custom robot platform developed by the Shadow Robot Company, consisting of a Shadow Hand mounted on a custom pneumatically

Discussion

The robustness and fault tolerance exhibited by the real robot systems trained using the self-organizing neural model is akin to behaviors by biological systems wherein reliability and fault tolerance are cornerstones of performance. These key features ensure the survivability of biological systems when faced with previously unforeseen environments and disturbances. It seems that by learning the appropriate transform between various sensory modalities, robotic systems can also exhibit fault

Conclusions

In this study, a novel kinematic controller was developed, inspired by human and animal behaviors that are capable of controlling redundant robot arms to avoid obstacles while reaching for targets. The core of the kinematic controller for trajectory generation is based on a self-organizing neural model that is capable of learning to reach 3D targets by self-generated vision and motor signals during action-perception cycles. The approach in this study integrates the process of trajectory

Acknowledgments

We would like acknowledge the support of a Collaborative Research grant CR07C202 at HRL for this research work. S.G. was supported in part by CELEST, an NSF Science of Learning Center (SBE-0354378), and by the SyNAPSE program of DARPA (HR0011-09-C-0001).

References (58)

  • S. Grossberg

    Towards a unified theory of neocortex: laminar cortical circuits for vision and cognition

    Progress in Brain Research

    (2007)
  • J.M. Hollerbach

    Computers, brains, and the control of movement

    Trends in Neurosciences

    (1982)
  • M. Mishkin et al.

    Contribution of striate inputs to the visuospatial functions of parieto-preoccipital cortex in monkeys

    Behavioural Brain Research

    (1982)
  • L.H. Snyder et al.

    Intention-related activity I the posterior-parietal cortex: a review

    Vision Research

    (2000)
  • N. Srinivasa et al.

    A head–neck–eye system that learns fault-tolerant saccades to 3-D targets using a self-organizing neural model

    Neural Networks

    (2008)
  • D. Wienke et al.

    Adaptive resonance theory based neural network for supervised chemical pattern recognition (Fuzzy ARTMAP) part 1: theory and network properties

    Chemometrics and Intelligent Laboratory Systems

    (1996)
  • D. Wolpert et al.

    Multiple paired forward and inverse models for motor control

    Neural Networks

    (1998)
  • G. Amis et al.

    Default ARTMAP 2

  • G. Amis et al.

    Self-supervised ARTMAP

    Neural Networks

    (2009)
  • Baillieul, J., Hollerbach, J., & Brockett, R.W. (1984). Programming and control of kinematically redundant...
  • J. Barraquand et al.

    A random sampling scheme for path planning

    International Journal of Robotics Research

    (1997)
  • R. Bhattacharyya et al.

    Parietal reach region encodes reach depth using retinal disparity and vergence angle signals

    Journal of Neurophysiology

    (2009)
  • D. Bullock et al.

    Cortical networks for control of voluntary arm movements under variable force conditions

    Cerebral Cortex

    (1998)
  • D. Bullock et al.

    A self-organizing neural model of motor equivalent reaching and tool use by a multijoint arm

    Journal of Cognitive Neuroscience

    (1993)
  • Carpenter, G.A. (2003). Default ARTMAP. In Proceedings of the international joint conference on neural networks,...
  • G.A. Carpenter et al.

    Biased ART: a neural architecture that shifts attention toward previously disregarded features following an incorrect prediction

    Neural Networks

    (2010)
  • G.A. Carpenter et al.

    Adaptive resonance theory

  • G.A. Carpenter et al.

    Fuzzy ARTMAP: a neural network architecture for incremental supervised learning of analog multidimensional maps

    IEEE Transactions on Neural Networks

    (1992)
  • P. Cisek et al.

    Neural activity in primary motor and dorsal pre-motor cortex in reaching tasks with the contralateral versus ipsilateral arm

    Journal of Neurophysiology

    (2003)
  • Cited by (8)

    • Fronto-parietal mirror neuron system modeling: Visuospatial transformations support imitation learning independently of imitator perspective

      2019, Human Movement Science
      Citation Excerpt :

      It has been proposed that the STS-IPL-IFG pathway acts as an inverse model that computes the motor commands for producing a given desired action (Miall, 2003). It has also been suggested that the motor/premotor regions may implement inverse models for self-intended reaching movements (Bullock, Grossberg, & Guenther, 1993; Gentili, Oh, Kregling, & Reggia, 2016; Gentili, Oh et al., 2015; Guenther & Micci-Barreca, 1997; Li, Padoa-Schioppa, & Bizzi, 2001; Molina-Vilaplana & Coronado, 2006; Molina-Vilaplana et al., 2007; Padoa-Schioppa, Li, & Bizzi, 2004; Srinivasa, Bhattacharyya, Sundareswara, Lee, & Grossberg, 2012; Tin & Poon, 2005). Thus, here we postulate that the IFG performs the inverse computation that maps a desired action such as an action to imitate, into the corresponding neural drive that allows imitating the observed action.

    • Deeply-learnt damped least-squares (DL-DLS) method for inverse kinematics of snake-like robots

      2018, Neural Networks
      Citation Excerpt :

      Finally, conclusions of this study and future works are given in Section 6. Solving IK problems in serial link robots have been based on several methods (Chung, Youm, & Chung, 1994; Elgazzar, 1985; Jamali, Khan, & Rahman, 2011; Kostic, Hensen, de Jager, & Steinbuch, 2002; Kucuk & Bingul, 2014; Makondo, Claassens, Tlale, & Braae, 2012; Ren et al., 2017; Sardana et al., 2013; Sheng et al., 2006; Srinivasa et al., 2012; Tchon, 2008; Toshani & Farrokhi, 2014; Yahya et al., 2011; Yahya, Mohamed, Moghavvemi, & Yang, 2009). According to Omisore, Han, Ren, Zhang, and Wang (2017), these methods can be simply categorized as algebraic and iterative approaches.

    • Cognitive models of the perception-action cycle: A view from the brain

      2013, Proceedings of the International Joint Conference on Neural Networks
    View all citing articles on Scopus
    1

    Authors contributed equally.

    View full text