Monitoring techniques for an online neuro-adaptive controller

https://doi.org/10.1016/j.jss.2006.03.048Get rights and content

Abstract

The appeal of biologically inspired soft computing systems such as neural networks in complex systems comes from their ability to cope with a changing environment. Unfortunately, adaptability induces uncertainty that limits the applicability of static analysis to such systems. This is particularly true for systems with multiple adaptive components or systems with multiple types of learning operation.

This work builds a paradigm of dynamic analysis for a neuro-adaptive controller where different types of learning are to be employed for its online neural networks. We use support vector data description as the novelty detector to detect unforeseen patterns that may cause abrupt system functionality changes. It differentiates transients from failures based on the duration and degree of novelties. Further, for incremental learning, we utilize Lyapunov functions to assess real-time performance of the online neural networks. For quasi-online learning, we define a confidence measure, the validity index, to be associated with each network output. Our study on the NASA F-15 Intelligent Flight Control System demonstrates that our novelty detection tool effectively filters out transients and detects failures; and our light-weight monitoring techniques supply sufficient evidence for an insightful validation.

Introduction

Adaptive systems are those systems whose functionality evolves over time in response to changes in the environment in which it operates. If learning and adaptation are allowed to occur after the system is deployed, the system is called online adaptive system Mili et al. (2003). The use of biologically inspired soft computing systems for online adaptation to counter the changes in system environments has revolutionized the operation of real-time automation and control applications. Neural networks are one of the most popular learning paradigms employed in online adaptive systems. Because the learning algorithms behind these computational architectures are usually derived from error/risk minimization theories, the computations are complex and the learning process is non-linear.

Applications of adaptive computing in safety critical systems are on the rise. These applications provide fault-tolerant control capabilities, automated maintenance of distributed networks, or enhance implementations of high security devices. Different research communities use different terms to describe online adaptive computing. For example, in computer networks automated modification of internal variables in traffic routing, possibly through judicious application of machine learning algorithms, is called the parameter adaptation paradigm McKinley et al. (2004). The Flight Control has become one of the most promising emerging applications of real-time adaptive control. The mechanisms and algorithms for achieving fault tolerant system features through adaptivity are termed software-enabled control, adaptive control augmentation or intelligent flight control.

When learning software is used in online adaptation, its behavior has direct consequence on the performance of systems into which it is embedded. Therefore, it is necessary to understand, predict and assure the behavior of the adaptive learner before its actual deployment. And while many experiments, especially in the aerospace domain, have demonstrated the potential of online adaptive computing, the lack of verification and validation procedures represents a serious problem barring their widespread use.

Our previous research using formal methods on certain families of neural networks suggests that environmental changes (learning data) have a significant impact on system behavior Mili et al. (2003). Abrupt and abnormal environmental changes are likely to generate previously unseen learning behaviors, because judicious enumeration of all environmental conditions is impossible. Therefore, we believe that the most promising paradigm for validating the performance of adaptive systems is their continual monitoring. The monitoring data collected in real time is then used to analyze system properties of interest, most typically related to convergence and stability of the learning algorithm Yerramalla et al. (2003).

Therefore, the most meaningful questions related to the monitoring approaches to validation of online adaptive systems are the following two:

  • What to monitor?

  • How to analyze monitored data?

This paper provides our answers to these questions. The data stream entering the adaptive computational element, such as the neural network, is monitored to detect anomalous data patterns which indicate significant environmental changes. The internal structure of the neural network is monitored and analyzed with respect to the convergent behavior that usually indicates successful learning. Furthermore, the network predictions, i.e., its output data stream, is monitored and statistically analyzed for variations as an indication of their volatility over time.

Our research further indicates that the choice of the methods for system validation is phenomenological. In other words, we encountered systems where, depending on the design choices, not all the above mentioned techniques are feasible. Therefore, while these techniques are complementary, the feasibility of their application in the analysis of adaptive learning varies. This paper presents three monitoring and analysis techniques, demonstrates their partial applicability to two types of learning modes, both investigated in the context of an experimental adaptive flight control scheme.

The Intelligent Flight Control System (IFCS) was developed by NASA with the primary goal to “flight evaluate control concepts that incorporate emerging soft computing algorithms to provide an extremely robust aircraft capable of handling multiple accident and/or an off-nominal flight scenario” Boyd et al., 2001, Jorgensen, 1991. The diagram in Fig. 1 shows the architectural overview of NASA’s IFCS implementation using Online Learning Neural Network (OLNN). The control concept can be briefly described as follows. Notable discrepancies from the outputs of the Baseline Neural Network and the Real-time Parameter Identification (PID), either due to a change in the aircraft dynamics (loss of control surface, aileron, stabilator) or due to sensor noise/failure, are accounted by the OLNN.

The primary goal of OLNN is to accomplish in-flight accommodation of discrepancies, commonly known as Stability and Control Derivative errors. Derivative errors indicate conditions that fall outside the scope of traditional (linearized) control gain look-up tables. When OLNN performs adaptation, its behavior has a direct consequence on the performance of the flight control system. In such a safety-critical application, it is necessary to understand and predict the behavior of the NN. The goal of validating NN-based online component is to provide a means to detect novel (abnormal) conditions entering the OLNN, to investigate the NN’s stability behavior during adaptation, and to interpret the results of the analysis so that it ensures safe operation.

One concern in this process is the presence of noise in sensor data as well as temporary variations in environmental data which we may wish the neural network to tolerate without disruption of the training process. In any validation method, it is necessary to distinguish between such transient disturbances which do not persist or may not impact performance and true failures which do persist or have a major impact on the system. The transient nature of a disturbance may not become apparent except over time and the true impact may only emerge after an adaptive learner has been trained, but it is still necessary to respond to extreme variability as quickly as possible. The primary contribution of the current work is to observe that an integrated approach to validation is needed in which monitoring occurs at multiple time scales so that both immediate analysis of real-time learning and buffered long-term failure indicators are used.

The rest of the paper is organized as follows. In Section 2 we give a summary of the adaptive component of the IFCS. This section also reviews of the underlying structure of the neural network, describes the flight simulator used to generate test data and lists the failure modes used in the ensuing experiments. Section 3 introduces novelty detection techniques which discriminate between permanent failure events and transient failure events before their data representations reach the input layer of the neural network. The same section discusses the accuracy of this method. Sections 4 Validating Type I online learning, 5 Validating Type II online learning outline two techniques for analyzing and validating the real-time behavior of the neural network used by the IFCS. Each technique addresses a different type of the learning algorithm. We also describe experimental results by applying the techniques to two simulated failure modes. Finally, we conclude with a summary and list our future research directions in Section 6.

Section snippets

The online learning neural network

In IFCS, a special type of neural network, the Dynamic Cell Structure (DCS), is deployed for online learning. The DCS neural network is a dynamically growing structure. It can be seen as a special case of Self-Organizing Maps(SOMs), originally introduced by Kohonen in 1981 as biologically inspired adaptive vector quantization algorithms suitable for unsupervised feature extraction Kohonen (1990). An SOM essentially constructs a structured representation of the presented input data using

Novelty detection

A novelty detection technique serves as a filter that disallows the transient conditions from entering the DCS network. In general, novelty detection techniques require beforehand knowledge of both nominal and off-nominal flight domains. However, for a flight control system it is impossible to anticipate all possible failure situations. As a one-class classification tool, Support Vector Data Description (SVDD) technique is derived from Support Vector learning theory by Tax and Duin, 1999a, Tax

Validating Type I online learning

Following the elimination of transients, DCS neural network is exposed to training data. The goal of the monitoring techniques is to extract numerical indicators of the quality of learning. Of course, learning quality is a very abstract term. But, if the DCS network can be considered a dynamical system, techniques that analyze convergence and stability of its adaptation can help. This is the motivation behind the approach described in this section. To remind the reader, Type I learning implies

Validating Type II online learning

For Type II online learning of the DCS network, we propose a different monitoring technique. It is important to notice here that stability monitors would not provide an effective monitoring mechanism for Type II learning. In Type I, every cycle of the control loop introduces a new data point on which the DCS network trains. Therefore, stability monitors are designed to be very sensitive and capture rapid changes in the behavior of the learning algorithm. In Type II learning, the buffer which

Summary and discussion

In our experience, the architectures and design of many adaptive systems suggest a multi-tiered approach to their analysis. Due to their complexity and the dynamics of their change, the problem of performance validation is complicated. No single aspect of the adaptive system is likely to be a reliable indicator of correctness of operation in non-deterministic environments.

This paper outlines the development of three types of functional performance monitoring techniques that can be used to

Yan Liu is a senior research scientist in the Software and System Engineering Research Lab at Motorola Inc. She received the BS degree in computer science from Wuhan University, China, and the MS and Ph.D. degrees in computer science from West Virginia University. Dr. Liu’s research interests include software verification and validation, software process improvement, adaptive systems, and intelligent computing.

References (26)

  • T. Martinetz et al.

    Topology representing networks

    Neural Networks

    (1994)
  • D.M.J. Tax et al.

    Support vector domain description

    Pattern Recognition Letters

    (1999)
  • K. Bennet et al.

    Support vector machines: hype or hallelujah?

    SIGKDD Explorations

    (2000)
  • Boyd, M.A., Schumann, J., Brat, G., Giannakopoulou, D., Cukic, B., Mili, A., 2001. Validation and verification process...
  • J. Bruske et al.

    Dynamic cell structure learns perfectly topology preserving map

    Neural Computation

    (1995)
  • Darrah, M., Taylor, B., Skias, S., 2004. Rule extraction from Dynamic CEll Structure neural networks used in a safety...
  • Gupta, P., Schumann, J., 2004. A tool for verification and validation of neural network based adaptive controllers for...
  • Institute for Scientific Research, Inc., 2001. Dynamic cell structure neural network report for the Intelligent Flight...
  • Jorgensen, C.C., 1991. Feedback linearized aircraft control using dynamic cell structures. World Automation Congress...
  • T. Kohonen

    The self-organizing map

    Proceedings of the IEEE

    (1990)
  • Y. Liu et al.

    Predicting with Confidence – An improved Dynamic Cell Structure

    (2005)
  • P.K. McKinley et al.

    Composing adaptive software

    IEEE Computer

    (2004)
  • A. Mili et al.

    Towards the verification and validation of on-line learning adaptive systems

  • Cited by (6)

    • Software cybernetics in BPM: Modeling software behavior as feedback for evolution by a novel discovery method based on augmented event logs

      2017, Journal of Systems and Software
      Citation Excerpt :

      Based on the proofs of algorithm λ+’s mining ability, it is obvious that λ+ can handle more sub-structures than other existing mining algorithms (i.e., α, α+, λ, α++, α# and β) by using the augmented input event logs in discovering running behavior of software systems for the usage in software evolution in software cybernetics. It was implied in Liu et al. (2006) that the efficiency of online system analysis and evolution in software cybernetics is greatly important. So it is very important to improve the efficiency of software execution process discovery techniques which helps to analyze software systems.

    • On the role of model-driven engineering in adaptive systems

      2016, 2016 IEEE 11th Colombian Computing Conference, CCC 2016 - Conference Proceedings
    • Software engineering for self-adaptive systems: A research roadmap

      2009, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

    Yan Liu is a senior research scientist in the Software and System Engineering Research Lab at Motorola Inc. She received the BS degree in computer science from Wuhan University, China, and the MS and Ph.D. degrees in computer science from West Virginia University. Dr. Liu’s research interests include software verification and validation, software process improvement, adaptive systems, and intelligent computing.

    Bojan Cukic is an associate professor in the Lane Department of Computer Science and Electrical Engineering at West Virginia University, where he chairs Biometrics curriculum. He also serves as the co-director of the Center for Identification Technology Research, a National Science Foundation Industry – University Research Center. Dr. Cukic’s research interests include software engineering for high-assurance systems, fault tolerant computing, information assurance and biometrics. He received a US National Science Foundation CAREER award and a Tycho Brahe award from the NASA Office of Safety and Mission Assurance. Dr. Cukic is a member of the editorial board of Empirical Software Engineering journal and served as a guest editor of IEEE Software magazine in 2005. He received his Ph.D. in Computer Science from the University of Houston.

    Edgar Fuller is an Assistant Professor of Mathematics at West Virginia University. He currently works on applications of topology and geometry to the study of autonomous learners, especially those in mission critical applications. He was the co-recipient of the Tyco Brahe award from the NASA Office of Safety and Mission Assurance in 2005. His other interests include applications of computational geometry to knot theory, the analysis of non-deterministic systems, bioinformatics, and mathematics education. He currently is the Principal Investigator for a US Department of Education funded Math and Science Partnership in the state of West Virginia. He received his Ph.D. from the University of Georgia in the area of differential geometry and topology.

    Sampath Yerramalla is currently with Sensor Research and Development Corporation (SRDCorp.) at Maine, where he is a Senior Research Scientist. He is a leading researcher in the area of Sensor Signal Processing at SRDCorp, and serves as the project lead for a US Department of Defense, DoD sponsored program on sensor development for detecting material breakthrough applications. He is an IEEE Student Travel awardee for 2004 and a co-recipient of Tycho Brahe award from NASA Office of Safety and Mission Assurance. His research interests include Digital Signal Processing, Artificial Intelligence, Adaptive Systems, and Control Theory. Yerramalla earned his Ph.D. at West Virginia University in Electrical Engineering, and holds a Masters Degree in Control Systems Engineering from West Virginia University Institute of Technology.

    Srikanth Gururajan is currently a Graduate Research Assistant and a Ph.D. candidate in Aerospace Engineering at the Department of Mechanical and Aerospace Engineering, West Virginia University, Morgantown, WV. He is currently involved in a NASA sponsored research project for the Intelligent Flight Control System (IFCS) program. He received his Master’s degree in Mechanical Engineering from West Virginia University, Morgantown, WV in 1999 and his Bachelor’s degree in Instrumentation and Control Engineering from the University of Madras, India in 1997. Gururajan’s research interests include Neural Networks, Adaptive Flight Control Systems, Distributed and Parallel Computing, Real Time Operating Systems and UAV Design, Construction, Instrumentation and Flight Testing.

    View full text