Learning fuzzy classifier systems for multi-agent coordination

doi:10.1016/S0020-0255(01)00149-9

Information Sciences

Volume 136, Issues 1–4, August 2001, Pages 215-239

https://doi.org/10.1016/S0020-0255(01)00149-9 Get rights and content

Abstract

We present ELF, a learning fuzzy classifier system (LFCS), and its application to the field of Learning Autonomous Agents. In particular, we will show how this kind of Reinforcement Learning systems can be successfully applied to learn both behaviors and their coordination for Autonomous Agents. We will discuss the importance of knowledge representation approach based on fuzzy sets to reduce the search space without losing the required precision. Moreover, we will show how we have applied ELF to learn the distributed coordination among agents which can exchange information with each other. The experimental validation has been done on software agents interacting in a real-time task.

Introduction

In the last decade, we have worked on the definition of learning classifier systems (LCS) [1], [2], [3] whose rules (or classifiers) are implemented as fuzzy rules (learning fuzzy classifier systems (LFCS)) [4], [5], [6]. This makes it possible to match real-valued input preserving most of its informative content, whereas traditional LCS lose part of this information, since their classifiers match real-valued input with rules whose antecedent is composed of interval-valued variables. Moreover, at each activation cycle, an LFCS composes the output of different fuzzy rules to obtain a real-valued output, while in traditional LCS, a single rule is selected, proposing an output from a small set of different values. In Section 2 we briefly introduce the context where evolutionary learning of fuzzy rules (ELF) was developed, then describing the principles on which it is based. In other papers we have discussed the application of ELF to real and simulated robotics agents [7], [8], [9], and some of the various methodological aspects related to a whole class of LFCS [6], [10]. Here, we focus on a challenging task, such as simulated robotics soccer, in the distributed Robocup environment [11], which can be summarized as follows. We have a set of autonomous agents, i.e., entities governed by software programs, able to perform a task autonomously in an environment perceived by a set of sensors. Behaviors shown by each agent are obtained by the interaction of relatively simple and independent behavior modules [12]. One of the problems of this behavior-based approach is the composition of concurrent behaviors, which is usually designed by hand. Elsewhere we have shown as ELF can learn single behavior modules [7], and the selection/composition of behavior modules [8], [9]. In Section 3, we propose another, more complete, solution to this problem, which also takes into account the role of interaction with other agents in the optimization of the behavior blend.

An agent may have to interact with other agents to perform its task. An important aspect of its global behavior concerns the modality of this interaction, and this is also an issue so complex that learning may play a key role in facing it. In this paper we will show how ELF can be applied in a simulated robotics soccer context, where agents can communicate to improve their limited perception of an environment where they have to interact with agents which may contrast them. This happens in a real-time, dynamic environment (provided by Robocup [11]), with hard technological constraints, which will be described in Section 3. In Section 4, we present experimental results showing the effectiveness of ELF in learning a distributed policy for the interacting agents.

Section snippets

ELF: a learning fuzzy classifier system

Since the beginning of the last decade, genetic algorithms have been proposed to learn fuzzy rule bases [13], [14]. Most of the different approaches (for a review, see [4]) can be classified in two main categories, corresponding to those introduced for crisp LCS. In the Pittsburgh approach [15] a chromosome (i.e., an element of the population of solutions optimized by the genetic algorithm) represents a whole rule base [16], [17]. In this case, the rule base is globally optimized, but each

The application

The application we have developed belongs to the field of robotics soccer; in particular, we have chosen the RoboCup Simulation League [11] as a testbench, because it offers an interesting and difficult problem and makes it possible to share knowledge among teams and to compare results, due to its diffusion in the academic world [21], [22], [23]. The nature of the RoboCup Simulation League environment brings many constraints that an agent must satisfy to optimize its performance. First of all,

Experimental results

All the experiments that we are presenting are structured as follows: each learning session lasts 10,000 action cycles; it is framed in trials that terminate when either a final state is reached or after a maximum number of action cycles, usually fixed at 100. The reinforcement program is the only part of the learning system that is different from experiment to experiment; we describe the reinforcement program using the following notation: IPE represents the individual progress estimator, GPE

Conclusions

We have presented in this paper ELF, a Fuzzy Learning Classifier System which can be successfully applied to many learning tasks. Here we have discussed the issues related to co-evolutionary learning, where different agents should learn a cooperative task in an environment with strong real-time constraints where other agents are contrasting them in the achievement of their goals. To face this problem we have addressed the need for an appropriate knowledge model, aimed at reducing the search

Acknowledgements

This research is partially supported by the Politecnico di Milano Research Grant “Development of autonomous agents through machine learning”, and partially by the project “CERTAMEN” co-funded by the Italian Ministry of University and Scientific and Technological Research. We have to thank some former students who have dedicated a significant part of their lives to work on this topic, and in particular: F. Giorlandino, A.M. Carpani, G. Calegari.

References (30)

L. Booker et al.
Classifier systems and genetic algorithms
Artificial Intelligence
(1989)
A. Bonarini et al.
Learning to to coordinate fuzzy behaviors for autonomous agents
International Journal of Approximate Reasoning
(1997)
M. Asada et al.
RoboCup today and tomorrow – what we have learned
Artificial Intelligence
(1999)
B. Carse et al.
Evolving fuzzy rule based controllers using genetic algorithms
Fuzzy Sets and Systems
(1996)
L. Booker
Classifier systems that learn internal world models
Machine Learning
(1988)
S.W. Wilson
Classifier fitness based on accuracy
Evolutionary Computation
(1995)
A. Bonarini
Learning fuzzy classifier systems
A. Bonarini, ELF: learning incomplete fuzzy rule sets for an autonomous robot, in: H.-J. Zimmermann (Ed.), First...
A. Bonarini, C. Bonacina, M. Matteucci, A framework to support the reinforcement function design in real-world...
A. Bonarini
Evolutionary learning of fuzzy rules: competition and cooperation

A. Bonarini

Anytime learning and adaptation of hierarchical fuzzy logic behaviors

Adaptive Behavior Journal

(1997)

A. Bonarini, Reinforcement distribution to fuzzy classifiers: a methodology to extend crisp algorithms, in: IEEE...

R.A. Brooks

A robust layered control system for a mobile robot

IEEE Journal of Robotics and Automation

(1986)

M. Valenzuela-Rendón

The fuzzy classifier system: a classifier system for continuously varying variables

C.L. Karr, L.M. Freeman, D.L. Meredith, Improved fuzzy process control of spacecraft autonomous rendezvous using a...

Cited by (23)

Intelligent methodology for project conceptual cost prediction
2019, Heliyon
Citation Excerpt :
The performance of any fuzzy system mainly depends on the rule base which are IF–Then rules. Evolutionary learning is a suitable technique as it can incorporate prior knowledge of the developed system (Belarbi and Titel, 2000; Bonarini and Trianni, 2001). The prior knowledge may be in the form of linguistic variables, MF parameters, and fuzzy rules.
Developing a reliable parametric cost model at the conceptual stage of the project is crucial for projects managers and decision makers. Several methodologies exist to develop a conceptual cost model. However, many gaps exist in the current methodologies such as depending only on experts ‘opinions and questionnaire survey to identify the project features, key cost drivers and developing deterministic predictive models without taking uncertainty nature into consideration. The main contribution of this study is developing an intelligent methodology for predicting the project cost at the conceptual stage. The proposed methodology can automatically identify key cost drivers and maintain uncertainty to predicted cost. Field canals improvement projects (FCIPs) are used as a case study to validate the proposed methodology. The selected methodology has applied quantitative approaches to identify the key cost drivers. In addition, the methodology has applied a genetic fuzzy model that automatically generates fuzzy rules to automatically predict the conceptual cost. Moreover, the results show a superior performance of the genetic fuzzy model than the traditional fuzzy model. In addition, this study presents a publicly open dataset for FCIPs to be used for future models validation and analysis.
On the potential contributions of hybrid intelligent approaches to Multicomponent Robotic System development
2010, Information Sciences
Citation Excerpt :
Evidently, the individual tasks must have been formulated in an appropriate way and the global problem must be decomposable (a common research issue in agent literature) [51,104]. This decomposition may be achieved by negotiation as in [130,145], by some form of swarm dynamics [190], or even by some form of learning [17]. The communications structure is intimately related to control decentralization, but does not necessarily determine it.
The area of cognitive or intelligent robotics is moving from the single monolithic robot control and behavior problem to that of controlling robots with multiple components or multiple robots operating together, and even collaborating, in dynamic and unstructured environments. This paper introduces the topic and provides a general overview of the current state of the field of Multicomponent Robotic Systems focusing on providing some insights into where Hybrid Intelligent Systems could provide key contributions to its advancement. Thus, the aim is to identify prospective research areas and to try to delimit the field from the point of view of the following essential problem: how to coordinate multiple robotic elements in order to perform useful tasks.
A new approach to fuzzy classifier systems and its application in self-generating neuro-fuzzy systems
2006, Neurocomputing
Citation Excerpt :
Furthermore, only a pre-specified number of rules can be fired to respond to messages posted in the message list. In the last decade, many researches have extensively worked on LFCS [6–8,22,23,56]. The aim of LFCS is to learn a set of fuzzy rules that optimize the evaluation (reinforcement) given to the performance of the learnt model.
A classifier system is a machine learning system that learns syntactically simple string rules (called classifiers) through a genetic algorithm to guide its performance in an arbitrary environment. In a classifier system, the bucket brigade algorithm is used to solve the problem of credit assignment, which is a critical problem in the field of reinforcement learning. In this paper, we propose a new approach to fuzzy classifier systems and a neuro-fuzzy system referred to as ACSNFIS to implement the proposed fuzzy classifier system. The proposed system is tested by the balancing problem of a cart pole and the back-driving problem of a truck to demonstrate its performance.
Multi-agent system approach to context-aware coordinated web services under general market mechanism
2006, Decision Support Systems
Web services have been emerging as a new business opportunity over the last few years. However, these web services are still passive, often lacking rich representations and rich strategies to attract customers. This paper proposes a context-aware coordinated web service mechanism for autonomous and intelligent service provision. Multi-agent intelligent architecture is adopted to coordinate web services that are federated in a web service coordinator, which is also an agent. To provide more effective advertisement and at the same time guarantee higher profits, case-based reasoning is applied to automatically estimate user preferences. We describe a proof-of-concept prototype that was developed using JATLite. Using a simulation-based experimental study performed with an illustrative example, we found that our mechanism with multi-agents yields higher levels of user preference, higher overall profit across all federating merchants, and an increased rate of wins than current web services mechanisms, which do not include these features.
Nash-optimization enhanced distributed model predictive control applied to the Shell benchmark problem
2005, Information Sciences
This paper presents an efficient distributed model predictive control scheme based on Nash optimality, in which the on-line optimization of the whole system is decomposed into that of several small co-operative agents in distributed structures, thus it can significantly reduce computational complexity in model predictive control of large-scale systems. The relevant nominal stability and the performance on single-step horizon under the communication failure are investigated. The Shell heavy oil fractionator benchmark control problem is illustrated to verify the effectiveness of the proposed control algorithm.
Applying case-based reasoning and multi-agent intelligent system to context-aware comparative shopping
2004, Decision Support Systems
Citation Excerpt :
Multi-agent systems offer a new dimension for coordination and negotiation in an enterprise. Incorporating autonomous agents into the problem-solving process allows improved coordination of different functional unit-defined tasks, both independent of the user and of the functional units under control [2,3,14,16,20,25,33–35,38–40]. Under a multi-agent system, the problem-solving tasks of each functional unit become populated by a number of heterogeneous intelligent agents with diverse goals and capabilities [17,18,22,38,40].
Comparative shopping is a promising web service in the field of mobile commerce. This paper aims to propose a context-aware comparative shopping. Multi-agent intelligent architecture is adopted to implement the autonomous negotiation mechanism between buyers and sellers. To automatically estimate user preferences to determine the best purchase, case-based reasoning and negotiation mechanism are utilized. We developed a prototype system and experiment to show the possibility of the mechanism proposed in this paper. We found that our mechanism with multi-agents yields more pay-off, total sales, and wins than the system without those features.

View all citing articles on Scopus

View full text

Learning fuzzy classifier systems for multi-agent coordination

Abstract

Introduction

Section snippets

ELF: a learning fuzzy classifier system

The application

Experimental results

Conclusions

Acknowledgements

Artificial Intelligence

International Journal of Approximate Reasoning

Artificial Intelligence

Fuzzy Sets and Systems

Classifier systems that learn internal world models

Machine Learning

Classifier fitness based on accuracy

Evolutionary Computation