Distributed agents for cost-effective monitoring of critical success factors

https://doi.org/10.1016/S0167-9236(02)00113-6Get rights and content

Abstract

Business managers should promptly respond to important events (e.g. exceptions) that happen on a set of critical success factors (CSF). A CSF monitoring system is thus essential in capturing the events for the managers. It monitors the information items concerning the CSF. Once an update is detected, critical events may be validated, logged, and signaled for the manager. Since CSF monitoring is often time-critical and mission-critical, a CSF monitoring system should be cost-effective: It should detect updates in a timely, complete, and robust manner without incurring heavy loading to related information servers (e.g. query overheads) and the Intranet (e.g. communication overheads). To achieve that, the monitoring tasks should be properly distributed and coordinated on the Intranet. We propose a multiagent CSF monitoring paradigm, CSFMonitor, in which distributed agents share a collective goal of cost-effective CSF monitoring. An experiment on monitoring real-world financial CSF is conducted. The delivery of CSFMonitor to businesses may robustly provide more complete and timelier information without causing serious problems to the original information processing in businesses.

Introduction

Management by exceptions (MBE) has been widely adopted in business administration. It suggests managers to promptly respond to the exceptions happening on a predefined set of critical success factors (CSF), without being involved with the tedious activities of monitoring and validation [3], [16]. CSF monitoring is thus essential to the realization of MBE. Conceptually, a CSF consists of a critical information item and a validation procedure. The critical information item may be maintained by internal or external entities (e.g. database systems, web servers, and file servers) [14] and updated at any time [32]. Once an update on the information item is detected, the validation procedure is triggered to determine whether an exception occurs, and if so, corresponding managers are notified. By promptly responding to the exceptions happening on the CSF, the managers may maximize the wealth of their businesses.

In this paper, we explore how multiagent systems may support CSF monitoring so that managers may promptly respond to a higher percentage of exceptions. CSF monitoring is actually a kind of information monitoring, since exceptions may be detected only by monitoring the updates of their corresponding information items. However, since CSF monitoring is often more time-critical and mission-critical, it is associated with particular requirements on timely, complete, and robust (fault tolerant) update detection. That is, a CSF monitoring system should detect a higher percentage of updates in a timely manner without being crashed by any single fault or failure.

The requirements significantly increase the cost of CSF monitoring. We thus focus on the design of multiagent systems for cost-effective CSF monitoring, which aims to maximize the quality but minimize the cost of CSF monitoring. On the quality part, we are concerned with the timeliness, completeness, and robustness of update detection. On the cost part, we are concerned with the loading incurred by the monitor to the related information servers, the Intranet, and the Internet. The amount of the cost should be properly controlled; otherwise, the delivery of the system will make all the servers and the Intranet exhausted, which, in turn, deteriorates the performance of original information processing in businesses [23]. Similar situations may be found in various domains such as information inquiry through the Internet [21]. A cost-effective CSF monitor should maximize timeliness, completeness, and robustness of update detection, while simultaneously minimize the cost it incurs.

Major challenges of cost-effective CSF monitoring lie on the trade-off between the quality concerns and the cost concern. For example, timely and complete monitoring often calls for frequent inquiry to information servers, which will incur heavy loading (cost) to the servers and the Intranet. As another example, consider robust monitoring, which may be achieved by distributing the monitoring tasks (i.e. the tasks of monitoring individual CSF) to multiple machines. In that case, a fault on a machine will not cause serious problems to CSF monitoring. However, intensive communications among the distributed tasks should be avoided for not incurring a heavy burden to the Intranet.

Previous information monitoring techniques did not tackle the challenges, making them unsuitable for CSF monitoring. They often relied on a centralized site to monitor all information items [9], [24], [26], [27], [30]. This method cannot be robust, since a fault on the centralized site could stop all monitoring tasks, and thus cause a great loss to the managers. Moreover, previous techniques often predefined a frequency to periodically check for information updates. This method could not fulfill the requirements of CSF monitoring either, since critical information items may be updated at any time. They may even be updated frequently at a particular time (e.g. the daytime of some particular days in a month) but infrequently at another time. Since periodical monitoring does not estimate the timing of information updates, it may waste a great amount of efforts without finding information updates. On the other hand, the timing of information updates cannot be hypothesized by traditional forecasting techniques (e.g. moving average) either, since the monitoring system cannot know the happening time of the updates. (Note: Only the servers that maintain the information items may know the happening time. Unfortunately, in most cases, they cannot provide such information to the monitoring system.) No reliable history data may be sampled in order to forecast the happening time of the next update.

We tackle the challenges by delegating a monitoring task to an agent, which may be distributed to various sites on the Intranet. The distributed setting contributes three main benefits to CSF monitoring: load balancing, bandwidth saving, and robust monitoring. This is because the agents may be distributed according to the loading (for load balancing) and the location (for bandwidth saving) of each site. A fault on a site will not stop all the monitoring tasks (i.e. robust monitoring). Therefore, under such a distributed setting, we explore how individual agents may learn to monitor their CSF and collaborate with each other so that a limited amount of resources (i.e. query transactions and Intranet bandwidths) may be directed to those agents that are more likely to detect information updates. The amount of the communications required for the collaboration should be controlled as well, since intensive communications among the distributed agents may cause serious problems to both the efficiency of the collaboration and the loading of the Intranet.

We first survey the environments and requirements of distributed cost-effective CSF monitoring for decision support (ref. Section 2). Based on the requirements, a multiagent paradigm CSFMonitor is developed (ref. Section 3). Each agent learns to properly work with each other to achieve cost-effective CSF monitoring. Cost-effectiveness of CSFMonitor is investigated in an experiment that simulates real-world environments of financial CSF (ref. Section 4). In the experiment, CSFMonitor significantly outperforms state-of-the-art monitoring techniques. The framework is then comprehensively evaluated from the viewpoints of its related work and future work 5 Discussion, 6 Future work. We finally conclude that effective decision making may be supported by the delivery of CSFMonitor, which aims to robustly and promptly report more events happening on CSF without causing problems to original information processing in businesses.

Section snippets

Distributed cost-effective CSF monitoring

Distributed cost-effective CSF monitoring aims to achieve cost-bounded, timely, complete, and robust monitoring of CSF for business managers.

Distributed agents for cost-effective CSF monitoring

A multiagent model CSFMonitor is developed for fulfilling the requirements of distributed cost-effective CSF monitoring.

Experiment

Experiments, which simulated real-world environments for financial decision support, were designed to investigate the performance of CSFMonitor. The experiments were conducted on PCs with independent and concurrent processes of two types. One was for simulating the update behaviors of CSF (ref. Section 4.1); while the other was for simulating the operations of various kinds of monitoring systems (ref. Section 4.2).

Discussion

The agents in CSFMonitor support distributed cost-effective CSF monitoring by a simple and effective coordination protocol. In this section, we discuss major contributions of the framework with respect to its related work.

Future work

CSFMonitor does not consider the dynamic distribution of agents to the Intranet. As mentioned above, the distribution of agents may contribute the benefits of fault tolerance, load balancing, and bandwidth saving. However, there might exist trade-offs among the benefits as well. For example, to save bandwidths, the agents may be hosted in the site near to the machines that maintain the information items being monitored. If the site is quite busy at the current moment, load balancing is

Conclusion

It is a must for managers to effectively capture those critical events that happen on CSF. However, managers often have difficulties in monitoring the CSF by themselves all the time. Therefore, a multiagent system is helpful in monitoring the CSF on behalf of the managers. Once an agent detects a status update of a CSF, it validates and handles the events happening on the CSF. By the support from the agents, the managers may focus their attention on other important jobs while simultaneously

Acknowledgements

This research was supported by the National Science Council of the Republic of China under grants 88-2213-E-216-003 and NSC 89-2213-E-216-003.

Rey-Long Liu is currently an associate professor of the Department of Information Management, ChungHua University, Taiwan. He received his PhD degree in Computer Science from the National TsingHua University, Taiwan, 1994. His research interest lies on the development and application of intelligent information technology to businesses information management, with a special focus on adaptive information systems. His main areas of interest include intelligent multiagent systems, information

References (33)

  • B Fazlollahi et al.

    Adaptive decision support systems

    Decis. Support Syst.

    (1997)
  • L Volonino et al.

    Using EIS to respond to dynamic business conditions

    Decis. Support Syst.

    (1995)
  • A Anderson et al.

    Integer programming for combinatorial auction winner determination

  • M Andersson et al.

    Time–quality tradeoffs in reallocative negotiation with combinatorial contract types

  • A.A Atkinson et al.

    Planning and control

  • R Azoulay-Schwartz et al.

    Assessing usage patterns to improve data allocation via auctions

  • M Barbuceanu

    Coordinating agents by role-based social constraints and conversation plans

  • R.P Bonasso et al.

    Experiences with an architecture for intelligent, reactive agents

  • C Brown et al.

    AI on the WWW supply and demand agents

    IEEE Expert

    (Aug. 1995)
  • H Chen et al.

    An intelligent personal spider (agent) for dynamic Internet/Intranet searching

    Decis. Support Syst.

    (1998)
  • Comshare, Comshare Decision, http://www.comshare.com,...
  • J Cuena et al.

    Distributed models for decision support

  • K Decker et al.

    MACRON: An Architecture for Multi-Agent Cooperative Information Gathering

  • E.H Durfee et al.

    The search for coordination: knowledge-guided abstraction and search in a hierarchical behavior space

  • M.N Frolick et al.

    Using EISs for environmental scanning

    Inf. Syst. Manage.

    (1997)
  • S Giroux

    Open reflective agents

  • Cited by (4)

    Rey-Long Liu is currently an associate professor of the Department of Information Management, ChungHua University, Taiwan. He received his PhD degree in Computer Science from the National TsingHua University, Taiwan, 1994. His research interest lies on the development and application of intelligent information technology to businesses information management, with a special focus on adaptive information systems. His main areas of interest include intelligent multiagent systems, information retrieval, data mining, machine learning, and knowledge management systems.

    Yun-Ling Lu is currently a master student of the Department of Information Management, ChungHua University, Taiwan. Her research interests include knowledge management, intelligent multiagent systems, and information retrieval.

    View full text