Embedding decision-analytic control in a learning architecture

https://doi.org/10.1016/0004-3702(91)90008-8Get rights and content

Abstract

An autonomous agent's control problem is often formulated as the attempt to minimize the expected cost of accomplishing a goal. This paper presents a three-dimensional view of the control problem that is substantially more realistic. The agent's control policy is assessed along three dimensions: deliberation cost, execution cost, and goal value. The agent must choose which goal to attend to as well as which action to take. Our control policy seeks to maximize satisfaction by trading execution cost and goal value while keeping deliberation cost low. The agent's control decisions are guided by the MU heuristic—choose the alternative whose marginal expected utility is maximal. Thus, when necessary, the agent will prefer easily-achieved goals to attractive but difficult-to-attain alternatives. The MU heuristic is embedded in an architecture with record-keeping and learning capabilities. The architecture offers its control module expected utility and expected cost estimates that are gradually refined as the agent accumulates experience. A programmer is not required to supply that knowledge, and the estimates are provided without recourse to distributional assumptions.

References (39)

  • T. Dean

    Decision-theoretic control of inference for time-critical applications

    (1990)
  • T. Dean et al.

    An analysis of time-dependent planning

  • J. Doyle

    Big problems for artificial intelligence

    AI Mag.

    (1988)
  • J. Doyle

    Rationality and its roles in reasoning (extended abstract)

  • O. Etzioni et al.

    A comparative analysis of chunking and decision-analytic control

  • M.R. Fehling et al.

    Adaptive planning and search

  • E.A. Feigenbaum et al.

    Performance of a reading task by an elementary perceiving and memorizing program

    Behav. Sci.

    (1963)
  • M.R. Garey et al.
  • P. Haddawy et al.

    Issues in decision-theoretic planning: symbolic goals and numeric utilities

    (1990)
  • Cited by (34)

    • A deliberative scheduling technique for a real-time agent architecture

      2006, Engineering Applications of Artificial Intelligence
      Citation Excerpt :

      This set of algorithms or methods is used to make tradeoffs in computation time vs. quality and may have different performance characteristics in different environmental situations. There are several works on the multiple-method approach such as the works shown by Garvey and Lesser (1993), Bonnisone and Halverson (1990) and Etzioni (1991), they all focus on designing a solution that uses all available resources to maximize the quality within the available time based on the existence of multiple methods for many tasks. The multiple-method approach has at least two potential advantages over an anytime algorithm approach (Garvey and Lesser, 1994).

    • Deriving consensus in multiagent systems

      1996, Artificial Intelligence
    View all citing articles on Scopus
    View full text