Abstract
This paper develops the formal framework of ant programming with the goal of gaining a deeper understanding on ant colony optimization and, more in general, on the principles underlying the use of an iterated Monte Carlo approach for the multi-stage solution of combinatorial optimization problems. Ant programming searches for the optimal policy of a multi-stage decision problem to which the original combinatorial problem is reduced. In order to describe ant programming we adopt on the one hand concepts of optimal control, and on the other hand the ant metaphor suggested by ant colony optimization. In this context, a critical analysis is given of notions such as state, representation, and sequential decision process under incomplete information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Beckers, J. L. Deneubourg, and S. Goss. Trails and U-turns in the selection of the shortest path by the ant Lasius Niger. Journal of Theoretical Biology, 159:397–415, 1992.
R. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, USA, 1957.
D. P. Bertsekas. Dynamic Programming and Optimal Control. Athena Scientific, Belmont, MA, USA, 1995. Vols. I and II.
M. Birattari, G. Di Caro, and M. Dorigo. For a formal foundation of the Ant Programming approach to combinatorial optimization. Part 1: The problem, the representation, and the general solution strategy. Technical Report TR-H-301, ATR Human Information Processing Research Laboratories, Kyoto, Japan, 2000.
V. Boltyanskii. Optimal Control of Discrete Systems. John Wiley & Sons, New York, NY, USA, 1978.
M. Dorigo. Optimization, Learning and Natural Algorithms (in Italian). PhD thesis, Dipartimento di Elettronica, Politecnico di Milano, Milan, Italy, 1992.
M. Dorigo, E. Bonabeau, and G. Theraulaz. Ant algorithms and stigmergy. Future Generation Computer Systems, 16(8):851–871, 2000.
M. Dorigo and G. Di Caro. The ant colony optimization meta-heuristic. In D. Corne, M. Dorigo, and F. Glover, editors, New Ideas in Optimization, pages 11–32. McGraw-Hill, NewYork, NY, USA, 1999.
M. Dorigo, G. Di Caro, and T. Stützle (Editors). Special issue on “Ant Algorithms”. Future Generation Computer Systems, 16(8), 2000.
M. Dorigo, G. Di Caro, and L. M. Gambardella. Ant algorithms for distributed discrete optimization. Artificial Life, 5(2):137–172, 1999.
M. Dorigo, V. Maniezzo, and A. Colorni. The ant system:An autocatalytic optimizing process. Technical Report 91-016 Revised, Dipartimento di Elettronica, Politecnico di Milano, Milan, Italy, 1991.
M. Dorigo, M. Zlochin, N. Meuleau, and M. Birattari. Updating ACO pheromones using stochastic gradient ascent and cross-entropy methods. In S. Cagnoni, J. Gottlieb, E. Hart, M. Middendorf, and R. Raidl, editors, EvoCOP 2002: Applications of Evolutionary Computing, volume 2279 of Lecture Notes in Computer Science, pages 21–30. Springer-Verlag, Heidelberg, Germany, 2002
P. P. Grassé. La reconstruction du nid et les coordinations interindividuelles chez bellicositermes natalensis et cubitermes sp. La théorie de la stigmergie: essai d’interprétation du comportement des termites constructeurs. Insectes Sociaux, 6:41–81, 1959.
W. Gutjahr. A graph-based ant system and its convergence. Special issue on Ant Algorithms, Future Generation Computer Systems, 16(8):873–888, 2000.
N. Meuleau and M. Dorigo. Ant colony optimization and stochastic gradient descent. Artificial Life, 8(2):103–121, 2002.
T. Stützle and M. Dorigo. A short convergence proof for a class of ACO algorithms. IEEE Transactions on Evolutionary Computation, 6(4), 2002, in press.
R. S. Sutton and A. G. Barto. Reinforcement Learning. An Introduction. MIT Press, Cambridge, MA, USA, 1998.
C. J. C. H. Watkins. Learning from Delayed Rewards. PhD thesis, King’s College, Cambridge, United Kingdom, 1989.
S. D. Whitehead and D. H. Ballard. Learning to perceive and act. Machine Learning, 7(1):45–83, 1991.
L. Zadeh and C. Desoer. Linear System Theory. McGraw-Hill, NewYork, NY, USA, 1963.
M. Zlochin, M. Birattari, N. Meuleau, and M. Dorigo. Model-base search for combinatorial optimization. Technical Report TR/IRIDIA/2001-15, IRIDIA, Université Libre de Bruxelles, Brussels, Belgium, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Birattari, M., Di Caro, G., Dorigo, M. (2002). Toward the Formal Foundation of Ant Programming. In: Dorigo, M., Di Caro, G., Sampels, M. (eds) Ant Algorithms. ANTS 2002. Lecture Notes in Computer Science, vol 2463. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45724-0_16
Download citation
DOI: https://doi.org/10.1007/3-540-45724-0_16
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44146-5
Online ISBN: 978-3-540-45724-4
eBook Packages: Springer Book Archive