Abstract
There are numerous applications of multi-agent systems like disaster management [1], sensor networks [2], traffic control [3] and scheduling problems [4] where agents should coordinate to achieve a common goal. In most of these cases a centralized solution is inefficient because of the scale and the complexity of the problems and thus distributed solutions are required.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kitano, H., Tadokoro, S., Noda, I., Matsubara, H., Takahashi, T., Shinjou, A., Shimada, S.: Robocup rescue: Search and rescue in large-scale disasters as a domain for autonomous agents research. In: Proc. of IEEE Conf. on System, Man and Cybernetics, 5 pages. (1999)
Kho, J., Rogers, A., Jennings, N.R.: Decentralized control of adaptive sampling in wireless sensor networks. ACM Trans. Sen. Netw. 5(3), 1–35 (2009)
van Leeuwen, P., Hesselink, H., Rohling, J.: Scheduling aircraft using constraint satisfaction. Electr. Notes Theor. Comput. Sci. 76 (2002)
Stranjak, A., Dutta, P.S., Ebden, M., Rogers, A., Vytelingum, P.: A multi-agent simulation system for prediction and scheduling of aero engine overhaul. In: AAMAS 2008: Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems, pp. 81–88 (2008)
Wolpert, D.H., Tumer, K.: An introduction to collective intelligence. Technical report, NASA (1999)
Arslan, G., Marden, J.R., Shamma, J.S.: Autonomous vehicle-target assignment: A game-theoretical formulation. Journal of Dynamic Systems, Measurement, and Control 129(5), 584–596 (2007)
Crites, R.H., Barto, A.: Improving elevator performance using reinforcement learning. Advances in Neural Information Processing Systems 8 (1996)
Littman, M.: Markov games as a framework for multiagent reinforcement learning. In: Proceedings of the Eleventh International Conference of Machine learning (1994)
Uther, W., Veloso, M.: Adversarial reinforcement learning. Technical report, Carnegie Mellon University (1997)
Bowling, M., Veloso, M.: Multiagent learning using a variable learning rate. Artificial Intelligence 136, 215–250 (2002)
Fudenberg, D., Levine, D.: The theory of Learning in Games. The MIT Press, Cambridge (1998)
Monderer, D., Shapley, L.: Potential games. Games and Economic Behavior 14, 124–143 (1996)
Wolpert, D.H., Strauss, C.E.M., Rajnarayan, D.: Advances in distributed optimization using probability collectives. Advances in Complex Systems (ACS) 9(04), 383–436 (2006)
Leslie, D.S., Collins, E.: Generalised weakened fictitious play. Games and Economic Behavior 56(2), 285–298 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Smyrnakis, M., Leslie, D.S. (2010). Convergence of Probability Collectives with Adaptive Choice of Temperature Parameters. In: Blum, C., Battiti, R. (eds) Learning and Intelligent Optimization. LION 2010. Lecture Notes in Computer Science, vol 6073. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13800-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-13800-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13799-0
Online ISBN: 978-3-642-13800-3
eBook Packages: Computer ScienceComputer Science (R0)