Market-Based Dynamic Task Allocation Using Heuristically Accelerated Reinforcement Learning

Gurzoni, José Angelo; Tonidandel, Flavio; Bianchi, Reinaldo A. C.

doi:10.1007/978-3-642-24769-9_27

José Angelo Gurzoni Jr.²¹,
Flavio Tonidandel²¹ &
Reinaldo A. C. Bianchi²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7026))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

1474 Accesses
6 Citations

Abstract

This paper presents a Multi-Robot Task Allocation (MRTA) system, implemented on a RoboCup Small Size League team, where robots participate of auctions for the available roles, such as attacker or defender, and use Heuristically Accelerated Reinforcement Learning to evaluate their aptitude to perform these roles, given the situation of the team, in real-time.

The performance of the task allocation mechanism is evaluated and compared in different implementation variants, and results show that the proposed MRTA system significantly increases the team performance, when compared to pre-programmed team behavior algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bianchi, R.A.C., Ribeiro, C., Costa, A.: Accelerating autonomous learning by using heuristic selection of actions. Journal of Heuristics 14, 135–168 (2008)
Article MATH Google Scholar
Browning, B., Bruce, J., Bowling, M., Veloso, M.: STP: Skills, tactics and plays for multi-robot control. IEEE Journal of Control and Systems Engineering 219, 33–52 (2005)
Google Scholar
Bruce, J., Zickler, S., Licitra, M., Veloso, M.: Cmdragons: Dynamic passing and strategy on a champion robot soccer team. In: Proceedings of the IEEE Int. Conf. on Robotics and Automation (ICRA), Pasadena, CA (2008)
Google Scholar
Celiberto Jr., L.A., Ribeiro, C.H.C., Costa, A.H.R., Bianchi, R.A.C.: Heuristic Reinforcement Learning Applied to RoboCup Simulation Agents. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds.) RoboCup 2007: Robot Soccer World Cup XI. LNCS (LNAI), vol. 5001, pp. 220–227. Springer, Heidelberg (2008)
Chapter Google Scholar
Dias, M.B., Zlot, R.M., Zinck, M.B., Gonzalez, J.P., Stentz, A.T.: A versatile implementation of the traderbots approach for multirobot coordination. In: Int. Conf. on Intelligent Autonomous Systems (2004)
Google Scholar
Dias, M., Zlot, R., Kalra, N., Stentz, A.: Market-based multirobot coordination: A survey and analysis. Proceedings of the IEEE 94(7), 1257–1270 (2006)
Article Google Scholar
Gerkey, B., Matarić, M.: Sold!: auction methods for multirobot coordination. IEEE Transactions on Robotics and Automation 18(5), 758–768 (2002)
Article Google Scholar
Gerkey, B.P., Matarić, M.J.: Multi-robot task allocation: analyzing the complexity and optimality of key architectures. In: Proceedings of IEEE Int. Conf. on Robotics and Automation, ICRA 2003, vol. 3, pp. 3862–3868 (September 2003)
Google Scholar
Gerkey, B.P., Matarić, M.J.: A formal analysis and taxonomy of task allocation in multi-robot systems. Int. Journal of Robotics Research 23(9), 939–954 (2004)
Article Google Scholar
Kose, H., Tatlidede, U., Mericli, C., Kaplan, K., Akin, H.L.: Q-learning based market-driven multi-agent collaboration in robot soccer. In: Proceedings of the Turkish Symposium on Artificial Intelligence and Neural Networks, pp. 219–2228 (2004)
Google Scholar
Kyrylov, V.: Balancing Gains, Risks, Costs, and Real-Time Constraints in the Ball Passing Algorithm for the Robotic Soccer. In: Lakemeyer, G., Sklar, E., Sorrenti, D.G., Takahashi, T. (eds.) RoboCup 2006: Robot Soccer World Cup X. LNCS (LNAI), vol. 4434, pp. 304–313. Springer, Heidelberg (2007)
Chapter Google Scholar
Parker, L.E., Tang, F.: Building multirobot coalitions through automated task solution synthesis. Proceedings of the IEEE 94(7), 1289–1305 (2006)
Article Google Scholar
Parker, L.E.: Distributed intelligence: Overview of the field and its application in multi-robot systems. Journal of Physical Agents 2(1), 5–14 (2008); special issue on Multi-Robot Systems
Google Scholar
Sandholm, T., Suri, S.: Improved algorithms for optimal winner determination in combinatorial auctions and generalizations. In: Proceedings of the Seventeenth National Conf. on Artificial Intelligence, pp. 90–97 (2000)
Google Scholar
Stone, P., Sutton, R.S., Kuhlmann, G.: Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior 13(3), 165–188 (2005)
Article Google Scholar
Sukthankar, G., Sycara, K.: Robust recognition of physical team behaviors using spatio-temporal models. In: AAMAS 2006: Proceedings of the Fifth Int. Joint Conf. on Autonomous Agents and Multiagent Systems, pp. 638–645. ACM (2006)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Tang, F., Parker, L.E.: A complete methodology for generating multi-robot task solutions using asymtre-d and market-based task allocation. In: 2007 IEEE Int. Conf. on Robotics and Automation, pp. 3351–3358 (April 2007)
Google Scholar
Taylor, M.E., Stone, P.: Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research 10(1), 1633–1685 (2009)
MathSciNet MATH Google Scholar
Vail, D., Veloso, M.: Feature selection for activity recognition in multi-robot domains. In: AAAI 2008, Twenty-third Conf. on Artificial Intelligence (2008)
Google Scholar
Watkins, C.J.C.H.: Learning from Delayed Rewards. Ph.D. thesis, University of Cambridge (1989)
Google Scholar
Weigel, T., Auerbach, W., Dietl, M., Dümler, B., Gutmann, J.-S., Marko, K., Müller, K., Nebel, B., Szerbakowski, B., Thiel, M.: CS Freiburg: Doing the Right Thing in a Group. In: Stone, P., Balch, T., Kraetzschmar, G.K. (eds.) RoboCup 2000. LNCS (LNAI), vol. 2019, p. 52. Springer, Heidelberg (2001)
Chapter Google Scholar
Werger, B., Mataric, M.J.: Broadcast of local eligibility for multi-target observation. In: 5th Int. Symposium on Distributed Autonomous Robotic Systems (DARS), pp. 347–356 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Centro Universitário da FEI, São Bernardo do Campo, Brazil
José Angelo Gurzoni Jr., Flavio Tonidandel & Reinaldo A. C. Bianchi

Authors

José Angelo Gurzoni Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Flavio Tonidandel
View author publications
You can also search for this author in PubMed Google Scholar
Reinaldo A. C. Bianchi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Ciências, Departamento de Informática, GUESS/LabMAg/Universidade de Lisboa, Campo Grande, 749-016, Lisboa, Portugal
Luis Antunes
Department of Computer Science and Engineering, INESC-ID, Instituto Superior Técnico, IST, Avenida Rovisco Pais, 1049-001, Lisboa, Portugal
H. Sofia Pinto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gurzoni, J.A., Tonidandel, F., Bianchi, R.A.C. (2011). Market-Based Dynamic Task Allocation Using Heuristically Accelerated Reinforcement Learning. In: Antunes, L., Pinto, H.S. (eds) Progress in Artificial Intelligence. EPIA 2011. Lecture Notes in Computer Science(), vol 7026. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24769-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-642-24769-9_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24768-2
Online ISBN: 978-3-642-24769-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics