Abstract
This article describes how an experiment to train an agent to perform a task, which had originally failed, was made successful by incorporating a contextual structure that decomposed the tasks into contexts through Context-based Reasoning. The task involved a simulation of a crane that was used by a human operator to move boxes from arbitrary locations throughout a wide area to a designated drop off location in the environment. Initial attempts to teach an agent how to perform the task through observation in a context-free manner yielded poor performance. However, when the task to be learned was decomposed into separate contexts and the agents learned each context independently, the performance improved significantly. The paper describes the process that enabled the improvements achieved and discusses the tests and results that demonstrated the improvement.
Similar content being viewed by others
References
Stein G, Gonzalez A J (2011) Building high-performing human-like tactical agents through observation and experience. IEEE Trans Syst Man Cybern B 41(3):792–804
Fernlund H, Gonzalez A J, Georgiopoulos M, DeMara R F (2006) Learning tactical human behavior through observation of human performance. IEEE Trans Syst Man Cybern B 36(1):128–140
Johnson C L, Gonzalez A J (2014) Learning collaborative behavior by observation. Expert Syst Appl 41:2316–2328
Sammut C, Hurst S, Kedzier D, Michie D (1992) Learning to fly. In: Proceedings of the ninth international conference on machine learning. Aberdeen, pp 335–339
Ontañón S, Bonnette K, Mahindrakar P, Gómez-Martín MA, Long K, Radhakrishnan J, Shah R, Ram A (2009) Learning from human demonstrations for real-time case-based planning. IJCAI-09 workshop on learning structural knowledge from observations
Bentivegna D, Atkeson C (2001) Learning from observation using primitives. In: Proceedings of the IEEE international conference on robotics & automation. Seoul
Lee S, Chen J (1994) Skill Learning from observations. Proc IEEE Conf Robot Autom 4:3245–3250
Quinlan J R (1986) Induction of decision trees. Mach Learn 1:81–106
Breiman L, Friedman J H, Olshen R A, Stone C J (1984) Classification and regression trees. Wadsworth International Group, Belmont
Friedman J H (1991) Multivariate adaptive regression splines. Ann Stat 19:1–41
Pagallo G, Haussler D (1990) Boolean feature discovery in empirical learning. Mach Learn 5:71–99
Furnkranz J (1999) Separate-and-conquer rule learning. Artif Int Rev 13:3–54
Bostrom H (1995) Covering vs. divide-and-conquer for top-down induction of logic programs. In: Proceedings of the international joint conference on artificial intelligence
Bostrom H, Asker L (1999) Combining divide-and-conquer and separate-and-conquer for efficient and effective. In: Ninth international workshop on inductive logic programming. Bled, Slovenia
Bshouty N H (1996) Simple learning algorithms using divide-and-conquer. Comput Complex 6:174–194
Cohen W W (1993) Efficient pruning methods for separate-and-conquer rule learning methods. In: Proceedings of the 13th international joint conference on artificial intelligence. Chambery
Dietterich TG (2000) The divide-and-conquer manifesto. In: Proceedings of the conference on algorithmic learning theory
Gonzalez A J, Ahlers R (1998) Context-based representation of intelligent behaviour in training simulations. Trans Soc Comput Simul Int 15(4):153–166
Romaniuk S G, Hall L O (1993) Divide and conquer neural networks. Neural Netw 6(8):1105–1116
Frosyniotis D, Stafylopatis A, Likas A (2003) A divide-and-conquer method for multi-net classifiers. Pattern Anal Appl 6:32–40
Stach W, Kurgan L, Pedrycz W (2010) A divide and conquer method for learning large fuzzy cognitive maps. Fuzzy Sets Syst 161:2515–2532
Fu H-C, Lee Y-P, Chiang C-C, Pao H-T (2001) Divide-and-conquer learning and modular perceptron networks. IEEE Trans Neural Netw 12(2):250–263
Bentivegna D, Atkeson C (2001) Learning from observation using primitives. In: Proceedings of the IEEE international conference on robotics & automation. Seoul
Floyd M W, Esfandiari B, Lam K (2008) A case-based reasoning approach to imitating robocup players. In: Proceedings of the twenty-first international Florida artificial intelligence research society (FLAIRS). pp 251–256
Oñón S (2012) Case acquisition strategies for case-based reasoning in real-time strategy games. In: Proceedings of FLAIRS 2012 Conference
Cohen W W (1999) Context-sensitive learning methods for text categorization. ACM Trans Inf Syst 17(2):141–173
Grobelnik M, Mladenić D, Leban G, Štajner T (2011) Machine learning techniques for understanding context and process. In: Warren P, Davies J, Simperl E (eds) Context and semantics for knowledge management. Springer, pp 127–145
Lieberman H, Selker T (2000) Out of context: computer systems that adapt to, and learn from, context. IBM Syst J 39(3/4):617–632
Mannini A, Sabatini A M (2010) Machine learning methods for classifying human physical activity from on-body accelerometers. Sensors 10(2):1154–1175
Johnson C, Gonzalez L (2014) Learning collaborative behavior by observation. Exert Syst Appl 41:2316–2328
Stensrud B S, Gonzalez A J (2008) Discovery of high-level behavior from observation of human performance in a strategic game. IEEE Trans Syst Man Cybern B 38(3):855–874
Trinh V C, Gonzalez A J (2013) Identifying contexts from observed human performance. IEEE Trans Human Mach Syst 43(4):359–370
Gonzalez A J, Georgiopoulos M, DeMara R F (2007) Maintaining coherence among entities states in a distributed multi-agent system. J Def Model Simul (JDMS) 4(2):147–172
Sidani T A, Gonzalez A J (2000) A framework for learning implicit expert knowledge through observation. Trans Soc Comput Simul 17(2):54–72
Stanley K, Miikkulainen R (2002) Evolving neural networks through augmenting topologies. Evol Comput 10(2):99–127
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of the IEEE international conference on neural networks. p 4
Li P, Xiao H (2014) An improved quantum-behaved particle swarm optimization algorithm. Appl Int 40(3):479–496
Hasanzadeh M, Meybodi M R, Ebadzadeh M M (2013) Adaptive cooperative particle swarm optimizer. Appl Int 39(2):397–420
Wang H, Zhao X, Wang K, Xia K, Tu X (2014) Cooperative velocity updating model based particle swarm optimization. Appl Int 40(2):322–342
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Stein, G., Gonzalez, A.J. Learning in context: enhancing machine learning with context-based reasoning. Appl Intell 41, 709–724 (2014). https://doi.org/10.1007/s10489-014-0550-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-014-0550-0