Abstract
How to improve the behavioral decision-making ability and adaptability to unknown environments is of great importance for an agent. The traditional decision-making methods usually suffer from long training time, due to the large amount of training samples, or low adaptability to the unknown environments, or lack of the continuous learning capacities, etc. In response to these problems, this work proposes a novel motivated developmental network (MDN) to improve the decision-making ability of the agent. During the environment exploration, if the agent encounters an unknown environment, new layers and neurons are dynamically inserted to the MDN, according to the task requirements. Through the interaction between internal neurons and the inserted new neurons, the agent can autonomously develop and learn in the unknown environments without training data, but the behavioral decision-making at this stage is random. To further improve the agent’s decision-making ability, in the off-task process, through the gated self-organization mechanism, the agent can selectively recall the specific knowledge in its “brain”, and the MDN will transfer and generate a large amount of new data, according to the recalled knowledge, then the new layers and neurons will be inserted to memorize the new knowledge. Hence the knowledge base of the MDN becomes more and more complete, thereby improving its decision-making ability and adaptability to the new environment. To demonstrate the performance of the MDN model, a mobile robot navigation in different environments are executed. The experimental results illustrate that the agent can not only autonomously learn in static environments, but also has better decision-making ability in unknown dynamic environment, i.e., better adaptability to the new environment. Comparison with other algorithms further demonstrate the potential of the proposed MDN model.
Similar content being viewed by others
References
Li, H., Savkin, A.V.: An algorithm for safe navigation of mobile robots by a sensor network in dynamic cluttered industrial environments. Robot. Comput. Integr. Manuf. 54, 65–82 (2018)
Qiu, Q., Fan, Z., Meng, Z., Zhang, Q., Cong, Y., Li, B., Wang, N., Zhao, C.: Extended ackerman steering principle for the coordinated movement control of a four wheel drive agricultural mobile robot. Comput. Electron. Agric. 152, 40–50 (2018)
Buyurgan, N., Lehlou, N.: A terrain risk assessment method for military surveillance applications for mobile assets. Comput. Ind. Eng. 88, 88–99 (2015)
Sword, C.M.: Viable alternative mine operating system: A novel underwater robotic excavation system for flooded open-cut mines. Energy Procedia 125, 50–55 (2017)
Salzmann-Erikson, M., Erikssonm, H: Absorbability: applicability and availability in nursing and care robots. A thematic analysis of twitter postings. Telematics Inform. 35(5), 1553–1560 (2018)
Sanguino, T.J.M.: 50 years of rovers for planetary exploration. A retrospective review for future directions. Robot. Auton. Syst. 94, 172–185 (2017)
Bai, L., Guan, J., Chen, X., Hou, J., Duan, W.: An optional passive/active transformable wheel-legged mobility concept for search and rescue robots. Robot. Auton. Syst. 107, 145–155 (2018)
Calzado, J., Lindsay, A., Chen, C., Samuels, G., Olszewska, J.I: Sami: Interactive, multi-sense robot architecture. In: Proceedings of 22nd IEEE International Conference on Intelligent Engineering Systems, June 21–23, pp 317–322, Las Palmas de Gran Canaria, Spain (2018)
Sekiguchi, S., Yorozu, A., Kuno, K., Okada, M., Takahashi, M.: Human-friendly control system design for two-wheeled service robot with optimal control approach. Robot. Auton. Syst. 131, 1–16 (2020)
Mohanta, J.C., Keshari, A.: A knowledge based fuzzy-probabilistic roadmap method for mobile robot navigation. Appl. Soft Comput. 79, 391–409 (2019)
Hacohen, S., Shoval, S., Shvalb, N.: Applying probability navigation function in dynamic uncertain environments. Robot. Auton. Syst. 87, 237–246 (2017)
Goto, Y., Fujita, M., Nide, N.: Impletation of 3-valued paraconsistent logic programming towards decision making system of agents. J. Syst. Sci. Syst. Eng. 27(3), 323–339 (2018)
Rath, A.K., Das, D. R., Parhi, H. C., Muni, M.K., Kumar, P.B.: Analysis and use of fuzzy intelligent technique for navigation of humanoid robot in obstacle prone zone. Def. Technol. 14(6), 677–682 (2018)
Turnwald, A., Wollherr, D.: Human-like motion planning based on game theoretic decision making. Int. J. Soc. Robot. 11, 151–170 (2019)
Liu, P., Yu, H., Cang, S.: Optimized adaptive tracking control for an underactuated vibro-driven capsule system. Nonlinear Dyn. 94, 1803–1817 (2018)
Liu, P., Huda, M.N., Tang, Z., Sun, L.: A self-propelled robotic system with a visco-elastic joint: dynamics and motion analysis. Eng. Comput. 36, 655–669 (2020)
Ahmed, S.A., Topalov, A.V., Shakev, N.G., Popov, V.L.: Model-free detection and following of moving objects by an omnidirectional mobile robot using 2d range data. IFAC PapersOnLine 51(22), 226–231 (2018)
Magro, A.V., Manso, L.J., Macharet, D.G., Bustos, P.: Socially aware robot navigation system in human-populated and interactive environments based on an adaptive spatial density function and space affordances. Pattern Recogn. Lett. 118, 72–84 (2019)
Abeyrathna, K.D., Granmo, O.C., Yakovlev, R., Shafikand, A., Goodwin, M.: A novel multi-step finite-state automaton for arbitrarily deterministic tsetlin machine learning. In: Proc Artificial Intelligence XXXVII: 40th SGAI International Conference on Artificial Intelligence, December 15–17, pp 108–114, Cambridge, UK (2020)
Aleluya, E.R.M., Zamayla, A.D., Tamula, S.L.M.: Decision-making system of soccer-playing robots using finite state machine based on skill hierarchy and path planning through bezier polynomials. Procedia Comput. Sci. 135, 230–237 (2018)
Chen, X., Tian, G., Miao, Y.: Driving rule acquisition and decision algorithm to unmanned vehicle in urban traffic. Trans. Bjing Inst. Technol. 37(5), 491–496 (2017)
Klose, P., Mester, R.: Simulated autonomous driving in a realistic driving environment using deep reinforcement learning and a deterministic finite state machine. In: Proceedings of the 2nd International Conference on Applications of Intelligent Systems, pp 1–6 (2019)
Li, J., Tan, Y.: A probabilistic finite state machine based strategy for multi-target search using swarm robotics. Appl. Soft Comput. 77, 467–487 (2019)
Olszewska, J.I., Toman, J.: Open: New path-planning algorithm for real-world complex environment. In: Proceedings of the International Conference on Innovative Techniques and Applications of Artificial Intelligence, pp 237–244 (2016)
Boloor, A., Garimella, K., He, X., Gill, C.: Attacking vision-based perception in end-to-end autonomous driving models. J. Syst. Archit. 110, 1–13 (2020)
Liang, Y., Yan, Z., Zhang, Q., Liang, H., Ji, X., Liu, Y., Liu, R.: A decision-making model based on basal ganglia account of action prediction. In: Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, December 6–8, pp 1705–1710, Dali, Yunnan, China (2019)
Wang, C., Zhang, X., Cong, L., Li, J., Zhang, J.: Research on intelligent collision avoidance decision-making of unmanned ship in unknown environments. Evol. Syst. 10, 649–658 (2019)
Zheng, Z., Wu, X., Weng, J: Emergent neural turing machine and its visual navigation. Neural Netw. 10, 116–130 (2019)
Wang, D., Hu, Y., Ma, T.: Mobile robot navigation with the combination of supervised learning in cerebellum and reward-based learning in basal ganglia. Cogn. Syst. Res. 59, 1–14 (2020)
Wang, D., Wang, H., Liu, L.: Unknown environment exploration of multi-robot system with the fordpso. Swarm Evol. Comput. 26, 157–174 (2016)
Gao, W., Tang, Q., Ye, B., Yang, Y., Yao, J.: An enhanced heuristic ant colony optimization for mobile robot path planning. Soft Comput. 24, 6139–6150 (2020)
Faridi, A.Q., Sharma, S., Shukla, A., Tiwari, R., Dhar, J.: Multi-robot multi-target dynamic path planning using artificial bee colony and evolutionary programming in unknown environment. Intell. Serv. Robot. 11, 171–186 (2018)
Ding, H.: Motion path planning of soccer training auxiliary robot based on genetic algorithm in fixed-point rotation environment. J. Ambient Intell. Humanized Comput. https://doi.org/10.1007/s12652-020-01877-4 (2020)
Rao, D.C., Kabat, M.R., Das, P.K., Jena, P.K.: Cooperative navigation planning of multiple mobile robots using improved krill herd. Arab. J. Sci. Eng. 43, 7869–7891 (2018)
Gonzalez-Billandon, J., Sciutti, A., Sandini, G., Rea, F.: Towards a cognitive architecture for self-supervised transfer learning for objects detection with a humanoid robot. In: Proceedings of 2020 Joint IEEE 10th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), Oct 26–30, pp 1–8, Valparaiso, Chile (2020)
Hou, S., Dong, B., Wang, H., Wu, G.: Inspection of surface defects on stay cables using a robot and transfer learning. Autom. Constr. 119, 1–14 (2020)
Carlucho, I., Paula, M.D., Acosta, G.G.: An adaptive deep reinforcement learning approach for mimo pid control of mobile robots. ISA Trans. 102, 280–294 (2020)
Bing, Z., Lemke, C., Cheng, L., Huang, K., Knol, A.: Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning. Neural Netw. 129, 323–333 (2020)
Cuayahuitl, H.: A data-efficient deep learning approach for deployable multimodal social robots. Neurocomputing 396, 587–598 (2020)
Rincon, L., Coronado, E., Law, C., Venture, G.: Adaptive cognitive robot using dynamic perception with fast deep-learning and adaptive on-line predictive control. In: Proceedings of IFToMM World Congress on Mechanism and Machine Science, pp 2429–2438 (2019)
Liu, P., Yu, H., Cang, S.: Adaptive neural network tracking control for underactuated systems with matched and mismatched disturbances. Nonlinear Dyn. 98, 1447–1464 (2019)
Bryndin, E.: Development of sensitivity and active behavior of cognitive robot by means artificial intelligence. Int. J. Robot. Res. Dev. 10(1), 1–11 (2020)
Goel, A.K., Fitezerald, T., Parashar, P.: Analogy and Meta Reasoning: Cognitive Strategies for Robot Learning. Academic Press, Salt Lake City, UT USA (2020)
Olszewska, J.I., Houghtaling, M., Goncalves, P.J.S., Fabiano, N., Haidegger, T., Carbonera, J.L., Patterson, W.R., Ragavan, S.V., Fiorini, S.R., Prestes, E.: Robotic standard development life cycle in action. J. Intell. Robot. Syst. 98, 119–131 (2020)
Weng, J.: Why have we passed neural networks no not abstract well. Nat. Intell. INNS Mag. 1 (1), 13–22 (2011)
Wang, D., Wang, J., Liu, L.: Developmental network: An internal emergent object feature learning. Neural Process. Lett. 48, 1135–1159 (2018)
Wu, X., Bo, Y., Weng, J.: Information-dense actions as contexts. Neurocomputing 311, 164–175 (2018)
Weng, J.: Natural and Artificial Intelligence: Introduction to Computational Brain-Mind. BMI Press, Okemos, Michigan USA (2012)
Avery, M.C., Krichmar, J.L.: Neuromodulatory systems and their interactions: A review of models, theories, and experiments. Front. Neural Circ. 11, 1–18 (2017)
Dasgupta, S., Worgotter, F., Manoonpong, P.: Neuromodulatory adaptive combination of correlation-based learning in cerebellum and reward-based learning in basal ganglia for goal-directed behavior control. Front. Neural Circ. 8, 1–21 (2014)
Krichmar, J.L.: The neuromodulatory system: a framework for survival and adaptive behavior in a challenging world. Adapt. Behav. 16(6), 385–399 (2008)
Wang, D., Duan, Y., Weng, J.: Motivated optimal developmental learning for sequential tasks without using rigid time-discounts. IEEE Tran. Neural Netw. Learn. Syst. 29(10), 4917–4931 (2018)
Barr, R.: Transfer of learning between 2d and 3d sources during infancy: Informing theory and practice. Dev. Rev. 30(2), 128–154 (2010)
Solgi, M., Liu, T., Weng, J.: A computational developmental model for specificity and transfer in perceptual learning. J. Vis. 13(1), 1–23 (2013)
Liu, J.: Optimization of stochastic computing based deep learning systems with parallel finite state machine implementation. In: Proceedings of the 2020 4th International Conference on Algorithms, Computing and Systems, , September 26–28, 20120, pp 22–26, Berlin, German (2020)
Waterman, M.W., Frezzo, D.C., Wang, M.X.: Adaptive learning using finite state machine logic. In: Proceedings of the Seventh ACM Conference on Learning@Scale, pp 237–240, Virtual Event USA (2020)
Wang, D., Xin, J.: Emergent spatio-temporal multimodal learning using a developmental network. Appl. Intell. 49, 1306–1323 (2019)
Sanchez, J.A., Romero, V.: Computation of moments for probabilistic finite-state automata. Inform. Sci. 516, 388–400 (2020)
Acknowledgements
This research is supported by the National Natural Science Funds of China under Grants 61873245 and 61876169, Natural Science Funds of Henan Province under Grant 202300410483, and Scientific Problem Tackling of Henan Province under Grant 192102210256.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, D., Yang, K., Wang, H. et al. Behavioral Decision-Making of Mobile Robot in Unknown Environment with the Cognitive Transfer. J Intell Robot Syst 103, 7 (2021). https://doi.org/10.1007/s10846-021-01451-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10846-021-01451-w