Abstract
Detecting objects and referring to them in a dialog is a crucial requirement for robotic systems that cooperate with humans. For this, in an unrestricted natural environment the innate concepts of the robot must be extended and adapted over time. In this paper we describe an autonomous mobile robot system that performs online interactive incremental learning of objects. We argue that this combination strongly contributes to the variation of appearance, context, and labels under which visual concepts are encountered and thus overcomes limitations of existing databases and robotic systems where one or more of these aspects are missing. In the current prototype version, objects are shown to the robot in hand and are learned by a standard classifier on top of pre-trained CNN features. We evaluate the basic feasibility of the current approach on an existing database of hand-held objects, show how it performs online on the robot, and discuss extensions of the system towards life-long learning and data acquisition.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Andreopoulos, A., Hasler, S., Wersing, H., Janssen, H., Tsotsos, J.K., Körner, E.: Active 3D object localization using a humanoid robot. IEEE Trans. Robot. 27(1), 47–64 (2011)
Borji, A., Izadi, S., Itti, L.: iLab-20M: a large-scale controlled object dataset to investigate deep learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2221–2230. IEEE Computer Society (2016)
Camoriano, R., Pasquale, G., Ciliberto, C., Natale, L., Rosasco, L., Metta, G.: Incremental robot learning of new objects with fixed update time. In: IEEE International Conference on Robotics and Automation, pp. 3207–3214. IEEE (2017)
Chatfield, K., Simonyan, K., Vedaldi, A., Zisserman, A.: Return of the devil in the details: delving deep into convolutional nets. In: Valstar, M.F., French, A.P., Pridmore, T.P. (eds.) British Machine Vision Conference. BMVA Press (2014)
Deng, J., Dong, W., Socher, R., Li, L., Li, K., Li, F.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. IEEE (2009)
Fäulhammer, T., et al.: Autonomous learning of object models on a mobile robot. IEEE Robot. Autom. Lett. 2(1), 26–33 (2017)
Fischer, L., Hasler, S., Schrom, S., Wersing, H.: Improving online learning of visual categories by deep features. In: Future of Interactive Learning Machines workshop at the Conference on Neural Information Processing Systems (2016)
Goerick, C., et al.: Interactive online multimodal association for internal concept building in humanoids. In: 9th IEEE-RAS International Conference on Humanoid Robots, pp. 411–418. IEEE (2009)
Goerick, C., Wersing, H., Mikhailova, I., Dunn, M.: Peripersonal space and object recognition for humanoids. In: 5th IEEE-RAS International Conference on Humanoid Robots, pp. 387–392. IEEE (2005)
Google: Cloud Speech-To-Text. https://cloud.google.com/speech-to-text/
Hammer, B., Villmann, T.: Generalized relevance learning vector quantization. Neural Netw. 15(8–9), 1059–1068 (2002)
Hasler, S., Wersing, H., Kirstein, S., Körner, E.: Large-scale real-time object identification based on analytic features. In: Alippi, C., Polycarpou, M., Panayiotou, C., Ellinas, G. (eds.) ICANN 2009. LNCS, vol. 5769, pp. 663–672. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04277-5_67
IBM: Watson Assistant. www.ibm.com/watson/services/conversation/
Jia, Y., et al.: Caffe: convolutional Architecture for Fast Feature Embedding. In: Hua, K.A., Rui, Y., Steinmetz, R., Hanjalic, A., Natsev, A., Zhu, W. (eds.) Proceedings of the ACM International Conference on Multimedia, pp. 675–678. ACM (2014)
Käding, C., Rodner, E., Freytag, A., Denzler, J.: Fine-tuning deep neural networks in continuous learning scenarios. In: Chen, C.-S., Lu, J., Ma, K.-K. (eds.) ACCV 2016. LNCS, vol. 10118, pp. 588–605. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54526-4_43
Kaiser, L., Nachum, O., Roy, A., Bengio, S.: Learning to remember rare events. CoRR abs/1703.03129 (2017)
Kinovarobotics: Kinova. www.meetjaco.com/about/
Kirstein, S., Denecke, A., Hasler, S., Wersing, H., Gross, H., Körner, E.: A vision architecture for unconstrained and incremental learning of multiple categories. Memetic Comput. 1(4), 291–304 (2009)
Kirstein, S., Wersing, H., Körner, E.: Rapid online learning of objects in a biologically motivated recognition architecture. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds.) DAGM 2005. LNCS, vol. 3663, pp. 301–308. Springer, Heidelberg (2005). https://doi.org/10.1007/11550518_38
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Bartlett, P.L., Pereira, F.C.N., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25: Proceedings of the 26th Annual Conference on Neural Information Processing Systems, pp. 1106–1114 (2012)
Lee, J., Yoon, J., Yang, E., Hwang, S.J.: Lifelong learning with dynamically expandable networks. CoRR abs/1708.01547 (2017)
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Losing, V., Hammer, B., Wersing, H.: KNN classifier with self adjusting memory for heterogeneous concept drift. In: Bonchi, F., Domingo-Ferrer, J., Baeza-Yates, R.A., Zhou, Z., Wu, X. (eds.) IEEE 16th International Conference on Data Mining, pp. 291–300. IEEE (2016)
Losing, V., Hammer, B., Wersing, H.: Incremental on-line learning: a review and comparison of state of the art algorithms. Neurocomputing 275, 1261–1274 (2018)
MetraLabs: Scitos G5. www.metralabs.com/en/mobile-robot-scitos-g5/
Mudrová, L., Lacerda, B., Hawes, N.: An integrated control framework for long-term autonomy in mobile service robots. In: 2015 European Conference on Mobile Robots, pp. 1–6. IEEE (2015)
Nakadai, K., Takahashi, T., Okuno, H.G., Nakajima, H., Hasegawa, Y., Tsujino, H.: Design and implementation of robot audition system ‘Hark’ - open source software for listening to three simultaneous speakers. Adv. Robot. 24(5–6), 739–761 (2010)
Nuance: Vocalizer Text-To-Speech. www.nuance.com/mobile/mobile-solutions/vocalizer-expressive.html
Oberlin, J., Meier, M., Kraska, T., Tellex, S.: Acquiring object experiences at scale. In: AAAI-RSS Special Workshop on the 50th Anniversary of Shakey: The Role of AI to Harmonize Robots and Humans (2015)
Pasquale, G., Ciliberto, C., Odone, F., Rosasco, L., Natale, L.: Real-world object recognition with off-the-shelf deep Conv Nets: how many objects can iCub learn? CoRR abs/1504.03154 (2015)
Pasquale, G., Ciliberto, C., Rosasco, L., Natale, L.: Object identification from few examples by improving the invariance of a Deep Convolutional Neural Network. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4904–4911. IEEE (2016)
Peng, X., Usman, B., Kaushik, N., Hoffman, J., Wang, D., Saenko, K.: VisDA: the visual domain adaptation challenge. CoRR abs/1710.06924 (2017)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517–6525. IEEE Computer Society (2017)
ROS: Robot Operating System. www.ros.org
Sato, A., Yamada, K.: Generalized learning vector quantization. In: Touretzky, D.S., Mozer, M., Hasselmo, M.E. (eds.) Advances in Neural Information Processing Systems 8, pp. 423–429. MIT Press (1995)
Acknowledgments
We thank MetraLabs for the setup and support of the robots. We got a quick start with our robots by being able to use the software developed in the STRANDS project. For this we thank the whole project team, especially Lenka Mudrová and Nick Hawes. We also thank Manuel Mühlig for establishing and maintaining the basic robot software system at our institute.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Hasler, S., Kreger, J., Bauer-Wersing, U. (2018). Interactive Incremental Online Learning of Objects Onboard of a Cooperative Autonomous Mobile Robot. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11307. Springer, Cham. https://doi.org/10.1007/978-3-030-04239-4_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-04239-4_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04238-7
Online ISBN: 978-3-030-04239-4
eBook Packages: Computer ScienceComputer Science (R0)