Abstract
The aim of this paper is to create a model which is able to be used to accurately identify objects as well as spacial relationships in a dynamic environment. This paper proposed methods to train a deep learning model which recognizes unique objects and positions of key items in an environment. The model requires a low amount of images compared to others and also can recognize multiple objects in the same frame due to the utilization of region proposal networks. Methods are also discussed to find the position of recognized objects which can be used for picking up recognized items with a robotic arm. The system utilizes logic operations to be able to deduct how different objects relate to each other in regard to their placement from one another based off of the localization technique. The paper discusses how to create spacial relationships specifically.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255, June 2009. https://doi.org/10.1109/CVPR.2009.5206848
Feng, C., Xiao, Y., Willette, A., McGee, W., Kamat, V.: Towards autonomous robotic in-situ assembly on unstructured construction sites using monocular vision. In: Proceedings of the 31st International Symposium on Automation and Robotics in Construction, pp. 163–170 (2014)
Hossain, M.A., Mukit, M.: A real-time face to camera distance measurement algorithm using object classification. In: 2015 International Conference on Computer and Information Engineering (ICCIE), pp. 107–110, November 2015. https://doi.org/10.1109/CCIE.2015.7399293
Kennedy, M., Queen, K., Thakur, D., Daniilidis, K., Kumar, V.: Precise dispensing of liquids using visual feedback. In: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 1260–1266, September 2017. https://doi.org/10.1109/IROS.2017.8202301
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems, pp. 91–99 (2015)
Stratigopoulos, H.G.: Machine learning applications in IC testing. In: 2018 IEEE 23rd European Test Symposium (ETS), pp. 1–10, May 2018. https://doi.org/10.1109/ETS.2018.8400701
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Wilson, D., Yan, F., Sinha, K., He, H. (2018). Robotic Understanding of Scene Contents and Spatial Constraints. In: Ge, S., et al. Social Robotics. ICSR 2018. Lecture Notes in Computer Science(), vol 11357. Springer, Cham. https://doi.org/10.1007/978-3-030-05204-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-05204-1_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05203-4
Online ISBN: 978-3-030-05204-1
eBook Packages: Computer ScienceComputer Science (R0)