Abstract
Recently, Deep Convolutional Neural Networks have been successfully applied to various robotics problems, such as robot vision and simultaneous localization and mapping. Among these, siamese and triplet networks have obtained great traction in intra-domain matching. However, it is impossible to directly use these networks in cross-domain problems. Thus, this paper proposes a new method to train a set of triplet networks to perform cross-domain matching and ranking focused in underwater robotics. The method is used to train a pair of networks to perform matching of acoustic and segmented aerial images aiming to support an underwater robot localization algorithm. Results show that the method is able to achieve up to 83% accuracy in matching acoustic and segmented aerial images and up to 85% recall in ranking relevant aerial images given an acoustic image.
Similar content being viewed by others
References
Bertinetto, L, Valmadre, J, Henriques, JF, Vedaldi, A, Torr, PH: Fully-convolutional siamese networks for object tracking. In: European Conference on Computer Vision. Springer, pp. 850–865 (2016)
Bromley, J., Bentz, J.W., Bottou, L., Guyon, I., LeCun, Y., Moore, C., Säckinger, E, Shah, R.: Signature verification using a “siamese” time delay neural network. Int. J. Pattern Recognit. Artif. Intell. 7(04), 669–688 (1993)
De Giacomo, G.G., dos Santos, M.M., Drews-Jr, P.L., Botelho, S.S.: Cooperative training of triplet networks for cross-domain matching. In: 2020 Latin American Robotics Symposium (LARS), 2020 Brazilian Symposium on Robotics (SBR) and, vol. 2020, pp 1–6. Workshop on Robotics in Education (WRE), IEEE (2020)
Dos Santos, M.M., De Giacomo, G.G., Drews, P., Botelho, S.S.: Satellite and underwater sonar image matching using deep learning. In: 2019 Latin American Robotics Symposium (LARS), 2019 Brazilian Symposium on Robotics (SBR) and 2019 Workshop on Robotics in Education (WRE), pp 109–114. IEEE (2019)
Dos Santos, M.M., De Giacomo, G.G., Drews, P.L., Botelho, SS: Underwater sonar and aerial images data fusion for robot localization. IEEE (2019)
Dos Santos, M.M., De Giacomo, G.G., Drews-Jr, P.L., Botelho, S.S.: Matching color aerial images and underwater sonar images using deep learning for underwater localization. IEEE Robot. Autom. Lett. 5(4), 6365–6370 (2020)
Fox, D.: Kld-sampling: Adaptive particle filters. Adv. Neural Inform. Process. Syst. 14, 713–720 (2001)
Glorot, X, Bengio, Y: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pp. 249–256 (2010)
Hoffer, E, Ailon, N: Deep metric learning using triplet network. In: International Workshop on Similarity-Based Pattern Recognition. Springer, pp. 84–92 (2015)
Liu, L, Jiang, H, He, P, Chen, W, Liu, X, Gao, J, Han, J: On the variance of the adaptive learning rate and beyond. arXiv:190803265 (2019)
Ribeiro, P.O., dos Santos, M.M., Drews, P.L., Botelho, S.S., Longaray, L.M., Giacomo, G.G., Pias, MR: Underwater place recognition in unknown environments with triplet based acoustic image retrieval. IEEE (2018)
Ronneberger, O, Fischer, P, Brox, T: U-net: Convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, pp 234–241 (2015)
dos Santos, M.M., De Giacomo, G.G., Drews, P., Botelho, SS, 2019: Semantic segmentation of static and dynamic structures of marina satellite images using deep learning. IEEE
Schroff, F, Kalenichenko, D, Philbin, J: Facenet: A unified embedding for face recognition and clustering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823 (2015)
Shi, Y, Yu, X, Campbell, D, Li, H: Where am i looking at? Joint location and orientation estimation by cross-view matching. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4064–4072 (2020)
Silveira, L., Guth, F., Drews-Jr, P., Ballester, P., Machado, M., Codevilla, F., Duarte-Filho, N., Botelho, S.: An open-source bio-inspired solution to underwater slam. IFAC-PapersOnLine 48(2), 212–217 (2015)
de Souza Ribeiro, POC, dos Santos, MM, Drews, PLJ, da Costa Botelho, SS: Forward looking sonar scene matching using deep learning. In: 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), pp. 574–579. https://doi.org/10.1109/ICMLA.2017.00-99 (2017)
Steffens, C, Messias, L, Drews-Jr, P, Botelho, S: Contrast enhancement and image completion: A cnn based model to restore ill exposed images. In: 2019 IEEE 17th International Conference on Industrial Informatics (INDIN), pp. 226–232. https://doi.org/10.1109/INDIN41052.2019.8972228 (2019)
Steffens, C, Messias, L, Drews-Jr, P, Botelho, S: Cnn based image restoration: Adjusting ill-exposed srgb images in post-processing. Journal of Intelligent & Robotic Systems. https://doi.org/10.1007/s10846-019-01124-9 (2020)
Wang, P., Li, W., Wan, J., Ogunbona, P., Liu, X.: Cooperative training of deep aggregation networks for rgb-d action recognition. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Yu, F, Koltun, V: Multi-scale context aggregation by dilated convolutions. arXiv:151107122 (2015)
Zhang, M, Lucas, J, Ba, J, Hinton, GE: Lookahead optimizer: k steps forward, 1 step back. In: Advances in Neural Information Processing Systems, pp. 9597–9608 (2019)
Zhu, Z, Wang, Q, Li, B, Wu, W, Yan, J, Hu, W: Distractor-aware siamese networks for visual object tracking. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 101–117 (2018)
Funding
This study was partly supported by the National Council for Scientific and Technological Development (CNPq) and Coordenacao de Aperfeiçoamento de Pessoal de Nivel Superior - Brasil (CAPES) - Finance Code 001. This paper is also a contribution of the INCT-Mar COI funded by CNPq Grant Number 610012/2011-8.
Author information
Authors and Affiliations
Contributions
Giovanni G. De Giacomo: implementation and execution of the Deep Learning experiments; writing of the manuscript. Matheus M. dos Santos: development of the dataset and associated tools; helped writing the manuscript. Paulo L. J. Drews-Jr: theoretical support on the idea; revising the manuscript. Silvia S. C. Botelho: theoretical support on the idea; revising the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for Publication
Not applicable.
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Availability of data and materials
Python 3/TensorFlow 2.x implementation of the model and datasets are available for download at: https://github.com/giovgiac/neptune.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
De Giacomo, G.G., dos Santos, M.M., Drews-Jr, P.L.J. et al. Matching Cross-Domain Data with Cooperative Training of Triplet Networks: A Case Study on Underwater Robotics. J Intell Robot Syst 104, 55 (2022). https://doi.org/10.1007/s10846-022-01591-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10846-022-01591-7