Coarse-to-Fine Deep Orientation Estimator for Local Image Matching

Mori, Yasuaki; Hirakawa, Tsubasa; Yamashita, Takayoshi; Fujiyoshi, Hironobu

doi:10.1007/978-3-030-41404-7_27

Coarse-to-Fine Deep Orientation Estimator for Local Image Matching

Yasuaki Mori¹²,
Tsubasa Hirakawa¹²,
Takayoshi Yamashita¹² &
…
Hironobu Fujiyoshi¹²

Conference paper
First Online: 23 February 2020

1401 Accesses
2 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12046))

Abstract

Convolutional neural networks (CNNs) have become a mainstream method for keypoint matching in addition to image recognition, object detection, and semantic segmentation. Learned Invariant Feature Transform (LIFT) is pioneering method based on CNN. It performs keypoint detection, orientation estimation, and feature description in a single network. Among these processes, the orientation estimation is needed to obtain invariance for rotation changes. However, unlike the feature point detector and feature descriptor, the orientation estimator has not been considered important for accurate keypoint matching or been well researched even after LIFT is proposed. In this paper, we propose a novel coarse-to-fine orientation estimator that improves matching accuracy. First, the coarse orientation estimator estimates orientations to make the rotation error as small as possible even if large rotation changes exist between an image pair. Second, the fine orientation estimator further improves matching accuracy with the orientation estimated by the coarse orientation estimator. By using the proposed two-stage CNNs, we can accurately estimate orientations improving matching performance. The experimental results with the HPatches benchmark show that our method can achieve a more accurate precision-recall curve than single CNN-based orientation estimators.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Arandjelović, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: Conference on Computer Vision and Pattern Recognition, pp. 2911–2918 (2012)
Google Scholar
Bailey, T., Durrant-Whyte, H.: Simultaneous localization and mapping (SLAM). Part II. Robot. Autom. Mag. 13, 108–117 (2006)
Article Google Scholar
Balntas, V., Lenc, K., Vedaldi, A., Mikolajczyk, K.: HPatches: a benchmark and evaluation of handcrafted and learned local descriptors. In: Computer Vision and Pattern Recognition, pp. 5173–5182 (2017)
Google Scholar
Balntas, V., Riba, E., Ponsa, D., Mikolajczyk, K.: Learning local feature descriptors with triplets and shallow convolutional neural networks. In: British Machine Vision Conference, p. 119 (2016)
Google Scholar
Bay, H., Tuytelaars, T., Gool, L.V.: SURF: Speeded-up robust features. Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Article Google Scholar
Brown, M., Lowe, D.G.: Automatic panoramic image stitching using invariant features. Int. J. Comput. Vis. 74(1), 59–73 (2007)
Article Google Scholar
Durrant-Whyte, H., Bailey, T.: Simultaneous localization and mapping: part I. Robot. Autom. Mag. 13(2), 99–110 (2006)
Article Google Scholar
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Neural Information Processing Systems, pp. 2017–2025 (2015)
Google Scholar
Kingma, D.P., Ba, J.L.: Adam : a method for stochasic optimization. In: International Conference on Learning Representation (2015)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: International Conference on Computer Vision (1999)
Google Scholar
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_56
Chapter Google Scholar
Mur-Artal, R., Montiel, J.M.M., Tardos, J.D.: ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE Trans. Robot. 31(5), 1147–1163 (2015)
Article Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. Comput. Vis. Pattern Recogn. 2, 2161–2168 (2006)
Google Scholar
Ono, Y., Trulls, E., Fua, P., Yi, K.M.: LF-Net: learning local features from images. In: Neural Information Processing Systems (2018)
Google Scholar
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: International Conference on Computer Vision, pp. 2564–2571 (2011)
Google Scholar
Simo-Serra, E., Trulls, E., Ferraz, L., Kokkinos, I., Fua, P., Moreno-Noguer, F.: Discriminative learning of deep convolutional feature point descriptors. In: International Conference on Computer Vision, pp. 118–126 (2015)
Google Scholar
Taylor, S., Drummond, T.: Binary histogrammed intensity patches for effcient and robust matching. Int. J. Comput. Vis. 94, 241–265 (2011)
Article Google Scholar
Trzcinski, T., Christoudias, M., Lepetit, V.: Learning image descriptors with boosting. Pattern Anal. Mach. Intell. 37, 597–610 (2015)
Article Google Scholar
Yi, K.M., Trulls, E., Lepetit, V., Fua, P.: LIFT: learned invariant feature transform. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016, Part VI. LNCS, vol. 9910, pp. 467–483. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46466-4_28
Chapter Google Scholar
Yi, K.M., Verdie, Y., Fua, P., Lepetit, V.: Learning to assign orientations to feature points. In: Computer Vision and Pattern Recognition, pp. 107–116 (2016)
Google Scholar
Zagoruyko, S., Komodakis, N.: Learning to compare image patches via convolutional neural networks. In: Computer Vision and Pattern Recognition, pp. 359–366 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Chubu University, 1200 Matsumotocho, Kasugai, Aichi, Japan
Yasuaki Mori, Tsubasa Hirakawa, Takayoshi Yamashita & Hironobu Fujiyoshi

Authors

Yasuaki Mori
View author publications
You can also search for this author in PubMed Google Scholar
Tsubasa Hirakawa
View author publications
You can also search for this author in PubMed Google Scholar
Takayoshi Yamashita
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Fujiyoshi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yasuaki Mori .

Editor information

Editors and Affiliations

University of Malaya, Kuala Lumpur, Malaysia
Shivakumara Palaiahnakote
Consiglio Nazionale delle Ricerche, ICAR, Naples, Italy
Gabriella Sanniti di Baja
Chinese Academy of Sciences, Beijing, China
Liang Wang
Auckland University of Technology, Auckland, New Zealand
Wei Qi Yan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mori, Y., Hirakawa, T., Yamashita, T., Fujiyoshi, H. (2020). Coarse-to-Fine Deep Orientation Estimator for Local Image Matching. In: Palaiahnakote, S., Sanniti di Baja, G., Wang, L., Yan, W. (eds) Pattern Recognition. ACPR 2019. Lecture Notes in Computer Science(), vol 12046. Springer, Cham. https://doi.org/10.1007/978-3-030-41404-7_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-41404-7_27
Published: 23 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41403-0
Online ISBN: 978-3-030-41404-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics