Abstract
One of the key challenges of current image matching techniques is how to build a robust local descriptor which is invariant to large variations in scale and rotation. To address this issue, in this work a polar gradient local oriented histogram pattern (PGP) is localized on normalized cropped regions around detected interest points. Then, a new image descriptor named two-dimensional intensity gradient histogram (2DIGH) is introduced using the joint histogram scheme. 2DIGH builds the extracted feature vector by intersecting of gradient and intensity information on subregions of the PGP. The measured distance with K-nearest neighbor represents feature vectors similarity/distance for image matching. The experimental results on Graffiti, Boat, Bark and ZuBud datasets indicate that the performance of the introduced 2DIGH is at least 41% better than other widely applied descriptors.
Similar content being viewed by others
References
Zhou, W., Wang, C., Xiao, B., Zhang, Z.: SLD: a novel robust descriptor for image matching. IEEE Signal Process. Lett. 21, 339–342 (2014)
Ferrari, V., Tuytelaars, T., Van Gool, L.: Simultaneous object recognition and segmentation by image exploration. In: Eighth European Computer Vision (2004)
Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Lazebnik, S., Schmid, C., Ponce, J.: Sparse texture representation using affine-invariant neighborhoods. In: Computer Vision and Pattern Recognition (2005)
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Ninth International Conference on Computer Vision (2003)
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: Eighth International Conference on Computer Vision (2001)
Smith, S.M., Brady, J.M.: SUSAN—a new approach to low level image processing. Int. J. Comput. Vis. 23, 45–78 (1997)
Tamrakar, D., Khanna, P.: Noise and rotation invariant RDF descriptor for palmprint identification . Multimedia Tools Appl. 75(10), 5777–5794 (2016)
Desai, A., Lee, D.-J., Wilson, C.: Using affine features for an efficient binary feature descriptor. In: IEEE; SSIAI (2014)
Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Computer Vision and Pattern Recognition (2004)
Mikolajczyk, K., Schmid, C.: A Performance Evaluation of Local Descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1615–1630 (2005)
Tola, E., Lepetit, V., Fua, P.: DAISY: an efficient dense descriptor applied to wide-baseline stereo. IEEE Trans. Pattern Anal. Mach. Intell. 32, 815–830 (2010)
Bay, H., Tuytelaars, T., Van Gool, L.: Surf: speeded up robust features. Comput. Vis. Image Underst. 110(3), 346–359 (2008)
Calonder, M., Lepetit, V., Özuysal, M., Trzcinski, T., Strecha, C., Fua, P.: BRIEF: computing a local binary descriptor very fast. IEEE Trans. Pattern Anal. Mach. Intell. 34, 1281–1298 (2012)
Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: an efficient alternative to SIFT or SURF. In: IEEE International Conference on Computer Vision (2011)
Leutenegger, S., Chli, M., Siegwart, R.Y.: BRISK: binary robust invariant scalable keypoints. In: IEEE International Conference on Computer Vision (2011)
Alahi, A., Ortiz, R., Vandergheynst, P.: FREAK: fast retina keypoint. In: IEEE Conference on CVPR (2012)
O. R. site. http://www.robots.ox.ac.uk/~vgg/research/affine (2004)
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. Int. J. Comput. Vis. 60, 63–86 (2004)
Tuytelaars, T., Van Gool, L.: Matching widely separated views based on affine invariant regions. Int. J. Comput. Vis. 59(1), 61–85 (2004)
Yu, Y., Huang, K., Chen, W., Tan, T.: A novel algorithm for view and illumination invariant image matching. IEEE Trans. Image Process. 21, 229–240 (2012). doi:10.1109/TIP.2011.2160271
Song, T., Li, H.: Local polar DCT features for image description. IEEE Signal Process. Lett. (2013). doi:10.1109/LSP.2012.2229273
Lazebnik, S., Schmid, C., Ponce, J.: A sparse texture representation using local affine regions. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 27, 1265 (2005). doi:10.1109/TPAMI.2005.151
Kang, T.K., Choi, I.-H., Lim, M.T.: MDGHM-SURF: a robust local image descriptor based on modified discrete Gaussian–Hermite moment. Pattern Recognit. 48, 670–684 (2014). doi:10.1016/j.patcog.2014.06.022
Zheng, M., Wu, C., Chen, D., Meng, Z.: Rotation and affine-invariant SIFT descriptor for matching UAV images with satellite images. In: IEEE Chinese Guidance, Navigation and Control Conference, Yantai (2014). doi:10.1109/CGNCC.2014.7007582
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide-baseline stereo from maximally stable extremal regions. Image Vis. Comput. 22(10), 761 (2004). doi:10.1016/j.imavis.2004.02.006
Zhanga, M., Li, Z., Bai, H., Sun, Y.: Robust image salient regional extraction and matching based on DoGSS-MSERs. Int. J. Light Electron Opt. 125, 1469–1473 (2014). doi:10.1016/j.ijleo.2013.09.007
Tuytelaars, T., Mikolajczyk, K.: Local invariant feature detectors: a survey, pp. 177–280. http://www.nowpublishers.com/article/Details/CGV-017 (2008)
ZuBud dataset: Zurich Buildings Database. http://www.vision.ee.ethz.ch/en/datasets (2003)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sadeghi, B., Jamshidi, K., Vafaei, A. et al. 2DIGH: a polar invariant local image descriptor based on joint histogram. Vis Comput 34, 1579–1595 (2018). https://doi.org/10.1007/s00371-017-1433-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-017-1433-2