Abstract
As reported by many neurophysiological researches, the receptive field is a basic and significant component in the human visual system. It has various kinds of properties such as orientation-selectivity, correlativity, etc. Motivated by these structural and functional properties, we propose in this paper a novel local image descriptor namely the Discriminative Transform of Receptive Field (DTRF). Specifically, around each sample pixel in the interest region, we define a low-level feature structure called Receptive Field Patterns (RFP) which is further divided into two components: the RFP-Center and RFP-Surround. Then, the local features are extracted based on Local Annular Discrete Cosine Transform (LADCT). At the descriptor construction stage, these features are pooled spatially to mimic the correlative property of receptive field. Image matching and classification experiments on four standard data set demonstrate that the proposed descriptor outperforms the state-of-the-art methods under various types of image transformations such as rotation and scaling changes, viewpoint changes, image blurring, JPEG compression, illumination changes, and image noise.
Similar content being viewed by others
References
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: Application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Balasubramanian V, Sterling P (2009) Receptive fields and functional architecture in the retina. J Phys 587(12):2753–2767
Chen Y, Anand S, Martinez-Conde S, Macknik SL, Bereshpolova Y, Swadlow HA, Alonso JM (2009) The linearity and selectivity of neuronal responses in awake visual cortex. J Vis 9(9):12
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In Workshop on statistical learning in computer vision. ECCV 1(1-22):1–2
Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Proceedings IEEE ECCV, vol 1 (1-22), pp 1–2
Fan B, Wu F, Hu Z (2011) Aggregating gradient distributions into intensity orders: a novel local image descriptor. In: Proceedings IEEE CVPR, pp 2377–2384
Fan B, Wu F, Hu Z (2012) Rotationally invariant descriptors using intensity order pooling. IEEE Trans Pattern Anal Mach Intell 34(10):2031–2045
Fei-Fei L, Fergus R, Perona P (2007) Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
Guo Z, Zhang D (2010) A completed modeling of local binary pattern operator for texture classification. IEEE Trans Image Process 19(6):1657–1663
Hafed ZM, Levine MD (2001) Face recognition using the discrete cosine transform. Int J Comput Vis 43(3):167–188
Heikkila M, Pietikainen M, Schmid C (2009) Description of interest regions with local binary patterns. Pattern Recognit 42:425–436
Hubel DH (1963) The visual cortex of the brain. Sci Am 209:54–62
Ke Y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings IEEE CVPR, vol 2, pp 506–513
Lazebnik S, Schmid C, Ponce J (2005) A sparse texture representation using local affine regions. IEEE Trans Pattern Anal Mach Intell 27(8):1265–1278
Liu GH, Li ZY, Zhang L, Xu Y (2011) Image retrieval based on micro-structure descriptor. Pattern Recognit 44(9):2123–2133
Zhang J, Marszaek M, Lazebnik S, Schmid C (2007) Local features and kernels for classification of texture and object categories: A comprehensive study. Int J Comput Vis 73(2):213–238
Lowe DG (2004) Distinctive image features from scale-invariant key-points. Int J Comput Vis 60(2):91–110
Malone BJ, Kumar VR, Ringach DL (2007) Dynamics of receptive field size in primary visual cortex. J Neurophysiol 97(1):407–414
McGugin RW, Gatenby JC, Gore JC, Gauthier I (2013) High-resolution imaging of expertise reveals reliable object selectivity in the fusiform face area related to perceptual performance. Proc Natl Acad Sci 109(42):17063–17068
Mikolajczyk K, Schmid C (2004) Scale and afffine invariant interest point detectors. Int J Comput Vis 60(1):63–86
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 27(10):1615–1630
Mikolajczyk K, Tuytelaars T, Schmid C, Zisserman A, Matas J, Schaffalitzky F, Kadir T, Gool LV (2004) A comparison of affine region detectors. Int J Comput Vis 65(1-2):43–72
Murala S, Maheshwari RP, Balasubramanian R (2012) Local tetra patterns: a new feature descriptor for content-based imagen retrieval. IEEE Trans Image Process 21(5):2874–2886
Nister D, Stewenius H (2006) Scalable recognition with a vocabulary tree. IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2:2161–2168
Ojala T, Pietikainen M, Harwood D (1996) A comparative study of texture measures with classification based on feature distributions. Pattern Recognit 29(1):51–59
Pillow JW, Shlens J, Paninski L, Sher A, Litke AM, Chichilnisky EJ, Simoncelli EP (2008) Spatio-temporal correlations and visual signalling in a complete neuronal population. Nature 454(7207):995–999
Saha S (2000) Image compressionfrom DCT to wavelets: a review. Crossroads 6(3):12–21
Shadlen MN, Newsome WT (1998) The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. J Neurosci 18(10):3870–3896
Shu Y, Wang T, Shao G, Liu F, Feng Q (2014) DTRF: A Physiologically Motivated Method for Image Description. In: Proceedings IEEE ICIP, pp 5601–5605
Tola E, Lepetit V, Fua P (2010) Daisy: An efficient dense descriptor applied to wide-baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830
Tsao DY, Freiwald WA, Tootell RBH, Livingstone MS (2006) A cortical region consisting entirely of face-selective cells. Science 311(5761):670–674
Tuytelaars T, Van Gool L (2004) Matching Widely Separated Views Based on Affine Invariant Regions. Int J Comput Vis 59(1):61–85
Wang Z, Fan B, Wu F (2011) Local intensity order pattern for feature description. In: Proceedings IEEE ICCV, pp 603–610
Winder S, Hua G, Brown M (2009) Picking the best daisy. In: Proceedings IEEE CVPR, pp 178–185
Winder SAJ, Brown M (2007) Learning local image descriptors. In: Proceedings IEEE CVPR, pp 1–8
Zhou X, Cui N, Li Z, Liang F, Huang TS (2009) Hierarchical gaussianization for image classification. In: Proceedings IEEE ICCV, pp 1971–1977
Acknowledgment
This work is partially supported by the National Natural Science Foundation of China(Grant 61073094 and U1233119).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shu, Y., Wang, T., Shao, G. et al. Discriminative transform of receptive field patterns for feature representation. Multimed Tools Appl 75, 7495–7517 (2016). https://doi.org/10.1007/s11042-015-2673-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2673-7