Statistics and category systems for the shape index descriptor of local 2nd order natural image structure

https://doi.org/10.1016/j.imavis.2008.08.003Get rights and content

Abstract

The shape index offers a natural and invariant description of pure 2nd order image structure. We discuss its properties and report novel natural image statistics for the shape index and the additional two parameters of curvedness and principal direction that form a complete and decoupled re-parameterisation of 2nd order structure. In a second main theme, we address three separate avenues to a categorisation of the shape index for natural images and suggest five feature categories as a natural 2nd order image structure ‘alphabet’.

Introduction

The notions of local image structure and image features are as old as the field of computer vision itself. Both topics has always attracted much attention and the literature is vast. Although much progress has been made, complete lists of local structure types or features in say natural images still seem elusive and most approaches based on features are constrained by the availability of feature detectors more or less suited for the tasks at hand. Our aim with this paper is to investigate the feature types governed by 2nd order structure of natural images. We will continue this introduction with a short review of the ideas that have dominated the way features are conceived, describe our overall approach, and give a summary of how we choose to measure quantitative image structure.

Local image structure in its simplest form can be described through the pixels in a neighbourhood around a point – typically in form of a patch of image pixels. Although pure patch based descriptions have recently had a revival [52], [8], [58], [1], a next step has been to apply a bank of filters to the image patches to further facilitate analysis – often this is realised without explicit patch extraction using convolution across the entire image. Many collections of filters have been proposed over time: Gabor Filters, Wavelets, and derivatives of Gaussians [24], [40], [57], [25]. The output of such filter banks is typically a response vector in an N-dimensional response space. Many vision algorithms use these quantitative descriptions of local image structure directly and with great success [4], [61], [11], [7]. Nevertheless, it remains an intriguing question whether local image structure can be described through a limited alphabet of qualitative distinct letters or feature types. Filter responses have been used as the basis for a number of feature detectors [41], [3], [20], [38], [38], [39] – all of these are, however, results of conscious design choices and do not bring us much closer to answering the posed question.

The main aim of this work is to investigate whether such an alphabet naturally arises from geometrical and statistical studies of a simple model of early visual processing (a filter bank) exposed to natural image stimuli. We adopt the general notion that an alphabet can be found through a partitioning [29], [26], [31], [35], [54], [60], [15], [17] of response space into categories of qualitatively similar structures. We will, however, refine this and rather discuss the partitioning of a factored response space. A description that factors out extrinsic factors such as translation, orientation, and absolute grey value and generally provides a lower dimensional and more invariant starting point than the full response space. As mentioned above, we focus on the 2nd order structure in this work and thus another aim of the paper is to describe and discuss a suitable factored response space and its properties for natural images – see Section 2.

In terms of categorising or partitioning the factored response space, we will investigate three separate approaches. One is based on the idea of categories being defined as Voronoi cells around prototypes [12]. The second approach is inspired by Koenderink [29], [26], [31] and refined through Geometric Texton Theory [15], [17]. Here the category structure is induced through qualitative similarity of canonical representatives across response space. In the third and final approach, we consider stability – a desirable quality of any category system and investigate the most stable category systems under simple image perturbations such as additive Gaussian noise. These ideas are presented in Section 3.

The output of V1 simple cells are modelled as the inner product of Receptive Field weighting functions (RFs) and the retinal irradiance [22]. Several models of V1 simple cell ensembles such as Gabor filters have been reported in the literature [6], [24].

In this work we, however, prefer the Gaussian Derivatives (DtGs) as an uncommitted model to measure local image structure. Several arguments favour the DtGs: in terms of biological relevance DtGs up to 4th or 5th order provide a well fitting set of models (a simple linear combination brings the DtGs into the ‘wave-train’ basis very similar to Gabors [59], [30]). It was recently demonstrated [14] that DtGs provide as good a fit to reported Gabors as, e.g. filters obtained from ICA [2], [53], [23] or sparse coding [44] based approaches. Furthermore, the DtGs are well studied and have many attractive properties [25], [28], [10], [32], [14]. The real strength of DtGs is that the measured quantities are much easier to interpret than corresponding Gabors. Local measurements of an image with DtGs up to some order n is also referred to as the n-jet. The n-jet can be understood as the coefficients of truncated Taylor-series of the blurred image.

The one-dimensional Gaussian kernel of is given by G(x)=12πσ2exp-x22σ2, where σ is the standard deviation and controls the scale or blur at which the jet is calculated. Due to separability the two-dimensional Gaussian kernel is easily constructed as G(x,y)=G(x)G(y). We will use Gx,Gxy, etc. to denote partial derivatives of the two-dimension kernel and Lxy to denote the inner product between an image patch L and the suitable derivative of the Gaussian, i.e. Lxy=rR2Gxy(r)L(r)dr. The n-jet is the vector of inner products up to total order of derivation n – the 2-jet is thus given by (L,Lx,Ly,Lxx,Lxy,Lyy).

Section snippets

Intrinsic 2nd order structure and the shape index

The first focus of the paper is an investigation of ‘pure’ 2nd order structure in natural images. In the following, we motivate that this is best studied through the shape index and present properties and novel statistics of this descriptor.

Shape index category systems

The shape index constitutes a natural re-parameterisation of the factored jet-space and we now address the second main topic of this paper – an investigation of category systems (or partitionings of the factored response space) that naturally arise from the geometry and natural image statistics of the shape index descriptor. As described in the introduction, three different avenues will be investigated and the results related.

Discussion

The novel contributions of this paper are:

  • Statistics for the shape index, curvedness, and principal direction for a large database of natural images yielding results clearly distinct from that of Gaussian random noise controls. Furthermore, we have re-argued Koenderink’s point that the shape index is a natural, invariant, and decoupled re-parameterisation of 2nd order (image) fundamental structure. In combination with the curvedness and principal direction, we have a very intuitive triple

Acknowledgement

EPSRC-funded project ‘Basic Image Features’ EP/D030978/1.

References (61)

  • J.G. Daugman

    Uncertainty relations for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters

    Journal of the Optical Society of America A

    (1985)
  • M. de Bruijne et al.

    Shape particle filtering for image segmentation

  • A.A. Efros, W.T. Freeman, Image quilting for texture synthesis and transfer, in: Proceedings of the 28th Annual...
  • D.J. Field

    Relations between the statistics of natural images and the response properties of cortical cells

    Journal of the Optical Society of America

    (1987)
  • L.M.J. Florack et al.

    Cartesian differential invariants in scale-space

    Journal of Mathematical Imaging and Vision

    (1993)
  • J. Folkesson et al.

    Locating articular cartilage in MR images

    Proceedings of SPIE

    (2005)
  • P. Gärdenfors

    Conceptual Spaces: The Geometry of Thought

    (2000)
  • L.D. Griffin, Deduction of feature categories from colour images, in: Spatial Vision, vol. 18(4), Brill Academic...
  • L.D. Griffin, An analogy between colour vision and local spatial vision: filters, factors and categories, Cognitive...
  • L.D. Griffin et al.

    Image features and the 1-d, 2nd order gaussian derivative jet

    Proceedings of Scale Space

    (2005)
  • L.D. Griffin

    The 2nd order local-image-structure solid

    IEEE Transactions on Pattern Analysis and Machine Intelligence

    (2007)
  • L.D. Griffin et al.

    Hypotheses for image features, icons and textons

    International Journal of Computer Vision

    (2006)
  • L.D. Griffin et al.

    Feature category systems for 2nd order local image structure induced by natural image statistics and otherwise

    Proceedings of SPIE

    (2007)
  • C. Harris, M.J. Stephens, A combined corner and edge detector, in: Alvey88, 1988, pp....
  • J. Huang, D. Mumford, Statistics of natural images and models, in: CVPR99, 1999, pp....
  • D.H. Hubel et al.

    Receptive fields and functional architecture of monkey striate cortex

    Journal of Physiology

    (1968)
  • J.P. Jones et al.

    The two-dimensional spatial structure of simple receptive fields in cat striate cortex

    Journal of Neurophysiology

    (1987)
  • J.J. Koenderink

    The structure of images

    Biological Cybernetics

    (1984)
  • J.J. Koenderink

    What is a feature?

    Journal of Intelligent Systems

    (1993)
  • J.J. Koenderink, The structure of colorimetry, in: G. Sommer ,Y.Y. Zeevi (Eds.), Alebraic Frames for Perception–Action...
  • Cited by (7)

    View all citing articles on Scopus
    View full text