Skip to main content
Log in

The effect of structure on image classification using signatures

  • Original Article
  • Published:
Biological Cybernetics Aims and scope Submit manuscript

Abstract

Humans recognize transformed images from a very small number of samples. Inspired by this idea, we evaluate a classification method that requires only one sample per class, while providing invariance to image transformations generated by a compact group. This method is based on signatures computed for images. We test and illustrate this theory through simulations that highlight the role of image structure and sampling density, as well as how the signatures are constructed. We extend the existing theory to account for variations in recognition accuracy due to image structure.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  • Anselmi F, Leibo J, Rosasco L, Mutch J, Tacchetti A, Poggio T (2014) Unsupervised learning of invariant representations with low sample complexity. CBMM Memo No. 001, March 12

  • Cucker F, Smale S (2001) On the mathematical foundations of learning. Am Math Soc 39(1):1–49

    Article  Google Scholar 

  • Dow DM, Snyder AZ, Vautin RG, Bauer R (1981) Magnification factor and receptive field size in foveal striate cortex of the monkey. Exp Brain Res 44:213–228

    Article  CAS  PubMed  Google Scholar 

  • Downing P, Jiang Y (2001) A cortical area selective for visual processing of the human body. Science 293(5539):2470–2473

    Article  CAS  PubMed  Google Scholar 

  • Foldiak P (1991) Learning invariance from transformation sequences. Neural Comput 3(2):194–200

    Article  PubMed  Google Scholar 

  • Fukushima K (1980) Neocognitron: a self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36:193–202

    Article  CAS  PubMed  Google Scholar 

  • Grill-Spector K, Knouf N, Kanwisher N (2004) The fusiform face area subserves face perception, not generic within-category identification. Nat Neurosci 7(5):555–562

    Article  CAS  PubMed  Google Scholar 

  • Kanwisher N (2017) The quest for the FFA and where it led. J Neurosci 37(5):1056–1061

    Article  CAS  PubMed Central  PubMed  Google Scholar 

  • Kanwisher N, McDermott J, Chun M (1997) The fusiform face area: a module in human extrastriate cortex specialized for face perception. J Neurosci 17(11):4302–4311

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  • Liao Q, Leibo JZ, Poggio T (2013) Learning invariant representations and applications to face verification. In: Proceedings of advances in neural information processing systems (NIPS 2013), Lake Tahoe, NV

  • Tsao D, Freiwald W, Tootell R, Livingstone M (2006) A cortical region consisting entirely of face-selective cells. Science 311(5761):670–674

    Article  CAS  PubMed Central  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Florian Raudies.

Additional information

Communicated by Benjamin Lindner.

Appendix A: Some concepts from group theory

Appendix A: Some concepts from group theory

A group is a set G together with an associative binary operation on G which has an identity and such that each element has an inverse with respect to the operation. That is,

  1. i.

    For all \(a,b,c\in G\), it holds that \((a\cdot b)\cdot c=a\cdot (b\cdot c)\);

  2. ii.

    There exists an element e of G such that for all \(a\in G\), \(e\cdot a=a\cdot e=a\);

  3. iii.

    For every \(a\in G\), there exists \(b\in G\) such that \(a\cdot b=b\cdot a=e\).

Example

The integers are a group with respect to the operation of addition. The identity element is 0, and the inverse of a is \(-a\).

The action of a group G with identity e on a set X is a mapping from \(G\times X\) to X,

$$\begin{aligned} G\times X\rightarrow X:\left( {g,x} \right) \rightarrow gx, \end{aligned}$$

such that

$$\begin{aligned} ex=x,\text { for all }x\in X, \end{aligned}$$

and

$$\begin{aligned} \left( {g_1 g_2 } \right) x=g_1 \left( {g_2 x} \right) \text { for all }g_1 ,g_2 \in G, x\in X. \end{aligned}$$

Example

Let X be a two-dimensional disk centered about the origin. Define the rotation \(\phi \) as the operation which takes each point \(\left( {r,\theta } \right) \) of the disk to the point \(\left( {r,\theta +\phi } \right) \). The set of such rotations is a group which acts on X.

Suppose the group G acts on the set X. For each point \(x\in X\), the orbit of x is the set \(\left\{ {gx:g\in G} \right\} \).

Example

The orbit of the point \(\left( {r,\theta } \right) \) in the disk example above is the circle of radius r about the origin.

A computation on the elements of a set X is said to be invariant under the action of the group G if the result of the computation on an element x is the same as the result of the element gx for all \(g\in G\). In other words, if the results of the computation performed on any two elements of the orbit of x are indistinguishable.

Example

In the above disk example, consider the computation “distance from the origin.” The result of this computation is the same for all elements of any circle about the origin. That is, for all elements of the orbit of any point in the disk. Thus, this computation is invariant under the group action.

The simple examples above are given to illustrate the concepts. In the context of this paper, the set of interest will be a given set of images and the group action will be rotation of these images. The orbit of any particular image is in this case the set of all its rotations. A key idea is to find a computation (called a signature) which is invariant under this group action. This reduces the problem of distinguishing between different (potentially rotated) images to distinguishing between orbits of images, which is a much smaller set. This, in turn, drastically reduces the number of training samples required for certain image recognition tasks.

One final concept is needed. Given a group action G on a space X, the group G is said to be compact if its topology is compact. For example, if G is the set of two-dimensional rotations then G can be identified with the unit circle (if \(\phi \) is an element of G, then \(\phi \) can be identified with the element of the unit circle \(e^{i\phi })\), and the unit circle is a compact subset of the plane.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Roccaforte, R., Raudies, F. The effect of structure on image classification using signatures. Biol Cybern 112, 415–425 (2018). https://doi.org/10.1007/s00422-018-0761-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00422-018-0761-2

Keywords