Elsevier

Neural Networks

Volume 99, March 2018, Pages 114-122
Neural Networks

A border-ownership model based on computational electromagnetism

https://doi.org/10.1016/j.neunet.2017.12.004Get rights and content

Abstract

The mathematical relation between a vector electric field and its corresponding scalar potential field is useful to formulate computational problems of lower/middle-order visual processing, specifically related to the assignment of borders to the side of the object: so-called border ownership (BO). BO coding is a key process for extracting the objects from the background, allowing one to organize a cluttered scene. We propose that the problem is solvable simultaneously by application of a theorem of electromagnetism, i.e., “conservative vector fields have zero rotation, or “curl.” We hypothesize that (i) the BO signal is definable as a vector electric field with arrowheads pointing to the inner side of perceived objects, and (ii) its corresponding scalar field carries information related to perceived order in depth of occluding/occluded objects. A simple model was developed based on this computational theory. Model results qualitatively agree with object-side selectivity of BO-coding neurons, and with perceptions of object order. The model update rule can be reproduced as a plausible neural network that presents new interpretations of existing physiological results. Results of this study also suggest that T-junction detectors are unnecessary to calculate depth order.

Introduction

Humans can perceive objects and their relative order from a cluttered scene filled with objects that are mutually occlusive. In the real world, when one object overlaps another, the border one sees between the two objects is attributed to the occluding object. Assigning borders to the perceived object is believed to be a key process of perception; it facilitates distinction of objects from the background (Nakayama, Shimojo, & Silverman, 1989). This process is called border ownership (BO) coding. One can consider an ambiguous figure that can be perceived as either a vase or two faces (Fig. 1 a). In the perception of a vase, the border is assigned to the inner white area, which is perceived as closer to the viewer (Fig. 1 b). Similarly, in the perception of two faces, the border is assigned to the outer black area, which is perceived as closer to the viewer (Fig. 1 c). This example illustrates how BO coding and the perception of depth order are closely related.

Neurophysiological experiments using simple shaded stimuli on areas V1, V2, and V4 in the visual cortex of monkeys revealed the existence of neurons which respond more strongly to an edge when the “owner” of the edge is located on a specific side (Zhou, Friedman, & von der Heydt, 2000). Spatial selectivity for the owner’s location was found to emerge in neurons of area V2 in the visual cortex. Border ownership (BO) is believed to be coded by calculating the difference in response of these sets of neurons with opposite object-side preference towards a stimulus.

Various models have been proposed to code BO as neural networks in area V2 Craft et al. (2007), Li (2005), Sakai et al. (2012). The model explained by Li (2005) requires more than 20 free parameters to function well. Craft and his colleagues suggest hypothetical “grouping cells”, of which the receptive fields have an annular distribution (Craft et al., 2007). The model produced by Sakai et al. relies on randomly generated feedforward neural connections (Sakai et al., 2012). These studies reproduce the responses of BO-related neurons, but they are fundamentally flawed in their approach to elucidating visual systems from computational viewpoints. It is quite difficult to elucidate what sort of mathematical problem those models are intended to solve.

No mathematical formulation of the BO problem has been proposed to date. However, many problems of early vision can be formulated using regularization theory (Poggio, Torre, & Koch, 1985). Neural network models to solve these problems are drawn from the theory. Similarly, if the BO coding problem is expressed in a mathematically well-defined fashion, then one might be able to deduce a neural network model for the mathematical problem, and in turn understand the characteristics of the neural connections involved. Consequently, we might also, for example, reduce the number of parameters in Li’s model.

A fundamental question must be considered: what is the visual system trying to achieve through BO coding? As shown in Fig. 1, BO coding enables the visual cortex to calculate a rough reproduction of the outer world information, which includes depth order in addition to observable two-dimensional (2D) information such as surface brightness and edges. A further study of neurons in V2 revealed the existence of neurons that respond selectively to the edges of objects with actual stereoscopic depth information (Qiu & von der Heydt, 2005). Some of these neurons show selectivity to both 2D figures (owners) as well as three-dimensional (3D) figures placed on a certain side of their receptive field. This result reinforces the idea that area V2 might be involved in the process of a rough reproduction of a 3D image, and that BO coding and depth coding are interdependent processes.

As described in this paper, we attempt to overcome the flaws of current studies by formulating BO coding as a well-defined mathematical problem and subsequently solving a middle-level visual task: estimation of the depth order of objects from 2D images. We hope that formulation and solutions related to BO coding and order estimation presented in this paper will serve as the basis for a comprehensive model that can aid in the elucidation of low/middle-level visual processing. The remainder of this paper is organized as follows: Section 2 presents discussion of the key ideas underpinning our theory and proposes a neural network model for BO coding. Section 3 presents results of numerical simulations of our model. Section 4 emphasizes some important findings derived from simulation results, including the deduced neural connections. Section 5 is the conclusion.

Section snippets

Computational theory

The relation between depth order and BO must be reviewed before formulating the BO assignment problem. In an inset of Fig. 2, a light gray rectangle overlapping a dark gray rectangle, over a dark background is visible. This perception of “occluding” or “occluded” objects can be represented using a scalar field signifying depth order. We define the depth order as ϕ(x,y). A region with a larger depth order of ϕ coincides with its perception as closer to the viewer. The occluding rectangle has a

Results

First, numerical simulations were conducted for three stimuli used in neurophysiological experiments by Zhou et al. (2000). These include a simple square (Fig. 5 a), a C-shaped figure (Fig. 5 b) and overlapping rectangles (Fig. 5 c). Initial vectors for all three stimuli were assigned to the concave side of the contour, which does not necessarily point to the perceived object, as shown in the C-shaped figure (Fig. 5 b). Calculated BO signals, or the vector field after energy minimization Eendx,y

Discussion

The proposed model simultaneously generated BO signals and depth order that match human perception. As Fig. 7 shows, the resultant BO signals of the model, Eendx,y, are tolerant of edge arrangement (concavity or convexity) and occlusion, although some initial BO signals Einitx,y are not correct, as shown in the concave L-junctions. We cannot determine whether an L-junction is concave or convex, but our model distinguishes it automatically by spatial propagation of BO signals among adjacent

Conclusion

We demonstrated the importance of formulating the problem in a well-defined manner, by application of a theorem in electromagnetism to solve the problem of BO coding in the visual system. The model can reproduce human perception of depth order qualitatively for various figures. Nevertheless, the model is incapable of producing results of perceptual switching, as shown in Fig. 1, because initial vectors are given in a deterministic manner. Future work shall include further development of the

Acknowledgments

This work was partially supported by JSPS KAKENHI Grant Nos. 24500371 and 16K00204.

References (19)

  • QiuF.T. et al.

    Figure and ground in the visual cortex: V2 combines stereoscopic cues with gestalt rules

    Neuron

    (2005)
  • SakaiK. et al.

    Consistent and robust determination of border ownership based on asymmetric surrounding contrast

    Neural Networks

    (2012)
  • CoxM.A. et al.

    Receptive field focus of visual area V4 neurons determines responses to illusory surfaces

    Proceedings of the National Academy of Sciences of the United States of America

    (2013)
  • CraftE. et al.

    A neural model of figure-ground organization

    Journal of Neurophysiology

    (2007)
  • DayR.H. et al.

    Amodal completion as a basis for illusory contours

    Perception & Psychophysics

    (1983)
  • GirardP. et al.

    Feedforward and feedback connections between areas V1 and V2 of the monkey have similar rapid conduction velocities

    Journal of Neurophysiology

    (2001)
  • ItoM. et al.

    Mechanisms underlying the representation of angles embedded within contour stimuli in area V2 of macaque monkeys

    European Journal of Neuroscience

    (2011)
  • ItoM. et al.

    Representation of angles embedded within contour stimuli in area V2 of macaque monkeys

    The Journal of Neuroscience: The Official Journal of the Society for Neuroscience

    (2004)
  • KogoN. et al.

    Surface construction by a 2-D differentiation-integration process: a neurocomputational model for perceived border ownership, depth, and lightness in Kanizsa figures

    Psychological Review

    (2010)
There are more references available in the full text version of this article.

Cited by (0)

View full text