Elsevier

Neurocomputing

Volumes 44–46, June 2002, Pages 353-358
Neurocomputing

Implementation of multilayer perceptron networks by populations of spiking neurons using rate coding

https://doi.org/10.1016/S0925-2312(02)00381-8Get rights and content

Abstract

In this paper we consider the biological plausibility of perceptron networks. We identify problems that arise if one tries to model form processing in visual cortex with a perceptron network. We conclude that a perceptron's activity corresponds to a population firing rate and that serious rate problems can be avoided if the squashing function maps zero input on zero output. Typically, such squashing functions are anti-symmetric. We present a circuit with a perceptron-like behavior, which provides an elegant interpretation for the negative perceptron activities that these functions entail.

Section snippets

Motivation

Multilayer feedforward perceptron networks are widely used in modeling higher cognitive functions. One particular example is form processing in visual cortex. Visual cortex processes familiar objects so fast, that this is believed to be a feedforward process. Therefore, visual cortex is able to perform a non-trivial computation, like object recognition or classification, using cortical networks that are massively parallel and that use the same basic element, the neuron, over and over again. The

Perceptron activity as a population rate

Originally, the perceptron was introduced as a highly abstracted version of a real neuron. Its state o is described in terms of its input signals ij, input weights wj and given byo=f(Σjwjij−θ).Here f is a non-descending squashing function and θ a threshold. To identify the perceptron activity with that of a neuron, a time-averaged rate hypothesis could be considered: the perceptron activity represents the firing rate of a real neuron averaged over a short window in time. There are a number of

Biological considerations

A typical characteristic of cortical networks is that the firing rates of the neurons are low. This precludes the use of squashing functions likef(x)=11+exp(−βx).This may come as a surprise, because Eq. (3) is perhaps the most widely used squashing function in cognitive modeling, but the reason is simple: f(0)=0.5. If the perceptron's activity corresponds to a firing rate, then this implies that a population that receives no input will fire at half of its maximum firing rate, which is well

The interpretation of the squashing function

The first squashing function that comes to mind, which maps 0 on zero output isf(x)=21+e−βx−1.Indeed, with this squashing function the rate problems are readily avoided. It leaves the question, however, of how to interpret the negative branch of the squashing function. We propose the following circuit to implement this squashing function. The idea is that population P codes for the positive branch of the squashing function and N for the negative branch. The cross-inhibition ensures that only

Discussion

Clearly, there are better choices for spike response functions than the ones we used in the last section. They were picked to demonstrate the perceptron-like behavior of the circuit. A more realistic choice leads to a different kind of perceptron with interesting effects, in particular to the fact that this perceptron loses its discriminatory power for large inputs, even if there are large differences between the inputs [3]. Dale's law is respected in a natural way. The circuit is symmetric and

References (4)

There are more references available in the full text version of this article.

Cited by (2)

View full text