Multivariate neural network operators with sigmoidal activation functions

doi:10.1016/j.neunet.2013.07.009

Neural Networks

Volume 48, December 2013, Pages 72-77

https://doi.org/10.1016/j.neunet.2013.07.009 Get rights and content

Abstract

In this paper, we study pointwise and uniform convergence, as well as order of approximation, of a family of linear positive multivariate neural network (NN) operators with sigmoidal activation functions. The order of approximation is studied for functions belonging to suitable Lipschitz classes and using a moment-type approach. The special cases of NN operators, activated by logistic, hyperbolic tangent, and ramp sigmoidal functions are considered. Multivariate NNs approximation finds applications, typically, in neurocomputing processes. Our approach to NN operators allows us to extend previous convergence results and, in some cases, to improve the order of approximation. The case of multivariate quasi-interpolation operators constructed with sigmoidal functions is also considered.

Introduction

Neural networks (NNs) with one hidden layer can be represented as $N_{n} (x) = \sum_{j = 0}^{n} c_{j} σ ({\underline{a}}_{j} \cdot \underline{x} + b_{j}), \underline{x} \in R^{s}, s \in N^{+},$ where, for $0 \leq j \leq n$ , the $b_{j}$ ’s, $b_{j} \in R$ , are the threshold values, the ${\underline{a}}_{j}$ ’s, ${\underline{a}}_{j} \in R^{s}$ , are the weights, and the $c_{j}$ ’s are the coefficients. Here ${\underline{a}}_{j} \cdot \underline{x}$ is the inner product in $R^{s}$ , and $σ$ is the activation function of the network, see Chui and Li (1992), Costarelli and Spigler (submitted for publication), Jones (1988), Lenze (1992), Li (1996), Li and Micchelli (2000), Light (1993), Makovoz (1998), Mhaskar and Micchelli (1995) and Pinkus (1999). The activation function usually is a sigmoidal function. Neural networks are extensively used in Approximation Theory (Barron, 1993, Chen, 1993, Costarelli and Spigler, 2013b, Cybenko, 1989, Gao and Xu, 1993, Girosi and Anzellotti, 1993, Gnecco, 2012, Gnecco and Sanguineti, 2011, Hahm and Hong, 2002, Kainen and Kurková, 2009, Kurková, 2012, Lewicki and Marino, 2003, Lewicki and Marino, 2004, Mhaskar and Micchelli, 1992).

Constructive multivariate approximation algorithms based on sigmoidal functions are important since they play a central role in typical applications of neurocomputing processes concerning high-dimensional data. Applications of NNs with sigmoidal functions in Numerical Analysis, for instance, to the numerical solution of Volterra integral and integro-differential equations by suitable collocation methods were shown in Costarelli and Spigler, in press-a, Costarelli and Spigler, in press-b.

Anastassiou (1997) was the first to establish NN approximations for continuous functions, providing estimates for the rate of convergence, using NN operators of the Cardaliagnet–Euvrard type. He used the modulus of continuity of the function being approximated, to produce Jackson type inequalities. Subsequently, Anastassiou studied NN operators activated by the hyperbolic tangent as well as the logistic function, in both, the univariate and the multivariate case, see Anastassiou, 2011a, Anastassiou, 2011b, Anastassiou, 2011c, Anastassiou, 2011d, Anastassiou, 2012.

In this paper, we study the convergence, as well as the order of approximation, of a family of linear positive multivariate neural network operators, activated by sigmoidal functions. This work represents the extension to the multivariate case of the results established in Costarelli and Spigler (2013a) for univariate functions. We treat by a unified approach all cases studied by Anastassiou, and moreover, we can apply our theory to other useful sigmoidal functions, such as, for instance, the ramp function (Cao and Chen, 2012, Cheang, 2010), and many others (Costarelli & Spigler, submitted for publication).

We first study pointwise and uniform convergence for functions defined on bounded intervals of $R^{s}$ . In addition, we study the order of approximation for functions belonging to certain Lipschitz classes by means of our NN operators, following a moment-type approach. In particular, we exploit the finiteness of a few discrete absolute moments of certain density functions, $ϕ_{σ}$ , defined by the sigmoidal functions $σ$ . The approximation error is considered in connection to both, the weights and the number of neurons of the network, in the sup-norm. In this framework, a remarkable result is that the order of approximation achieved when we approximate $C^{1}$ -functions by our operators with logistic or hyperbolic tangent sigmoidal functions, is higher than that obtained in Anastassiou, 2011b, Anastassiou, 2011c.

At the end, the case of quasi-interpolation operators constructed with sigmoidal functions is also considered in this paper, aimed at approximating functions defined on the whole space $R^{s}$ , see Anastassiou, 2011a, Anastassiou, 2011b, Anastassiou, 2011c, Anastassiou, 2011d, Anastassiou, 2012, Cao and Chen (2009) and Costarelli and Spigler (2013a), e.g.

The paper is organized as follows. In Section 2 we recall some preliminary results given in Costarelli and Spigler (2013a), and prove some new lemmas useful to establish the main results of the paper. In Section 3 the convergence and order of approximation theorems are proved. Moreover, many examples of sigmoidal functions satisfying the hypothesis of our theory are presented, and a discussion of results obtained is given. Finally, in Section 4 some final remarks are summarized and the case of quasi-interpolation operators constructed with sigmoidal functions is analyzed.

Section snippets

Preliminary results

In this section, we establish some preliminary results that will be useful in the rest of the paper. We recall that a measurable function $σ : R \to R$ is called a sigmoidal function if and only if ${lim}_{x \to - \infty} σ (x) = 0$ and ${lim}_{x \to + \infty} σ (x) = 1$ . In what follows, we consider non-decreasing functions $σ$ , such that $σ (2) > σ (0)$ , and satisfying all the following assumptions:

$(Σ 1)$
$g_{σ} (x) ≔ σ (x) - 1 / 2$ is an odd function;
$(Σ 2)$
$σ \in C^{2} (R)$ is concave for $x \geq 0$ ;
$(Σ 3)$
$σ (x) = O ({| x |}^{- 1 - α})$ as $x \to - \infty$ , for some $α > 0$ .

The condition

σ (2) > σ (0)

is merely technical.

For

The main results

In what follows, we denote by $R$ the $s$ -dimensional interval $R ≔ [a_{1}, b_{1}] \times \dots \times [a_{s}, b_{s}] \subset R^{s}$ , and by $C^{0} (R)$ and $C^{0} (R^{s})$ the spaces of all continuous real-valued functions defined on $R$ and $R^{s}$ , respectively, equipped with the sup-norm ${‖ \cdot ‖}_{\infty}$ . Let us now define the operators that will be studied in this section.

Definition 3.1

Let $f : R \to R$ be a bounded function, and $n \in N^{+}$ such that $⌈ n a_{i} ⌉ \leq ⌊ n b_{i} ⌋$ for every $i = 1, \dots, s$ . The linear positive multivariate NN operators $F_{n} (f, \underline{x})$ , activated by the sigmoidal function $σ$ , and acting on $f$ , are

Final remarks and conclusions

In this paper, we study pointwise and uniform convergence, as well as the order of approximation, of a family of multivariate NN operators, activated by certain sigmoidal functions. Our approach allows us to extend some previous results. The order of approximation achieved using our operators is studied through a moment-type approach.

In the present context, we can use sigmoidal functions also to study convergence and order of approximation of a class of quasi-interpolation operators, defined

Acknowledgments

This work was supported, in part, by the GNAMPA and the GNFM of the Italian INdAM.

References (39)

G.A. Anastassiou
Rate of convergence of some neural network operators to the unit-univariate case
Journal of Mathematical Analysis and Applications
(1997)
G.A. Anastassiou
Multivariate hyperbolic tangent neural network approximation
Computers & Mathematics with Applications
(2011)
G.A. Anastassiou
Multivariate sigmoidal neural network approximation
Neural Networks
(2011)
G.A. Anastassiou
Univariate hyperbolic tangent neural network approximation
Mathematical and Computer Modelling
(2011)
G.H.L. Cheang
Approximation with neural networks activated by ramp sigmoids
Journal of Approximation Theory
(2010)
C.K. Chui et al.
Approximation by ridge functions and neural networks with one hidden layer
Journal of Approximation Theory
(1992)
D. Costarelli et al.
Approximation results for neural network operators activated by sigmoidal functions
Neural Networks
(2013)
B. Gao et al.
Univariant approximation by superpositions of a sigmoidal function
Journal of Mathematical Analysis and Applications
(1993)
V. Kurková
Complexity estimates based on integral transforms induced by computational units
Neural Networks
(2012)
V. Kurková et al.
Estimates of covering numbers of convex sets with slowly decaying orthogonal subsets
Discrete Applied Mathematics
(2007)

G. Lewicki et al.

Approximation of functions of finite variation by superpositions of a sigmoidal function

Applied Mathematics Letters

(2004)

X. Li

Simultaneous approximations of multivariate functions and their derivatives by neural networks with one hidden layer

Neurocomputing

(1996)

Y. Makovoz

Random approximants and neural networks

Journal of Approximation Theory

(1996)

Y. Makovoz

Uniform approximation by neural networks

Journal of Approximation Theory

(1998)

H.N. Mhaskar et al.

Approximation by superposition of sigmoidal and radial basis functions

Advances in Applied Mathematics

(1992)

H.N. Mhaskar et al.

Degree of approximation by neural and translation networks with a single hidden layer

Advances in Applied Mathematics

(1995)

G.A. Anastassiou

Univariate sigmoidal neural network approximation

Journal of Computational Analysis and Applications

(2012)

A.R. Barron

Universal approximation bounds for superpositions of a sigmoidal function

IEEE Transactions on Information Theory

(1993)

Cited by (113)

Some density results by deep Kantorovich type neural network operators
2024, Journal of Mathematical Analysis and Applications
In this paper, we prove density results by using deep Kantorovich type neural network operators. Firstly, we define a two layer neural network operator and prove the density results in the spaces $C (I)$ and $L^{p} (I)$ for $p \geq 1$ , where $I : = [- 1, 1]$ . Then we extend it to a multi-layer neural network operator and prove the corresponding density results. Our study provides a generalizations of the well known single layer Kantorovich type neural network operator in terms of its deeper version.
Integrating multivariate fuzzy neural networks into fuzzy inference system for enhanced decision making
2023, Fuzzy Sets and Systems
In this paper, we introduce a novel family of operators for single-hidden layer feed-forward neural networks that operate on multivariate fuzzy-valued functions. These operators are based on the fractional mean value of the approximating function. The construction of these operators involves the use of sigmoidal functions, providing multiple choices for their implementation. We demonstrate the modeling capabilities of our operators employing a novel combined neural network system that incorporates the basic fuzzy inference mechanism and fuzzy neural networks. This system represents the connection strength among the output neurons using multivariate fuzzy-valued functions, enhancing decision-making processes. Additionally, we prove the convergence properties of these operators with respect to the Pompeiu-Hausdorff metric. By controlling the asymptotic decay of the sigmoidal function, we achieve accurate Jackson-type estimations using the modulus of continuity of fuzzy-valued functions.
Approximation error for neural network operators by an averaged modulus of smoothness
2023, Journal of Approximation Theory
In the present paper we establish estimates for the error of approximation (in the $L^{p}$ -norm) achieved by neural network (NN) operators. The above estimates have been given by means of an averaged modulus of smoothness introduced by Sendov and Popov, also known with the name of $τ$ -modulus, in case of bounded and measurable functions on the interval $[- 1, 1]$ . As a consequence of the above estimates, we can deduce an $L^{p}$ convergence theorem for the above family of NN operators in case of functions which are bounded, measurable, and Riemann integrable on the above interval. In order to reach the above aims, we preliminarily establish a number of results; among them we can mention an estimate for the $p$ -norm of the operators, and an asymptotic type theorem for the NN operators in case of functions belonging to Sobolev spaces.
Neural network operators with hyperbolic tangent functions
2023, Expert Systems with Applications
We determine the global errors occurring as a result of applying the method of approximate approximations to a function defined on a compact interval. By the method of extending a function to a wider interval, we obtain upper bounds on the error estimates in the uniform norm for continuous and differentiable functions by using these approximation tools. We extend this study to the bivariate case by constructing the associated approximate approximation neural network operators.
General sigmoid based Banach space valued neural network multivariate approximations
2024, Journal of Computational Analysis and Applications
Parametrized Gudermannian function induced Banach space valued ordinary and fractional neural networks approximations
2024, Journal of Computational Analysis and Applications

View all citing articles on Scopus

View full text

Multivariate neural network operators with sigmoidal activation functions

Abstract

Introduction

Section snippets

Preliminary results

The main results

Final remarks and conclusions

Acknowledgments

Journal of Mathematical Analysis and Applications

Computers & Mathematics with Applications

Neural Networks

Mathematical and Computer Modelling

Journal of Approximation Theory

Journal of Approximation Theory

Neural Networks

Journal of Mathematical Analysis and Applications

Neural Networks

Discrete Applied Mathematics

Applied Mathematics Letters

Neurocomputing

Journal of Approximation Theory

Journal of Approximation Theory

Advances in Applied Mathematics

Advances in Applied Mathematics

Univariate sigmoidal neural network approximation

Journal of Computational Analysis and Applications

Universal approximation bounds for superpositions of a sigmoidal function

IEEE Transactions on Information Theory