A novel algorithm of extended neural networks for image recognition

doi:10.1016/j.engappai.2015.03.010

Engineering Applications of Artificial Intelligence

Volume 42, June 2015, Pages 57-66

https://doi.org/10.1016/j.engappai.2015.03.010 Get rights and content

Abstract

As a class of important classifiers, feedforward neural networks (FNNs) have been used considerably in the study of pattern recognition. Since the inputs to FNNs are usually vectors, and many data are usually presented in the form of matrices, the matrices have to be decomposed into vectors before FNNs are employed. A drawback to this approach is that important information regarding correlations of elements within the original matrices are lost. Unlike traditional vector input based FNNs, a new algorithm of extended FNN with matrix inputs, called two-dimensional back-propagation (2D-BP), is proposed in this paper to classify matrix data directly, which utilizes the technique of incremental gradient descent to fully train the extended FNNs. These kinds of FNNs help to maintain the matrix structure of the 2D input features, which helps with image recognition. Promising experimental results of handwritten digits and face-image classification are provided to demonstrate the effectiveness of the proposed method.

Introduction

It is well-known that image recognition is a hot topic in the fields of machine learning and computer vision. In such recognition systems, many data images, such as handwritten digital images, face images, and palm images, are usually presented in the form of matrices. Clearly, determining how to classify these kinds of data is an important topic in pattern recognition.

Traditional image recognition system contains three steps: image pre-processing, feature extraction, and classification. At present, there are various methods for feature extraction, such as principal component analysis (PCA) (Jolliffe, 2002), independent component analysis (ICA) (Hyvärinen and Oja, 2000), and several popular applied classifiers, including K-nearest neighbourhoods (KNNs) (Shakhnarovich et al., 2008), support vector machines (SVMs) (Vapnik, 2000), feedforward neural networks (FNNs) (Hornik et al., 1989), and so on.

Nevertheless, these methods are usually based on vector inputs. Thus, when they are used in image processing, we first have to expand the matrix inputs into vector form. These types of transformation often lead to the loss of important information regarding the original matrix data, and thus affect the recognition process. On the other hand, expanding matrix inputs into vectors usually causes high dimensionality and increases the complexity of the used models.

To solve this problem, two-dimensional (2D) methods that directly operate on matrix data are proposed, for example, the two-dimensional principal component analysis (2DPCA) (Yang et al., 2004, Zhang and Zhou, 2005) and two-dimensional linear discriminant analysis (2DLDA) (Li and Yuan, 2005, Sanguansat et al., 2006, Yang et al., 2010), which were verified useful for extracting effective information about the inner structure of matrix data, as well as reducing the computational complexity of the extraction. It is natural to raise the question: For the existing vector-based classifiers, such as the SVM and FNN methods, can they be extended for matrix input?

Neural networks have played an important role in pattern recognition (Bishop, 1995, Lin et al., 1997, LeCun et al., 1998, Shang et al., 2006). In order to classify matrix data directly, and to preserve the matrix or 2D feature structure effectively, Lu et al. (2014) proposed a novel classifier, the two-dimensional neural network with random weights (2D-NNRW) method, and achieved good performance on face recognition. In fact, it is an extended 2D single hidden layer feed forward neural network (2D-SLFN) model that employs left and right projecting vectors to regress matrix inputs. Additionally, it uses the random idea to train the network, i.e. it randomly sets the left and right projecting vectors and the hidden biases, and then determines the output weights by solving a linear equation system. The results obtained in Lu et al. (2014) show that the use of 2D-SLFN improves face recognition accuracy.

The randomness in the NNRW algorithm can be understood in deeper detail when we consider the function approximation with Monte Carlo (MC) methods. It was shown in Igelnik and Pao (1995) that any continuous function defined on a compact set can be represented by a limit-integral of a multivariate continuous function that is integrated in parameter space. Although using the NNRW algorithm can simplify the learning steps taken by SLFNs, the following issues still remain in both the NNRW and 2D-NNRW methods:

•
The number of hidden nodes should be sufficiently large and supervised initialization is needed in order to model and compensate for the system׳s uncertainties.
•
There exists an over-fitting phenomenon caused by many additional hidden nodes in the NNRW method due to the MC approximating approach.
•
There is a predictive instability caused by randomly assignment of nonlinear weights, and the way of learning SLFNs with NNRW methods using singular value decomposition (SVD) usually produces large magnitude linear weights, which makes the networks highly sensitive to new data.

To overcome these issues, we have attempted to fully train the network with the aim of implementing incremental gradient based learning for 2D-SLFN. A learning algorithm called two-dimensional back-propagation (2D-BP) is proposed, where a momentum modification is added to improve convergence. A series of comparative studies of handwritten digits and face-image classification were carried out. The results of the testing datasets are promising, and support a positive statement regarding their performances among the 1D-BP and 2D-NNRW methods.

The rest of this paper is organized as follows. FNNs models and their corresponding training algorithms are reviewed in Section 2. A detailed description of our method is given in Section 3. An evaluation of the performance of our algorithm, the handwritten digits and face datasets employed in the experiments, and our results, which include comparisons and discussions, are presented in Section 4. Conclusions are presented in the final section.

Section snippets

Single hidden-layer feedforward neural networks

Generally, a SLFN is described as follows: $f (x) = \sum_{k = 1}^{L} β_{k} g (w_{k}^{⊤} x + b_{k}) + α,$ where x is the input pattern vector, $w_{k} = {[w_{k 1}, w_{k 2}, \dots, w_{kd}]}^{⊤}$ , b_k are input layer weights and biases, respectively, and $β_{k} = {[β_{k 1}, β_{k 2}, \dots, β_{ko}]}^{⊤}$ and α are the output layer weights and biases, respectively.

It is true that SLFNs are universal approximators (Hornik et al., 1989, Cybenko, 1989, Barron, 1993), even when the hidden-layer weights and bias are randomly assigned. In the case that weights and bias are randomly assigned, i.e., the

2D-BP for single hidden-layer feedforward neural networks

In order to maintain the matrix structure of the image inputs, 2D-SLFN is used in Lu et al. (2014), where the NNRW algorithm is extended to train this network, and the results suggest that 2D-SLFN is superior to general vector based SLFN (1D-SLFN) for image recognition. In this paper, we propose an incremental gradient based learning method for 2D-SLFN (2D-BP). Due to the left and right projecting vectors, the free weights of the 2D-SLFN are less than the general 1D-SLFN with the same number of

Performance evaluation

In this section we investigated the performance of the 2D-BP, and then compared it with two other methods: the 1D-BP and the 2D-NNRW.

Conclusions

In the structure of classical FNNs, the inputs are usually vectors, and thus FNNs cannot be used to classify matrix data directly. In this paper, a new FNNs algorithm, called the two-dimensional back-propagation (2D-BP), was proposed for classifying matrix data directly. The method of incremental gradient based learning is used to fully train the weights in extended 2D-FNNs. Since the proposed algorithm can retain important information of 2D input feature, it can be used as a classifier for

References (26)

K. Hornik et al.
Multilayer feedforward networks are universal approximators
Neural Netw.
(1989)
A. Hyvärinen et al.
Independent component analysis: algorithms and applications
Neural Networks
(2000)
M. Li et al.
2D-LDAa statistical linear discriminant analysis for image matrix
Pattern Recognit. Lett.
(2005)
J. Lu et al.
Extended feed forward neural networks with random weights for face recognition
Neurocomputing
(2014)
S. McLoone et al.
Improving neural network training solutions using regularisation
Neurocomputing
(2001)
Y.H. Pao et al.
Learning and generalization characteristics of the random vector functional-link net
Neurocomputing
(1994)
L. Shang et al.
Palmprint recognition using Fast ICA algorithm and radial basis probabilistic neural network
Neurocomputing
(2006)
W. Yang et al.
Feature extraction based on fuzzy 2DLDA
Neurocomputing
(2010)
D. Zhang et al.
(2D) 2PCAtwo-directional two-dimensional PCA for efficient face representation and recognition
Neurocomputing
(2005)
A.R. Barron
Universal approximation bounds for superpositions of a sigmoidal function
IEEE Trans. Inf. Theory
(1993)

Battiti, R., Masulli, F., 1990. BFGS optimization for faster and automated supervised learning. In: International...

C.M. Bishop

Neural Networks for Pattern Recognition

(1995)

G. Cybenko

Approximation by superposition of sigmoidal function

Math. Control, Signals Syst.

(1989)

Cited by (12)

Delay independent robust stability analysis of delayed fractional quaternion-valued leaky integrator echo state neural networks with QUAD condition
2019, Applied Mathematics and Computation
Citation Excerpt :
Real-valued Neural Networks (RVNN) have been successfully applied in practice, such as in modeling, control, associative memory and image recognition [1–6].
This paper studies the stability of fractional order (FO) continuous-time quaternion-valued Leaky Integrator Echo State Neural Networks (NN). The delay independent robust stability of NN with QUAD vector field activation functions under time varying delays is derived. The analysis follows the FO Lyapunov theorem while decomposing the NN into four real-valued systems. The existence and uniqueness of the equilibrium point are also verified by means of a contraction map on the decomposed NN. The new approach is tested in the stability analysis of the FO quaternion-valued Echo State NN whitout time delays, complex-valued Echo State and quaternion-/complex-valued Hopfield NN with/without time delays. Two numerical examples demonstrate the feasibility of the proposed method.
Super-resolution using neighbourhood regression with local structure prior
2019, Signal Processing: Image Communication
Citation Excerpt :
It uses external LR and HR image pairs to estimate missing details. The main approaches here include sparse representation [20–24], neighbourhood regression [25–30], and deep learning [31–35], whereby impressive results have been obtained. Neighbourhood regression is a widely used method for SR and uses a certain number of nearest samples to explore the mapping relationship between LR and HR images.
Learning-based super-resolution (SR) imaging has been extensively studied. In this study, a novel reconstruction method is proposed for SR that employs the geometrical structure of an image as well as the statistical priors based on clustering and local structure priors. In this approach, the training samples are divided into numerous overlapping sub-patches, and the sub-patches are then grouped into different clusters, where anchored centres for different subspaces are defined by training joint dictionaries. Variable low-resolution to high-resolution mappings are then learned according to the local similarities. Finally, the desired high resolution patch can be reconstructed by using the multiple learned local relationships corresponding to its sub-patches. Experimental results demonstrated the superior performance of the proposed method.
Establishing the flow of information between two bio-inspired spiking neural networks
2019, Information Sciences
In this article, four bio-inspired spiking neural networks (emitter1&2, receiver1&2) were presented. Using the proposed brain-compatible learning, spiking networks were trained with spike-based unsupervised weight optimization to recognize handwritten digits, capital letters, YALE, and ORL datasets. Recent experimental reports have proven that effective neuronal interactions are affected by neuronal synchronization. Therefore, by preserving synchronization between emitters and receivers, the effective connection between them was established. Then, transfer entropy was used to quantify the exchanged information between emitter1-receiver1 and emitter2-receiver2. We proved that depending on the synchronization metric between emitter and receiver, information of emitter could be transmitted to receiver network. With these prerequisites, receivers were fed via the transmitted information of emitters without training. The remarkable result was that receiver1 and receiver2 achieved 70.23% and 84.97% recognition accuracy, respectively, without being trained for the handwritten digits and YALE faces. This means that we hacked 78.38% of the information of emitter1 and transmitted them into receiver1 (also 88.79% of emitter2 information was transmitted to receiver2). Thus, receivers learned extra information without training, which was guided solely by the information provided by emitters. In this paper, for the first time, we were able to bypass the training process of the spiking networks.
Multistability analysis of delayed quaternion-valued neural networks with nonmonotonic piecewise nonlinear activation functions
2019, Applied Mathematics and Computation
Citation Excerpt :
There is no doubt that the quaternion gradually shows its advantages in practical applications. Over the past few decades, the investigation on dynamic behaviors of neural networks has gained in popularity and plentiful results have been obtained for their useful applications in associative memory, pattern recognition, image processing, and so on (see [11–14] and references therein). As is well-known, the two-dimensional data can be well processed by complex-valued neural networks (CVNNs) or many real-valued neural networks (RVNNs).
This paper deals with the multistability problem of the quaternion-valued neural networks (QVNNs) with nonmonotonic piecewise nonlinear activation functions and unbounded time-varying delays. By virtue of the non-commutativity of quaternion multiplication resulting from Hamilton rules, the QVNNs can be separated into four real-valued systems. By using the fixed point theorem and other analytical tools, some novel algebraic criteria are established to guarantee that the QVNNs can have 5⁴ⁿ equilibrium points, 3⁴ⁿ of which are locally μ-stable. Some criteria that guarantee the multiple exponential stability, multiple power stability, multiple log-stability, multiple log–log-stability are also derived as special cases. The obtained results reveal that the introduced QVNNs in this paper can have larger storage capacity than the complex-valued ones. Finally, one numerical example is presented to clarify the validity of the theoretical results.
Global μ-stability criteria for quaternion-valued neural networks with unbounded time-varying delays
2016, Information Sciences
Citation Excerpt :
There is no deny that quaternion is an important topic needing much more attentions. Besides, neural networks have been one of the most interesting focuses due to the promising development and wide applications in diverse science and engineering fields such as associative memories [50], pattern recognition [8], control engineering [1]. So far, neural networks have been extensively investigated, see [4,44–46,51] and the references therein.
In this paper, we first propose quaternion-valued neural networks (QVNNs) with unbounded time-varying delays. Some sufficient conditions on the global μ-stability in the form of both complex-valued and real-valued linear matrix inequalities (LMIs) are provided by solving two difficulties. One is decomposing the QVNN into two complex-valued systems with the plural decomposition method of quaternion, which can reduce the complexity of calculations by avoiding the non-commutativity of quaternion multiplication. The other is choosing the appropriate Lyapunov–Krasovskii functional in the form of Hermitian matrices, which is a big challenge. Finally, two numerical examples are provided to verify the effectiveness of the obtained results.
Application of machine learning methods for the prediction of crystal system of cathode materials in lithium-ion batteries
2016, Computational Materials Science
The system of crystal structure has a major effect on the physical and chemical properties of Li-ion silicate cathodes. Hence, the prediction of crystal system has a vital importance to estimate many other properties of cathodes for applications in batteries. Three major crystal systems (monoclinic, orthorhombic and triclinic) of silicate-based cathodes with Li–Si–(Mn, Fe, Co)–O compositions were predicted using wide range of classification algorithms in machine learning. The calculations are based on the results of density functional theory calculations from Materials Project. The strong correlation between the crystal system and other physical properties of the cathodes was confirmed based on the feature evaluation in the statistical models. In addition, the parameters of various classification methods were optimized to obtain the best accuracy of prediction. Ensemble methods including random forests and extremely randomized trees provided the highest accuracy of prediction among other classification methods in the Monte Carlo cross validation tests.

View all citing articles on Scopus

^☆: Supported by the National Natural Science Foundation of China (Nos. 61272023 and 91330118) and Zhejiang Provincial Natural Science Foundation of China (No. LY14A010027).

View full text

A novel algorithm of extended neural networks for image recognition☆

Abstract

Introduction

Section snippets

Single hidden-layer feedforward neural networks

2D-BP for single hidden-layer feedforward neural networks

Performance evaluation

Conclusions

Neural Netw.

Neural Networks

Pattern Recognit. Lett.

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Neurocomputing

Universal approximation bounds for superpositions of a sigmoidal function

IEEE Trans. Inf. Theory

Neural Networks for Pattern Recognition

Approximation by superposition of sigmoidal function

Math. Control, Signals Syst.