Non-iterative online sequential learning strategy for autoencoder and classifier

Paul, Adhri Nandini; Yan, Peizhi; Yang, Yimin; Zhang, Hui; Du, Shan; Wu, Q. M. Jonathan

doi:10.1007/s00521-021-06233-x

Non-iterative online sequential learning strategy for autoencoder and classifier

Original Article
Published: 02 July 2021

Volume 33, pages 16345–16361, (2021)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Adhri Nandini Paul¹,
Peizhi Yan²,
Yimin Yang^1,3,
Hui Zhang ORCID: orcid.org/0000-0002-1803-3148⁴,
Shan Du⁵ &
…
Q. M. Jonathan Wu⁶

387 Accesses
1 Altmetric
Explore all metrics

Abstract

Artificial neural network training algorithms aim to optimize the network parameters regarding the pre-defined cost function. Gradient-based artificial neural network training algorithms support iterative learning and have gained immense popularity for training different artificial neural networks end-to-end. However, training through gradient methods is time-consuming. Another family of training algorithms is based on the Moore–Penrose inverse, which is much faster than many other gradient methods. Nevertheless, most of those algorithms are non-iterative and thus do not support mini-batch learning in nature. This work extends two non-iterative Moore–Penrose inverse-based training algorithms to enable online sequential learning: a single-hidden-layer autoencoder training algorithm and a sub-network-based classifier training algorithm. We further present an approach that uses the proposed autoencoder for self-supervised dimension reduction and then uses the proposed classifier for supervised classification. The experimental results show that the proposed approach achieves satisfactory classification accuracy on many benchmark datasets with extremely low time consumption (up to 50 times faster than the support vector machine on CIFAR 10 dataset).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comparing the performance of Hebbian against backpropagation learning using convolutional neural networks

Article 18 January 2022

Combination of Optimization Methods in a Multistage Approach for a Deep Neural Network Model

Article 03 November 2023

Analytical Incremental Learning: Fast Constructive Learning Method for Neural Network

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Artificial Intelligence

Data availability

All the datasets used in this work are publicly available.

Notes

References

Bai Z, Huang G, Wang D, Wang H, Westover MB (2014) Sparse extreme learning machine for classification. IEEE Trans Cybern 44(10):1858–1870. https://doi.org/10.1109/TCYB.2014.2298235
Article Google Scholar
Bartlett PL (1996) For valid generalization, the size of the weights is more important than the size of the network. In: Proceedings of the 9th international conference on neural information processing systems
Bartlett PL (1998) The sample complexity of pattern classification with neural networks: the size of the weights is more important than the size of the network. IEEE Trans Inf Theory 44(2):525–536. https://doi.org/10.1109/18.661502
Article MathSciNet MATH Google Scholar
Bengio Y, Lamblin P, Popovici D, Larochelle H (2007) Greedy layer-wise training of deep networks. Advances in neural information processing systems 19
Breiman L (2001) Random forests. Mach Learn 45(1):5–32. https://doi.org/10.1023/A:1010933404324
Article MATH Google Scholar
Cao J, Zhao, Y, Lai X, Chen T, Liu N, Mirza B, Lin Z (2015) Landmark recognition via sparse representation. In: 2015 IEEE international conference on digital signal processing (DSP). IEEE, pp 1030–1034
Deng C, Wang S, Li Z, Huang G, Lin W (2019) Content-insensitive blind image blurriness assessment using weibull statistics and sparse extreme learning machine. IEEE Trans Syst Man Cybern Syst 49(3):516–527
Article Google Scholar
Dong G, Liao G, Liu H, Kuang G (2018) A review of the autoencoder and its variants: a comparative perspective from target recognition in synthetic-aperture radar images. IEEE Geosci Remote Sens Mag 6(3):44–68
Article Google Scholar
Fang X, Tie Z, Guan Y, Rao S (2018) Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding. Soft Compu. https://doi.org/10.1007/s00500-018-3221-y
Article Google Scholar
Fernandez-Delgado M, Cernadas E, Barro S, Amorim D (2014) Do we need hundreds of classifiers to solve real world classification problems?. J Mach Learn Res 15:3133–3181
MathSciNet MATH Google Scholar
French R (1992) Semi-distributed representations and catastrophic forgetting in connectionist networks. Connect Sci 4:365–377. https://doi.org/10.1080/09540099208946624
Article Google Scholar
French R (1999) Catastrophic forgetting in connectionist networks. Trends Cogn Sci 3:128–135. https://doi.org/10.1016/S1364-6613(99)01294-2
Article Google Scholar
Ghosh T (2017) Quicknet: maximizing efficiency and efficacy in deep architectures. arXiv preprint arXiv:1701.02291
He X, Ji M, Zhang C, Bao H (2011) A variance minimization criterion to feature selection using laplacian regularization. IEEE Trans Pattern Anal Mach Intell 33(10):2013–2025. https://doi.org/10.1109/TPAMI.2011.44
Article Google Scholar
Henriquez PA, Ruz GA (2018) A non-iterative method for pruning hidden neurons in neural networks with random weights. Appl Soft Comput 70:1109–1121. https://doi.org/10.1016/j.asoc.2018.03.013
Article Google Scholar
Hinton G, Salakhutdinov R (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Article MathSciNet Google Scholar
Hinton G, Roweis S (2003) Stochastic neighbor embedding. Advances in neural information processing systems, 2002
Hinton G, Salakhutdinov R (2006) Reducing the dimensionality of data with neural networks. Science 313(5786):504–507
Article MathSciNet Google Scholar
Huang G, Song S, Gupta JND, Wu C (2014) Semi-supervised and unsupervised extreme learning machines. IEEE Trans Cybern 44:2405–2417
Article Google Scholar
Huang GB, Chen L, Siew CK et al (2006) Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Trans Neural Netw 17(4):879–892
Article Google Scholar
Huang GB, Saratchandran P, Sundararajan N (2005) An efficient sequential learning algorithm for growing and pruning rbf (gap-rbf) networks. IEEE Trans Syst Man Cybern Part B 34(6):2284–2292
Article Google Scholar
Huang GB, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. Trans Syst Man Cybern Part B 42(2):513–529. https://doi.org/10.1109/TSMCB.2011.2168604
Article Google Scholar
Jia Y, Kwong S, Wang R (2020) Applying exponential family distribution to generalized extreme learning machine. IEEE Trans Syst Man Cybern Syst 50(5):1794–1804
Article Google Scholar
Johnson R, Zhang T (2013) Accelerating stochastic gradient descent using predictive variance reduction. Advances in neural information processing systems 26.
Kasun L, Zhou H, Huang GB, Vong CM (2013) Representational learning with elms for big data. IEEE Intell Syst 28:31–34
Article Google Scholar
Katuwal R, Suganthan P (2019) Stacked autoencoder based deep random vector functional link neural network for classification. Appl Soft Comput 85:105854. https://doi.org/10.1016/j.asoc.2019.105854
Article Google Scholar
Kim J (2019) Sequential training algorithm for neural networks. arXiv abs/1905.07490
Krizhevsky A, Hinton G (2009) Learning multiple layers of features from tiny images. Tech. rep, Citeseer
Google Scholar
Le Roux N, Bengio Y (2008) Representational power of restricted boltzmann machines and deep belief networks. Neural Comput 20(6):1631–1649
Article MathSciNet Google Scholar
Liang NY, Huang GB, Saratchandran P, Sundararajan N (2006) A fast and accurate online sequential learning algorithm for feedforward networks. IEEE Trans Neural Netw 17(6):1411–1423
Article Google Scholar
Liu B, Xia SX, Meng FR, Zhou Y (2015) Extreme spectral regression for efficient regularized subspace learning. Neurocomputing 149:171–179
Article Google Scholar
Lu Y, Sundararajan N, Saratchandran P (1998) Performance evaluation of a sequential minimal radial basis function (rbf) neural network learning algorithm. IEEE Trans Neural Netw 9(2):308–18
Article Google Scholar
Mayne AJ (1972) Generalized inverse of matrices and its applications. J Oper Res Soc 23(4):598
Google Scholar
Pao YH, Park GH, Sobajic DJ (1994) Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6(2):163–180
Article Google Scholar
Platt J (1991) A resource-allocating network for function interpolation. Neural Comput 3(2):213–225. https://doi.org/10.1162/neco.1991.3.2.213
Article MathSciNet Google Scholar
Robins A (2004) Sequential learning in neural networks: a review and a discussion of pseudorehearsal based methods. Intell Data Anal 8(3):301–322
Article MathSciNet Google Scholar
Yang Y, Wu QJ, Feng X, Akilan T (2019) Recomputation of the dense layers for performance improvement of dcnn. IEEE Trans Pattern Anal Mach Intell 42(11):2912–2925
Google Scholar
Yang Y, Wu QJ, Wang Y (2018) Autoencoder with invertible functions for dimension reduction and image reconstruction. IEEE Trans Syst Man Cybern Syst 48(7):1065–1079
Article Google Scholar
Yang Y, Wu QMJ (2016) Extreme learning machine with subnetwork hidden nodes for regression and classification. IEEE Trans Cybern 46(12):2885–2898. https://doi.org/10.1109/TCYB.2015.2492468
Article Google Scholar
Yang Y, Wu QMJ (2016) Multilayer extreme learning machine with subnetwork nodes for representation learning. IEEE Trans Cybern 46(11):2570–2583. https://doi.org/10.1109/TCYB.2015.2481713
Article Google Scholar
Yang Y, Wu QMJ, Feng X, Akilan T (2020) Recomputation of the dense layers for performance improvement of dcnn. IEEE Trans Pattern Anal Mach Intell 42(11):2912–2925. https://doi.org/10.1109/TPAMI.2019.2917685
Article Google Scholar
Yingwei L, Sundararajan N, Saratchandran P (1997) A sequential learning scheme for function approximation using minimal radial basis function neural networks. Neural Comput 9(2):461–478. https://doi.org/10.1162/neco.1997.9.2.461
Article MATH Google Scholar

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Computer Science, Lakehead University, Thunder Bay, ON, Canada
Adhri Nandini Paul & Yimin Yang
Department of Electrical and Computer Engineering, The University of British Columbia, Vancouver, BC, Canada
Peizhi Yan
The Vector Institute, Toronto, ON, Canada
Yimin Yang
College of Robotics, Hunan University, Changsha, China
Hui Zhang
Department of Computer Science, Mathematics, Physics and Statistics, The University of British Columbia (Okanagan), Kelowna, BC, Canada
Shan Du
Department of Electrical and Computer Engineering, University of Windsor, Windsor, ON, Canada
Q. M. Jonathan Wu

Authors

Adhri Nandini Paul
View author publications
You can also search for this author inPubMed Google Scholar
Peizhi Yan
View author publications
You can also search for this author inPubMed Google Scholar
Yimin Yang
View author publications
You can also search for this author inPubMed Google Scholar
Hui Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Shan Du
View author publications
You can also search for this author inPubMed Google Scholar
Q. M. Jonathan Wu
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Hui Zhang.

Ethics declarations

Conflict of interest

We declare that we have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A. Paul and P. Yan: co-first authorship.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Paul, A.N., Yan, P., Yang, Y. et al. Non-iterative online sequential learning strategy for autoencoder and classifier. Neural Comput & Applic 33, 16345–16361 (2021). https://doi.org/10.1007/s00521-021-06233-x

Download citation

Received: 22 February 2021
Accepted: 13 June 2021
Published: 02 July 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s00521-021-06233-x

Keywords

Part of a collection:

Computer Science SDG 7: Affordable and Clean Energy

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-iterative online sequential learning strategy for autoencoder and classifier

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Comparing the performance of Hebbian against backpropagation learning using convolutional neural networks

Combination of Optimization Methods in a Multistage Approach for a Deep Neural Network Model

Analytical Incremental Learning: Fast Constructive Learning Method for Neural Network

Explore related subjects

Data availability

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now