Optimization of Neural Network Training for Image Recognition Based on Trigonometric Polynomial Approximation

Vershkov, N.; Babenko, M.; Tchernykh, A.; Pulido-Gaytan, B.; Cortés-Mendoza, J. M.; Kuchukov, V.; Kuchukova, N.

doi:10.1134/S0361768821080272

Optimization of Neural Network Training for Image Recognition Based on Trigonometric Polynomial Approximation

Published: 28 December 2021

Volume 47, pages 830–838, (2021)
Cite this article

Programming and Computer Software Aims and scope Submit manuscript

N. Vershkov¹,
M. Babenko^1,2,
A. Tchernykh^2,3,4,
B. Pulido-Gaytan³,
J. M. Cortés-Mendoza⁴,
V. Kuchukov¹ &
…
N. Kuchukova¹

308 Accesses
Explore all metrics

Abstract

The paper discusses optimization issues of training Artificial Neural Networks (ANNs) using a nonlinear trigonometric polynomial function. The proposed method presents the mathematical model of an ANN as an information transmission system where effective techniques to restore signals are widely used. To optimize ANN training, we use energy characteristics assuming ANNs as data transmission systems. We propose a nonlinear layer in the form of a trigonometric polynomial that approximates the “syncular” function based on the generalized approximation theorem and the wave model. To confirm the theoretical results, the efficiency of the proposed approach is compared with standard ANN implementations with sigmoid and Rectified Linear Unit (ReLU) activation functions. The experimental evaluation shows the same accuracy of standard ANNs with a time reduction of the training phase of supervised learning for the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Trainable activation function with differentiable negative side and adaptable rectified point

Article 15 October 2020

Neural Networks Training Based on Second-Order Optimization Technique

E-Tanh: a novel activation function for image processing neural network models

Article 14 June 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

REFERENCES

Kolmogorov, A.N., On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition, Am. Math. Soc. Transl.: Ser. 2, 1963, vol. 28, pp. 55–59.
Arnol’d, V.I., On the representation of functions of several variables as a superposition of functions of a smaller number of variables, in Collected Works, Arnold, V.I., Ed., Springer, 2009, vol. 1.
Book Google Scholar
Hecht-Nielsen, R., Neurocomputing, Addison-Wesley, 1989.
Google Scholar
McCulloch, W.S. and Pitts, W., A logical calculus of the ideas immanent in nervous activity, Bull. Math. Biophys., 1943, vol. 5, pp. 115–133.
Article MathSciNet Google Scholar
Lashley, K.S., The Brain and Intellect, Moscow-Leningrad: State Social and Economic Publishing House, 1933.
Google Scholar
Dzyadyk, V.K., Introduction to the Theory of the Uniform Approximation of Functions by Polynomials, Moscow: Nauka, 1977.
MATH Google Scholar
Stone, M.N., The generalized Weierstrass approximation theorem, Math. Mag., 1948, vol. 21, pp. 167–183, 237–254.
Article MathSciNet Google Scholar
Gorban, A.N., Generalized approximation theorem and computational capabilities of neural networks, Sib. J. Comput. Math., 1998, vol. 1, no. 1, pp. 12–24.
MATH Google Scholar
Ahmed, N. and Rao, K.R., Orthogonal Transformations in Digital Signal Processing, Berlin, Heidelberg: Springer-Verlag, 1975.
Book Google Scholar
Hebb, D.O., The Organization of Behavior, Wiley, 1949.
Google Scholar
Hinton, G.E., Training products of experts by minimizing contrastive divergence, Neural Comput., 2002, vol. 14, no. 8, pp. 1771–1800.
Article Google Scholar
Hinton, G.E., Learning multiple layers of representation, Trends Cognit. Sci., 2007, vol. 11, pp. 428–434.
Article Google Scholar
Vershkov, N.A., Babenko, M.G., Kuchukov, V.A., and Kuchukova, N.N., Advanced supervised learning in multi-layer perceptrons to the recognition tasks based on correlation indicator, Tr. Inst. Sist. Program. Russ. Akad. Nauk, 2021, vol. 33, issue 1, pp. 33–46.
Google Scholar
Nikolenko, S., Kadurin, A., and Arhangel’skaya, E., Deep Learning, St. Petersburg: Piter, 2018.
Google Scholar
Dorogov, A.Y., Implementation of spectral transformations in the class of fast neural networks, Program. Comput. Software, 2003, vol. 29, pp. 187–198. https://doi.org/10.1023/A:1024966508452
Article MathSciNet Google Scholar
Adjemov, S.S., Klenov, N.V., Tereshonok, M.V., et al., The use of artificial neural networks for classification of signal sources in cognitive radio systems, Program. Comput. Software, 2016, vol. 42, pp. 121–128. https://doi.org/10.1134/S0361768816030026
Article MathSciNet Google Scholar
Vershkov, N.A., Kuchukov, V.A., Kuchukova, N.N., and Babenko, M., The wave model of artificial neural network, Proc. IEEE Conf. of Russian Young Researchers in Electrical and Electronic Engineering, EIConRus 2020, 2020, pp. 542–547.
Shannon, C., Works on Information Theory and Cybernetics, Moscow: Izd. Inostrannoi Literatury, 1963.
Sikarev, A.A. and Lebedev, O.N., Microelectronic Devices for the Generation and Processing of Complex Signals, Moscow: Radio i Svyaz’, 1983.
Widrow, B., Adaptive sampled–data systems, a statistical theory of adaptation, IRE WESCON Conv. Rec., 1959, vol. 4, pp. 74–85.
Google Scholar
Ifeachor, E.C. and Jervis, B.W., Digital Signal Processing: a Practical Approach, Pearson Education, 2002.
Google Scholar
Solodov, A.V., Information Theory and Its Application to Tasks of Automatic Control and Monitoring, Moscow: Nauka, 1967.
Google Scholar
Tsypkin, Ya.Z., Information Theory of Identification, Moscow: Nauka, Fizmatlit, 1995.
MATH Google Scholar
Vershkov, N.N., Kuchukov, V.A., and Kuchukova, N.N., The theoretical approach to the search for a global extremum in the training of neural networks, Tr. Inst. Sist. Program. Russ. Akad. Nauk, 2019, vol. 31, issue 2, pp. 41–52. https://doi.org/10.15514/ISPRAS-2019-31(2)-4
Google Scholar
Haykin, S., Neural Networks: A Comprehensive Foundation, Prentice Hall, 1999.
MATH Google Scholar
Linnik, Yu.V., The Method of Least Squares and the Foundations of the Mathematical-Statistical Theory of Observation Processing, Moscow: Fizmatgiz, 1958.
Google Scholar
Osovsky, S., Neural Networks for Information Processing, Moscow: Finance and Statistics, 2002.
Google Scholar
Kharkevich, A.A., Information Theory. Pattern Recognition. Selected Works in Three Vols., Moscow: Nauka, 1973, vol. 3, p. 524.
Google Scholar
Cook, C.E. and Bernfeld, M., Radar Signals. An Introduction to Theory and Application, New York, London: Acad. Press, 1967.
Google Scholar
LeCun, Y., Cortes, C., and Burges, C.J.C., The MNIST Database of handwritten digits. http://yann.lecun.com/exdb/mnist/. Accessed February 10, 2020
PyTorch. https://pytorch.org/. Accessed November 10, 2019.
Zeiler, M.D., Adadelta: an adaptive learning rate method, 2012. arXiv:1212.5701
Bracewell, R.N., The Hartley Transform, Oxford Univ. Press, 1986.
MATH Google Scholar
Vershkov, N., Babenko, M., Kuchukov, V., et al., Search for the global extremum using the correlation indicator for neural networks supervised learning, Program. Comput. Software, 2020, vol. 46, pp. 609–618. https://doi.org/10.1134/S0361768820080265
Article MathSciNet Google Scholar
Tchernykh, A., Babenko, M., Chervyakov, N., Miranda-López, V., Avetisyan, A., Drozdov, A.Y., and Du, Z., Scalable data storage design for nonstationary IoT environment with adaptive security and reliability, IEEE Internet Things J., 2020, vol. 7, no. 10, pp. 10171–10188.
Article Google Scholar

Download references

ACKNOWLEDGMENTS

This work was supported in part by the Russian Science Foundation, project number 19-71-10033.

Author information

Authors and Affiliations

North-Caucasus Center for Mathematical Research, North-Caucasus Federal University, 355029, Stavropol, Russia
N. Vershkov, M. Babenko, V. Kuchukov & N. Kuchukova
Institute for System Programming of the Russian Academy of Sciences, 109004, Moscow, Russia
M. Babenko & A. Tchernykh
CICESE Research Center, 22860, Ensenada, B.C., Mexico
A. Tchernykh & B. Pulido-Gaytan
South Ural State University, 454080, Chelyabinsk, Russia
A. Tchernykh & J. M. Cortés-Mendoza

Authors

N. Vershkov
View author publications
You can also search for this author in PubMed Google Scholar
M. Babenko
View author publications
You can also search for this author in PubMed Google Scholar
A. Tchernykh
View author publications
You can also search for this author in PubMed Google Scholar
B. Pulido-Gaytan
View author publications
You can also search for this author in PubMed Google Scholar
J. M. Cortés-Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
V. Kuchukov
View author publications
You can also search for this author in PubMed Google Scholar
N. Kuchukova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to N. Vershkov, M. Babenko, A. Tchernykh, B. Pulido-Gaytan, J. M. Cortés-Mendoza, V. Kuchukov or N. Kuchukova.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vershkov, N., Babenko, M., Tchernykh, A. et al. Optimization of Neural Network Training for Image Recognition Based on Trigonometric Polynomial Approximation. Program Comput Soft 47, 830–838 (2021). https://doi.org/10.1134/S0361768821080272

Download citation

Received: 11 July 2021
Revised: 29 July 2021
Accepted: 15 August 2021
Published: 28 December 2021
Issue Date: December 2021
DOI: https://doi.org/10.1134/S0361768821080272