Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices

Tsmots, Ivan; Teslyuk, Vasyl; Kryvinska, Natalia; Skorokhoda, Oleksa; Kazymyra, Iryna

doi:10.1007/s11227-022-04838-0

Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices

Published: 30 September 2022

Volume 79, pages 4820–4846, (2023)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Ivan Tsmots¹,
Vasyl Teslyuk¹,
Natalia Kryvinska ORCID: orcid.org/0000-0003-3678-9229²,
Oleksa Skorokhoda¹ &
…
Iryna Kazymyra¹

144 Accesses
Explore all metrics

Abstract

Nowadays, intensive streams of fuzzy input data need to be processed in real-time for different fields of science and engineering. To solve this problem, a generalized model for the parallel-streaming neural element was developed in this paper. The proposed model allows minimizing hardware costs while providing scalar product and activation function calculations in real time. In particular, an algorithm and a structure for a parallel-streaming device (PSD) were developed to calculate a scalar product with the direct formation of partial products based on the analysis of a single bit-cut of multipliers, which provides working with the shortest conveyor stage. It is based on a modified Booth’s algorithm that allows reducing equipment costs for processing operands with high bit-width. Moreover, it promotes the lowest equipment costs for the operands with a low bit-width. Besides, researches demonstrate that the main way of increasing the speed of the developed algorithms and structures of PSD for scalar product calculating is a preliminary formation of partial products. Further, the estimation of the model parameters shows reducing conveyor steps, improvement of the locality of connections, and an increase of an adaptation to the coming data intensity. It is proposed to use the developed algorithms and structures as a basis for building devices for parallel-streaming calculation of the scalar product in real time with high efficiency of equipment use. The main ways of harmonizing the time of incoming data and weights with the conveyor cycle of the PSD for calculation of the scalar product are determined. A methodology proposed for building conveyor devices for parallel-streaming calculation of the scalar product in real time for a given intensity of input data ensures the implementation of devices with the required speed and with minimal hardware costs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Finite-Field Parallel Adder Circuit Over Prime Numbers Based on Spiking Neural P Systems

Improved parallel matrix multiplication using Strassen and Urdhvatiryagbhyam method

Article 24 May 2023

A Bufferless Non-exact Matching Hardware Accelerator for Processing Large Non-uniform Stream Data

Data availability

Statement—Our manuscript has no associated data.

References

Chen Y-H, Krishna T, Emer JS, Sze V (2017) Eyeriss, an energy-efficient reconfigurable accelerator for deep convolutional neural networks. IEEE J Solid-State Circuits 52(1):127–138
Article Google Scholar
Chen YH, Krishna T, Emer JS, Sze V (2019) Eyeriss v2: a flexible accelerator for emerging deep neural networks on mobile devices. IEEE J Emerg Sel Top Circuits Syst 9(2):292–308
Article Google Scholar
Wu R, Guo X, Du J, Li J (2021) Accelerating neural network inference on FPGA-based platforms—A survey. Electronics 10:1025. https://doi.org/10.3390/electronics10091025
Article Google Scholar
Torbati N, Ayatollahi A, Kermani A (2014) An efficient neural network based method for medical image segmentation. Comput Biol Med 44:76–87
Article Google Scholar
Berezsky O, Pitsun O, Batryn N, Datsko T, Berezska K, Dubchak L, 2018 Modern automated microscopy systems in oncology. In: Proceedings of the 1st International Workshop on Informatics & Data-Driven Medicine, Lviv, Ukraine, 311–325
Lytvyn V, Vysotska V, Mykhailyshyn V, Peleshchak I, Peleshchak R, Kohut I, (2019) Intelligent system of a smart house. In: 3rd International Conference on Advanced Information and Communications Technologies, AICT, 282–287
Allam Z (2019) Achieving neuroplasticity in artificial neural networks through smart cities. Smart Cities 2:118–134
Article Google Scholar
Duka AV (2014) Neural network based inverse kinematics solution for trajectory tracking of a robotic arm. Procedia Technol 12:20–27
Article Google Scholar
Nurvitadhi E, Venkatesh G, Sim J, Marr D, Huang R, Ong Gee Hock J, Liew YT, Srivatsan K, Moss D, Subhaschandra S, et al. 2017 Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Neural Networks. In: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, ACM: NY, USA, 2017: 5–14
Trimberger SM (2015) Three ages of FPGAs: a retrospective on the first thirty years of FPGA technology. Proc IEEE 103:318–331
Article Google Scholar
Lotricˇ U, Bulic P (2012) Applicability of approximate multipliers in hardware neural networks. Neurocomputing 96:57–65
Article Google Scholar
Johnston SP, Prasad G, Maguire L, Mcginnity TM (2010) An FPGA hardware/software co-design towards evolvable spiking neural networks for robotics application. Int J Neural Syst 20(6):447–461
Article Google Scholar
Sugiarto I, Axenie C, Conradt J (2019) FPGA-based hardware accelerator for an embedded factor graph with configurable optimization. J Circuits Syst Comput 28(02):1950031
Article Google Scholar
Ramakrishna BR, Fisher JA (1993) Instruction-level parallel processing: history, overview and perspective. J Supercomput 7(1):9–50
Google Scholar
Sohi G (1990) Instruction issue logic for high-performance interruptible, multiple functional unit. Pipelined Comput IEEE Trans Comput 39(3):349–359
Article Google Scholar
Yarovyi A, Ilchenko R, Arseniuk I, Shmet Y, Kotyra A, Smailova S, (2018) An intelligent system of neural networking recognition of multicolor spot images of laser beam profile. In: Proceedings of SPIE 10808, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2018, 108081B. https://doi.org/10.1117/12.2501691
Himavathi S, Anitha D, Himavathi S (2007) Feedforward neural network implementation in FPGA using layer multiplexing for effective resource utilization. IEEE Trans Neural Networks 18(3):880–888
Article Google Scholar
Kozhemyako V, Timchenko L, Yarovyy A (2008) Methodological principles of pyramidal and parallel-hierarchical image processing on the base of neural-like network systems. Adv Electr Comput Eng 8(2):54–60
Article Google Scholar
Tkachenko R, Izonin I, (2019) Model and Principles for the Implementation of Neural-Like Structures based on Geometric Data Transformations. In: Hu ZB, Petoukhov S (eds) Advances in Computer Science for Engineering and Education, ICCSEEA2018, Advances in Intelligent Systems and Computing. Springer, Cham, 754, 578–587
Gadekallu TR, Khare N, Bhattacharya S et al (2020) Deep neural networks to predict diabetic retinopathy. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-01963-7
Article Google Scholar
Ashraf I, Zikriya YB, Hur S et al (2021) Localizing pedestrians in indoor environments using magnetic field data with term frequency paradigm and deep neural networks. Int J Mach Learn Cyber 12:3203–3219. https://doi.org/10.1007/s13042-021-01279-8
Article Google Scholar
Oskouei SSL, Golestani H, Kachuee M, Hashemi M, Mohammadzade H, Ghiasi S, (2015) GPU-based Acceleration of Deep Convolutional Neural Networks on Mobile Platforms. Distrib Parallel Clust Comput, https://arxiv.org/pdf/1511.07376v1.pdf
Huqqani AA, Schikuta E, Ye S, Chen P (2013) Multicore and GPU parallelization of neural networks for face recognition. Procedia Comput Sci 18:349–358
Article Google Scholar
Gadekallu TR, Rajput DS, Reddy MPK et al (2020) A novel PCA–whale optimization-based deep neural network model for classification of tomato plant diseases using GPU. J Real-Time Image Proc. https://doi.org/10.1007/s11554-020-00987-8
Article Google Scholar
Geche F, Mulesa O, Buchok V (2017) Verification of realizability of boolean functions by a neural element with a threshold activation function. Eastern-Eur J Enterp Technol 1(4):30–40
Article Google Scholar
Pukach AI, Teslyuk VM, Tkachenko RO, Ivantsiv R-AD, (2011) Implementation of neural networks for fuzzy and semistructured data. In: Proceedings of the 11th International Conference on the Experience of Designing and Application of CAD Systems in Microelectronics, CADSM’2011, Lviv-Polyana, Ukraine, 23–25, 350–352
Bodyanskiy Y, Tyshchenko O, Kopaliani D (2015) An extended neo-fuzzy neuron and its adaptive learning algorithm. Int J Intell Syst Appl 7(2):21–26
Google Scholar
Zeng W, Guo Z, Shen Y et al (2021) Data-driven management for fuzzy sewage treatment processes using hybrid neural computing. Neural Comput Appl. https://doi.org/10.1007/s00521-020-05655-3
Article Google Scholar
Tsmots I, Teslyuk V, Teslyuk T, Ihnatyev I, (2018) Basic components of neuronetworks with parallel vertical group data real-time processing. In: Advances in Intelligent Systems and Computing II, Advances in Intelligent Systems and Computing 689. Springer International Publishing AG 2018: 558–576
Tsmots I, Skorokhoda O, Ignatyev I, Rabyk V, (2017) Basic vertical-parallel real time neural network components. In: Proceedings of XIIth International Scientific and Technical Conference, CSIT 2017, Lviv, Ukraine, 344–347
Dendaluce Jahnke M, Cosco F, Novickis R, Pérez Rastelli J, Gomez-Garay V (2019) Efficient neural network implementations on parallel embedded platforms applied to real-time torque-vectoring optimization using predictions for multi-motor electric vehicles. Electronics 8:250
Article Google Scholar
Tsmots I, Skorokhoda O, Rabyk V, (2018) Parallel algorithms and matrix structures for scalar product calculation. In: Proceedings of the 14th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering, TCSET, Lviv-Slavske, Ukraine, 144
Tsmots I, Skorokhoda O, Tsymbal Y, Teslyuk T, Khavalko V, (2018) Neural-like means for data streams encryption and decryption in real time. In: Proceedings of the 2018 IEEE Second International Conference on Data Stream Mining & Processing, DSMP, Lviv, Ukraine, 438–443
Tsmots I, Rabyk V, Skorokhoda O, Teslyuk T, (2019) Neural element of parallel-stream type with preliminary formation of group partial products. In: Electronics and information technologies (ELIT-2019): Proceedings of the XI-th International Scientific And Practical Conference, 154–158. https://doi.org/10.1109/ELIT.2019.8892334
Tsmots I, Tsymbal Y, Skorokhoda O, Tkachenko R, (2019) Neural-like methods and hardware structures for real-time data encryption and decryption. In: 2019 IEEE 14th international conference on computer sciences and information technologies, CSIT, Lviv, Ukraine, 248–253
Zhang C, Li P, Sun G, Guan Y, Xiao B, Cong J, (2015) Optimizing FPGA-based accelerator design for deep convolutional neural networks. In: Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, 22–24 February 2015, ACM: New York, NY, USA, 161–170
Li H, Fan X, Jiao L, Cao W, Zhou X, Wang L, (2016) A high performance FPGA-based accelerator for large-scale convolutional neural networks. In: Proceedings of the 2016 26th International Conference on Field Programmable Logic and Applications, FPL, Lausanne, Switzerland, 1–9
Suda N, Chandra V, Dasika G, Mohanty A, Ma Y, Vrudhula S, Seo J, Cao Y, (2016) Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks, ACM Press: New York, NY, USA, 16–25
Tokheim RL, (2013) Digital electronics: principles and application. 8th edition. McGraw Hill Higher Education
Tsmots I, Skorokhoda O (2011) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy №66138, (Patent of Ukraine, in Ukrainian)
Tsmots I, Skorokhoda O, Teslyuk V (2013) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy № 101922, (Patent of Ukraine, in Ukrainian)
Booth AD (1951) A signed binary multiplication technique. Oxford University Press, Oxford
Book MATH Google Scholar
Patterson DA, Hennessy JL, (1998) Computer organization and design: the hardware/software interface (Second ed.). San Francisco, California, USA: Morgan Kaufmann Publishers. ISBN 1-55860-428-6
Tsmots I, Skorokhoda O, Medykovskyy M, (2019) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy № 118596, (Patent of Ukraine, in Ukrainian)
Brown SD, Francis RJ, Rose J, Vranesic ZG (1992) Field programmable gate arrays. Kluwer Academic Publishers, Boston, MA
Book MATH Google Scholar
Marongiu A, Palazzari P (2020) Using high-level synthesis to implement the matrix-vector multiplication on FPGA. High Perform Comput 12151:251–269. https://doi.org/10.1007/978-3-030-50743-5_13
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Automated Control Systems, Lviv Polytechnic National University, Lviv, Ukraine
Ivan Tsmots, Vasyl Teslyuk, Oleksa Skorokhoda & Iryna Kazymyra
Department of Information Systems, Faculty of Management, Comenius University in Bratislava, Bratislava, Slovakia
Natalia Kryvinska

Authors

Ivan Tsmots
View author publications
You can also search for this author inPubMed Google Scholar
Vasyl Teslyuk
View author publications
You can also search for this author inPubMed Google Scholar
Natalia Kryvinska
View author publications
You can also search for this author inPubMed Google Scholar
Oleksa Skorokhoda
View author publications
You can also search for this author inPubMed Google Scholar
Iryna Kazymyra
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Natalia Kryvinska.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest/competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Tsmots, I., Teslyuk, V., Kryvinska, N. et al. Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices. J Supercomput 79, 4820–4846 (2023). https://doi.org/10.1007/s11227-022-04838-0

Download citation

Accepted: 16 September 2022
Published: 30 September 2022
Issue Date: March 2023
DOI: https://doi.org/10.1007/s11227-022-04838-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Finite-Field Parallel Adder Circuit Over Prime Numbers Based on Spiking Neural P Systems

Improved parallel matrix multiplication using Strassen and Urdhvatiryagbhyam method

A Bufferless Non-exact Matching Hardware Accelerator for Processing Large Non-uniform Stream Data

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now