Skip to main content

Advertisement

Log in

Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Nowadays, intensive streams of fuzzy input data need to be processed in real-time for different fields of science and engineering. To solve this problem, a generalized model for the parallel-streaming neural element was developed in this paper. The proposed model allows minimizing hardware costs while providing scalar product and activation function calculations in real time. In particular, an algorithm and a structure for a parallel-streaming device (PSD) were developed to calculate a scalar product with the direct formation of partial products based on the analysis of a single bit-cut of multipliers, which provides working with the shortest conveyor stage. It is based on a modified Booth’s algorithm that allows reducing equipment costs for processing operands with high bit-width. Moreover, it promotes the lowest equipment costs for the operands with a low bit-width. Besides, researches demonstrate that the main way of increasing the speed of the developed algorithms and structures of PSD for scalar product calculating is a preliminary formation of partial products. Further, the estimation of the model parameters shows reducing conveyor steps, improvement of the locality of connections, and an increase of an adaptation to the coming data intensity. It is proposed to use the developed algorithms and structures as a basis for building devices for parallel-streaming calculation of the scalar product in real time with high efficiency of equipment use. The main ways of harmonizing the time of incoming data and weights with the conveyor cycle of the PSD for calculation of the scalar product are determined. A methodology proposed for building conveyor devices for parallel-streaming calculation of the scalar product in real time for a given intensity of input data ensures the implementation of devices with the required speed and with minimal hardware costs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Statement—Our manuscript has no associated data.

References

  1. Chen Y-H, Krishna T, Emer JS, Sze V (2017) Eyeriss, an energy-efficient reconfigurable accelerator for deep convolutional neural networks. IEEE J Solid-State Circuits 52(1):127–138

    Article  Google Scholar 

  2. Chen YH, Krishna T, Emer JS, Sze V (2019) Eyeriss v2: a flexible accelerator for emerging deep neural networks on mobile devices. IEEE J Emerg Sel Top Circuits Syst 9(2):292–308

    Article  Google Scholar 

  3. Wu R, Guo X, Du J, Li J (2021) Accelerating neural network inference on FPGA-based platforms—A survey. Electronics 10:1025. https://doi.org/10.3390/electronics10091025

    Article  Google Scholar 

  4. Torbati N, Ayatollahi A, Kermani A (2014) An efficient neural network based method for medical image segmentation. Comput Biol Med 44:76–87

    Article  Google Scholar 

  5. Berezsky O, Pitsun O, Batryn N, Datsko T, Berezska K, Dubchak L, 2018 Modern automated microscopy systems in oncology. In: Proceedings of the 1st International Workshop on Informatics & Data-Driven Medicine, Lviv, Ukraine, 311–325

  6. Lytvyn V, Vysotska V, Mykhailyshyn V, Peleshchak I, Peleshchak R, Kohut I, (2019) Intelligent system of a smart house. In: 3rd International Conference on Advanced Information and Communications Technologies, AICT, 282–287

  7. Allam Z (2019) Achieving neuroplasticity in artificial neural networks through smart cities. Smart Cities 2:118–134

    Article  Google Scholar 

  8. Duka AV (2014) Neural network based inverse kinematics solution for trajectory tracking of a robotic arm. Procedia Technol 12:20–27

    Article  Google Scholar 

  9. Nurvitadhi E, Venkatesh G, Sim J, Marr D, Huang R, Ong Gee Hock J, Liew YT, Srivatsan K, Moss D, Subhaschandra S, et al. 2017 Can FPGAs Beat GPUs in Accelerating Next-Generation Deep Neural Networks. In: Proceedings of the 2017 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, ACM: NY, USA, 2017: 5–14

  10. Trimberger SM (2015) Three ages of FPGAs: a retrospective on the first thirty years of FPGA technology. Proc IEEE 103:318–331

    Article  Google Scholar 

  11. Lotricˇ U, Bulic P (2012) Applicability of approximate multipliers in hardware neural networks. Neurocomputing 96:57–65

    Article  Google Scholar 

  12. Johnston SP, Prasad G, Maguire L, Mcginnity TM (2010) An FPGA hardware/software co-design towards evolvable spiking neural networks for robotics application. Int J Neural Syst 20(6):447–461

    Article  Google Scholar 

  13. Sugiarto I, Axenie C, Conradt J (2019) FPGA-based hardware accelerator for an embedded factor graph with configurable optimization. J Circuits Syst Comput 28(02):1950031

    Article  Google Scholar 

  14. Ramakrishna BR, Fisher JA (1993) Instruction-level parallel processing: history, overview and perspective. J Supercomput 7(1):9–50

    Google Scholar 

  15. Sohi G (1990) Instruction issue logic for high-performance interruptible, multiple functional unit. Pipelined Comput IEEE Trans Comput 39(3):349–359

    Article  Google Scholar 

  16. Yarovyi A, Ilchenko R, Arseniuk I, Shmet Y, Kotyra A, Smailova S, (2018) An intelligent system of neural networking recognition of multicolor spot images of laser beam profile. In: Proceedings of SPIE 10808, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2018, 108081B. https://doi.org/10.1117/12.2501691

  17. Himavathi S, Anitha D, Himavathi S (2007) Feedforward neural network implementation in FPGA using layer multiplexing for effective resource utilization. IEEE Trans Neural Networks 18(3):880–888

    Article  Google Scholar 

  18. Kozhemyako V, Timchenko L, Yarovyy A (2008) Methodological principles of pyramidal and parallel-hierarchical image processing on the base of neural-like network systems. Adv Electr Comput Eng 8(2):54–60

    Article  Google Scholar 

  19. Tkachenko R, Izonin I, (2019) Model and Principles for the Implementation of Neural-Like Structures based on Geometric Data Transformations. In: Hu ZB, Petoukhov S (eds) Advances in Computer Science for Engineering and Education, ICCSEEA2018, Advances in Intelligent Systems and Computing. Springer, Cham, 754, 578–587

  20. Gadekallu TR, Khare N, Bhattacharya S et al (2020) Deep neural networks to predict diabetic retinopathy. J Ambient Intell Human Comput. https://doi.org/10.1007/s12652-020-01963-7

    Article  Google Scholar 

  21. Ashraf I, Zikriya YB, Hur S et al (2021) Localizing pedestrians in indoor environments using magnetic field data with term frequency paradigm and deep neural networks. Int J Mach Learn Cyber 12:3203–3219. https://doi.org/10.1007/s13042-021-01279-8

    Article  Google Scholar 

  22. Oskouei SSL, Golestani H, Kachuee M, Hashemi M, Mohammadzade H, Ghiasi S, (2015) GPU-based Acceleration of Deep Convolutional Neural Networks on Mobile Platforms. Distrib Parallel Clust Comput, https://arxiv.org/pdf/1511.07376v1.pdf

  23. Huqqani AA, Schikuta E, Ye S, Chen P (2013) Multicore and GPU parallelization of neural networks for face recognition. Procedia Comput Sci 18:349–358

    Article  Google Scholar 

  24. Gadekallu TR, Rajput DS, Reddy MPK et al (2020) A novel PCA–whale optimization-based deep neural network model for classification of tomato plant diseases using GPU. J Real-Time Image Proc. https://doi.org/10.1007/s11554-020-00987-8

    Article  Google Scholar 

  25. Geche F, Mulesa O, Buchok V (2017) Verification of realizability of boolean functions by a neural element with a threshold activation function. Eastern-Eur J Enterp Technol 1(4):30–40

    Article  Google Scholar 

  26. Pukach AI, Teslyuk VM, Tkachenko RO, Ivantsiv R-AD, (2011) Implementation of neural networks for fuzzy and semistructured data. In: Proceedings of the 11th International Conference on the Experience of Designing and Application of CAD Systems in Microelectronics, CADSM’2011, Lviv-Polyana, Ukraine, 23–25, 350–352

  27. Bodyanskiy Y, Tyshchenko O, Kopaliani D (2015) An extended neo-fuzzy neuron and its adaptive learning algorithm. Int J Intell Syst Appl 7(2):21–26

    Google Scholar 

  28. Zeng W, Guo Z, Shen Y et al (2021) Data-driven management for fuzzy sewage treatment processes using hybrid neural computing. Neural Comput Appl. https://doi.org/10.1007/s00521-020-05655-3

    Article  Google Scholar 

  29. Tsmots I, Teslyuk V, Teslyuk T, Ihnatyev I, (2018) Basic components of neuronetworks with parallel vertical group data real-time processing. In: Advances in Intelligent Systems and Computing II, Advances in Intelligent Systems and Computing 689. Springer International Publishing AG 2018: 558–576

  30. Tsmots I, Skorokhoda O, Ignatyev I, Rabyk V, (2017) Basic vertical-parallel real time neural network components. In: Proceedings of XIIth International Scientific and Technical Conference, CSIT 2017, Lviv, Ukraine, 344–347

  31. Dendaluce Jahnke M, Cosco F, Novickis R, Pérez Rastelli J, Gomez-Garay V (2019) Efficient neural network implementations on parallel embedded platforms applied to real-time torque-vectoring optimization using predictions for multi-motor electric vehicles. Electronics 8:250

    Article  Google Scholar 

  32. Tsmots I, Skorokhoda O, Rabyk V, (2018) Parallel algorithms and matrix structures for scalar product calculation. In: Proceedings of the 14th International Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering, TCSET, Lviv-Slavske, Ukraine, 144

  33. Tsmots I, Skorokhoda O, Tsymbal Y, Teslyuk T, Khavalko V, (2018) Neural-like means for data streams encryption and decryption in real time. In: Proceedings of the 2018 IEEE Second International Conference on Data Stream Mining & Processing, DSMP, Lviv, Ukraine, 438–443

  34. Tsmots I, Rabyk V, Skorokhoda O, Teslyuk T, (2019) Neural element of parallel-stream type with preliminary formation of group partial products. In: Electronics and information technologies (ELIT-2019): Proceedings of the XI-th International Scientific And Practical Conference, 154–158. https://doi.org/10.1109/ELIT.2019.8892334

  35. Tsmots I, Tsymbal Y, Skorokhoda O, Tkachenko R, (2019) Neural-like methods and hardware structures for real-time data encryption and decryption. In: 2019 IEEE 14th international conference on computer sciences and information technologies, CSIT, Lviv, Ukraine, 248–253

  36. Zhang C, Li P, Sun G, Guan Y, Xiao B, Cong J, (2015) Optimizing FPGA-based accelerator design for deep convolutional neural networks. In: Proceedings of the 2015 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, 22–24 February 2015, ACM: New York, NY, USA, 161–170

  37. Li H, Fan X, Jiao L, Cao W, Zhou X, Wang L, (2016) A high performance FPGA-based accelerator for large-scale convolutional neural networks. In: Proceedings of the 2016 26th International Conference on Field Programmable Logic and Applications, FPL, Lausanne, Switzerland, 1–9

  38. Suda N, Chandra V, Dasika G, Mohanty A, Ma Y, Vrudhula S, Seo J, Cao Y, (2016) Throughput-optimized OpenCL-based FPGA accelerator for large-scale convolutional neural networks, ACM Press: New York, NY, USA, 16–25

  39. Tokheim RL, (2013) Digital electronics: principles and application. 8th edition. McGraw Hill Higher Education

  40. Tsmots I, Skorokhoda O (2011) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy №66138, (Patent of Ukraine, in Ukrainian)

  41. Tsmots I, Skorokhoda O, Teslyuk V (2013) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy № 101922, (Patent of Ukraine, in Ukrainian)

  42. Booth AD (1951) A signed binary multiplication technique. Oxford University Press, Oxford

    Book  MATH  Google Scholar 

  43. Patterson DA, Hennessy JL, (1998) Computer organization and design: the hardware/software interface (Second ed.). San Francisco, California, USA: Morgan Kaufmann Publishers. ISBN 1-55860-428-6

  44. Tsmots I, Skorokhoda O, Medykovskyy M, (2019) Prystrii dlia obchyslennia skaliarnogo dobutku. Patent Ukrainy № 118596, (Patent of Ukraine, in Ukrainian)

  45. Brown SD, Francis RJ, Rose J, Vranesic ZG (1992) Field programmable gate arrays. Kluwer Academic Publishers, Boston, MA

    Book  MATH  Google Scholar 

  46. Marongiu A, Palazzari P (2020) Using high-level synthesis to implement the matrix-vector multiplication on FPGA. High Perform Comput 12151:251–269. https://doi.org/10.1007/978-3-030-50743-5_13

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natalia Kryvinska.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest/competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tsmots, I., Teslyuk, V., Kryvinska, N. et al. Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices. J Supercomput 79, 4820–4846 (2023). https://doi.org/10.1007/s11227-022-04838-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-022-04838-0

Keywords