Optimization of Industrial Neural Network Simulators for GPGPUs

Wafai, Mhd. Amer; Ahmed, Zaheer; Keller, Rainer; Holzmann, Sven; Sander, Björn; Resch, Michael

doi:10.1007/978-3-662-43454-3_3

Optimization of Industrial Neural Network Simulators for GPGPUs

Mhd. Amer Wafai²⁰,
Zaheer Ahmed²⁰,
Rainer Keller²⁰,
Sven Holzmann²¹,
Björn Sander²¹ &
…
Michael Resch²⁰

Conference paper

1636 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7697))

Abstract

This paper introduces the porting of an industrial neural network simulator onto GPUs used in a tool-chain to sort massive amounts of E-mails and other textual data. Compared to other previous work, all steps are being executed on the GPU, achieving overall up to 33× speedup without using any cuBLAS functionality. All the time-consuming routines have been ported onto the GPU, i.e. the training-, the simulation- and the verification-phases, the training being the most time-consuming. It is planned to include these GPU-kernels into the product for special costumer’s demands.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

van Amesfoort, A.S., Varbanescu, A.L., Sips, H.J., van Nieuwpoort, R.V.: Evaluating multi-core platforms for hpc data-intensive kernels. In: CF 2009: Proceedings of the 6th ACM Conference on Computing Frontiers, pp. 207–216. ACM, New York (2009)
Google Scholar
Dongarra, J.: Basic linear algebra subprograms technical forum standard. Int. J. of High Performance Applications and Supercomputing 16(1), 1–111 (2002)
Article Google Scholar
Flynn, M.J.: Some computer organizations and their effectiveness. IEEE Trans. Comput. C-21, 948 (1972)
Google Scholar
Göddeke, D., Strzodka, R., Turek, S.: Accelerating double precision FEM simulations with GPUs. In: Hülsemann, F., Kowarschik, M., Rüde, U. (eds.) Frontiers in Simulation, pp. 139–144 (2005)
Google Scholar
Han, Y., Chakraborty, K., Roy, S., Kuntamukkala, V.: Design and implementation of a throughput-optimized gpu floorplanning algorithm. ACM Trans. Des. Autom. Electron. Syst. 16, 1–23 (2011)
Article MATH Google Scholar
NVIDIA: CUDA basic linear algebra subroutines (cuBLAS), http://developer.nvidia.com/cublas (2011)
NVIDIA: CUDA C programming guide version 4.0. Tech. rep. (2011), http://developer.download.nvidia.com/compute/cuda/4_0/toolkit/docs/CUDA_C_Programming_Guide.pdf
Scanzio, S., Cumani, S., Gemello, R., Mana, F., Laface, P.: Parallel implementation of artificial neural network training. In: IEEE Int. Conf. on Acoustics Speech and Signal Processing (ICASSP), March 14-19, vol. 1, pp. 4902–4905 (2010)
Google Scholar
Siek, J., Lee, L.Q., Lumsdaine, A.: The Boost Graph Library. Addison-Wesley (2002)
Google Scholar
Strigl, D., Kofler, K., Podlipnig, S.: Performance and scalability of GPU-based convolutional neural networks. In: 18th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Pisa, Italy, February 17-19 (2010)
Google Scholar
Takizawa, H., Chida, T., Kobayashi, H.: Evaluating computational performance of backpropagation learning on graphics hardware. Electr. Notes Theor. Comput. Sci. 225, 379–389 (2009)
Article Google Scholar
Zhu, W.: A study of parallel evolution strategy: pattern search on a gpu computing platform. In: Proceedings of the First ACM/SIGEVO Summit on Genetic and Evolutionary Computation, GEC 2009, pp. 765–772. ACM, New York (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

HLRS, University of Stuttgart, Nobelstraße 19, 70569, Stuttgart, Germany
Mhd. Amer Wafai, Zaheer Ahmed, Rainer Keller & Michael Resch
HMI-Tec GmbH, Im Breitspiel 11C, 69126, Heidelberg, Germany
Sven Holzmann & Björn Sander

Authors

Mhd. Amer Wafai
View author publications
You can also search for this author in PubMed Google Scholar
Zaheer Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Rainer Keller
View author publications
You can also search for this author in PubMed Google Scholar
Sven Holzmann
View author publications
You can also search for this author in PubMed Google Scholar
Björn Sander
View author publications
You can also search for this author in PubMed Google Scholar
Michael Resch
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dickson Computer Systems, 7A Victory Avenue, 4th Floor, Homantin, Kowloon, Hong Kong
Dickson K. W. Chiu
Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong
Minhong Wang
Faculty of Automation, Computers and Electronics, University of Craiova, Boulevard Decebal 107, 200440, Craiova, Romania
Elvira Popescu
City University of Hong Kong, Hong Kong, China
Qing Li
City University of Hong Kong, 83 Tat Chee Avenue, Kowloon Tong, Hong Kong
Rynson Lau

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wafai, M.A., Ahmed, Z., Keller, R., Holzmann, S., Sander, B., Resch, M. (2014). Optimization of Industrial Neural Network Simulators for GPGPUs. In: Chiu, D.K.W., Wang, M., Popescu, E., Li, Q., Lau, R. (eds) New Horizons in Web Based Learning. ICWL 2012. Lecture Notes in Computer Science, vol 7697. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-43454-3_3

Download citation

DOI: https://doi.org/10.1007/978-3-662-43454-3_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-43453-6
Online ISBN: 978-3-662-43454-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics