research-article

Massive parallelization of SPICE device model evaluation on GPU-based SIMD architectures

Authors:
Amr M. Bayoumi

ACCIT-New Systems Research, Azareeta, Alexandria, Egypt

ACCIT-New Systems Research, Azareeta, Alexandria, Egypt
View Profile

,
Yasser Y. Hanafy

Arab Academy for Science & Technology, Abu Qir, Alexandria, Egypt

Arab Academy for Science & Technology, Abu Qir, Alexandria, Egypt
View Profile

IFMT '08: Proceedings of the 1st international forum on Next-generation multicore/manycore technologiesNovember 2008Article No.: 12Pages 1–5https://doi.org/10.1145/1463768.1463784

Published:24 November 2008Publication History

IFMT '08: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies

Pages 1–5

ABSTRACT

Device model evaluation is one of the most time-consuming tasks in analog simulators such as SPICE. Graphics Processing Unit (GPU) architectures allow massive utilization of vector data on SIMD architectures. In this paper, the formulation of double precision device model equations into a form compatible with stream computing is presented. We show data on isolating typical bottlenecks, especially the communication and kernel call overheads. Our results indicate speedup of up to 20X when counting overheads, and up to 50X when using techniques to overcome these overheads. In particular, we show that our techniques are valid for small device counts, which is typically a well known problem for accelerated parallel computing with communications overheads.

References

Pillage, L. T., Rohrer, R. A., Visweswariah, C., Electronic Circuit and System Simulation Methods, (1995), McGraw-Hill. Google ScholarDigital Library
Cox, P. F., Burch, R. G., Hocevar, D. E., Yang, P., and Epler, B. D., Direct Circuit Simulation Algorithms for Parallel Processing", IEEE Trans. on Computer-Aided Design, Vol. 10, no. 6. (June 1991), 714--725.Google ScholarDigital Library
Brook+ Language Specifcation Version 1.0 Beta, (2008).Google Scholar
AMD Compute Abstraction Layer Programming Guide, version 1-0, (2008).Google Scholar
Sadayappan, P., and Visvanathan, V., Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers, 26th Design Automation Conference, (June 1989), 13--18. Google ScholarDigital Library
Garland, M., Sparse Matrix Computations on Manycore GPU's, 45th Design Automation Conference, (June 2008), 2--6. Google ScholarDigital Library
AMD Stream Computing Website at: http://ati.amd.com/products/streamprocessor/specs.htmlGoogle Scholar
Predictive Technology Model Official website at: http://www.eas.asu.edu/~ptm/Google Scholar
Official OpenMP website at: http://www.openmp.orgGoogle Scholar
AMD Stream Computing User Guide, Rev. 1.1, (Aug. 2008).Google Scholar

Index Terms

Massive parallelization of SPICE device model evaluation on GPU-based SIMD architectures

Recommendations

A Comparative Evaluation of Parallel Programming Models for Shared-Memory Architectures
ISPA '12: Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications

Nowadays, most computers that are commercially available off-the-shelf (COTS) include hardware features that increase the performance of parallel general-purpose threads (hyper threading, multicore, ccNUMA architectures) or SIMD kernels (CPU vector ...
Read More
A performance study of general-purpose applications on graphics processors using CUDA

Graphics processors (GPUs) provide a vast number of simple, data-parallel, deeply multithreaded cores and high memory bandwidths. GPU architectures are becoming increasingly programmable, offering the potential for dramatic speedups for a variety of ...
Read More
On-GPU Thread-Data Remapping for Branch Divergence Reduction

General Purpose GPU computing (GPGPU) plays an increasingly vital role in high performance computing and other areas like deep learning. However, arising from the SIMD execution model, the branch divergence issue lowers efficiency of conditional ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
IFMT '08: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies
November 2008
121 pages
ISBN:9781605584072
DOI:10.1145/1463768
General Chairs:
Ian Watson
Manchester University, UK
,
Hisham El-Shishiny
IBM Center for Advanced Studies, Egypt
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 November 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
BSIM
GPGPU
SIMD
SPICE
graphics processing units
manycore
parallel computing
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 14
  Total Citations
  View Citations
- 488
  Total Downloads
- Downloads (Last 12 months)7
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Massive parallelization of SPICE device model evaluation on GPU-based SIMD architectures

IFMT '08: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Comparative Evaluation of Parallel Programming Models for Shared-Memory Architectures

A performance study of general-purpose applications on graphics processors using CUDA

On-GPU Thread-Data Remapping for Branch Divergence Reduction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Massive parallelization of SPICE device model evaluation on GPU-based SIMD architectures

IFMT '08: Proceedings of the 1st international forum on Next-generation multicore/manycore technologies

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Comparative Evaluation of Parallel Programming Models for Shared-Memory Architectures

A performance study of general-purpose applications on graphics processors using CUDA

On-GPU Thread-Data Remapping for Branch Divergence Reduction

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media