short-paper

Voyager – An Innovative Computational Resource for Artificial Intelligence & Machine Learning Applications in Science and Engineering

Authors:
Rommie E. Amaro

University of California, San Diego, USA

University of California, San Diego, USA

0000-0002-9275-9553
View Profile

,
Jiunn-Yeu Chen

Habana Labs, USA

Habana Labs, USA

0000-0002-0436-3488
View Profile

,
Javier M. Duarte

University of California, San Diego, USA

University of California, San Diego, USA

0000-0002-5076-7096
View Profile

,
Thomas E. Hutton

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0002-1969-3121
View Profile

,
Christopher Irving

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0009-0004-3409-3441
View Profile

,
Martin C. Kandes

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0002-0721-025X
View Profile

,
Amit Majumdar

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0002-0860-6686
View Profile

,
Dmitry Y. Mishin

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0003-1125-448X
View Profile

,
Mai H. Nguyen

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0002-1444-648X
View Profile

,
Paul Rodriguez

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0009-0006-5484-9642
View Profile

,
Fernando Silva

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0009-0006-0857-1459
View Profile

,
Robert S. Sinkovits

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0001-7377-9762
View Profile

,
Shawn M. Strande

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0002-6183-0679
View Profile

,
Mahidhar Tatineni

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0009-0003-0709-090X
View Profile

,
Leon Si Tran

Habana Labs, USA

Habana Labs, USA

0009-0006-2047-2998
View Profile

,
Nicole Wolter

San Diego Supercomputer Center, USA

San Diego Supercomputer Center, USA

0000-0001-5623-7101
View Profile

PEARC '23: Practice and Experience in Advanced Research ComputingJuly 2023Pages 278–282https://doi.org/10.1145/3569951.3597597

Published:10 September 2023Publication History

PEARC '23: Practice and Experience in Advanced Research Computing

Pages 278–282

ABSTRACT

Voyager is an innovative computational resource designed by the San Diego Supercomputer Center in collaboration with technology partners to accelerate the development and performance of artificial intelligence and machine learning applications in science and engineering. Based on Intel’s Habana Labs first-generation deep learning (Gaudi) training and (Goya) inference processors, Voyager is funded by the National Science Foundation’s Advanced Computing Systems & Services Program as a Category II system and will be operated for 5 years, starting with an initial 3-year exploratory test-bed phase that will be followed by a 2-year allocated production phase for the national research community. Its AI-focused hardware features several innovative components, including fully-programmable tensor processing cores, high-bandwidth memory, and integrated, on-chip RDMA over Converged Ethernet network interfaces. In addition, Habana’s SynapseAI software suite provides seamless integration to popular machine learning frameworks like PyTorch and TensorFlow for end users. Here, we describe the design motivation for Voyager, its system architecture, software and user environment, initial benchmarking results, and the early science use cases and applications currently being ported to and deployed on the system.

References

2023. Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS). https://access-ci.orgGoogle Scholar
2023. DeepSpeed. https://www.deepspeed.aiGoogle Scholar
2023. Habana Gaudi Documentation. https://docs.habana.ai/en/latestGoogle Scholar
2023. Laion2B-en. https://huggingface.co/datasets/laion/laion2B-enGoogle Scholar
2023. Training Causal Language Models on SDSC’s Gaudi-based Voyager Supercomputing Cluster. https://developer.habana.ai/blog/training-causal-language-models-on-sdscs-gaudi-based-voyager-supercomputing-cluster/Google Scholar
Kaimin He et al.2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition. 770–778. https://doi.org/10.1109/CVPR.2016.90Google Scholar
Olga Russakovsky et al.2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 3 (2015), 211–252. https://doi.org/10.1007/s11263-015-0816-yGoogle ScholarDigital Library
Peter Mattson et al.2020. MLPerf Training Benchmark. arxiv:1910.01500 [cs.LG]Google Scholar
Intel Habana Labs. 2020. Habana Deep Learning Examples for Training and Inference. Available at https://github.com/HabanaAI/Model-References.Google Scholar
Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, and Yuxiong He. 2020. ZeRO: Memory Optimizations Toward Training Trillion Parameter Models. arxiv:1910.02054 [cs.LG]Google Scholar
Baidu Research. 2016. DeepBench: Benchmarking Deep Learning operations on different hardware. Available at https://github.com/baidu-research/DeepBench.Google Scholar
Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner. 2022. AI and ML Accelerator Survey and Trends. In 2022 IEEE High Performance Extreme Computing Conference (HPEC). 1–10. https://doi.org/10.1109/HPEC55821.2022.9926331Google Scholar
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. arxiv:2112.10752 [cs.CV]Google Scholar

Index Terms

Voyager – An Innovative Computational Resource for Artificial Intelligence & Machine Learning Applications in Science and Engineering
1. Computer systems organization
  1. Architectures
    1. Distributed architectures

Recommendations

Artificial intelligence researchers: Alan Turing, Marvin Minsky, Seymour Papert, Joseph Weizenbaum, Kevin Warwick, Ray Kurzweil
Read More
Computational approaches to Explainable Artificial Intelligence: Advances in theory, applications and trends
Abstract
Deep Learning (DL), a groundbreaking branch of Machine Learning (ML), has emerged as a driving force in both theoretical and applied Artificial Intelligence (AI). DL algorithms, rooted in complex and non-linear artificial neural systems, excel at ...
Highlights
- The most groundbreaking advances in theoretical and applied Artificial Intelligence.
- Deep Learning in real-world tasks, such as clinical diagnostics or robotics.
- Several applications are presented, reviewed and discussed.
- State-...
Read More
Review of artificial intelligence applications in engineering design perspective
Abstract
Having passed the primitive phases and starting to revolutionize many different fields in some way, artificial intelligence is on its way to becoming a disruptive technology. It is also foreseen to totally change human-centred ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
PEARC '23: Practice and Experience in Advanced Research Computing
July 2023
519 pages
ISBN:9781450399852
DOI:10.1145/3569951
Editors:
Robert Sinkovits
San Diego Supercomputer Center
,
Alana Romanella
University of Colorado Boulder
,
Shelley Knuth
University of Colorado Boulder
,
Ken Hackworth
Pittsburgh Supercomputing Center
,
Jeff Pummill
University of Arkansas
Copyright © 2023 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 September 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
AI-focused hardware
benchmarking
deep learning
scientific applications
system deployment
Qualifiers
- short-paper
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate133of202submissions,66%
Upcoming Conference
PEARC '24

Sponsor:

sighpc

sighpc

PEARC '24: Practice and Experience in Advanced Research Computing

July 21 - 25, 2024

Providence , RI , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 105
  Total Downloads
- Downloads (Last 12 months)105
- Downloads (Last 6 weeks)19
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Voyager – An Innovative Computational Resource for Artificial Intelligence & Machine Learning Applications in Science and Engineering

PEARC '23: Practice and Experience in Advanced Research Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Artificial intelligence researchers: Alan Turing, Marvin Minsky, Seymour Papert, Joseph Weizenbaum, Kevin Warwick, Ray Kurzweil

Computational approaches to Explainable Artificial Intelligence: Advances in theory, applications and trends

Review of artificial intelligence applications in engineering design perspective

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Voyager – An Innovative Computational Resource for Artificial Intelligence & Machine Learning Applications in Science and Engineering

PEARC '23: Practice and Experience in Advanced Research Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Artificial intelligence researchers: Alan Turing, Marvin Minsky, Seymour Papert, Joseph Weizenbaum, Kevin Warwick, Ray Kurzweil

Computational approaches to Explainable Artificial Intelligence: Advances in theory, applications and trends

Review of artificial intelligence applications in engineering design perspective

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media