ABSTRACT
Voyager is an innovative computational resource designed by the San Diego Supercomputer Center in collaboration with technology partners to accelerate the development and performance of artificial intelligence and machine learning applications in science and engineering. Based on Intel’s Habana Labs first-generation deep learning (Gaudi) training and (Goya) inference processors, Voyager is funded by the National Science Foundation’s Advanced Computing Systems & Services Program as a Category II system and will be operated for 5 years, starting with an initial 3-year exploratory test-bed phase that will be followed by a 2-year allocated production phase for the national research community. Its AI-focused hardware features several innovative components, including fully-programmable tensor processing cores, high-bandwidth memory, and integrated, on-chip RDMA over Converged Ethernet network interfaces. In addition, Habana’s SynapseAI software suite provides seamless integration to popular machine learning frameworks like PyTorch and TensorFlow for end users. Here, we describe the design motivation for Voyager, its system architecture, software and user environment, initial benchmarking results, and the early science use cases and applications currently being ported to and deployed on the system.
- 2023. Advanced Cyberinfrastructure Coordination Ecosystem: Services & Support (ACCESS). https://access-ci.orgGoogle Scholar
- 2023. DeepSpeed. https://www.deepspeed.aiGoogle Scholar
- 2023. Habana Gaudi Documentation. https://docs.habana.ai/en/latestGoogle Scholar
- 2023. Laion2B-en. https://huggingface.co/datasets/laion/laion2B-enGoogle Scholar
- 2023. Training Causal Language Models on SDSC’s Gaudi-based Voyager Supercomputing Cluster. https://developer.habana.ai/blog/training-causal-language-models-on-sdscs-gaudi-based-voyager-supercomputing-cluster/Google Scholar
- Kaimin He et al.2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition. 770–778. https://doi.org/10.1109/CVPR.2016.90Google Scholar
- Olga Russakovsky et al.2015. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision 115, 3 (2015), 211–252. https://doi.org/10.1007/s11263-015-0816-yGoogle ScholarDigital Library
- Peter Mattson et al.2020. MLPerf Training Benchmark. arxiv:1910.01500 [cs.LG]Google Scholar
- Intel Habana Labs. 2020. Habana Deep Learning Examples for Training and Inference. Available at https://github.com/HabanaAI/Model-References.Google Scholar
- Samyam Rajbhandari, Jeff Rasley, Olatunji Ruwase, and Yuxiong He. 2020. ZeRO: Memory Optimizations Toward Training Trillion Parameter Models. arxiv:1910.02054 [cs.LG]Google Scholar
- Baidu Research. 2016. DeepBench: Benchmarking Deep Learning operations on different hardware. Available at https://github.com/baidu-research/DeepBench.Google Scholar
- Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, and Jeremy Kepner. 2022. AI and ML Accelerator Survey and Trends. In 2022 IEEE High Performance Extreme Computing Conference (HPEC). 1–10. https://doi.org/10.1109/HPEC55821.2022.9926331Google Scholar
- Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. High-Resolution Image Synthesis with Latent Diffusion Models. arxiv:2112.10752 [cs.CV]Google Scholar
Index Terms
- Voyager – An Innovative Computational Resource for Artificial Intelligence & Machine Learning Applications in Science and Engineering
Recommendations
Computational approaches to Explainable Artificial Intelligence: Advances in theory, applications and trends
AbstractDeep Learning (DL), a groundbreaking branch of Machine Learning (ML), has emerged as a driving force in both theoretical and applied Artificial Intelligence (AI). DL algorithms, rooted in complex and non-linear artificial neural systems, excel at ...
Highlights- The most groundbreaking advances in theoretical and applied Artificial Intelligence.
- Deep Learning in real-world tasks, such as clinical diagnostics or robotics.
- Several applications are presented, reviewed and discussed.
- State-...
Review of artificial intelligence applications in engineering design perspective
AbstractHaving passed the primitive phases and starting to revolutionize many different fields in some way, artificial intelligence is on its way to becoming a disruptive technology. It is also foreseen to totally change human-centred ...
Comments