On-Demand Learning

Advanced HPC-CI Webinar Series: Scalable Machine Learning

This webinar is an introduction to performing machine learning at scale. An overview of approaches for parallelizing R code on HPC will be provided. We will also cover the essentials of Spark and demonstrate how to use Spark for large-scale data analytics and machine learning. Demonstrations will allow participants to gain practical guidance for building and scaling machine learning workflows. Presented on January 14, 2025

Webinar Recording

AMD EPYC Advanced User Training on Expanse

This event will help users to make the most effective use of Expanse’s AMD EPYC processors. Topics include an introduction to the EPYC architecture, AMD compilers and math libraries, strategies for mapping processes and tasks to compute cores, Slurm, application tuning and profiling tools. Presented on April 21, 2021

Webinar RecordingGitHub Repository

Comet 101: Accessing and Running Jobs on Comet

This webinar covers the basics of accessing the SDSC Comet supercomputer, managing the user environment, compiling and running jobs on Comet, where to run them, and how to run batch jobs. It is assumed that you have mastered the basics skills of logging onto Comet and running basic Unix commands. The webinar will include access to training material. Presented on April 16, 2020

Interactive VideoGithub Repository

Comet to Expanse Transition Tutorial

This tutorial is intended for all current users of Comet who intend to make the transition to Expanse. Topics will include an overview of the system, batch job submission, modules, compilation, job charging, basic optimization, interactive computing and data transfer. Presented on March 4, 2021

Github RepositoryInteractive Video

Comet to Expanse Transition Tutorial

This tutorial is intended for all current users of Comet who intend to make the transition to Expanse. Topics will include an overview of the system, batch job submission, modules, compilation, job charging, basic optimization, interactive computing and data transfer. Presented on October 29, 2020

Interactive VideoGithub Repository

Comet Webinar: A Quick Introduction to Machine Learning

Machine learning is an interdisciplinary field focused on the study and construction of computer systems that can learn from data without being explicitly programmed. Machine learning techniques can be used to uncover patterns in your data and gain insights into your problem. Presented on June 11, 2019

Interactive VideoDownload slides

Comet Webinar: CUDA-Python and RAPIDS for blazing fast scientific computing

This webinar introduces users to Numba and RAPIDS for GPU programming in Python. Numba allows us to write just-in-time compiled CUDA code in Python, giving us easy access to the power of GPUs from a powerful high-level language. RAPIDS is a suite of tools with a Python interface for machine learning and dataframe operations. Presented on February 20, 2020

Interactive Video

Comet Webinar: Data Visualization With Python Using Jupyter Notebooks

Python is rapidly becoming the programming language of choice for scientific research, and Jupyter Notebooks provide a user-friendly way of writing and running python code and of teaching and learning how to program. Visual analytics is playing an increasingly important role in data science by allowing researchers to explore massive amounts of data for patterns which may not be obvious using other methods. Presented on December 12, 2019

Interactive VideoDownload SlidesGithub Repository

Comet Webinar: Distributed Parallel Computing with Python

This webinar provides an introduction to distributed computing with Python, we will show how to modify a standard Python script to use multiple CPU cores using the concurrent.futures module from the Python standard library and then the dask package. Presented on May 14, 2019

Interactive VideoDownload slidesGithub Repository

Comet Webinar: GPU Computing and Programming

This webinar provides an introduction to massively parallel computing with graphics processing units (GPUs) on the SDSC Comet supercomputer. The use of GPUs is becoming increasingly popular across all scientific domains since GPUs can significantly accelerate time to solution for many computational tasks. In this webinar, participants will learn how to access Comet GPU nodes, how to launch GPU jobs on Comet, and get introduced to GPU programming. The webinar will cover the essential background of GPU chip architectures and the basics of programming GPUs via the use of libraries, OpenACC compiler directives, and the CUDA programming language. The participants will thus acquire the foundation to use and develop GPU aware applications.  Presented on April 9, 2019

Download RecordingDownload Slides

Comet Webinar: Introduction to Deep Learning

Deep learning has seen tremendous growth and success in the past few years. Deep learning techniques have achieved state-of-the-art performance across many domains, including image classification, speech recognition, and biomedical applications. Presented on October 17, 2019

Interactive VideoDownload slides

Comet Webinar: Introduction to Expanse

The goal of this webinar is to provide an overview of Expanse, an upcoming NSF funded HPC resource at SDSC. Expanse will have nearly double the performance compared to the Comet supercomputer. With innovations in cloud integration and composable systems, as well as continued support for science gateways and distributed computing via the Open Science Grid, Expanse will allow researchers to push the boundaries of computing. Presented on June 18, 2020

Interactive VideoDownload Slides

Comet Webinar: Introduction to Running Jobs on Comet

This webinar covers the basics of accessing the SDSC Comet supercomputer, managing the user environment, compiling and running jobs on Comet, where to run them, and how to run batch jobs. It is assumed that you have mastered the basics skills of logging onto Comet and running basic Unix commands. The webinar will include access to training material. Presented on January 8, 2019

Interactive VideoDownload SlidesGithub Repository

Comet Webinar: Obtaining Hardware Information and Monitoring Performance

In this webinar we start by describing how to obtain hardware and system information such as CPU specifications, memory quantity, cache configuration, mounted file systems and their usage, peripheral storage devices and GPU properties. This information is useful for anyone who is interested in how hardware specs influence performance or who needs to report benchmarking data. Presented on September 19, 2019

Interactive VideoDownload slides

Comet Webinar: Running Jupyter Notebooks on Comet

In this webinar, we will present SDSC’s multitiered approach to running notebooks more securely: running notebooks in the usual way using the insecure HTTP connections; hosting Jupyter services on Comet using HTTP over SSH Tunneling; and the SDSC Reverse Proxy Service (RPS) which connects the user over an HTTPS connection. When used, the RPS will launch a batch script that creates a securely hosted HTTPS access point for the user, resulting in a safer, more secure notebook environment. Presented on May 21, 2020

Interactive VideoDownload slides

Comet Webinar: Using the NVIDIA Rapids Toolkit on Comet

In this webinar we will show how to use RAPIDS to accelerate your data science applications utilizing libraries like cuDF (GPU-enabled Pandas-like dataframes) and cuML (GPU-accelerated machine learning algorithms). Presented on January 23, 2020

Interactive VideoDownload slides

Comet Webinar- Indispensable Security: Tips to Use SDSC's HPC Resources Securely

This webinar will highlight security-related topics that can improve the trustworthiness of your research. The topics covered include logging in to SDSC's HPC resources, file and directory permissions, and common practices that may create trouble. Presented on July 16, 2020

Interactive VideoDownload Slides

COMPLECS: Batch Computing: Getting Started with Batch Job Scheduling - Slurm Edition

A brief introduction on how to schedule your batch jobs on high-performance computing systems using the Slurm Workload Manager. Presented on March 21, 2024

Recorded WebinarInteractive Video

COMPLECS: Data Storage and File Systems

How to use the data storage and file systems you’ll find mounted on high-performance computing systems. Presented on December 12, 2024

Webinar RecordingInteractive Video

COMPLECS: Data Transfer

How to get the data you need for your research to and from high-performance computing systems. Presented on June 6, 2024

Recorded WebinarInteractive Video

COMPLECS: High-Throughput and Many-Task Computing - Slurm Edition

How to build and run your high-throughput and many-task computing workflows on high-performance computing systems using the Slurm Workload Manager. Presented on October 17, 2024

Webinar RecordingInteractive Video

COMPLECS: HPC Hardware Overview

A brief introduction into what makes up a HPC system and how users should use this information. No programming required. Presented on November 7, 2024

Webinar RecordingInteractive Video

COMPLECS: HPC Hardware Overview

A brief introduction into what makes up a HPC system and how users should use this information. No programming required. Presented on April 4, 2024

Recorded WebinarInteractive Video

COMPLECS: HPC Security and Getting Help

Discussion on best practices for using HPC systems and getting support Presented on August 15, 2024

Recorded Webinar

COMPLECS: HPC Security and Getting Help

Discussion on best practices for using HPC systems and getting support Presented on February 15, 2024

Recorded WebinarInteractive Video

COMPLECS: Interactive Computing

Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. This session presents an overview of interactive computing tools and methods. Presented on December 5, 2024

Webinar RecordingInteractive Video

COMPLECS: Interactive Computing

Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. This session presents an overview of interactive computing tools and methods. Presented on April 18, 2024

Recorded WebinarInteractive VideoGitHub Repository

COMPLECS: Intermediate Linux and Shell Scripting

A survey of intermediate Linux skills for effectively using advanced cyberinfrastructure. Presented on May 16, 2024

Interactive Video

COMPLECS: Intermediate Linux and Shell Scripting

A survey of intermediate Linux skills for effectively using advanced cyberinfrastructure. Presented on January 18, 2024

Interactive Video

COMPLECS: Linux Tools for File Processing

An overview of commonly used Linux tools for searching and manipulating text. Presented on July 18, 2024

Recorded WebinarInteractive Video

COMPLECS: Linux Tools for File Processing

An overview of commonly used Linux tools for searching and manipulating text. Presented on February 1, 2024

Recorded WebinarInteractive Video

COMPLECS: Parallel Computing Concepts

A brief introduction to fundamental concepts in parallel computing. No programming experience needed. Presented on January 9, 2025

Webinar RecordingInteractive Video

COMPLECS: Parallel Computing Concepts

A brief introduction to fundamental concepts in parallel computing. No programming experience needed. Presented on September 5, 2024

Webinar RecordingInteractive Video

COMPLECS: Parallel Computing Concepts

A brief introduction to fundamental concepts in parallel computing. No programming experience needed. Presented on May 2, 2024

Recorded WebinarInteractive Video

COMPLECS: Parallel Computing Concepts

A brief introduction to fundamental concepts in parallel computing for anyone who uses HPC resources. Presented on January 4, 2024

Interactive Video

Cyberinfrastructure-Enabled Machine Learning Summer Institute

The CIML Summer Institute will involve introducing ML researchers, developers and educators to the techniques and methods needed to migrate their ML applications from smaller, locally run resources, such as laptops and workstations, to large-scale HPC systems, such as the SDSC Expanse supercomputer. Presented on June 27, 2022

Interactive Videos

Cyberinfrastructure-Enabled Machine Learning Summer Institute

The CIML Summer Institute will involve introducing ML researchers, developers and educators to the techniques and methods needed to migrate their ML applications from smaller, locally run resources, such as laptops and workstations, to large-scale HPC systems, such as the SDSC Expanse supercomputer. Presented on June 22, 2021

Interactive Video

Data Management & File Systems

Managing data efficiently on a supercomputer is important from both users' and system's perspectives. In this webinar, we will cover a few basic data management techniques and I/O best practices in the context of the Expanse system at SDSC. Presented on March 16, 2023

Interactive Video

Expanse 101: Accessing and Running Jobs on Expanse

This webinar covers the basics of accessing the SDSC Expanse supercomputer, managing the user environment, compiling and running jobs on Expanse. Presented on October 8, 2020

Interactive Video

Expanse Webinar: Accessing and Running Jobs on Expanse

This webinar covers the basics of accessing SDSC's Expanse supercomputer, managing the user environment, compiling and running jobs using Slurm, where to run them, and how to run batch jobs. We will also cover interactive computing using applications such as Jupyter Notebooks and how to run them via the command line or from the Expanse portal. It is assumed that you have mastered the basic skills of logging onto HPC systems using SSH and running basic Unix commands on these systems. Presented on February 17, 2022

Interactive Video

Expanse Webinar: Accessing and Running Jobs on Expanse

This webinar covers the basics of accessing SDSC's Expanse supercomputer, managing the user environment, compiling and running jobs using Slurm, where to run them, and how to run batch jobs. Presented on September 16, 2021

Interactive Video

Expanse Webinar: An Introduction to Singularity: Containers for Scientific and High-Performance Computing

Come learn about Singularity containers and how you might use them in your own work. Presented on January 21, 2021

Interactive Video

Expanse Webinar: Composable Systems in Expanse

This webinar will present the approach and the architecture of the composable systems component of Expanse. We will also summarize scientific case studies that demonstrate the application of this new infrastructure and its federation with Nautilus, a Kubernetes-based GPU geo-distributed cluster. Presented on April 15, 2021

Interactive Video

Expanse Webinar: Data Management & File Systems on Expanse

Managing data efficiently on a supercomputer is very important from both users' and system's perspectives. In this webinar, we will cover some of the basic data management techniques, I/O best practices in the context of the Expanse system at SDSC. Presented on October 21, 2021

Interactive Video

Expanse Webinar: Enduring Security: The Journey Continues

The first in a recurring webinar series on using Expanse and other SDSC HPC resources securely. This webinar will cover security and security-related topics relevant to researchers and the trustworthiness of their work produced on these resources. Presented on April 21, 2022

Interactive Video

Expanse Webinar: GPU Computing and Programming on Expanse

This webinar will give a brief introduction to GPU computing and programming on Expanse. We will cover the GPU architecture, programming with the NVIDIA HPC SDK via libraries, OpenACC compiler directives, CUDA, profiling and debugging, and submitting GPU enabled jobs on Expanse. Presented on May 20, 2021

Interactive Video

Expanse Webinar: How-to secure your Jupyter notebook sessions on Expanse

Come learn how to launch your Jupyter notebook sessions on Expanse in a simple, secure way. Presented on December 14, 2021

Interactive Video

Expanse Webinar: Introduction to Neural Networks, Convolution Neural Networks and Deep Learning on Expanse

This webinar will be a quick introduction and overview of neural networks, convolution networks, and deep learning on Expanse. Presented on May 19, 2022

Interactive Video

Expanse Webinar: Parallel Computing Concepts

In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on September 14, 2022

Interactive Video

Expanse Webinar: Parallel Computing Concepts

In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on January 20, 2022

Interactive Video

Expanse Webinar: Performance Tuning and Single Processor Optimization

Presentation will cover cache-level optimizations and other techniques for achieving optimal software performance. We will also cover AMD specific compiler options, libraries and performance tools. Presented on February 18, 2021

Interactive Video

Expanse Webinar: Running Jupyter Notebooks on Expanse

In this webinar, we will present SDSC’s multitiered approach to running notebooks more securely: hosting Jupyter services on Expanse using SSH Tunneling or using the SDSC Jupyter Reverse Proxy Service (JRPS), which connects the user over an HTTPS connection. The JRPS will launch a batch script that creates a securely hosted HTTPS access point for the user, resulting in a safer, more secure notebook environment. Presented on December 10, 2020

Interactive Video

Expanse Webinar: Run your Jupyter Notebooks anywhere: Scaling up your Projects from Laptop to Expanse

In this webinar we demonstrate how to transition your Jupyter Notebooks from a local machine to the Expanse HPC system using command-line tools and the Expanse Portal. We cover creating transferable software environments, scaling up calculations to large datasets, parallel processing, and running Jupyter Notebooks in batch mode. Presented on June 16, 2022

Interactive Video

Expanse Webinar: Scientific Computing with Kubernetes

In this webinar we provide recipes for transitioning scientific workloads that currently run on traditional batch systems to Kubernetes systems. Kubernetes is batch-like in nature, but there are some differences that science users should be aware of. We will also briefly describe capabilities that are not found in traditional batch systems that can improve the effectiveness of scientific computing. Presented on October 20, 2022

Interactive Video

Expanse Webinar: Singularity – Containers for Scientific and High-Performance Computing

Come learn all about Singularity containers. In this webinar, we'll provide an overview of Singularity and how you might incorporate the use of containers in your own research. We'll also show you how to access and use some of the containerized applications that we make available to users on Expanse at SDSC. Presented on March 17, 2022

Interactive Video

Getting Started with Batch Job Scheduling: Slurm Edition

Learn how to write your first batch job script and submit it to a Slurm batch job scheduler. We discuss best practices on how to structure your batch job scripts, teach you how to leverage Slurm environment variables, and provide you with some tips on how to request resources from the scheduler to get your work done faster. We also introduce you to some advanced features like Slurm job arrays and job dependencies for more structured computational workflows. Presented on February 16, 2023

Interactive Video

GPU Computing and Programming on Expanse

This webinar gives a brief introduction to GPU computing and programming on Expanse. We will cover the GPU architecture, programming with the Nvidia CUDA Toolkit and HPC SDK via libraries, OpenACC compiler directives, and CUDA, and submitting GPU enabled jobs on Expanse. Presented on October 26, 2023

Interactive Video

High Performance Computing and Data Science Summer Institute

This year’s Summer Institute continues SDSC’s strategy of bringing HPC to the “long tail of science,” i.e., providing resources to a larger number of modest-sized computational research projects that represent, in aggregate, a tremendous amount of scientific research and discovery. Presented on August 2, 2021

Interactive VideoGitHub Repository

High Performance Computing and Data Science Summer Institute

The HPC and Data Science Summer Institute is a week-long workshop focusing on a broad spectrum of introductory-to-intermediate topics in High Performance Computing and Data Science. The program is aimed at researchers in academia and industry, especially in domains not traditionally engaged in supercomputing, who have problems that cannot typically be solved using local computing resources. Presented on August 1, 2022

Interactive VideosGitHub Repository

HPC/CI Training Series

SDSC’s High Performance Computing (HPC)/ Cyberinfrastructure (CI) Training Series was developed to support UC San Diego undergraduates and graduates interested in furthering their knowledge of HPC concepts and hands-on training, as well as, building a team interested in competing in the Student Cluster Competition held at the annual International Conference for High Performance Computing, Networking, Storage, and Analysis (SC). This program is available to any who are interested in advancing their knowledge and experience on HPC systems and concepts. Presented on January 14, 2022

GitHub Repository

HPC CI Onboarding for the UCSD Research Community

The mini-workshop is designed to provide the UCSD research community with a streamlined pathway to swiftly engage with the Expanse cluster for their scientific endeavors. Collaboratively organized by Research IT and SDSC, this workshop series offers participants the opportunity to start using the Expanse cluster through the Campus Champions allocation, while benefiting from comprehensive training resources and expert guidance provided by SDSC. Presented on October 3, 2023

Interactive Video

Implementing Research Data Management for Labs & Grants

Implement a practical and well supported data management plan for your research lab, project or grant with SeedMeLab. Presented on April 28, 2021

Webinar Recording

Interactive Computing on High Performance Computing Resources

Interactive computing includes commonly used programs, such as word processors or spreadsheet applications running user devices (mobile phones, laptops). Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. In this session we’ll present an overview of interactive computing tools and methods. Presented on December 7, 2023

Interactive Video

Introduction to Neural Networks, Convolution Neural Networks and Deep Learning

This webinar will be a quick introduction and overview of neural networks, convolution networks, and demonstration of executing deep learning models in an HPC environment. Presented on April 20, 2023

Interactive Video

Introduction to Singularity: Creating and Running Containers for High-Performance Computing

In this webinar, Yuwu Chen from TSCC User Services will show how to build Singularity images and then run them on the SDSC supercomputer clusters such as TSCC. Yuwu will also be sharing his insider knowledge of best practices along with pitfalls to avoid while working with Singularity. Presented on October 13, 2022

Interactive Video

Introduction to TSCC 2.0

This training will cover everything users need to know about the new TSCC 2.0 system that will be launched in phases starting late spring. Topics will include changes to the TSCC system, scheduler, queues, software stack, accounting, and policies for using TSCC. Presented on April 13, 2023

Recording

Kubernetes for Science Compute

Several new scientific compute resources are becoming available only through Kubernetes and their users will have to adapt their workloads to interface to it. This tutorial provides the basic Kubernetes notions any science user will need, paired with extensive hands-on exercises on a production-quality system to better explore the details.  Presented on May 4, 2022

RepositoryVideo

Parallel and GPU Computing with MATLAB

In this session you will learn how to solve and accelerate computationally and data-intensive problems that are becoming common in the areas of machine learning and deep learning using multicore processors, GPUs, and computer clusters. Presented on April 27, 2022

RepositoryInteractive Video

Parallel Computing Concepts

In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on January 19, 2023

Interactive Video

Performance Tuning and Optimization

This session is intended for attendees who do their own code development and need their calculations to finish as quickly as possible. We cover effective use of cache, loop-level optimizations, and other topics for writing and building optimal code. Presented on September 21, 2023

Interactive Video

Rich Data Sharing for HPC Users

This free webinar will introduce HPCShare, a web-based resource for users of SDSC’s high-performance computing resources, including Expanse, to easily share small-to medium-scale datasets in an efficient and organized manner. Attendees will learn about using HPCShare and SDSC’s SeedMeLab scientific data management system. Hosted by SDSC Visualization Group Lead Amit Chourasia. Presented on April 22, 2021

Interactive Video

Run your Jupyter Notebooks anywhere: Scaling up your Projects from your Laptop

In this webinar, we demonstrate how to transition your Jupyter Notebooks from a local machine to the Expanse HPC system using command-line tools and the Expanse Portal. We cover creating transferable software environments, scaling up calculations to large datasets, parallel processing, and running Jupyter Notebooks in batch mode. Presented on June 15, 2023

Interactive Video

Scientific Computing with Kubernetes

In this webinar we provide recipes for transitioning scientific workloads that currently run on traditional batch systems to Kubernetes systems. Kubernetes is batch-like in nature, but there are some differences that science users should be aware of. We will also briefly describe capabilities that are not found in traditional batch systems that can improve the effectiveness of scientific computing. Presented on July 20, 2023

Interactive Video

Singularity Containers

This webinar will briefly introduce how to build Singularity images and how to run them on the SDSC supercomputer clusters. We will also share some insider knowledge of best practices and pitfalls to avoid while working with Singularity. Presented on May 18, 2023

Interactive Video

Technology Forum: Expanse Supercomputer for Industry

SDSC's newest Supercomputer, Expanse, supports SDSC's vision of 'Computing without Boundaries' by increasing the capacity and performance for thousands of users of batch-oriented and science gateway computing, and by providing new capabilities that will enable research increasingly dependent upon heterogeneous and distributed resources composed into integrated and highly usable cyberinfrastructure. It also implements new technical capabilities such as Direct Liquid Cooling. SDSC has acquired additional capacity for Expanse specifically to support industrial research and collaborations. Presented on April 15, 2021

Webinar Recording

Triton Shared Computing Cluster (TSCC) 101 Spring Training

This training will cover everything new users need to know about using the TSCC system. Topics will include: an overview of condo/hotel program; how to apply; accounts and allocation usage monitoring; environment and software modules; overview of various queues, building PBS job scripts, job submission and monitoring; data transfers; and file systems. Presented on March 3, 2022

RecordingGitHub Content

TSCC 1.0 to 2.0 Transitional Workshop

During this workshop, we will provide an overview of TSCC 2.0 including the new authentication method, new allocation system, new filesystems, shared data transfer options from the current TSCC to TSCC 2.0, software stack, new partition characteristics, and provide examples of SLURM job scripts. Presented on November 6, 2023

Recording

TSCC 101: Accessing and Running Jobs on TSCC

During this workshop, we will provide an overview of TSCC, including authentication, allocation, filesystems, software stack, partition characteristics, and job submission, with examples of SLURM job scripts. Presented on September 12, 2024

Recorded Workshop

Using Python and Jupyter Notebooks on TSCC

This workshop will focus on providing guidelines for setting up customized Python environments, how to install and manage packages using Miniconda/pip, and how to run secure Jupyter notebooks on Triton Shared Computing Cluster (TSCC) HPC system. Presented on September 2, 2021

Github RepositoryInteractive Videos

Voyager Part 1: Introduction and User Environment

This is the first of a two-part Voyager training session. Voyager is based on Intel’s Habana Lab AI processors and provides a unique opportunity to use AI focused hardware for exploring AI in science and engineering. Voyager features Habana’s Gaudi processors optimized for training, Goya processors optimized for inference, 100 GbE all-to-all connection within Gaudi nodes, 24 x 100GbE RDMA RoCE for scale-out across Gaudi nodes, and a Ceph file system. Presented on October 18, 2022

Recording

Voyager Part 2: Habana Architecture Deep Dive and Porting of TensorFlow and PyTorch Applications

This is the second of a two-part Voyager training session. Voyager is based on Intel’s Habana Lab AI processors and provides a unique opportunity to use AI focused hardware for exploring AI in science and engineering. Voyager features Habana’s Gaudi processors optimized for training, Goya processors optimized for inference, 100 GbE all-to-all connection within Gaudi nodes, 24 x 100GbE RDMA RoCE for scale-out across Gaudi nodes, and a Ceph file system. Presented on November 8, 2022

Recording

Back to top