Advanced HPC-CI Webinar Series: Scalable Machine Learning
This webinar is an introduction to performing machine learning at scale. An overview of approaches for parallelizing R code on HPC will be provided. We will also cover the essentials of Spark and demonstrate how to use Spark for large-scale data analytics and machine learning. Demonstrations will allow participants to gain practical guidance for building and scaling machine learning workflows. Presented on January 14, 2025
Webinar Recording
AMD EPYC Advanced User Training on Expanse
This event will help users to make the most effective use of Expanse’s AMD EPYC processors. Topics include an introduction to the EPYC architecture, AMD compilers and math libraries, strategies for mapping processes and tasks to compute cores, Slurm, application tuning and profiling tools. Presented on April 21, 2021
Webinar RecordingGitHub Repository
Comet 101: Accessing and Running Jobs on Comet
This webinar covers the basics of accessing the SDSC Comet supercomputer, managing the user environment, compiling and running jobs on Comet, where to run them, and how to run batch jobs. It is assumed that you have mastered the basics skills of logging onto Comet and running basic Unix commands. The webinar will include access to training material. Presented on April 16, 2020
Interactive VideoGithub Repository
Comet to Expanse Transition Tutorial
This tutorial is intended for all current users of Comet who intend to make the transition to Expanse. Topics will include an overview of the system, batch job submission, modules, compilation, job charging, basic optimization, interactive computing and data transfer. Presented on March 4, 2021
Github RepositoryInteractive Video
Comet to Expanse Transition Tutorial
This tutorial is intended for all current users of Comet who intend to make the transition to Expanse. Topics will include an overview of the system, batch job submission, modules, compilation, job charging, basic optimization, interactive computing and data transfer. Presented on October 29, 2020
Interactive VideoGithub Repository
Comet Webinar: A Quick Introduction to Machine Learning
Machine learning is an interdisciplinary field focused on the study and construction of computer systems that can learn from data without being explicitly programmed. Machine learning techniques can be used to uncover patterns in your data and gain insights into your problem. Presented on June 11, 2019
Interactive VideoDownload slides
Comet Webinar: CUDA-Python and RAPIDS for blazing fast scientific computing
This webinar introduces users to Numba and RAPIDS for GPU programming in Python. Numba allows us to write just-in-time compiled CUDA code in Python, giving us easy access to the power of GPUs from a powerful high-level language. RAPIDS is a suite of tools with a Python interface for machine learning and dataframe operations. Presented on February 20, 2020
Interactive Video
Comet Webinar: Data Visualization With Python Using Jupyter Notebooks
Python is rapidly becoming the programming language of choice for scientific research, and Jupyter Notebooks provide a user-friendly way of writing and running python code and of teaching and learning how to program. Visual analytics is playing an increasingly important role in data science by allowing researchers to explore massive amounts of data for patterns which may not be obvious using other methods. Presented on December 12, 2019
Interactive VideoDownload SlidesGithub Repository
Comet Webinar: Distributed Parallel Computing with Python
This webinar provides an introduction to distributed computing with Python, we will show how to modify a standard Python script to use multiple CPU cores using the concurrent.futures module from the Python standard library and then the dask package. Presented on May 14, 2019
Interactive VideoDownload slidesGithub Repository
Comet Webinar: GPU Computing and Programming
This webinar provides an introduction to massively parallel computing with graphics processing units (GPUs) on the SDSC Comet supercomputer. The use of GPUs is becoming increasingly popular across all scientific domains since GPUs can significantly accelerate time to solution for many computational tasks. In this webinar, participants will learn how to access Comet GPU nodes, how to launch GPU jobs on Comet, and get introduced to GPU programming. The webinar will cover the essential background of GPU chip architectures and the basics of programming GPUs via the use of libraries, OpenACC compiler directives, and the CUDA programming language. The participants will thus acquire the foundation to use and develop GPU aware applications. Presented on April 9, 2019
Download RecordingDownload Slides
Comet Webinar: Introduction to Deep Learning
Deep learning has seen tremendous growth and success in the past few years. Deep learning techniques have achieved state-of-the-art performance across many domains, including image classification, speech recognition, and biomedical applications. Presented on October 17, 2019
Interactive VideoDownload slides
Comet Webinar: Introduction to Expanse
The goal of this webinar is to provide an overview of Expanse, an upcoming NSF funded HPC resource at SDSC. Expanse will have nearly double the performance compared to the Comet supercomputer. With innovations in cloud integration and composable systems, as well as continued support for science gateways and distributed computing via the Open Science Grid, Expanse will allow researchers to push the boundaries of computing. Presented on June 18, 2020
Interactive VideoDownload Slides
Comet Webinar: Introduction to Running Jobs on Comet
This webinar covers the basics of accessing the SDSC Comet supercomputer, managing the user environment, compiling and running jobs on Comet, where to run them, and how to run batch jobs. It is assumed that you have mastered the basics skills of logging onto Comet and running basic Unix commands. The webinar will include access to training material. Presented on January 8, 2019
Interactive VideoDownload SlidesGithub Repository
Comet Webinar: Obtaining Hardware Information and Monitoring Performance
In this webinar we start by describing how to obtain hardware and system information such as CPU specifications, memory quantity, cache configuration, mounted file systems and their usage, peripheral storage devices and GPU properties. This information is useful for anyone who is interested in how hardware specs influence performance or who needs to report benchmarking data. Presented on September 19, 2019
Interactive VideoDownload slides
Comet Webinar: Running Jupyter Notebooks on Comet
In this webinar, we will present SDSC’s multitiered approach to running notebooks more securely: running notebooks in the usual way using the insecure HTTP connections; hosting Jupyter services on Comet using HTTP over SSH Tunneling; and the SDSC Reverse Proxy Service (RPS) which connects the user over an HTTPS connection. When used, the RPS will launch a batch script that creates a securely hosted HTTPS access point for the user, resulting in a safer, more secure notebook environment. Presented on May 21, 2020
Interactive VideoDownload slides
Comet Webinar: Using the NVIDIA Rapids Toolkit on Comet
In this webinar we will show how to use RAPIDS to accelerate your data science applications utilizing libraries like cuDF (GPU-enabled Pandas-like dataframes) and cuML (GPU-accelerated machine learning algorithms). Presented on January 23, 2020
Interactive VideoDownload slides
Comet Webinar- Indispensable Security: Tips to Use SDSC's HPC Resources Securely
This webinar will highlight security-related topics that can improve the trustworthiness of your research. The topics covered include logging in to SDSC's HPC resources, file and directory permissions, and common practices that may create trouble. Presented on July 16, 2020
Interactive VideoDownload Slides
COMPLECS: Batch Computing: Getting Started with Batch Job Scheduling - Slurm Edition
A brief introduction on how to schedule your batch jobs on high-performance computing systems using the Slurm Workload Manager. Presented on March 21, 2024
Recorded WebinarInteractive Video
COMPLECS: Code Migration
COMPLECS: Code Migration
COMPLECS: Data Storage and File Systems
COMPLECS: Data Transfer
COMPLECS: High-Throughput and Many-Task Computing - Slurm Edition
How to build and run your high-throughput and many-task computing workflows on high-performance computing systems using the Slurm Workload Manager. Presented on October 17, 2024
Webinar RecordingInteractive Video
COMPLECS: HPC Hardware Overview
A brief introduction into what makes up a HPC system and how users should use this information. No programming required. Presented on November 7, 2024
Webinar RecordingInteractive Video
COMPLECS: HPC Hardware Overview
A brief introduction into what makes up a HPC system and how users should use this information. No programming required. Presented on April 4, 2024
Recorded WebinarInteractive Video
COMPLECS: HPC Security and Getting Help
Discussion on best practices for using HPC systems and getting support
Presented on August 15, 2024
Recorded Webinar
COMPLECS: HPC Security and Getting Help
COMPLECS: Interactive Computing
Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. This session presents an overview of interactive computing tools and methods. Presented on December 5, 2024
Webinar RecordingInteractive Video
COMPLECS: Interactive Computing
Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. This session presents an overview of interactive computing tools and methods. Presented on April 18, 2024
Recorded WebinarInteractive VideoGitHub Repository
COMPLECS: Intermediate Linux and Shell Scripting
A survey of intermediate Linux skills for effectively using advanced cyberinfrastructure. Presented on May 16, 2024
Interactive Video
COMPLECS: Intermediate Linux and Shell Scripting
A survey of intermediate Linux skills for effectively using advanced cyberinfrastructure. Presented on January 18, 2024
Interactive Video
COMPLECS: Linux Tools for File Processing
COMPLECS: Linux Tools for File Processing
COMPLECS: Parallel Computing Concepts
COMPLECS: Parallel Computing Concepts
COMPLECS: Parallel Computing Concepts
COMPLECS: Parallel Computing Concepts
A brief introduction to fundamental concepts in parallel computing for anyone who uses HPC resources. Presented on January 4, 2024
Interactive Video
Cyberinfrastructure-Enabled Machine Learning Summer Institute
The CIML Summer Institute will involve introducing ML researchers, developers and educators to the techniques and methods needed to migrate their ML applications from smaller, locally run resources, such as laptops and workstations, to large-scale HPC systems, such as the SDSC Expanse supercomputer. Presented on June 27, 2022
Interactive Videos
Cyberinfrastructure-Enabled Machine Learning Summer Institute
The CIML Summer Institute will involve introducing ML researchers, developers and educators to the techniques and methods needed to migrate their ML applications from smaller, locally run resources, such as laptops and workstations, to large-scale HPC systems, such as the SDSC Expanse supercomputer. Presented on June 22, 2021
Interactive Video
Data Management & File Systems
Managing data efficiently on a supercomputer is important from both users' and system's perspectives. In this webinar, we will cover a few basic data management techniques and I/O best practices in the context of the Expanse system at SDSC. Presented on March 16, 2023
Interactive Video
Expanse 101: Accessing and Running Jobs on Expanse
This webinar covers the basics of accessing the SDSC Expanse supercomputer, managing the user environment, compiling and running jobs on Expanse. Presented on October 8, 2020
Interactive Video
Expanse Webinar: Accessing and Running Jobs on Expanse
This webinar covers the basics of accessing SDSC's Expanse supercomputer, managing the user environment, compiling and running jobs using Slurm, where to run them, and how to run batch jobs. We will also cover interactive computing using applications such as Jupyter Notebooks and how to run them via the command line or from the Expanse portal. It is assumed that you have mastered the basic skills of logging onto HPC systems using SSH and running basic Unix commands on these systems. Presented on February 17, 2022
Interactive Video
Expanse Webinar: Accessing and Running Jobs on Expanse
This webinar covers the basics of accessing SDSC's Expanse supercomputer, managing the user environment, compiling and running jobs using Slurm, where to run them, and how to run batch jobs. Presented on September 16, 2021
Interactive Video
Expanse Webinar: An Introduction to Singularity: Containers for Scientific and High-Performance Computing
Come learn about Singularity containers and how you might use them in your own work. Presented on January 21, 2021
Interactive Video
Expanse Webinar: Composable Systems in Expanse
This webinar will present the approach and the architecture of the composable systems component of Expanse. We will also summarize scientific case studies that demonstrate the application of this new infrastructure and its federation with Nautilus, a Kubernetes-based GPU geo-distributed cluster. Presented on April 15, 2021
Interactive Video
Expanse Webinar: Data Management & File Systems on Expanse
Managing data efficiently on a supercomputer is very important from both users' and system's perspectives. In this webinar, we will cover some of the basic data management techniques, I/O best practices in the context of the Expanse system at SDSC. Presented on October 21, 2021
Interactive Video
Expanse Webinar: Enduring Security: The Journey Continues
The first in a recurring webinar series on using Expanse and other SDSC HPC resources securely. This webinar will cover security and security-related topics relevant to researchers and the trustworthiness of their work produced on these resources. Presented on April 21, 2022
Interactive Video
Expanse Webinar: GPU Computing and Programming on Expanse
This webinar will give a brief introduction to GPU computing and programming on Expanse. We will cover the GPU architecture, programming with the NVIDIA HPC SDK via libraries, OpenACC compiler directives, CUDA, profiling and debugging, and submitting GPU enabled jobs on Expanse. Presented on May 20, 2021
Interactive Video
Expanse Webinar: How-to secure your Jupyter notebook sessions on Expanse
Come learn how to launch your Jupyter notebook sessions on Expanse in a simple, secure way. Presented on December 14, 2021
Interactive Video
Expanse Webinar: Introduction to Neural Networks, Convolution Neural Networks and Deep Learning on Expanse
This webinar will be a quick introduction and overview of neural networks, convolution networks, and deep learning on Expanse. Presented on May 19, 2022
Interactive Video
Expanse Webinar: Parallel Computing Concepts
In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on September 14, 2022
Interactive Video
Expanse Webinar: Parallel Computing Concepts
In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on January 20, 2022
Interactive Video
Expanse Webinar: Performance Tuning and Single Processor Optimization
Presentation will cover cache-level optimizations and other techniques for achieving optimal software performance. We will also cover AMD specific compiler options, libraries and performance tools. Presented on February 18, 2021
Interactive Video
Expanse Webinar: Running Jupyter Notebooks on Expanse
In this webinar, we will present SDSC’s multitiered approach to running notebooks more securely: hosting Jupyter services on Expanse using SSH Tunneling or using the SDSC Jupyter Reverse Proxy Service (JRPS), which connects the user over an HTTPS connection. The JRPS will launch a batch script that creates a securely hosted HTTPS access point for the user, resulting in a safer, more secure notebook environment. Presented on December 10, 2020
Interactive Video
Expanse Webinar: Run your Jupyter Notebooks anywhere: Scaling up your Projects from Laptop to Expanse
In this webinar we demonstrate how to transition your Jupyter Notebooks from a local machine to the Expanse HPC system using command-line tools and the Expanse Portal. We cover creating transferable software environments, scaling up calculations to large datasets, parallel processing, and running Jupyter Notebooks in batch mode. Presented on June 16, 2022
Interactive Video
Expanse Webinar: Scientific Computing with Kubernetes
In this webinar we provide recipes for transitioning scientific workloads that currently run on traditional batch systems to Kubernetes systems. Kubernetes is batch-like in nature, but there are some differences that science users should be aware of. We will also briefly describe capabilities that are not found in traditional batch systems that can improve the effectiveness of scientific computing. Presented on October 20, 2022
Interactive Video
Expanse Webinar: Singularity – Containers for Scientific and High-Performance Computing
Come learn all about Singularity containers. In this webinar, we'll provide an overview of Singularity and how you might incorporate the use of containers in your own research. We'll also show you how to access and use some of the containerized applications that we make available to users on Expanse at SDSC. Presented on March 17, 2022
Interactive Video
Getting Started with Batch Job Scheduling: Slurm Edition
Learn how to write your first batch job script and submit it to a Slurm batch job scheduler. We discuss best practices on how to structure your batch job scripts, teach you how to leverage Slurm environment variables, and provide you with some tips on how to request resources from the scheduler to get your work done faster. We also introduce you to some advanced features like Slurm job arrays and job dependencies for more structured computational workflows. Presented on February 16, 2023
Interactive Video
GPU Computing and Programming on Expanse
This webinar gives a brief introduction to GPU computing and programming on Expanse. We will cover the GPU architecture, programming with the Nvidia CUDA Toolkit and HPC SDK via libraries, OpenACC compiler directives, and CUDA, and submitting GPU enabled jobs on Expanse. Presented on October 26, 2023
Interactive Video
High Performance Computing and Data Science Summer Institute
This year’s Summer Institute continues SDSC’s strategy of bringing HPC to the “long tail of science,” i.e., providing resources to a larger number of modest-sized computational research projects that represent, in aggregate, a tremendous amount of scientific research and discovery. Presented on August 2, 2021
Interactive VideoGitHub Repository
High Performance Computing and Data Science Summer Institute
The HPC and Data Science Summer Institute is a week-long workshop focusing on a broad spectrum of introductory-to-intermediate topics in High Performance Computing and Data Science. The program is aimed at researchers in academia and industry, especially in domains not traditionally engaged in supercomputing, who have problems that cannot typically be solved using local computing resources. Presented on August 1, 2022
Interactive VideosGitHub Repository
HPC/CI Training Series
SDSC’s High Performance Computing (HPC)/ Cyberinfrastructure (CI) Training Series was developed to support UC San Diego undergraduates and graduates interested in furthering their knowledge of HPC concepts and hands-on training, as well as, building a team interested in competing in the Student Cluster Competition held at the annual International Conference for High Performance Computing, Networking, Storage, and Analysis (SC). This program is available to any who are interested in advancing their knowledge and experience on HPC systems and concepts. Presented on January 14, 2022
GitHub Repository
HPC CI Onboarding for the UCSD Research Community
The mini-workshop is designed to provide the UCSD research community with a streamlined pathway to swiftly engage with the Expanse cluster for their scientific endeavors. Collaboratively organized by Research IT and SDSC, this workshop series offers participants the opportunity to start using the Expanse cluster through the Campus Champions allocation, while benefiting from comprehensive training resources and expert guidance provided by SDSC. Presented on October 3, 2023
Interactive Video
Implementing Research Data Management for Labs & Grants
Implement a practical and well supported data management plan for your research lab, project or grant with SeedMeLab. Presented on April 28, 2021
Webinar Recording
Interactive Computing on High Performance Computing Resources
Interactive computing includes commonly used programs, such as word processors or spreadsheet applications running user devices (mobile phones, laptops). Interactive high-performance computing (HPC) involves real-time user inputs that result in actions being performed on HPC compute nodes. In this session we’ll present an overview of interactive computing tools and methods. Presented on December 7, 2023
Interactive Video
Introduction to Neural Networks, Convolution Neural Networks and Deep Learning
This webinar will be a quick introduction and overview of neural networks, convolution networks, and demonstration of executing deep learning models in an HPC environment. Presented on April 20, 2023
Interactive Video
Introduction to Singularity: Creating and Running Containers for High-Performance Computing
In this webinar, Yuwu Chen from TSCC User Services will show how to build Singularity images and then run them on the SDSC supercomputer clusters such as TSCC. Yuwu will also be sharing his insider knowledge of best practices along with pitfalls to avoid while working with Singularity. Presented on October 13, 2022
Interactive Video
Introduction to TSCC 2.0
This training will cover everything users need to know about the new TSCC 2.0 system that will be launched in phases starting late spring. Topics will include changes to the TSCC system, scheduler, queues, software stack, accounting, and policies for using TSCC. Presented on April 13, 2023
Recording
Kubernetes for Science Compute
Several new scientific compute resources are becoming available only through Kubernetes and their users will have to adapt their workloads to interface to it. This tutorial provides the basic Kubernetes notions any science user will need, paired with extensive hands-on exercises on a production-quality system to better explore the details. Presented on May 4, 2022
RepositoryVideo
Parallel and GPU Computing with MATLAB
In this session you will learn how to solve and accelerate computationally and data-intensive problems that are becoming common in the areas of machine learning and deep learning using multicore processors, GPUs, and computer clusters. Presented on April 27, 2022
RepositoryInteractive Video
Parallel Computing Concepts
In this webinar we cover supercomputer architectures, the differences between threads and processes, implementations of parallelism (e.g., OpenMP and MPI), strong and weak scaling, limitations on scalability (Amdahl’s and Gustafson’s Laws) and benchmarking. Presented on January 19, 2023
Interactive Video
Performance Tuning and Optimization
This session is intended for attendees who do their own code development and need their calculations to finish as quickly as possible. We cover effective use of cache, loop-level optimizations, and other topics for writing and building optimal code. Presented on September 21, 2023
Interactive Video
Rich Data Sharing for HPC Users
This free webinar will introduce HPCShare, a web-based resource for users of SDSC’s high-performance computing resources, including Expanse, to easily share small-to medium-scale datasets in an efficient and organized manner. Attendees will learn about using HPCShare and SDSC’s SeedMeLab scientific data management system. Hosted by SDSC Visualization Group Lead Amit Chourasia. Presented on April 22, 2021
Interactive Video
Run your Jupyter Notebooks anywhere: Scaling up your Projects from your Laptop
In this webinar, we demonstrate how to transition your Jupyter Notebooks from a local machine to the Expanse HPC system using command-line tools and the Expanse Portal. We cover creating transferable software environments, scaling up calculations to large datasets, parallel processing, and running Jupyter Notebooks in batch mode. Presented on June 15, 2023
Interactive Video
Scientific Computing with Kubernetes
In this webinar we provide recipes for transitioning scientific workloads that currently run on traditional batch systems to Kubernetes systems. Kubernetes is batch-like in nature, but there are some differences that science users should be aware of. We will also briefly describe capabilities that are not found in traditional batch systems that can improve the effectiveness of scientific computing. Presented on July 20, 2023
Interactive Video
Singularity Containers
This webinar will briefly introduce how to build Singularity images and how to run them on the SDSC supercomputer clusters. We will also share some insider knowledge of best practices and pitfalls to avoid while working with Singularity. Presented on May 18, 2023
Interactive Video
Technology Forum: Expanse Supercomputer for Industry
SDSC's newest Supercomputer, Expanse, supports SDSC's vision of 'Computing without Boundaries' by increasing the capacity and performance for thousands of users of batch-oriented and science gateway computing, and by providing new capabilities that will enable research increasingly dependent upon heterogeneous and distributed resources composed into integrated and highly usable cyberinfrastructure. It also implements new technical capabilities such as Direct Liquid Cooling. SDSC has acquired additional capacity for Expanse specifically to support industrial research and collaborations.
Presented on April 15, 2021
Webinar Recording
Triton Shared Computing Cluster (TSCC) 101 Spring Training
This training will cover everything new users need to know about using the TSCC system. Topics will include: an overview of condo/hotel program; how to apply; accounts and allocation usage monitoring; environment and software modules; overview of various queues, building PBS job scripts, job submission and monitoring; data transfers; and file systems. Presented on March 3, 2022
RecordingGitHub Content
TSCC 1.0 to 2.0 Transitional Workshop
During this workshop, we will provide an overview of TSCC 2.0 including the new authentication method, new allocation system, new filesystems, shared data transfer options from the current TSCC to TSCC 2.0, software stack, new partition characteristics, and provide examples of SLURM job scripts. Presented on November 6, 2023
Recording
TSCC 101: Accessing and Running Jobs on TSCC
During this workshop, we will provide an overview of TSCC, including authentication, allocation, filesystems, software stack, partition characteristics, and job submission, with examples of SLURM job scripts. Presented on September 12, 2024
Recorded Workshop
Using Python and Jupyter Notebooks on TSCC
This workshop will focus on providing guidelines for setting up customized Python environments, how to install and manage packages using Miniconda/pip, and how to run secure Jupyter notebooks on Triton Shared Computing Cluster (TSCC) HPC system. Presented on September 2, 2021
Github RepositoryInteractive Videos
Voyager Part 1: Introduction and User Environment
This is the first of a two-part Voyager training session. Voyager is based on Intel’s Habana Lab AI processors and provides a unique opportunity to use AI focused hardware for exploring AI in science and engineering. Voyager features Habana’s Gaudi processors optimized for training, Goya processors optimized for inference, 100 GbE all-to-all connection within Gaudi nodes, 24 x 100GbE RDMA RoCE for scale-out across Gaudi nodes, and a Ceph file system.
Presented on October 18, 2022
Recording
Voyager Part 2: Habana Architecture Deep Dive and Porting of TensorFlow and PyTorch Applications
This is the second of a two-part Voyager training session. Voyager is based on Intel’s Habana Lab AI processors and provides a unique opportunity to use AI focused hardware for exploring AI in science and engineering. Voyager features Habana’s Gaudi processors optimized for training, Goya processors optimized for inference, 100 GbE all-to-all connection within Gaudi nodes, 24 x 100GbE RDMA RoCE for scale-out across Gaudi nodes, and a Ceph file system. Presented on November 8, 2022
Recording