CUDA Multi Process Service, August 17, 2021
Introduction
NVIDIA will present “CUDA Multi Process Service” on Tuesday, August 17, 2021. This event is a continuation of the CUDA Training Series and will be presented by Max Katz from NVIDIA.
CUDA allows the user to target a GPU with multiple processes. A common use case in HPC is using multiple MPI ranks to target the same GPU. This session will describe how NVIDIA GPUs work when targeted by multiple processes simultaneously, and present the CUDA multi-process service (MPS) which helps improve the scheduling efficiency of this use case. The multi-instance GPU (MIG) feature on the NVIDIA A100 GPU will also be discussed, and the difference between MPS and MIG (and when you would use one or the other) will be explained.
After the presentation, there will be a hands-on session where participants can complete example exercises meant to reinforce the presented concepts and observe their impact on application performance. Cori-GPU access will be provided for current NERSC users for the hands-on exercises. Temporary OLCF Summit access will not be available for remote participants.
Date and Time: 10 am - 12 pm (Pacific time), Tuesday, August 17, 2021
The format of this event will be online only.
Registration
Registration is required for remote participation. Please click the "Registration" drop down on this page to register.
Remote Connection Information
Registration is required for remote participation. Please click the "Remote Connection Details" drop down on this page to register.
Presentation Materials
- Slides
- Recording
- Exercises: The example exercises for this module can be found in the "exercises/hw11" folder of this GitHub repo.