Introduction to Kernel Performance Analysis with NVIDIA Nsight Compute, Aug 26, 2021
Introduction
This webinar and demo/hands-on session presented by Max Katz from Nvidia is part of the ALCF Developer Sessions, also open to NERSC users.
Date and Time: 9 am - 10:30 am (Pacific time), August 26 (Thursday)
Abstract
This session will present the use of Nsight Compute for analyzing the performance of individual GPU kernels on the NVIDIA GPUs that power ALCF's ThetaGPU and NERSC's Perlmutter. We will walk through some simple compute kernels which are compute-bound and memory bandwidth-bound and learn how to profile them with Nsight Compute, generate roofline charts, and analyze the performance of those kernels. We will then introduce a sample realistic kernel from an HPC application and discuss how comprehensive kernel analysis can be used in an iterative process to substantially speed up key application bottlenecks. The webinar will conclude with an interactive demo and hands-on of Nsight Compute. The goal is for the users to be able to determine whether the performance of a compute construct is “good enough” relative to the capabilities of the hardware and, if not, what steps should be taken to address this.
Please find more information about the speaker and register at the event page of the ALCF Developer Sessions.
Presentation Materials