Optimizing CUDA Machine Learning Codes with Nsight Profiling Tools
, NVIDIA
, NVIDIA
This lab teaches how to use NVIDIA's Nsight tools for analyzing and optimizing CUDA applications. Attendees will be using Nsight Systems to analyze the overall application structure and explore parallelization opportunities. Nsight Compute will be used to analyze and optimize CUDA kernels, using an online machine learning code for 5G.
Prerequisite(s): CUDA knowledge. Linux command line familiarity.
*IMPORTANT: DLI Training Labs are free to attend with your GTC registration, but limited capacity and first-come, first-served. You may favorite or add a training lab to your schedule, but this does not guarantee you a seat. Rooms will be accessible 15 minutes before the session begins and can be accessed by clicking the “Join Now” button in the GTC Session Catalog. If the "Join Now" button isn't visible, you may need to refresh the page. Once the lab reaches capacity, you will no longer be able to enter the room. To get the most from your hands-on learning experience, please complete these steps prior to getting started.
活动: GTC Digital November
日期: November 2021
话题: Accelerated Computing & Dev Tools - Performance Optimization