Become Faster in Writing Performant CUDA Kernels using the Source Page in Nsight Compute

, Software Engineering Manager, NVIDIA
高度评价
Optimizing the performance of CUDA kernel code is typically by itself a time-constrained effort. Learn how to make the most of the Source Page in Nsight Compute to quickly pinpoint and resolve bottlenecks in your CUDA kernels. We'll discuss best practices to efficiently navigate the source views, how to utilize code correlation to understand the behavior of the compiler-generated code, and take a detailed look at the metrics that are available per individual source line.
活动: GTC Digital Spring
日期: March 2023
话题: Accelerated Computing & Dev Tools - Profilers / Debuggers / Code Analysis
行业: 所有行业
级别: 中级技术
语言: 英语
话题: Accelerated Computing & Dev Tools
所在地: