Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected

      Advanced Performance Optimization in CUDA

      , Developer Technology Engineer, NVIDIA
      This talk is the second part in a series of Core Performance optimization techniques. It is intended for developers who are already familiar with the basics covered in the first part. We'll teach advanced techniques, and how to use some of the new features introduced in Hoppper. The topics covered will include asynchronous copies and barriers, CUDA clusters, L2 persistency, CUDA graphs, memory pools, dynamic parallelism 2.0.
      活动: GTC 24
      日期: March 2024
      级别: 高级技术
      行业: 所有行业
      NVIDIA 技术: CUDA
      话题: Performance Optimization
      语言: 英语
      所在地: