Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected
      • Quality

      Accelerating Drug Discovery: Optimizing Dynamic GPU Workflows with CUDA Graphs, Mapped Memory, C++ Coroutines, and More

      , Tech Lead - Desmond Engine, Schrodinger, Inc.
      , Senior Developer Technology Engineer, NVIDIA Corporation
      As the raw compute FLOPS become faster and memory bandwidth becomes higher for the latest GPUs, it becomes challenging for applications that launch large numbers of lightweight kernels to saturate GPU compute resources. We'll present the challenges we faced when adapting Desmond, the state-of-art code for performing molecular dynamics simulations for drug discovery, to the latest GPUs, and show how various CUDA features are utilized to overcome them.

      Topics we'll cover include:
      • Employing CUDA graphs in dynamic environments to amortize the CUDA kernel and CUDA API launch overheads;
      • Using mapped memory to speed up the data transfers between the GPU and CPU; and
      • Using coroutine to delay the GPU synchronizations to reduce the GPU idle time.
      活动: GTC 24
      日期: March 2024
      NVIDIA 技术: CUDA,Nsight Compute,Nsight Systems
      行业: HPC / 科学计算
      级别: 中级技术
      话题: Performance Optimization
      语言: 英语
      所在地: