Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected

      Achieving Higher Performance From Your Data Center and Cloud Application

      , Senior Solution Architect, Amazon Web Services
      , Senior Director of Engineering, Developer Tools, NVIDIA
      When scaling out your application across multiple server nodes, what could go wrong? With so many servers, GPUs, network links, and process ranks to investigate, where do we start? One node at a time is daunting. How can we take a holistic view to triage a cluster of nodes all at once to narrow our scope to what's pertinent and tractable? How do we transition from monitoring to profiling, connecting issues to code and higher-level algorithms? Learn how to tap into a spectrum of data sources and analyzers, to find the bottlenecks and turn them into optimization opportunities. Multi-node analysis recipes will guide developer on their journey to maximize performance and utilization. We'll explore the impacts on an application's GPUs' utilization from various network, data, and software patterns. Whether your cluster is on-premises, in NVIDIA's DGX Cloud, or rented from a communication service provider, we'll help you get started.
      活动: GTC 24
      日期: March 2024
      行业: 所有行业
      NVIDIA 技术: Base Command,Cloud / Data Center GPU,CUDA,CUDA-X,DALI,DGX
      NVIDIA 技术: Ethernet Networking,Grace CPU,Infiniband Networking,NCCL,Nsight Systems,NVLink / NVSwitch
      级别: 中级技术
      话题: Profilers / Debuggers / Code Analysis
      语言: 英语
      所在地: