Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
详情
字幕
Achieving Higher Performance From Your Data Center and Cloud Application
, Senior Solution Architect, Amazon Web Services
, Senior Director of Engineering, Developer Tools, NVIDIA
When scaling out your application across multiple server nodes, what could go wrong? With so many servers, GPUs, network links, and process ranks to investigate, where do we start? One node at a time is daunting. How can we take a holistic view to triage a cluster of nodes all at once to narrow our scope to what's pertinent and tractable? How do we transition from monitoring to profiling, connecting issues to code and higher-level algorithms? Learn how to tap into a spectrum of data sources and analyzers, to find the bottlenecks and turn them into optimization opportunities. Multi-node analysis recipes will guide developer on their journey to maximize performance and utilization. We'll explore the impacts on an application's GPUs' utilization from various network, data, and software patterns. Whether your cluster is on-premises, in NVIDIA's DGX Cloud, or rented from a communication service provider, we'll help you get started.
活动: GTC 24
日期: March 2024
行业: 所有行业
NVIDIA 技术: Base Command,Cloud / Data Center GPU,CUDA,CUDA-X,DALI,DGX