Do you need to compute larger or faster than a single GPU allows? Learn how to scale your application to multiple GPUs and multiple nodes. We'll explain how to use the different available multi-GPU programming models and describe their individual advantages. All programming models, including CUDA-aware MPI, NVSHMEM and NCCL, will be introduced using same example, applying a domain decomposition strategy.