播放列表 GPU Accelerated Libraries

GPU Accelerated Libraries

22 个内容

March 2024

, Senior Product Manager, NVIDIA

, Director of Engineering, Math Libraries, NVIDIA

NVIDIA’s GPU-accelerated Math Libraries, which are part of the CUDA Toolkit and the HPC SDK, are constantly expanding, providing industry-leading performance and coverage of common compute workflows across AI, ML, and HPC. We'll do a deep dive into some of the latest advancements in the …

52:22

GPU-Accelerating Process Simulation Performance using NVIDIA’s cuDSS Sparse Linear Systems Solver

March 2024

, Senior System Software Engineer, NVIDIA

, Senior Fellow, Honeywell Connected Enterprise

NVIDIA has developed a new large-scale solver, cuDSS, for sparse linear systems that uses GPU computations for the matrix factorization and solution. This solver was integrated into the UniSim EO platform using the UniSim AXB sparse linear algebra interface, which enables sparse linear algebra …

49:36

CUTLASS: A Performant, Flexible, and Portable Way to Target Hopper Tensor Cores

March 2024

, Senior Architect, NVIDIA

, Sr. Architect, NVIDIA

NVIDIA’s H100 introduced fourth-generation Tensor Cores to GPU computing, with over twice the peak performance of the previous generation. This session will build on our GTC’23 session. We'll describe how the latest version of CUTLASS leverages Hopper features for peak performance, covering major …

51:03

Multi GPU Programming Models for HPC and AI

March 2024

, Principal Developer Technology, NVIDIA

Do you need to compute larger or faster than a single GPU allows? Learn how to scale your application to multiple GPUs and multiple nodes. We'll explain how to use the different available multi-GPU programming models and describe their individual advantages. All programming models, including …

51:40

Training Deep Learning Models at Scale: How NCCL Enables Best Performance on AI Data Center Networks

March 2024

, Distinguished Engineer, NVIDIA

Discover how NCCL uses every capability of all DGX and HGX platforms to accelerate inter-GPU communication and allow deep learning training to scale further. See how Grace Hopper platforms can leverage multi-node NVLink to compute in parallel at unprecedented speeds. Compare different …

38:15

A Deep Dive into the Latest HPC Software

March 2024

, Director HPC Architecture, NVIDIA

Take a deep dive into the latest developments in NVIDIA software for high performance computing applications, including a comprehensive look at what’s new in programming models, compilers, libraries, and tools. We'll cover topics of interest to HPC developers, targeting traditional HPC …

57:53

Connect with the Experts: C++ Standard Parallelism and C++ Core Compute Libraries

March 2023

, Principal Architect, NVIDIA

, GPU Architect, NVIDIA

, Sr. Software Engineer, NVIDIA

, HPC C++ Compiler Engineer, NVIDIA

, Senior Software Engineer and Author of Standard C++ Ranges and Senders/Receivers, NVIDIA

, CUDA Software Developer, NVIDIA

, Director of Research, NVIDIA

, Senior C++ Library Engineer, NVIDIA

, C++ Library Engineer (libcu++ Lead), NVIDIA

Do you want to write modern C++ on your GPU? Are you curious about C++ Standard Parallelism? Join NVIDIA's C++ library and standards team for a Q&A session on: C++ Standard Parallelism and NVC++, Thrust (CUDA C++'s high-productivity general-purpose library and parallel algorithms …

01:36:32

Getting Started with Large-Scale GNNs using cuGraph Packages for DGL and PyG

March 2024

, Software Engineer, NVIDIA

, Senior Data Scientist, NVIDIA

Graph neural networks (GNNs) are an increasingly popular class of artificial neural networks designed to process data that can be represented as graphs. The two prominent GNN frameworks are the Deep Graph Library (DGL) and PyTorch Geometric (PyG). The RAPIDS cuGraph effort has been working on …

24:25

Accelerating Pandas with Zero Code Change using RAPIDS cuDF

March 2024

, Senior Software Engineer, NVIDIA

Pandas is flexible, but often slow when processing gigabytes of data. Many frameworks promise higher performance, but they often support only a subset of the Pandas API, require significant code change, and struggle to interact with or accelerate third-party code that you can’t change. RAPIDS cuDF …

42:02

Scaling a Transformer-Powered Recommendation Model for Personalized Online Advertising

March 2024

, Data Science Manager, Capital One

, Senior Manager Data Science, Capital One

, Manager, Data Science, Capital One

Recommendation systems are integral to many online platforms, enabling personalized content and product recommendations. The transformer paradigm in particular has been leveraged for building state-of-the-art sequential recommender systems. In this session, we'll expand upon previous work …

42:28

Overcoming Pre- and Post-Processing Bottlenecks in AI-Based Imaging and Computer Vision Pipelines with CV-CUDA

March 2023

, CV-CUDA Senior Engineer, NVIDIA

CV-CUDA is an open source library that enables developers to build highly efficient, GPU-accelerated pre- and post-processing pipelines in cloud-scale Artificial Intelligence (AI) imaging and computer vision (CV) workloads in mapping, generative AI, three-dimensional worlds, image understanding, …

36:45

GPU-Accelerating End-to-End Geospatial Workflows

March 2023

, Senior Solutions Architect, NVIDIA

Both the federal community and the commercial marketplace have critical mission needs to rapidly geolocate imagery that has no associated geospatial information for a wide variety of computer vision applications, such as search and rescue, natural hazards detection, and environmental monitoring. …

56:55

Connect with the Experts: GPU-Accelerated Data Processing with NVIDIA Libraries

March 2023

, Senior Software Engineer, NVIDIA

, Senior Software Development Engineer, NVIDIA

, Deep Learning Manager, NVIDIA

, Sr. CUDA Math Library Engineer and Team Lead, NVIDIA

, Senior Deep Learning Software Engineer, NVIDIA

, Senior Software Development Engineer, NVIDIA

, Senior Director, System SW, NVIDIA

, CV-CUDA development lead, NVIDIA

, CV-CUDA Senior Engineer, NVIDIA

, Manager NPP/nvJPEG, NVIDIA

, Sr. Software Engineer, NVIDIA

Learn about the latest optimizations in NVIDIA's image/signal processing libraries like CV-CUDA, NPP, nvJPEG, and DALI — a fast, flexible data loading and augmentation library. We'll discuss how to use various data processing solutions spanning low-level image and signal processing primitives in NPP, …