Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected

      GPU DiskANN and Beyond: Accelerating Microsoft Vector Search with NVIDIA cuVS

      , AI Developer Technology Engineer, NVIDIA
      , Software Engineer, Microsoft
      , Senior Software Engineer, Microsoft
      From web search to ads recommendation to RAG, vector search has become an essential operation for vast range of large-scale applications. However, these applications may have widely varying requirements. As such, no single design or algorithm fits all use cases. Nevertheless, GPUs can accelerate many scenarios, potentially giving orders of magnitude improvements in index construction time or search throughput, compared with CPU-based solutions. In this talk joint talk with Microsoft and NVIDIA, we will present a range of Microsoft use cases that require high performance vector search. We will then discuss the unique challenges of these scenarios and how we leverage NVIDIA GPUs and the cuVS library to accelerate these workloads. From fast DiskANN index construction to high-throghput filtered search, a range of algorithms, features, and optimizations are used to leverage GPUs and provide performance and cost benefits over current CPU-based solutions.
      活动: GTC 25
      日期: March 2025
      NVIDIA 技术: CUDA,RAPIDS,cuVS
      话题: Data Science - Databases
      行业: HPC / 科学计算
      级别: Technical – Intermediate
      语言: 英语
      所在地: