Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected
      • Quality

      Harnessing Grace Hopper's Capabilities to Accelerate Vector Database Search

      , Principal Developer Technology Engineer, NVIDIA
      We'll explore methods to substantially alleviate the constraints imposed by database size, a well-recognized limitation in the realm of graph-based approximate nearest neighbor search (Graph ANNS) for GPUs such as CAGRA. This will be accomplished by harnessing the extensive data transfer bandwidth between the CPU and GPU of Grace Hopper.

      Conventional solutions to large databases that cannot be accommodated in GPU memory often involve quantization or compression. However, Graph ANNS necessitates an additional graph index alongside the database, a component that cannot be downsized through compression. For oversized databases, this graph index is relegated to the host memory. This scenario severely hampers the performance of x86+H100 systems. In contrast, Grace Hopper's performance experiences only a minor decline, showcasing its ability to manage a massive database up to 5-10x larger on a single GPU, maintaining nearly consistent performance levels.
      活动: GTC 24
      日期: March 2024
      行业: Cloud Services
      NVIDIA 技术: Grace CPU,Hopper
      级别: 中级技术
      话题: Performance Optimization
      语言: 英语
      所在地: