Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected
      • Quality

      Accelerating Enterprise: Tools and Techniques for Next-Generation AI Deployment

      , Generative AI Product Manager, NVIDIA
      , Software Product Manager, NVIDIA
      高度评价
      In this session, we will delve into the dynamic realm of AI inference, examining the latest state-of-the-art tools and techniques designed to revolutionize how developers deploy generative AI models. As the AI landscape continues to rapidly evolve, the demand for increased speed and efficiency in AI inference is becoming increasingly critical. Our focus will be on the newly announced NVIDIA NIMs, a set of easy-to-use runtimes designed to accelerate the deployment of generative AI. This versatile microservice supports a wide spectrum of AI models—from open-source community models to NVIDIA AI Foundation models, as well as bespoke custom AI models.
      活动: GTC 24
      日期: March 2024
      行业: 所有行业
      级别: 初级技术
      NVIDIA 技术: BioNeMo,Metropolis,MONAI,NeMo,TensorRT,Triton
      话题: 生成式 AI 平台
      语言: 英语
      所在地: