Video Player is loading.
Current Time 0:00
Duration 0:00
Loaded: 0%
Stream Type LIVE
Remaining Time 0:00
 
1x
    • Chapters
    • descriptions off, selected
    • subtitles off, selected
      • Quality

      FastDeploy: Full-Scene, High-Performance AI Deployment Tool (Presented by Baidu Online Network Technology (Beijing) Co., Ltd.)

      , Senior Product Manager, Baidu
      FastDeploy is a full-scene, extremely efficient, easy-to-use and flexible AI deployment toolkit for cloud, mobile, and edge. It unifies Paddle and the ecological AI Deployment Engine API including Paddle Inference, Paddle Lite, TensorRT, ONNX Runtime, Poros, and other inference engines to help developers flexibly switch multiple inference engine backends with a single command. It also integrates Triton Inference Server to help developers rapidly deploy to cloud, mobile, and edge in one toolkit. Integrating AI acceleration libraries such as CV-CUDA, FastTokenier, FlyCV, and PaddleSlim automatic compression tool achieves end-to-end performance optimization of AI models. FastDeploy designs a unified deployment API for different languages, and you only need three lines of core code to achieve high-performance AI deployment. You can complete the industrial AI deployment with the 160-plus state-of-the-art models demo.
      活动: GTC Digital Spring
      日期: March 2023
      行业: 所有行业
      级别: 初级技术
      话题: Deep Learning - Inference
      语言: 英语
      话题: Deep Learning
      所在地: