Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
详情
字幕
Scaling Generative AI Features to Millions of Users Thanks to Inference Pipeline Optimizations
, Cofounder & CTO, PhotoRoom
, Machine Learning Scientist, PhotoRoom
We'll cover how PhotoRoom uses generative AI models for real-time inference. We'll start by highlighting the techniques used for architecture optimization, followed by a deep dive into TensorRT. We'll also cover how we serve models for millions of users, leveraging Triton Inference Server.