Designing VLM-Based AI Agents for Large-Scale Video Analysis
, AI Research Lead, AppsFlyer
Dive deep into innovative architectures for building vision language models (VLMs)-based AI agents for large-scale video processing pipelines. Explore strategies to maximize the benefits of VLMs and their comprehensive visual understanding capabilities, balanced with computational efficiency.
活动: GTC 25
日期: March 2025
NVIDIA 技术: Cloud / Data Center GPU,RTX GPU,NeMo,NVIDIA NIM
话题: Computer Vision / Video Analytics - Tracking / Video Analytics