Orchestrating a Multimodal Agent With Foundation Model Building Blocks

, Sr. Content Developer, NVIDIA

Explore more training options offered by the NVIDIA Deep Learning Institute (DLI). Choose from an extensive catalog of self-paced, online courses or instructor-led virtual workshops to help you develop key skills in AI, HPC, graphics & simulation, and more.
Ready to validate your skills? Get NVIDIA certified and distinguish yourself in the industry.

As AI continues to make its way into popular use, deep learning components from a wide range of disciplines are quickly getting incorporated into end-user applications, and emerging workflows that combine multiple seemingly disjoint models or input types are able to operate surprisingly well. 

  • Investigate the primitives that help to make up anything from MNIST image generators to vision-guided language models. 
  • Discuss large model orchestration and show how they can be used to construct end-to-end multimodal agent systems.
活动: Siggraph
日期: August 2024
话题: AI 推理
行业: 所有行业
级别: 初级技术
语言: 英语
所在地: