How NVIDIA Accelerates Retailers on their GenAI Journey

, Senior Solutions Architect, NVIDIA
, Senior Solutions Architect, NVIDIA
, Solution Architect Manager, NVIDIA

Generative AI (GenAI) and large language models (LLMs) enable retailers to build novel and innovative solutions that empower internal employees, reduce costs, and revolutionize the customer experience. As the world’s most advanced platform for accelerated computing, NVIDIA provides hardware and software designed to accelerate development and deployment of generative AI applications. Learn how retailers can use NVIDIA solutions like NeMo and Edify to build, train, and optimize LLM/GenAI applications for production. Specifically, we'll cover: 

  • TRT-LLM and Triton Inference Server to optimize and deploy LLM models for low latency and high throughput;
  • NeMo Microservices — customizing and deploying any LLM in under five minutes with guardrails;
  • Retrieval-augmented generation workflows and use cases like the NVIDIA Retail Product Advisor AI Workflow; and
  • Combining models from AI Playground and AI Foundry into novel GenAI applications for in-painting and out-painting.

Note: As of June 6, 2025, NVIDIA Edify is no longer available as a NIM microservice preview. To explore available visual AI models, visit https://build.nvidia.com/explore/visual-design

活动: GTC 24
日期: March 2024
级别: 通用
行业: Retail / Consumer Packaged Goods
话题: Text Generation
语言: 英语
所在地: