Streamlining Investment Insights for Wealth Management with Generative AI

, Data Science Manager, Director, UBS
, Solutions Architect, NVIDIA
The collaboration between UBS and NVIDIA focuses on real-time risk assessment and monitoring of production retrieval augmented generation (RAG). Unlike holistic evaluations of RAG applications, real-time solutions must introduce minimal latency and offer a high degree of live adaptation and reasoning. To ensure the live observability of large language models (LLMs), we use "LLMs as a judge," where the reliability of the answers provided by one LLM is re-evaluated by another LLM before being returned as the system’s output. A common pitfall in "LLM as a judge" applications is that models tend to prefer their own answers, and thus undermine a system’s trustworthiness. To provide a first-of-its-class solution to this problem in the heavily-regulated banking industry, we adopt NVIDIA’s NIMs to deploy an open-source model as the LLM judge and extend it with observability tools from NeMo Guardrails and Evaluator.
活动: GTC 25
日期: March 2025
NVIDIA 技术: Cloud / Data Center GPU,TensorRT,NeMo,Triton,NVIDIA NIM
行业: 金融服务
级别: 通用
话题: Generative AI - Retrieval-Augmented Generation (RAG)
语言: 英语
所在地: