Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
详情
字幕
PDFSpeak: Unlocking Multimodal PDF Intelligence Through Speech
, Solutions Architect, Generative AI, NVIDIA
, Sr. Solution Architect , NVIDIA
, AI Solutions Architect, NVIDIA
, AI Solutions Architect, NVIDIA
Gain next-level insights from PDF documents just by speaking to your data. This hands-on session shows participants how to build and run PDFSpeak, an innovative approach to interacting with complex PDF documents using NVIDIA's cutting-edge AI technologies through speech, vision, and text. Prerequisite(s):
GitHub, Docker, Bash, familiarity with Python.
活动: GTC 25
日期: March 2025
行业: 所有行业
级别: 通用
话题: Generative AI - Retrieval-Augmented Generation (RAG)