Discover
Build Continuous Refining AI Agents with Data Flywheels
Try NowBuild a production-integrated data flywheel, with NVIDIA NeMo microservices, that continuously optimizes AI agents for latency and cost — while maintaining accuracy targets.
Featured Models
View AllThe leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.
metallama-4-maverick-17b-128e-instruct
A general purpose multimodal, multilingual 128 MoE model with 17B parameters.
qwenqwen3-235b-a22b
Advanced reasoing MOE mode excelling at reasoning, multilingual tasks, and instruction following
nvidiallama-3.3-nemotron-super-49b-v1
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
metallama-4-scout-17b-16e-instruct
A multimodal, multilingual 16 MoE model with 17B parameters.
Customize a Blueprint
View AllGet started with workflows and code samples to build AI applications from the ground up.
nvidiaBuild an AI Agent for Enterprise Research
Build artificial general agents (AGA) powered by AGI models that continuously process and synthesize multimodal enterprise data, enabling reasoning, planning, and refinement to generate comprehensive reports.
nvidiaBuild a Video Search and Summarization (VSS) Agent
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
nvidiaBuild an Enterprise RAG pipeline
Connect AI applications to multimodal enterprise data with a scalable retrieval augmented generation (RAG) pipeline built on highly performant, industry-leading NIM microservices, for faster PDF data extraction and more accurate information retrieval.
nvidiaSafety for Agentic AI
Improve safety, security, and privacy of AI systems at build, deploy and run stages.