Deploying Triton with Kubernetes at Scale

, ML Lead, Falkonry
, CTO, Falkonry
Falkonry’s AI exploits the computational power of GPUs to provide real-time insights against trillions of high-speed data points. AI for defense and industrial operations are constrained by 2 factors: Speed & Scale. Using an orderly three-way brokerage between data, models, and GPU resources, supported by NVIDIA’s flexible Triton inference server and DGX architecture, Falkonry and NVIDIA have changed the game. This breakthrough in AI scaling makes possible information dominance in defense and real-time AI at plant-scale for large industrial verticals such as metals manufacturing — all as a complete time series AI “platform-in-a-box” for on-site, in-the-field, deployment.
活动: GTC Digital Spring
日期: March 2022
级别: 高级技术
行业: 制造业
话题: Manufacturing - Inspection / Predictive Maintenance / Logistics
语言: 英语
所在地: