NVIDIA Tensorrt Inference Server

Copy-paste vulnerability hits AI inference frameworks at Meta, Nvidia, and Microsoft

Flaws replicated from Meta’s Llama Stack to Nvidia TensorRT-LLM, vLLM, SGLang, and others, exposing enterprise AI stacks to systemic risk. Cybersecurity researchers have uncovered a chain of critical ...

Hosted on MSN

Chained bugs in Nvidia's Triton Inference Server lead to full system compromise

Security researchers have lifted the lid on a chain of high-severity vulnerabilities that could lead to remote code execution (RCE) on Nvidia's Triton Inference Server.… Wiz Research said that if the ...

CSOonline

Nvidia patches critical Triton server bugs that threaten AI model security

A crafted inference request in Triton’s Python backend can trigger a cascading attack, giving remote attackers control over AI-serving environments, researchers say. A surprising attack chain in ...

SDxCentral

Nvidia sets benchmarking performance records with its H200 and TensorRT-LLM software

Nvidia has set new MLPerf performance benchmarking records on its H200 Tensor Core GPU and TensorRT-LLM software. MLPerf Inference is a benchmarking suite that measures inference performance across ...

Infosecurity-magazine.com

Critical Vulnerabilities Found in NVIDIA's Triton Inference Server

A chain of critical vulnerabilities in NVIDIA's Triton Inference Server has been discovered by researchers, just two weeks after a Container Toolkit vulnerability was identified. The Triton Inference ...

Forbes

Update Now — Nvidia Confirms New Security Vulnerabilities

Forbes contributors publish independent expert analyses and insights. Davey Winder is a veteran cybersecurity writer, hacker and analyst. Nvidia is no longer just the company that produces the ...

BGR

NVIDIA Is Helping Apple Build A Faster And Better AI Experience

Apple and NVIDIA shared details of a collaboration to improve the performance of LLMs with a new text generation technique for AI. Cupertino writes: Accelerating LLM inference is an important ML ...

Nasdaq

NVIDIA Enters Production With Dynamo, the Broadly Adopted Inference Operating System for AI Factories

SAN JOSE, Calif., March 16, 2026 (GLOBE NEWSWIRE) -- GTC-- NVIDIA today announced NVIDIA Dynamo 1.0, open source software for generative and agentic inference at scale, with widespread global adoption ...

Network World

Nvidia claims 10x cost savings with open-source inference models

Nvidia has released analysis showing a 4X to 10X reduction in cost per token for AI inferencing by switching to open source models. The cost discounts required combining Blackwell hardware with two ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results