site:venturebeat.com - Search News

Nvidia lets its 'claws' out: NemoClaw brings security, scale to the agent platform taking over AI

That said the direction is clear. Claws are coming to the enterprise. Nvidia just made its bet on being the platform they run on — and the guardrails that keep them in bounds.

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

19m

The authorization problem that could break enterprise AI

When an AI agent needs to log into your CRM, pull records from your database, and send an email on your behalf, whose identity is it using? And what happens when no one knows the ...

Nvidia's agentic AI stack is the first major platform to ship with security at launch, but governance gaps remain

Five security vendors shipped governance for Nvidia's agentic AI stack at GTC — the first time security has launched with a major AI platform. Here's the five-layer framework, what it covers, and ...

22h

Nvidia launches enterprise AI agent platform with Adobe, Salesforce, SAP among 17 adopters at GTC 2026

At GTC 2026, Nvidia launched its Agent Toolkit, signing Adobe, Salesforce, SAP and 14 others in a major push to power ...

19h

Crash course: How new tech is changing accident claims

Technology has completely changed how cars operate, and now it’s changing what happens after a car has been in an accident. These changes can take many forms, from the devices that record a crash to ...

Langsmart Publishes Industry’s First p95 Semantic Cache Benchmarks for On-Premises AI Gateway, Challenges Market: “Show Me the p95”

Testing Confirms 10.2x Faster Response Times, Exceeding Cloud-Hosted Alternatives SAN JOSE, Calif.--(BUSINESS WIRE)--March 17, 2026-- NVIDIA GTC 202 ...

22h

Nvidia introduces Vera Rubin, a seven-chip AI platform with OpenAI, Anthropic and Meta on board

Nvidia debuts Vera Rubin, a seven-chip AI platform designed to deliver cheaper, faster AI infrastructure with support from OpenAI, Anthropic and Meta.

22h

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x token throughput and 4x energy efficiency for agentic AI ...

23h

How LinkedIn replaced five feed retrieval systems with one LLM model, at 1.3 billion-user scale

How LinkedIn replaced five feed retrieval systems with one LLM model — and what engineers building recommendation pipelines can learn from the redesign.

23h

z.ai debuts faster, cheaper GLM-5 Turbo model for agents and 'claws' — but it's not open-source

Z.ai says GLM-5-Turbo is currently closed-source, but it also says the model’s capabilities and findings will be folded into its next open-source model release ...

22h

Nvidia's DGX Station is a desktop supercomputer that runs trillion-parameter AI models without the cloud

Nvidia introduced the DGX Station at GTC 2026, a desktop supercomputer with 20 petaflops of AI performance and 748GB of coherent memory that can run trillion-parameter AI models locally without the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results