What Is GPU Memory - Search News

What GPU You Really Need for AI Workloads

GPU memory (VRAM) is the critical limiting factor that determines which AI models you can run, not GPU performance. Total VRAM requirements are typically 1.2-1.5x the model size due to weights, KV ...

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage ...

InfoQ

Pitfalls of Unified Memory Models in GPUs

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

The Next Platform

Sizing Up AWS “Blackwell” GPU Systems Against Prior GPUs And Trainiums

This week, Amazon Web Services announced the availability of its first UltraServer pre-configured supercomputers based on Nvidia’s “Grace” CG100 CPUs and its “Blackwell” B200 GPUs in what is called a ...

Tech Times

NVIDIA vs AMD 2026: Ultimate GPU Showdown for Gaming, AI, and Performance

NVIDIA vs AMD heats up in 2026 with RTX 5090 and RX 9070 XT. Compare performance, value, and AI/gaming features to find the best GPU for your needs.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results