DGX Station for Windows: 1T-Param Local AI Workstation

NVIDIA announced DGX Station for Windows at Computex 2026 on June 1, putting a GB300 Grace Blackwell Ultra chip and 748GB of memory into a desktop workstation that runs trillion-parameter models locally. Shipping Q4 2026 from ASUS, Dell, GIGABYTE, HP, MSI, and Supermicro.

What Happened

Jensen Huang unveiled DGX Station during the Taipei keynote as the first desktop-class workstation built around Grace Blackwell Ultra. The system pairs a GB300 chip with 748GB of unified memory, delivering 20 petaflops of FP4 performance, and runs full Windows 11.

NVIDIA framed it explicitly for "developers, researchers, and creators who need to run AI workloads that today require a cloud GPU pod." The system supports inference on models up to 1 trillion parameters without offloading to disk.

It ships alongside NVIDIA OpenShell, a Windows agentic runtime that accelerates llama.cpp and vLLM by 2x through tighter Grace Blackwell scheduling.

Why It Matters for Creators

Cloud GPU rates for an H200 pod now run $4-12 per hour. A creator running a daily Stable Diffusion 4 training job, weekly LoRA fine-tunes, or interactive Wan 3.0 video generation can burn through $500-2000 per month. DGX Station's local-only operation turns that into a one-time hardware purchase with zero ongoing inference cost.

The 748GB memory is the differentiator: today's high-end workstation cards (RTX 6000 Ada, 48GB) can hold 70B models comfortably but choke on 405B or above. DGX Station handles DeepSeek V4, Llama 5 405B, and the upcoming Nemotron 3 Ultra (550B) without quantization tricks.

Key Details

Chip: GB300 Grace Blackwell Ultra

Memory: 748GB unified

Performance: 20 petaflops FP4

OS: Windows 11 with NVIDIA OpenShell runtime

Max model size: 1 trillion parameters local inference

Launch: Q4 2026

Partners: ASUS, Dell, GIGABYTE, HP, MSI, Supermicro

What to Do Next

Pricing is not yet announced, but compared to the previous DGX Station (~$74K), this generation is expected to land in the $40-60K range for OEM SKUs. If your team currently spends $30K+ per year on cloud GPU time for fine-tuning, the math closes inside two years.

Three things to do before Q4: benchmark your current cloud spend to set a payback target, confirm your data residency requirements (DGX Station means your training data never leaves your office), and shortlist the two or three models you would run locally that you currently can't.

Partner SKU details will appear on NVIDIA's DGX Station page through Q3.

This story was covered by Creative AI News.

Subscribe for free to get the weekly digest every Tuesday.

DGX Station for Windows: Run 1T-Param Models Locally

What Happened

Why It Matters for Creators

Key Details

What to Do Next

Keep reading

GPT-5.6 Sol, Terra, Luna Land on Amazon Bedrock

Claude Opus 5: Anthropic's New Frontier Model, Explained

Codex Slides: Open-Source AI Deck Studio in Codex

What Happened

Why It Matters for Creators

Key Details

What to Do Next

Stay ahead of AI

Keep reading

GPT-5.6 Sol, Terra, Luna Land on Amazon Bedrock

Claude Opus 5: Anthropic's New Frontier Model, Explained

Codex Slides: Open-Source AI Deck Studio in Codex

Stay ahead of Creative AI