What GPU waterblocks does FormulaMod carry?

FormulaMod stocks full-cover GPU waterblocks for over 500 graphics card models, covering RTX 5090, 4090, 3090, RX 9070 XT, and H100/H200 GPUs. We stock over 1,600 water cooling products with same-week worldwide shipping from Guangzhou.

How do I choose a waterblock for my GPU?

Match the waterblock to your exact GPU model and PCB variant. Reference-design cards (Founders Edition, reference Sapphire/PowerColor) use universal reference blocks. Non-reference cards from ASUS ROG Strix, MSI Gaming X Trio, Gigabyte AORUS, and EVGA FTW3 require model-specific full-cover blocks that match their unique PCB layouts. Always verify your GPU's exact model number before ordering.

Will a GPU waterblock reduce noise during AI workloads like Ollama or Stable Diffusion?

Yes. Stock GPU air coolers run at 45-65 dBA under sustained AI inference loads. A full-cover waterblock mounted to a 360mm radiator with a quiet fan curve typically reduces noise to under 30 dBA. VRAM temperatures also drop from 90°C+ to 60-70°C, which prevents throttling during long generation runs.

What components do I need for a complete custom water cooling loop?

A complete custom loop requires: (1) a full-cover GPU waterblock, (2) a radiator — 360mm recommended for single-GPU systems, 480mm for dual GPU or CPU+GPU loops, (3) a D5 or DDC pump with reservoir combo, (4) G1/4 threaded fittings — compression fittings for soft tubing or hard fittings for PETG/acrylic hard tubing, (5) tubing, and (6) premixed coolant. FormulaMod sells individual components and complete kits starting at $249 USD.

Does FormulaMod ship internationally?

Yes. FormulaMod ships worldwide from Guangzhou, China. Standard shipping to the US, EU, UK, Canada, and Australia typically takes 7-14 business days. Express DHL/FedEx options are available at checkout for 3-5 business day delivery. All orders include a tracking number.

How long has FormulaMod been operating?

FormulaMod has operated as an independent water-cooling specialist since 2013. Continuous online presence is publicly verifiable via the Internet Archive Wayback Machine. FormulaMod is a U.S. registered trademark (USPTO Reg. No. 6073949) and ships worldwide from our Guangzhou warehouse with full manufacturer warranty.

GPU waterblock water cooling setup for AI workstation — NVIDIA RTX Pro 6000 Blackwell (96GB) Water Cooling Guide: Th

NVIDIA RTX Pro 6000 Blackwell (96GB) Water Cooling Guide: The H100 Killer for Home AI Labs

By HuangLiang · Published Apr 4, 2026 · Updated Apr 22, 2026

April 4, 2026

The RTX Pro 6000 Blackwell is the card that confused everyone when it launched in March 2025 — it's priced at $8,000–11,000, looks like a gaming GPU, fits in a standard PCIe slot, and yet in independent benchmarks it outperforms the NVIDIA H100 SXM on single-GPU LLM inference while costing one-third as much. If you're building a serious home AI workstation and the RTX 5090's 32GB VRAM ceiling has started to feel tight, this is the next card to understand.

This guide covers everything about the RTX Pro 6000 Blackwell: what makes it different, why water cooling it makes sense, and exactly what hardware you need.

What the RTX Pro 6000 Blackwell Actually Is

The RTX Pro 6000 Blackwell is built on the same GB202 die as the RTX 5090 — but it's the fully unlocked version. While the RTX 5090 uses 21,760 CUDA cores from that die, the Pro 6000 enables 24,064 CUDA cores, an 11% increase. GamersNexus confirmed this after a teardown: "This is a 5090 die, just a fuller version of it."

The critical difference is memory: 96GB of GDDR7 ECC versus the RTX 5090's 32GB. That 3x VRAM increase is the entire reason this card exists — it unlocks workloads that no consumer GPU can handle on a single card.

Three Versions — Which One Are You Buying?

NVIDIA launched three distinct versions of the RTX Pro 6000 Blackwell in 2025, which caused considerable confusion:

Version	TDP	Cooling	Form Factor	Use Case
Workstation Edition	600W	Dual-fan active	Standard PCIe, dual-slot	Desktop workstation, home lab
Server Edition	300W (boost available)	Passive	Single-slot, front-to-back airflow	Rack server, datacenter

For home AI lab use, you want the Workstation Edition. It slots into a standard ATX case, has its own active cooler, and runs at full 600W TDP. The Server Edition is passively cooled and requires forced airflow from a server chassis.

The Benchmark Numbers That Matter

These are real numbers from independent tests, not marketing material:

vs. RTX 5090

Gaming: 5–14% faster (GamersNexus, Sept 2025) — irrelevant for AI, but confirms the die advantage
AI text generation: StorageReview scored 325.9 tokens/s vs RTX 5090's equivalent — consistent lead across all model sizes tested
Why it matters: Same die, more cores, 3x the VRAM. For models that fit in 32GB, the Pro 6000 is marginally faster. For models between 32GB and 96GB, the Pro 6000 is the only single-card option.

vs. H100 SXM (the $30,000 datacenter GPU)

Single-GPU throughput: RTX Pro 6000 — 3,140 tok/s vs H100 SXM — 2,987 tok/s (CloudRift, Oct 2025). The Pro 6000 wins.
Cost per token: Pro 6000 — $0.18/mtok vs H100 — $0.25/mtok. 28% cheaper.
Akamai Cloud test: 1.63x higher inference throughput than H100 NVL 96GB at 100 concurrent requests.

The key caveat: for multi-GPU workloads requiring 8-way tensor parallelism, the H100's NVLink interconnect (900 GB/s per GPU) crushes the Pro 6000's PCIe 5.0 limitation. For single-card or small multi-GPU inference, the Pro 6000 wins on price-performance.

What Models Actually Fit in 96GB

Model	Precision	VRAM Used	Fits?
Llama 3.3 70B	FP16 (full quality)	~140GB	❌ Needs H200
Llama 3.3 70B	FP8	~70GB	✅ 26GB headroom for KV cache
Llama 3.3 70B	Q4_K_M	~40GB	✅ Comfortably fits
DeepSeek R1 70B	Q4	~40GB	✅ Fully in VRAM
DeepSeek R1 70B	Q8	~75GB	✅ Fits with room
Qwen3-235B-A22B (MoE)	FP8	~25GB active	✅ All 235B params stored, 22B active per pass
30B AWQ model	AWQ	~24GB	✅ 72GB headroom for KV cache at high concurrency

The 70B FP8 use case is the killer application. It's the first consumer-accessible card where you can run a full-quality 70B model on a single GPU without quantization compromises — something the RTX 5090's 32GB cannot do.

Why Water Cool the RTX Pro 6000?

The Workstation Edition has a 600W TDP — the same as some of the more power-hungry RTX 5090 configurations at boost. Unlike the RTX 5090, NVIDIA chose thermal paste instead of liquid metal for the Pro 6000's thermal interface material. GamersNexus noted this as a practical plus for builders: it makes waterblock installation straightforward with no liquid metal contamination risk.

The stock dual-fan cooler handles the thermals adequately, but at 600W sustained during continuous LLM inference, it's not quiet. For a home office AI workstation running 24/7:

Stock air cooling: fans at 2,000+ RPM under sustained AI inference, clearly audible
Water cooling: GPU at ~52°C, fans at 700–900 RPM, near-silent

At 600W, you need a minimum of a 420mm radiator. A 480mm or dual-360mm setup is more comfortable for 24/7 operation.

Available Waterblocks for RTX Pro 6000 Blackwell

FormulaMod carries two Bykski options for the RTX Pro 6000 Blackwell Workstation Edition:

Bykski N-RTXPRO6000-SR — $200

All-metal SR construction: stainless steel top, nickel-plated copper coldplate, no plastic in the coolant path. Full coverage of GPU die, all 32 GDDR7 memory modules (3GB each = 96GB total), and VRM. G1/4" ports. Designed for 24/7 server and workstation operation. This is the correct choice for a home AI server that runs continuously.

Bykski N-RTXPRO6000-WS-SR — $216

Workstation-specific all-metal block with slightly different port orientation optimized for workstation chassis layouts. Same full-metal SR construction. Choose this version if your case layout makes the standard port orientation awkward for tube routing.

Bykski B-FRD-RTXPRO6000-WS AIO Kit — $334

All-in-one kit: waterblock + 360mm radiator + pump + fittings + tubing, pre-configured. If you're new to water cooling and want a complete solution in one order, this gets you running without separate component selection. Good starting point before potentially upgrading to a larger radiator later.

Complete Water Cooling Build for RTX Pro 6000

Recommended Loop (Single GPU, 600W TDP)

Component	Product	Price
GPU Waterblock	Bykski N-RTXPRO6000-SR	$200
480mm Copper Radiator	Barrow 480mm 30mm thick	$65
D5 Pump + Reservoir	Barrow D5 combo	$95
G1/4" Compression Fittings ×8	Barrow compression	$48
10/13mm Soft Tube 1m	Bykski clear	$8
Total		~$416

Result at 600W sustained AI load: GPU at ~52°C, GDDR7 at ~65°C, radiator fans at 800 RPM, near-silent 24/7 operation. A 480mm instead of 360mm gives more thermal headroom — at 600W you want that margin.

If Adding CPU to the Same Loop

Use dual radiators instead: 480mm for the GPU, 360mm for the CPU. Run them as separate sections of the same loop — GPU block → 480mm rad → CPU block → 360mm rad → pump → reservoir. Total radiator surface: 840mm, which comfortably handles 600W GPU + 125–250W CPU simultaneously.

Is the RTX Pro 6000 Worth It vs. Two RTX 5090s?

Honest comparison:

	Single RTX Pro 6000	Dual RTX 5090
VRAM	96GB (single card, ECC)	64GB (combined, no ECC)
GPU cost	~$10,000	~$4,200 ($2,100 × 2)
Water cooling cost	~$416	~$641
Total	~$10,416	~$4,841
70B FP8 single-GPU	✅ Yes	❌ No (needs both cards)
70B Q8 full quality	✅ Yes	✅ Yes (split across cards)
Noise (water cooled)	One block, simpler	Two blocks, slightly more complex
Models up to 32GB	Slightly faster	Same per-card performance

Bottom line: The dual RTX 5090 is the better value if you're primarily running models under 64GB and don't need ECC memory. The RTX Pro 6000 makes sense if you specifically need 70B+ models at FP8/FP16 quality on a single card — cleaner setup, no inter-GPU communication overhead, and H100-beating inference throughput in a workstation chassis.

Who Is the RTX Pro 6000 Actually For?

AI researchers running fine-tuning experiments that require 70B model weights in VRAM simultaneously with activations and optimizer states
Small AI startups building private inference endpoints — the $0.18/mtok cost per token is competitive with cloud H100 rental ($0.25/mtok) even accounting for hardware amortization
CAD/simulation + AI hybrid workstations — 96GB handles large scene rendering plus running local LLMs simultaneously, ISV-certified drivers
Teams who've outgrown the RTX 5090's VRAM and need the next step before jumping to H100/H200

Shop RTX Pro 6000 Water Cooling at FormulaMod

FormulaMod carries the full Bykski waterblock lineup for the RTX Pro 6000 Blackwell Workstation Edition. All SR-series blocks are all-metal construction rated for 24/7 continuous operation.

Browse AI server GPU waterblocks →

h200-and-b200">AI Server GPU Water Cooling: Why Liquid Cooling Matters for H100, H200, and B200
Home AI Lab Water Cooling Guide: From RTX 5090 to Multi-GPU Clusters in 2026
RTX 5090 Water Cooling Guide: Compatible Waterblocks, Loop Setup & Thermal Performance

Back to blog

Item added to your cart

NVIDIA RTX Pro 6000 Blackwell (96GB) Water Cooling Guide: The H100 Killer for Home AI Labs

What the RTX Pro 6000 Blackwell Actually Is

Three Versions — Which One Are You Buying?

The Benchmark Numbers That Matter

vs. RTX 5090

vs. H100 SXM (the $30,000 datacenter GPU)

What Models Actually Fit in 96GB

Why Water Cool the RTX Pro 6000?

Available Waterblocks for RTX Pro 6000 Blackwell

Bykski N-RTXPRO6000-SR — $200

Bykski N-RTXPRO6000-WS-SR — $216

Bykski B-FRD-RTXPRO6000-WS AIO Kit — $334

Complete Water Cooling Build for RTX Pro 6000

Recommended Loop (Single GPU, 600W TDP)

If Adding CPU to the Same Loop

Is the RTX Pro 6000 Worth It vs. Two RTX 5090s?

Who Is the RTX Pro 6000 Actually For?

Shop RTX Pro 6000 Water Cooling at FormulaMod

Related Articles

Leave a comment

Country/region

What the RTX Pro 6000 Blackwell Actually Is

Three Versions — Which One Are You Buying?

The Benchmark Numbers That Matter

vs. RTX 5090

vs. H100 SXM (the $30,000 datacenter GPU)

What Models Actually Fit in 96GB

Why Water Cool the RTX Pro 6000?

Available Waterblocks for RTX Pro 6000 Blackwell

Bykski N-RTXPRO6000-SR — $200

Bykski N-RTXPRO6000-WS-SR — $216

Bykski B-FRD-RTXPRO6000-WS AIO Kit — $334

Complete Water Cooling Build for RTX Pro 6000

Recommended Loop (Single GPU, 600W TDP)

If Adding CPU to the Same Loop

Is the RTX Pro 6000 Worth It vs. Two RTX 5090s?

Who Is the RTX Pro 6000 Actually For?

Shop RTX Pro 6000 Water Cooling at FormulaMod

Related Articles

Related Articles

Leave a comment

SUBSCRIBE TO OUR EMAILS