What GPU waterblocks does FormulaMod carry?

FormulaMod stocks full-cover GPU waterblocks for over 500 graphics card models, covering RTX 5090, 4090, 3090, RX 9070 XT, and H100/H200 GPUs. We stock over 1,600 water cooling products with same-week worldwide shipping from Guangzhou.

How do I choose a waterblock for my GPU?

Match the waterblock to your exact GPU model and PCB variant. Reference-design cards (Founders Edition, reference Sapphire/PowerColor) use universal reference blocks. Non-reference cards from ASUS ROG Strix, MSI Gaming X Trio, Gigabyte AORUS, and EVGA FTW3 require model-specific full-cover blocks that match their unique PCB layouts. Always verify your GPU's exact model number before ordering.

Will a GPU waterblock reduce noise during AI workloads like Ollama or Stable Diffusion?

Yes. Stock GPU air coolers run at 45-65 dBA under sustained AI inference loads. A full-cover waterblock mounted to a 360mm radiator with a quiet fan curve typically reduces noise to under 30 dBA. VRAM temperatures also drop from 90°C+ to 60-70°C, which prevents throttling during long generation runs.

What components do I need for a complete custom water cooling loop?

A complete custom loop requires: (1) a full-cover GPU waterblock, (2) a radiator — 360mm recommended for single-GPU systems, 480mm for dual GPU or CPU+GPU loops, (3) a D5 or DDC pump with reservoir combo, (4) G1/4 threaded fittings — compression fittings for soft tubing or hard fittings for PETG/acrylic hard tubing, (5) tubing, and (6) premixed coolant. FormulaMod sells individual components and complete kits starting at $249 USD.

Does FormulaMod ship internationally?

Yes. FormulaMod ships worldwide from Guangzhou, China. Standard shipping to the US, EU, UK, Canada, and Australia typically takes 7-14 business days. Express DHL/FedEx options are available at checkout for 3-5 business day delivery. All orders include a tracking number.

How long has FormulaMod been operating?

FormulaMod has operated as an independent water-cooling specialist since 2013. Continuous online presence is publicly verifiable via the Internet Archive Wayback Machine. FormulaMod is a U.S. registered trademark (USPTO Reg. No. 6073949) and ships worldwide from our Guangzhou warehouse with full manufacturer warranty.

GPU waterblock water cooling setup for home AI lab

Home AI Lab Water Cooling Guide: From RTX 5090 to Multi-GPU Clusters in 2026

By HuangLiang · Published Apr 4, 2026 · Updated Apr 22, 2026

April 4, 2026

Running local AI in 2026 is no longer experimental — it's practical. With DeepSeek R1, Qwen3, and Llama 3.3 available in quantized formats, a single RTX 5090 at $2,000 runs inference comparable to a $30,000 H100 for many workloads. The hardware question has been answered. What hasn't been solved for most home builders is heat and noise.

This guide covers water cooling specifically for home AI server builds — with real token-per-second data, real build costs, and honest tradeoffs.

Why Air Cooling Fails for AI Workloads

GPU manufacturers design air coolers for gaming: bursty loads with natural cooling periods. An LLM inference loop is different. When you run Qwen3-coder-30B continuously for coding assistance, or leave DeepSeek R1 processing batch requests overnight, your GPU is at 80–100% load indefinitely.

Under these conditions on an RTX 5090:

Stock air cooling: 80–85°C GPU core, 88–92°C GDDR7 memory, fans at 2,200+ RPM
Noise level: 45–55 dBA — audible from another room
After 2–4 hours: power throttling begins as memory thermal protection kicks in

Water cooling eliminates all three issues simultaneously.

The Four Home AI Build Tiers — Cooling Requirements for Each

Tier 1: Single RTX 5090 ($2,000–2,500 GPU)

What it runs well: DeepSeek R1 32B at Q6 (~35 tok/s), Qwen3-coder-30B at Q5 (~40 tok/s), Llama 3.3 70B at Q3 with RAM offload (~18 tok/s)

Cooling requirement: 360mm minimum. The RTX 5090 at 575W puts out roughly the same heat as three 120mm radiator segments worth of dissipation capacity at comfortable fan speeds. A single 360mm copper radiator keeps GPU under 55°C with fans at 800 RPM — genuinely quiet.

Ideal waterblock: Bykski for your specific card model. Full-coverage with backplate. The backplate matters — RTX 5090 backside component temps hit 80°C+ without it on some models (ASUS ROG Astral data from FTC Water Cooling).

Tier 2: RTX 5090 + RTX 4090 — 56GB VRAM

What it runs: A community member on Overclock.net is running this exact combination: "5090 + 3090 here... GPT-oss 120B and GLM 4.5 Air (Q4_K_S) are newer and better and both fit cleanly inside 56GB VRAM." The 5090 handles compute, the 4090/3090 provides VRAM extension.

Cooling requirement: Both GPUs need blocks if you're running them at sustained load. Total TDP: 575W (5090) + 450W (4090) = 1,025W. Minimum: dual 360mm radiators. Recommended: 420mm + 360mm.

Pro tip: Run the GPUs in series in the loop, not parallel. Series is simpler to plumb and the 5–8°C temperature rise across GPUs is completely acceptable for AI inference.

Tier 3: Dual RTX 5090 — 64GB VRAM

What it unlocks: Real-world benchmark from Hostkey (April 2025): dual 5090 running Llama 3.3 70B at 32K context = 26 tokens/second. DeepSeek R1 70B at 110 tokens/second. Models that don't fit in 48GB suddenly become viable at full quality.

The build that works: The Overclock.net thread on this build recommended Lian Li O11D Evo XL specifically — it's one of the few cases that fits dual 420mm radiators with enough clearance for two full-size GPU blocks. CPU on separate AIO or air cooling; don't mix the 1,100W GPU loop with CPU.

PSU requirement: 1,600W minimum. The Hostkey dual 5090 test system was pulling 1,500W from the wall under load.

Waterblocks needed: Two GPU blocks, both matched to their specific card models. If using two different 5090 card models (e.g., Gigabyte Master + FE), each needs its own dedicated block.

Tier 4: a16z-Style 8× GPU Server

Andreessen Horowitz published their own GPU server build guide — eight RTX 4090/5090s in a single rack-mount chassis, running at full PCIe 5.0 x16 lanes. This is beyond home use, but the a16z engineers made an important point: "For less of the cost of a single H100, you could stack multiple RTX 4090s or 5090s and still achieve serious throughput."

At 8× RTX 5090, you're looking at 4,600W of GPU heat alone. This requires industrial-scale liquid cooling (CDUs, dedicated cooling loops) rather than standard PC water cooling.

Verdict on Tier 4: At this scale, you're building infrastructure. FormulaMod carries Bykski's server-grade multi-block kits and high-flow pumps for this use case — but it's a different project from a home workstation.

Token Speed vs. Cooling: The Real Performance Impact

This is the number most people don't talk about. GDDR7 memory bandwidth throttles when temperatures exceed ~85°C. On an air-cooled RTX 5090 under sustained AI load:

First 10 minutes: full bandwidth, ~50 tok/s on DeepSeek R1 32B
After 30 minutes continuous: memory at 88–90°C, bandwidth throttling begins, drops to ~43–46 tok/s
Water-cooled equivalent: memory stays at 60–65°C, full bandwidth maintained indefinitely

The ~10–15% performance difference isn't in benchmarks (which run short tests) — it shows up in real usage over hours.

Bill of Materials: Complete Water Cooling Loop Costs

Single RTX 5090 Home AI Build

Component	Model	Price
GPU Waterblock (ASUS ROG Astral RTX 5090)	Bykski N-AS5090ASTRAL-X	~$185
360mm Copper Radiator (30mm thick)	Barrow 360mm	$45
D5 Pump + Reservoir Combo	Barrow D5 combo	$95
G1/4" Compression Fittings ×8	Barrow compression 10/13mm	$48
10/13mm Soft Tube 1m	Bykski clear EPDM	$8
Thermal Paste	Included with block	$0
Total (water cooling hardware)		~$381

Result: GPU at 48°C, GDDR7 at 62°C, fans at 700 RPM (near-silent), sustained full bandwidth indefinitely.

Dual RTX 5090 Build (GPU cooling only)

Component	Model	Price
GPU Waterblock ×2 (matched to your cards)	Bykski full-coverage	~$370
420mm Copper Radiator (front)	Barrow 420mm 30mm	$58
360mm Copper Radiator (top)	Barrow 360mm 30mm	$45
High-Flow Server Pump	Barrow SPB17-TP (960L/h)	$78
Fittings ×14 + Tube	Barrow G1/4" compression	$90
Total		~$641

Result: Both GPUs at ~52°C under dual-GPU inference load. Fans at 900 RPM. 1,100W heat output handled silently.

One Mistake to Avoid

Don't mix your GPU cooling loop with CPU in a dual-RTX-5090 build. With 1,100W of GPU heat going through the loop, the coolant temperature rise is significant — adding CPU heat makes the system harder to balance and pushes CPU temps higher than necessary. Run GPUs in one loop (with the high-flow pump), CPU on a separate AIO. This is the consensus recommendation from the Overclock.net thread on this exact build.

Browse RTX 5090 waterblocks → | Browse AI server GPU blocks →

Shop Water Cooling Components

Which NVIDIA GPU for Local AI in 2026? RTX 3090 vs 4060 Ti vs 4070 Ti Super vs 4090 vs 5090
Building a 2-GPU Local AI Server: Dual RTX 3090 vs Dual 4090 for Running 70B Models
RTX 5090 Water Cooling Guide: Compatible Waterblocks, Loop Setup & Thermal Performance

Back to blog

Item added to your cart

Home AI Lab Water Cooling Guide: From RTX 5090 to Multi-GPU Clusters in 2026

Why Air Cooling Fails for AI Workloads

The Four Home AI Build Tiers — Cooling Requirements for Each

Tier 1: Single RTX 5090 ($2,000–2,500 GPU)

Tier 2: RTX 5090 + RTX 4090 — 56GB VRAM

Tier 3: Dual RTX 5090 — 64GB VRAM

Tier 4: a16z-Style 8× GPU Server

Token Speed vs. Cooling: The Real Performance Impact

Bill of Materials: Complete Water Cooling Loop Costs

Single RTX 5090 Home AI Build

Dual RTX 5090 Build (GPU cooling only)

One Mistake to Avoid

Shop Water Cooling Components

Related Articles

Leave a comment

Country/region

Language

Why Air Cooling Fails for AI Workloads

The Four Home AI Build Tiers — Cooling Requirements for Each

Tier 1: Single RTX 5090 ($2,000–2,500 GPU)

Tier 2: RTX 5090 + RTX 4090 — 56GB VRAM

Tier 3: Dual RTX 5090 — 64GB VRAM

Tier 4: a16z-Style 8× GPU Server

Token Speed vs. Cooling: The Real Performance Impact

Bill of Materials: Complete Water Cooling Loop Costs

Single RTX 5090 Home AI Build

Dual RTX 5090 Build (GPU cooling only)

One Mistake to Avoid

Shop Water Cooling Components

Related Articles

Related Articles

Leave a comment

SUBSCRIBE TO OUR EMAILS