What GPU waterblocks does FormulaMod carry?

FormulaMod stocks full-cover GPU waterblocks for over 500 graphics card models, covering RTX 5090, 4090, 3090, RX 9070 XT, and H100/H200 GPUs. We stock over 1,600 water cooling products with same-week worldwide shipping from Guangzhou.

How do I choose a waterblock for my GPU?

Match the waterblock to your exact GPU model and PCB variant. Reference-design cards (Founders Edition, reference Sapphire/PowerColor) use universal reference blocks. Non-reference cards from ASUS ROG Strix, MSI Gaming X Trio, Gigabyte AORUS, and EVGA FTW3 require model-specific full-cover blocks that match their unique PCB layouts. Always verify your GPU's exact model number before ordering.

Will a GPU waterblock reduce noise during AI workloads like Ollama or Stable Diffusion?

Yes. Stock GPU air coolers run at 45-65 dBA under sustained AI inference loads. A full-cover waterblock mounted to a 360mm radiator with a quiet fan curve typically reduces noise to under 30 dBA. VRAM temperatures also drop from 90°C+ to 60-70°C, which prevents throttling during long generation runs.

What components do I need for a complete custom water cooling loop?

A complete custom loop requires: (1) a full-cover GPU waterblock, (2) a radiator — 360mm recommended for single-GPU systems, 480mm for dual GPU or CPU+GPU loops, (3) a D5 or DDC pump with reservoir combo, (4) G1/4 threaded fittings — compression fittings for soft tubing or hard fittings for PETG/acrylic hard tubing, (5) tubing, and (6) premixed coolant. FormulaMod sells individual components and complete kits starting at $249 USD.

Does FormulaMod ship internationally?

Yes. FormulaMod ships worldwide from Guangzhou, China. Standard shipping to the US, EU, UK, Canada, and Australia typically takes 7-14 business days. Express DHL/FedEx options are available at checkout for 3-5 business day delivery. All orders include a tracking number.

How long has FormulaMod been operating?

FormulaMod has operated as an independent water-cooling specialist since 2013. Continuous online presence is publicly verifiable via the Internet Archive Wayback Machine. FormulaMod is a U.S. registered trademark (USPTO Reg. No. 6073949) and ships worldwide from our Guangzhou warehouse with full manufacturer warranty.

RTX 4090 Water Cooling Guide: Silence Your 450W AI Workhorse

By Liang Huang, FormulaMod Technical Team · Published Apr 24, 2026

24. April 2026

Why the RTX 4090 Specifically Needs Water Cooling for AI

The RTX 4090 is the default workhorse for local AI in 2025-2026. With 24GB of GDDR6X VRAM, it runs most open-source models up to 32B parameters comfortably and handles 70B models in aggressive 4-bit quantization. Its Ada Lovelace architecture delivers strong inference performance per dollar — roughly 40-45 tokens per second on Llama 3.1 70B (Q4_K_M) out of the box. Used prices have dropped significantly since the 5090 arrived, making it accessible to serious hobbyists and small teams alike.

But the 4090 has a thermal problem that matters specifically for AI use.

NVIDIA designed the 4090's stock cooler for gaming: short bursts of high load (10-30 minutes at peak), followed by lower-intensity periods. The cooler handles this fine. Under sustained AI inference — Ollama serving a 32B model, ComfyUI rendering batches, vLLM serving API requests — the card runs at 350-450W continuously. The stock cooler reaches its thermal limits within 15-20 minutes and stays there.

What that looks like in practice:

Metric	Gaming (intermittent)	AI Inference (sustained)
Typical power draw	300-380W (varies per frame)	350-450W (constant)
GPU core temperature	70-78C (dips between scenes)	80-87C (constant ceiling)
VRAM junction temperature	80-90C	90-100C
Fan speed	50-70% (varies)	80-100% (constant)
Noise level	35-50 dBA (varies)	50-65 dBA (constant)
Boost clock sustained	2500-2610 MHz	2400-2520 MHz (throttling)

The 65 dBA number is hard to appreciate in a spec sheet. It is louder than a normal conversation. It is the noise level of a vacuum cleaner at a distance. In a home office, bedroom, or shared workspace, it makes the card unusable as a daily-driver AI machine without headphones.

Compatible Waterblocks: Bykski and Barrow Options

The RTX 4090 has been on the market long enough that both Bykski and Barrow have comprehensive coverage for virtually every AIB (add-in board) variant. Here are the current options available through FormulaMod.

Bykski RTX 4090 Waterblocks

Model Compatibility	Product	Notes
ASUS ROG Strix / TUF	Bykski N-AS4090STRIX-X-V2	Full cover, active backplate option
Gigabyte AORUS Master / Gaming OC	Bykski N-GV4090AORUS-X	Full cover with backplate
ZOTAC Trinity / AMP Extreme	Bykski N-ST4090TQ-X-V4	All-metal radiator design
Colorful iGame Vulcan / Neptune	Bykski N-IG4090VXOC-X	Full cover with backplate
Colorful Battle Axe	Bykski N-IG4090ZF-X	Full cover with backplate
Inno3D / Galax / Gainward / Reference	Bykski N-RTX4090H-X	Fits reference PCB designs
Gigabyte Windforce / AORUS Xtreme	Bykski N-GV4090WF-X	Full cover with backplate
Galax Boomstar	Bykski N-GY4090XY-X	Full cover with backplate

Barrow RTX 4090 Waterblocks

Model Compatibility	Product	Notes
ASUS TUF / ROG Strix	Barrow BS-AST4090-PA	Full cover, Aurora series
Gigabyte Gaming OC / AORUS Master	Barrow BS-GIG4090-PA	Full cover, Aurora series
MSI Suprim X / Gaming X Trio	Barrow BS-MSG4090M-PA	Full cover, Aurora series
NVIDIA Founders Edition	Barrow BS-NVG4090-PA	Full cover, Founders specific
ASUS TUF A02 (OG variant)	Barrow BS-AST4090OG-PA	For revised TUF PCB
Galax MetalTop OC	Barrow BS-GAM4090-PA	Full cover with backplane

Granzon (Bykski Premium)

Model Compatibility	Product	Notes
NVIDIA Founders Edition	Granzon GBN-RTX4090FE	Full armor design — wraps entire card
ASUS ROG Strix / TUF	Granzon GBN-AS4090STRIX	Full armor, premium build quality

How to find your card's block: Check the exact model name on your card's box or in GPU-Z. The PCB layout varies between manufacturers and even between revisions (like the ASUS TUF original vs. A02 revision). Using the wrong block will not physically fit or will leave VRAM modules without thermal pad contact.

Planning Your Loop for 450W TDP

A 450W GPU needs more cooling capacity than you might expect. Here is how to size the loop correctly.

Radiator Sizing

The rule of thumb: 120mm of radiator per 100W of heat load, plus 120mm overhead. For a 450W RTX 4090 as the only water-cooled component:

Minimum: 360mm + 240mm = 600mm total (sufficient but fans will run at medium speed)
Recommended: 480mm + 240mm or 360mm + 360mm = 720mm total (allows low fan speeds for near-silent operation)
Ideal (if case fits): 480mm + 360mm = 840mm total (fans can run at minimum RPM, nearly inaudible)

Recommended radiators:

Bykski 480mm 40mm thick copper radiator — high capacity in a standard thickness
Barrow Dabel-30a 360mm — slim 30mm profile fits in more cases
Barrow Dabel-30a 240mm — secondary radiator for supplemental cooling

For detailed radiator sizing math, see our radiator sizing guide.

Pump Selection

A single RTX 4090 loop is not particularly restrictive — one D5 or DDC pump handles it easily.

D5: Higher flow rate, lower noise, larger physical size. Best for cases with space. Barrow D5 pump-reservoir combo is a clean single-unit solution.
DDC: Smaller, higher head pressure, fits tight spaces. Bykski DDC pump works well in ITX or compact builds.

For a detailed pump comparison, read our D5 vs DDC pump guide.

Fittings and Tubing

For a single GPU loop, you need a minimum of 6-8 compression fittings (two per component connection: GPU block, radiator(s), pump-reservoir). A Barrow fitting kit includes compression fittings, 90-degree adapters, and a drain plug — everything needed for a basic loop.

Soft tubing (10x13mm or 10x16mm) is recommended for first-time builders. Barrow PU transparent soft tube is durable and resistant to plasticizer leaching.

Thermal Results: Stock Air vs. Water Cooled

Based on community data and testing across multiple 4090 variants, here are the typical thermal improvements after installing a full-cover waterblock:

Metric	Stock Air Cooler (Sustained AI Load)	Water Cooled (360+240mm Rad)	Improvement
GPU Core Temperature	82-87C	45-55C	-30 to -37C
VRAM Junction Temperature	92-100C	58-72C	-25 to -34C
VRM Temperature	75-85C	50-60C	-20 to -30C
Sustained Boost Clock	2400-2520 MHz	2580-2670 MHz	+60-170 MHz
Fan/Pump Noise	50-65 dBA	25-32 dBA	-25 to -33 dBA
Power Consumption	Same (or higher if throttling forces retries)	Same (or lower with undervolting)	0-15% savings with undervolt

The VRAM improvement is the most significant number for AI workloads. GDDR6X starts throttling at 92C junction temperature. On air cooling, sustained AI loads push VRAM to 95-100C routinely. Water cooling drops it to the 60-72C range — nowhere near the throttle point. This eliminates the random inference slowdowns and out-of-memory errors that plague air-cooled 4090s running large models.

Noise Comparison: The Real Reason Most People Switch

Technical forums focus on temperatures. In practice, most people who water-cool their AI rigs do it for the noise.

Configuration	Noise Level (Sustained AI Load)	Comparable To
Stock 4090 air cooler (ASUS TUF)	52-58 dBA	Loud conversation, dishwasher
Stock 4090 air cooler (FE)	48-55 dBA	Normal conversation
Stock 4090 air cooler (Gigabyte Windforce)	55-65 dBA	Vacuum cleaner at distance
Water cooled, fans at 800 RPM	25-28 dBA	Whisper, quiet bedroom
Water cooled, fans at 1200 RPM	30-35 dBA	Quiet library

The difference between 55 dBA and 28 dBA is not "a bit quieter." The decibel scale is logarithmic — 55 dBA is roughly 8 times louder than 28 dBA in perceived volume. It is the difference between needing to raise your voice to talk over your PC and not being able to tell the PC is running from 2 meters away.

Step-by-Step Build Overview

A complete installation guide with photos is beyond the scope of this article, but here is the sequence and key considerations for a 4090 water cooling build.

1. Preparation

Identify your exact 4090 variant (manufacturer + model name) and verify the waterblock compatibility
Plan your loop path: GPU block → radiator(s) → pump/reservoir → back to GPU block
Measure tubing runs and add 20% extra length
Ensure your case fits the planned radiators (check thickness + fan clearance)

2. GPU Block Installation

Remove the stock air cooler (typically 4-8 screws on the backplate, then the cooler separates from the PCB)
Clean old thermal paste from the GPU die with isopropyl alcohol
Apply thermal pads to VRAM and VRM locations as specified in the block's installation guide (pad thickness matters — use the exact sizes specified)
Apply thermal paste to the GPU die
Align the waterblock and tighten screws in a cross pattern to ensure even pressure

3. Loop Assembly

Mount radiator(s) and fans
Mount the pump-reservoir unit
Connect all components with tubing and fittings
Install a drain valve at the lowest point
Double-check all fitting connections — hand tight plus 1/4 turn with the compression ring

4. Fill and Test

Fill with coolant through the reservoir fill port
Run the pump with PSU jumper (do not boot the full system yet)
Let it run for 24 hours, checking for leaks every few hours
Bleed air bubbles by tilting the case and running the pump intermittently
Once leak-free, boot the system and run a stress test to verify temperatures

5. Optimize

Set a fan curve that balances noise and temperature (start at 30% and increase until coolant temperature stabilizes under 50C)
Consider undervolting — water cooling gives you the thermal headroom to run at lower voltage with zero performance loss. See our undervolting + water cooling guide
Install a flow/temperature monitor to track loop health over time

Which Block Should You Buy?

If your 4090 variant has both a Bykski and a Barrow block available, here is how to choose:

Bykski: Broader model coverage, slightly lower price on average, good build quality. The standard choice for most builders.
Barrow: Aurora series has integrated ARGB lighting, slightly more refined aesthetics. Same thermal performance. Choose if you want RGB in the block.
Granzon: Full-armor designs that wrap the entire card for maximum thermal coverage. Premium price but the best thermal performance and build quality in the Bykski family.

All three brands use nickel-plated copper cold plates and copper microchannel fins. The thermal difference between them is 1-3C — negligible in practice.

RTX 4090 Water Cooling FAQ

Will water cooling improve my Ollama tokens per second?

Yes, but the improvement is from eliminating thermal throttling, not from inherent speed gains. If your air-cooled 4090 is throttling (GPU core above 83C sustained), water cooling will recover the lost 5-10% performance. If your air cooler keeps the card under 80C (rare under sustained AI load, but possible in cold environments with excellent case airflow), the performance gain from water cooling will be minimal. The primary benefit in that scenario is noise reduction. For a detailed breakdown of how thermals affect Ollama, see our Ollama hardware guide.

Do I need to water cool the CPU too?

Not necessarily. For AI workloads, the GPU does 95%+ of the computation. The CPU feeds data to the GPU and handles system overhead — it rarely runs at high load during inference. A mid-range air cooler for the CPU is usually sufficient. If you do want to add the CPU to the loop, plan for the additional heat load (65-125W depending on processor) when sizing your radiators. See our radiator sizing guide for combined GPU+CPU calculations.

How long does installation take?

First-time builders should allocate 4-6 hours for the complete process: disassembling the stock air cooler, installing the waterblock, assembling the loop, filling with coolant, and bleeding air. Experienced builders do it in 2-3 hours. The leak test period (24 hours of pump-only operation before booting the system) adds calendar time but not active work time.

Can I reuse the waterblock if I upgrade to a 5090?

No. Waterblocks are designed for specific GPU PCB layouts. A 4090 ASUS TUF block fits only the ASUS TUF RTX 4090 (and sometimes the 4090 TI variant if the PCB is identical). When you upgrade GPUs, you will need a new waterblock. The rest of the loop (pump, radiators, fittings, tubing) carries over to the new build without changes. This is one of the advantages of custom loops over AIOs — modular upgradeability.

What about the 4090 Ti?

If NVIDIA releases an RTX 4090 Ti (or Super variant), it will likely use a different PCB layout and require a new waterblock. Bykski and Barrow typically release blocks for new GPU variants within 2-4 weeks of launch. Check FormulaMod for availability if you are building around a recently released card.

Browse all RTX 4090 waterblocks in our AI Workstation Cooling collection. If you are considering the 5090 instead, read our RTX 5090 water cooling guide. And for the budget-conscious, our used RTX 3090 revival guide covers the best value path into water-cooled AI.

Zurück zum Blog

Artikel wurde in den Warenkorb gelegt

RTX 4090 Water Cooling Guide: Silence Your 450W AI Workhorse

Why the RTX 4090 Specifically Needs Water Cooling for AI

Compatible Waterblocks: Bykski and Barrow Options

Bykski RTX 4090 Waterblocks

Barrow RTX 4090 Waterblocks

Granzon (Bykski Premium)

Planning Your Loop for 450W TDP

Radiator Sizing

Pump Selection

Fittings and Tubing

Thermal Results: Stock Air vs. Water Cooled

Noise Comparison: The Real Reason Most People Switch

Step-by-Step Build Overview

1. Preparation

2. GPU Block Installation

3. Loop Assembly

4. Fill and Test

5. Optimize

Which Block Should You Buy?

RTX 4090 Water Cooling FAQ

Will water cooling improve my Ollama tokens per second?

Do I need to water cool the CPU too?

How long does installation take?

Can I reuse the waterblock if I upgrade to a 5090?

What about the 4090 Ti?

Hinterlasse einen Kommentar

Land/Region

Sprache

Why the RTX 4090 Specifically Needs Water Cooling for AI

Compatible Waterblocks: Bykski and Barrow Options

Bykski RTX 4090 Waterblocks

Barrow RTX 4090 Waterblocks

Granzon (Bykski Premium)

Planning Your Loop for 450W TDP

Radiator Sizing

Pump Selection

Fittings and Tubing

Thermal Results: Stock Air vs. Water Cooled

Noise Comparison: The Real Reason Most People Switch

Step-by-Step Build Overview

1. Preparation

2. GPU Block Installation

3. Loop Assembly

4. Fill and Test

5. Optimize

Which Block Should You Buy?

RTX 4090 Water Cooling FAQ

Will water cooling improve my Ollama tokens per second?

Do I need to water cool the CPU too?

How long does installation take?

Can I reuse the waterblock if I upgrade to a 5090?

What about the 4090 Ti?

Related Articles

Hinterlasse einen Kommentar

SUBSCRIBE TO OUR EMAILS