Stop wasting electricity

Stop wasting electricity

Run on my rtx4090

llama.cpp params:

llama-server -m ~/Projects/llm/models/Qwen3.6-27B-UD-Q4_K_XL.gguf --flash-attn on -ngl all -ctk q4_0 -ctv q4_0 -t 32 -c 262144 

Power limit was set using sudo nvidia-smi -pl N

On my observation, GPU constantly hitting power limit, so its safe to say that it actual consumption. You can cut power consumption to 40% without losing performance(and also reduce noise, heat from pc, and extend lifespan of gpu).

submitted by /u/OkFly3388
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top