397B running in 14GB of RAM via PAGED MoE on a 64GB Mac Studio — here’s the engine
https://reddit.com/link/1t6b9bi/video/fhnou7160qzg1/player hellooo r/LocalLLaMA Qwen3.5-397B-A17B is 209GB on disk. The MoE has 512 experts, top-10 routing per token. The naive load won't open on a M1 64GB Mac. What I (claude) did: keep only K=20 …