Qwen will release another 27B with high probability
They are waiting for the exact roadmap submitted by /u/serige [link] [comments]
They are waiting for the exact roadmap submitted by /u/serige [link] [comments]
This shit is cool, I have a demo script where it compares over 1,300 phrases for similarity to a live webcam image, and it can process one image every 10 seconds or so. I've been waiting fruitlessly for someone to get the model working on thi…
Posted my project here a while back and got some solid feedback via DMs. The main ask was a converter so people don't lose their existing chats when switching – that's in now. https://preview.redd.it/mfn5i99d6c2h1.png?width=1400&forma…
I have mac at work that I want to use local model for prototyping and basic prompts that needs to stay on device. What sort of model I can run that I can fit at least 64k context ? Any setups sbare or guides welcome. I need to have firefox open with on…
Long story short, I am running Qwen3.5-35B-A3B (GGUF format) and other models on MacOS and getting around 1500 tokens/sec for prompt processing and around 35-50 tokens per second for prompt processing. I'm using the latest version of llama.cpp on M…
improved MTP performance submitted by /u/jacek2023 [link] [comments]
I have a framework desktop 128GB and a 3080 12GB running qwen 7b I want to move to a proper server rack + switch but not sure how to move from desktop PC to server rack. Any advice on what GPU/Server to get under 5k? Or at that price just stick to work…
I don’t think this thing is going to work out, if anyone wants a 4u gpu server complete with half a terabyte of ram hit me up. (/s) submitted by /u/Simple_Library_2700 [link] [comments]
Mods please be kind. This was not “low effort”. It took me several minutes to find just the right waiting room gif to capture the sentiment of all us folks patiently waiting for our brothers and sisters in the east to hopefully drop some amazing …
Hey r/LocalLLaMA, We’ve released our ByteShape Qwen 3.6 35B GGUF quantizations in two families: standard NTP (Next Token Prediction or non-MTP) and MTP. Blog / Download NTP Models / Download MTP Models TL;DR For NTP, “pick the largest quant that…