LocalLLaMA

Extracted MTP tensor GGUFs – smaller donor models for grafting.

The script to graft MTP tensors requires a full GGUF model file. I felt that was a bit hefty, so I asked local Gemma to write something to just extract what's required. The results are two faux GGUFs weighing in at just 900MB (35A3B) and 450MB (27B…