| Every tool (LM Studio, Ollama, llama.cpp) downloads models to its own directory. Same 8GB model × 3 tools = 24GB wasted. lmm uses HF Cache as a single store and symlinks models to each tool. Download once, use everywhere. https://reddit.com/link/1t934vi/video/zpx3dakzca0h1/player
GitHub: https://github.com/holotherapper/lmm Built in Rust, Apple Silicon only. Feedback welcome. [link] [comments] |