coding agents are everywhere right now but i'm more interested in models that actually take actions autonomously.
we built a small vlm for desktop gui automation. i mostly use it for moving data between apps that don't have apis, saves me a lot of copy pasting. still kinda janky on complex UIs though.
would be cool to see more people sharing non-coding use cases for local models
[link] [comments]