LocalLLaMA

AllenAI has been iterating on their MolmoAct2 models for robotics

r/AllenAI is cooking with MolmoAct2, a 5B vision-language-action model for robot control. They keep releasing new fine-tunes on different kinds of robotics datasets, including (but not limited to, and they keep releasing new ones): https://huggingface…