AllenAI has been iterating on their MolmoAct2 models for robotics

r/AllenAI is cooking with MolmoAct2, a 5B vision-language-action model for robot control. They keep releasing new fine-tunes on different kinds of robotics datasets, including (but not limited to, and they keep releasing new ones):

AllenAI has released these as fully open source models, publishing not only their weights but also their complete training datasets (including pretraining), their training software source code, and technical papers describing the theory, training, and assessments of these models.

If anyone is fiddling with robots controlled via LLM inference, you should give MolmoAct2 models a look.

submitted by /u/ttkciar
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top