Where are small Models like Qwen3 0.6B and Qwen3.5 0.8B used ? Huggingface shows 2.88 million downloads this month.[D]

I can see 2.88 million downloads per month for small Qwen3.5 model. I tried using earlier model 0.6B in a deep resarch workflow and it was very difficult to get something done with this model .

  • Firstly they have a very surface level understanding of concepts. Poor Semantic understand means they can get confused about the topic or the task.
  • Json outputs are often broken . Adding a layer of checks on top took much of my time while working with these models.
  • Slow resposne. This one depends on a lot of factors and can actullay be improved , still slow response is a buzz kill most of the time

I am very curious how is the community using these models.

submitted by /u/adssidhu86
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top