DeepSeek v4 – Subjective vibes

I must say Iam kinda torn what to think about those models. At one hand they "ace" some questions on other sometime they behave genuinely weird.

For example the big model appears to be "stubborn" like "3" era Claude used to be. It has some oppinion eg about historic figure and even if you present facts it will keep insisting on its version.

The lite model confidently lied to me, but when found out it became honest and very friendly... . Also the small model must have been trained on western models, because other chinese models (qwen, Kimi) tend to prefer chinese culture in certain question I ask them. But lite model was obsesed with "diversity" in all forms to the point of telling lies.

Then again in coding or even creative intelligence those models are really strong...

Also the large model has impresive memory, it knows things in superb detail.

The large model also in its thinking traces shows that it analyzes in length "user" state of mind and respond in strategic way.

Something is "off" with this DeepSeek, maybe undertrained.

submitted by /u/Single_Ring4886
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top