"Actually wait" … the current thinking SOTA open source

I'm trying GLM 5.1 but is it just me or the thing really just works by over-cranking thinking to almost ridiculous heights?

It has been for last 20 minutes writing novellas about what it is going to do with all, Uhm, Actually wait, but no..., and I really just asked it to write an owner draw CButton with different colors.

Now don't get me wrong, at the end it seems to get there - but I'm just having my own "Actually wait" thinking moment:

Is this the way they made it so smart?

While the other models like Claude (the $20 is now just a total test mode ripoff - the tokens get spent in 15 minutes then you wait for hours) or ChatGPT (I currently prefer codex lately over CC, honestly it feels as smart) simply give you the answer almost right away for such simple things.

Edit, 30 minutes and > 100k tokens and now it starts writing CThemedButtonCtrl

Edit 2: the code had errors (not horrible, basic mistakes, like accessing protected members directly, but still, errors)

Edit 3: It also means that while you can get "x" times more tokens for the price they offer, you are actually going to use "x" times more tokens easily this way. Right now I'm at 150k for a simple stuff with GLM 5.1. Now I'm not trying to upsell cc or codex, I don't care, but we need to have a perspective. 150k/30 min vs 15k-20k tokens and 2 min, is a difference and might not be "price smart". Of course ultimately we "can" run GLM 5.1 at home (Well, I can't) but we can't run GPT or claude... so yeah, but...

Edit 4: the code is ok-ish, but require more of my input to fix stuff. Thinking of teeth and gifted horse right now...

Edit5: LOL: "Actually, I just realized I'm overcomplicating this..."

submitted by /u/FPham
[link] [comments]

Leave a Comment