This test has always been so stupid since models work at the token level. Claude...

dannyw · 2025-02-24T20:32:26 1740429146

The problem also comes to LLMs being confidently wrong when it’s wrong.

elicksaur · 2025-02-25T02:28:18 1740450498

“Already 5xs”

Even AI marketing doesn’t claim this. Totally baseless claim given how many people report negative experiences trying to use AI.

ssijak · 2025-02-25T08:51:17 1740473477

Some people report some negative experiences for any tool ever brought into existence.

bufferoverflow · 2025-02-24T21:47:57 1740433677

This test isn't stupid. If it can't count the number of letters in a text, can you rely on it with more important calculations?

stnmtn · 2025-02-24T22:23:50 1740435830

You can rely on it for anything that you can validate quickly. And it turns out, there are a lot of problems which are trivial to validate the solution to, but difficult to build the solution.

101008 · 2025-02-24T23:00:36 1740438036

Coding is not one of those cases or edge cases wouldn't exists

TeMPOraL · 2025-02-25T01:03:53 1740445433

Not on calculations that involve counting at a sub-token level. Otherwise, it depends.