Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

ModerateImprovement@sh.itjust.works · 5 months ago

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

zbyte64@awful.systems · 5 months ago

Humans predict things by assigning meaning to events and things, because in nature, we’re constantly trying to guess what other creatures are planning. An LLM does not hypothesize what your plans are when you communicate to it, it’s just trying to predict the next set of tokens with the greatest reward value. Even if you were to use literal human neurons to build your LLM, you would still have a stochastic parrot.

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless – The Markup