Because the phrase 'language model' (or rather 'large language model', LLM) is n...

UniverseHacker · on March 15, 2023

You are missing what is so surprising about deep learning in general- it seems that training a sufficiently complex model on a sufficiently hard problem causes it to spontaneously develop intelligence, that generalizes to other problems. It doesn't really matter what problem you choose- image models, language models, etc. This is the "AI Scaling Hypothesis," basically the idea that advanced AI doesn't need much beyond greater scale. The GPT-4 paper itself is strong evidence of this.

In a similar manner to humans developing intelligence while being optimized just for reproductive success.

danparsonson · on March 16, 2023

I'm not missing that, I'm explicitly disagreeing that GPT shows evidence of this, and pointing out that human observers are mistakenly ascribing generalised intelligence to it because of some undeniably impressive, but explainable, results. The paper itself even opens with "GPT-4 is a Transformer-based model pre-trained to predict the next token in a document". I don't see any evidence of spontaneous development of intelligence, although I do think this work helps us get towards a deeper understand of the nature of intelligence itself, since a lot of what appears to be intelligent about GPT's behaviour is actually just the combination of a statistical model and an abundance of data, and perhaps that applies to humans too.

Also I would point out that emergent general intelligence would actually be quite an unsurprising result of deep learning for many people, given what we know about the human brain plus some hand-waving about emergent systems - I think many people actually expect something like that to happen, and that's exactly why so many people are jumping to that conclusion about GPT. It's confirmation bias.

But please enlighten me - where is the evidence that GPT-4 has generalised intelligence?

UniverseHacker · on March 17, 2023

> perhaps that applies to humans too

Like another commenter pointed out, you are redefining intelligence out of existence for both humans and GPT-4, and I don't see what that point of that is. GPT-4 can solve subtle and complex problems it has never seen before: explaining the point of jokes, guessing correctly about the subtle hidden motives of characters in a story, playing games, writing code, analyzing logical arguments, etc. These responses are not in the training data in any meaningful way. The ability to do tasks like this - things that require prediction, mental modeling of subtle details, etc. is what I feel makes both humans and GPT-4 generally intelligent.

FrojoS · on March 15, 2023

Humans evolved to spread their genes, but as a side effect they became good and producing music.

leventov · on March 15, 2023

Ok, then what's the point of defining 'intelligence' out of existence, if neither LMs nor humans are 'intelligent'?

danparsonson · on March 15, 2023

I'm sorry, I don't understand what you mean. Intelligence is quite an ill-defined concept already, is it not?