ChatGTP et al, the creators don't know how it works?

Crane · March 29, 2023, 1:48pm

I doubt the ‘we don’t really understand how it works’ pitch is the one he presents to the backers when he’s asking for another billion dollars.

Its just good theater for a PR session.

jjakucyk · March 29, 2023, 3:01pm

Sounds a lot like “we need to pass this bill so we can know what’s in it”.

“Give us teh moniez so we can figure out what this AI can do.”

Crane · March 29, 2023, 3:02pm

‘Push the button and see what happens’

Babale · March 29, 2023, 3:20pm

No, it’s a feature that’s pretty much inherent to neural networks.

I think the CGP Gray video you linked above pretty clearly explains the difference.

Crane · March 29, 2023, 3:32pm

No it’s not. It would be difficult to size a Sigmoid or program back propagation if you didn’t know how it works.

Babale · March 29, 2023, 3:35pm

I’ll refer you back to this post by @engineer_comp_geek that explains what you are missing:

engineer_comp_geek:

If you simulate a handful of neurons, the overall simulation is simple enough that you can fairly easily figure out what everything is doing. But the simulation also won’t do anything useful or interesting.

If you increase the number of neurons, the interconnections and interactions become so complex that the whole system is far beyond human understanding. You can easily focus on what an individual simulated neuron is doing, or what is happening along a particular simulated path, but the sheer number of interactions makes it impossible for a human to fully understand what the system is doing.

And it is beyond that point that interesting things start happening. The neural network starts to behave like a real brain. The simulated signals along simulated paths start to look a lot like actual biological brain waves. The neural network can also be “trained”, by feeding it particular stimuli, after which it will respond to how it has been trained.

But I suppose I was not specific enough:

It’s inherent to sufficiently large neural networks, not neural networks in principal.

Crane · March 29, 2023, 3:43pm

I’m not missing anything. The brilliant engineers who designed GPT controlled the parameters of every node in the system. They specified the activation window and the back propagation of every node in the net. It was not anything as ethereal as the simulation of neurons. They are not related.

Its an engineering problem with a solution. The ‘gosh we don’t know how it works’ is just good theater.

wolfpup · March 29, 2023, 3:51pm

This is the key. Of course we know how it was built since we built it. We know how it was trained since we trained it. But there is not remotely any practical way to know how or why ChatGPT produced any specific response. There is no way to predict ahead of time how well it will work. Perhaps most intringuingly, there is no way to predict the spontaneous emergence of new problem-solving skills on which it had never been explicitly trained, which can appear quite suddenly purely as a function of scale.

Crane · March 29, 2023, 3:56pm

Then why did they invest a billion dollars to train it? Just to see what it would do?

wolfpup · March 29, 2023, 4:17pm

In a nutshell, that’s pretty close to the truth. I mean, this is a research project, and that’s pretty much the nature of research. The justification for research funding is generally in the form of some evidence that it will be productive, but there are never guarantees. The Large Hadron Collider cost billions, and the justification was basically that cool things will likely happen at unprecedented levels of particle energy, and thus new discoveries will be made. The justification for the OpenAI research is that intelligent machines capable of natural language conversation and reasoning have many commercial applications.

Crane · March 29, 2023, 4:22pm

Yes, and they had to convince somebody that they had a reasonable probability of success. Since this is a commercial product, I don’t believe it was funded by the American Altruists Association.

Babale · March 29, 2023, 4:29pm

This is a strawman argument. You’re not responding at all to what @wolfpup is saying.

To demolish your strawman, OpenAI had done work on previous models (ChatGPT 2, for example) and showed that increasing the size of the neural network and amount of training time it received resulted in corresponding increases to the performance of the language model. Graphing the performance vs resources available to the model, they were able to show that the largest models they could produce were still improving rapidly with size, with no sign of diminishing returns.

They used this as evidence that increasing the size of the model further would result in even better results, and thus secured funding for ChatGPT 3.

Crane · March 29, 2023, 4:30pm

Thanks for making my argument. They knew what they were doing.

wolfpup · March 29, 2023, 4:34pm

My understanding is that the probability of success was largely premised on smaller, simpler models that showed the language model concept to be promising. But if you’re trying to suggest that the researchers or funding providers knew exactly how ChatGPT would perform in a massively scaled-up version, that’s absolutely not the case. Even today the research continues and new surprises continue to appear.

That’s not remotely the same thing as knowing how the assembled and trained system would perform, or even why it responds the way it does. For all anyone knew, it could well have been a total failure, proving only that scale alone doesn’t significantly improve cognitive skill. Instead it proved the opposite.

Crane · March 29, 2023, 4:40pm

Well yes, that’s the nature of development. I’m not sure what the argument is here. You are familiar with R&D. I am familiar with R&D funding. We both know that there is a large degree of risk in development and total success is a pleasant surprise. We also know that there is an element of theater in presenting to the media.

wolfpup · March 29, 2023, 5:09pm

Sometimes that happens, but sensationalism is usually the creation of the media itself, not the researchers. I haven’t seen anything like that coming out of OpenAI. That there are mysteries and surprises in ChatGPT like the emergence of novel skills isn’t “theater”, it’s fact.

Babale · March 29, 2023, 5:25pm

That… is not the point I was arguing against, whatsoever.

It’s like the difference between understanding the mechanics of evolution, and understanding what individual genes do, versus being able to program with DNA the way we do with computer code. Or having an understanding of both neurons and psychology versus understanding the complete mechanics of the human brain.

We understand the mechanics of neural networks, at an individual neuron level. We also have some ideas of how they evolve over time when trained, in a general sense.

That doesn’t mean that we “understand how they work”, any more than we understand how conciousness arises in the brain or how to read DNA like a book.

Mangetout · March 29, 2023, 7:24pm

Absolutely right - the individual nodes are very easy to understand. The thing that’s difficult to understand is what they’re actually doing together when ‘together’ means 175 billion of them are interacting with each other.

Chronos · March 29, 2023, 10:01pm

The engineers at OpenAI had strong reason to believe that interesting new capabilities would emerge when they went from ChatGPT 2 to ChatGPT 3, and from 3 to 3.5, and from 3.5 to 4, because new capabilities had emerged every previous time when they had made the network or training dataset larger. But they didn’t even know what those new capabilities would be. And they certainly didn’t know how those capabilities would work.

This is quite simply factually incorrect, and I don’t know why you keep saying it. There are way too many parameters for humans to even be capable of having specified every one. The system set its own values for those parameters, in the process of digesting the training data set.

Crane · March 30, 2023, 12:09am

Read what you quoted. Are you not familiar with software programming techniques?

Topic		Replies	Views
Artificial intelligence Factual Questions	2	646	June 20, 2001
An AI algorithm that tells you the How and Why or another algorithm Miscellaneous and Personal Stuff I Must Share	33	1117	August 7, 2018
The Miselucidation of Whack-a-Mole The BBQ Pit ai	115	2586	January 5, 2024
Artificial Intelligence - Possible or Probable? Factual Questions	39	3564	November 7, 2008
chatGPT is a fucking liar In My Humble Opinion	40	627	April 7, 2025

ChatGTP et al, the creators don't know how it works?

Related topics