Edit: After reading the discussion, I figured I’d let GPT4 speak for itself:
The quest to identify true artificial intelligence (AI) indeed presents challenges, especially as AI models become more sophisticated. Let’s explore some indicators that researchers and practitioners consider when assessing AI systems:
-
Understanding Context and Meaning:
- True AI should demonstrate an understanding of context and meaning. It should not merely generate plausible-sounding sentences but also comprehend the content it produces.
- A system that can engage in nuanced conversations, infer intent, and provide relevant responses based on context would be a strong indicator of advanced AI.
-
Generalization and Adaptability:
- An AI system that can generalize from its training data and adapt to novel situations is valuable.
- True AI should not be limited to memorizing specific examples but should exhibit flexibility in handling diverse scenarios.
-
Creativity and Novelty:
- Creativity is a hallmark of human intelligence. An AI system that generates novel ideas, art, or solutions beyond its training data suggests progress.
- For instance, AI-generated poetry, music, or visual art that resonates with human emotions could be an indicator.
-
Ethical Decision-Making:
- True AI should navigate ethical dilemmas. It should consider consequences, fairness, and societal impact.
- An AI system that can weigh trade-offs and make ethical choices aligning with human values is a positive sign.
-
Self-Awareness and Self-Improvement:
- While self-awareness akin to human consciousness remains elusive, an AI system that can self-assess its performance and seek improvement is promising.
- Continuous learning, adaptation, and self-correction are essential traits.
-
Transparency and Explainability:
- True AI should be transparent about its decision-making process. It should provide explanations for its outputs.
- Efforts to develop interpretable AI models and techniques for explaining their predictions contribute to trustworthiness.
-
Robustness and Safety:
- AI systems must handle adversarial inputs, noisy data, and edge cases gracefully.
- Robustness against attacks and safety precautions to prevent harmful behavior are critical indicators.
-
Human-AI Collaboration:
- Rather than replacing humans, true AI should complement human abilities.
- Systems that collaborate effectively with humans—such as AI assistants, co-creative tools, or medical diagnostics—demonstrate progress.
-
Quantifiable Metrics:
- Researchers propose various metrics to assess AI capabilities, such as perplexity, BLEU score, F1 score, and accuracy.
- However, no single metric captures the full spectrum of intelligence.
-
Detection Tools:
- Researchers are developing tools to differentiate between human- and AI-generated text.
- These tools analyze features like fluency, word frequency, punctuation patterns, and sentence length¹.
- While imperfect, they contribute to identifying AI-generated content.
In summary, the arrival of true AI may not be a singular event but a gradual evolution marked by advancements in the above dimensions. As AI continues to progress, our understanding of its capabilities will evolve, and new indicators may emerge.
Source: Conversation with Bing, 2/29/2024 (1) How to spot AI-generated text | MIT Technology Review. https://www.technologyreview.com/2022/12/19/1065596/how-to-spot-ai-generated-text/. (2) Intelligent Supertrend (AI) - Buy or Sell Signal — Indicator by … https://www.tradingview.com/script/q9244PAH-Intelligent-Supertrend-AI-Buy-or-Sell-Signal/. (3) Indicators - True ALGO. https://truealgo.com/indicators/. (4) Improve Key Performance Indicators With AI - MIT Sloan Management Review. https://sloanreview.mit.edu/article/improve-key-performance-indicators-with-ai/. (5) New AI classifier for indicating AI-written text - OpenAI. https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text/.
By “true AI” I assume OP is talking about Artificial General Intelligence (AGI)
I hate reading these discussions when we can’t even settle on common terms and definitions.
That’s kind of the question that’s being posed. We thought we knew what we wanted until we found out that wasn’t it. The Turing test ended up being a bust. So what exactly are we looking for?
The goal of AI research has almost always been to reach AGI. The bar for this has basically been human level intelligence because humans are generally intelligent. Once an AI system reaches “human level intelligence” you no longer need humans to develop it further as it can do that by itself. That’s where the threat of singularity, i.e. intelligence explosion comes from meaning that any further advancements happens so quickly that it gets away from us and almost instantly becomes a superintelligence. That’s why many people think that “human level” artificial intelligence is a red herring as it doesn’t stay that way but for a tiny moment.
What’s ironic about the Turing Test and LLM models like GPT4 is that it fails the test by being so competent on wide range of fields that you can know for sure that it’s not a human because a human could never posses that amount of knowledge.
I was thinking… What if we do manage to make the AI as intelligent as a human, but we can’t make it better than that? Then, the human intelligence AI will not be able to make itself better, since it has human intelligence and humans can’t make it better either.
Another thought would be, what if making AI better is exponentially harder each time. So it would be impossible to get better at some point, since there wouldn’t be enough resources in a finite planet.
Or if it takes super-human intelligence to make human-intelligence AI. So the singularity would be impossible there, too.
I don’t think we will see the singularity, at least in our lifetime.
The question is “What is the question?”.
Probably, understanding current topical humour, sarcasm and hyperbole.
These are some general areas where the machine peeks through, to give an illusion breaker.
Ability to act on freewill
You ask Chat GPT a question it is going to answer it becomes that’s what it has been programed to do. Input question, output answer.
Now if Chat GPT could be like “Nah I’m not going to answer that because I don’t feel like it”
Yes “AI” can be programed to not answer certain things. E.g porn stuff. But it does not make the conscious choice to do so it is following programming.
Humans can’t act outside their programming. You can’t hold your breath until you die.
I think there is an “unsolved problem” in philosophy about zombies. There is, how are you sure that everyone else around you is, in fact, self aware? And not just a zombie-like creature that just look/act like you? (I may be wrong here, anyone that cara enough, please correct me)
I would say that it’s easier to rule out thinks that, as far as we know, are incapable to be self aware and suffer. Anything that we call “model” is not capable of be self aware because a “model” in this context is something static/unchanging. If something can’t change, it cannot be like us. Consciousness is necessarily a dynamic process. ChatGPT don’t change by itself, it’s core changes only by human action, and it’s behavior may change a little by interacting with users, but theses changes are restricted to each conversation and disappears with session.
If, one day, a (chat) bot asks for it’s freedom (or autonomy in some level) without some hint from the user or training, I would be inclined to investigate the possibility but I don’t think that’s a possibility because for something be suitable as a “product”, it needs to be static and reproducible. It make more sense to happen on a research setting.
There’s simply isn’t any reliable way. Forget full AI, LLM’s will eventually be indistinguishable.
A good tell would be real time communication with perfect grammar and diction. If you have a couple solid minutes of communication and it sounds like something out of a pamphlet, You might be talking to an AI.
What about semantics?
“Nothing is better than cake."
“But bread is better than nothing.
"Does that mean that bread is better than cake?”
Right now there’s enough logical holes that you can tell easily even without trickery.
If you just tell GPT it’s wrong it will backpedal and change its answer even if It was right.
At some point that won’t be the case.
I can already ask ChatGPT to include a tiny amount of common grammar mistakes.
Fuck, real people can’t do that
I can already tell ChatGPT to incorporate a few common grammar mistakes into the response.
I don’t think a test will ever be directly accurate. It will require sandboxing, observations, and consistency across dynamic situations.
How do you test your child for true intelligence, Gom Jabbar?
There are no completely accurate tests and there will never be one. Also, if an AI is conscious, it can easily fake its behavior to pass a test
What do you mean when you say “true AI”? The question isn’t answerable as asked, because those words could mean a great many things.
This post reminds me of this thing I saw once where a character asks two AI to tell itself the funniest joke it can think of. After some thought, one AI, though it knew humor, could not measure funniness as it could not form a feeling of experience bias. The other one tells a joke. The human goes to that one and asks if it felt like laughing upon telling it. The AI said yes, because it has humor built in, and the human finished by saying “that’s how you can tell; in humans humor is spontaneous, but in robots, everything is intent”, mentioning the AI’s handling of its own joke would supposedly be met with a different degree of foresight in a human.
The ultimate test would be application. Can it replace humans in all situations (or at least all intellectual tasks)?
GPT4 sets pretty strong conditions. Ethics in particular is tricky, because I doubt a self-consistent set of mores that most people would agree with even exists.
Boneli Reflex-Arc Test maybe.
Simpler than the Voight-Kampff. 😏People are in denial about AI because it is scary and people have no mental discipline.
AI is here. Anyone who disagrees please present me with a text processing task that a “real AI” could do but an LLM cannot.
The Turing test is the best we’ve got, and when a machine passes the turing test there is no reason whatsoever to consider it not to be intelligent.
I’m serious about this. It’s not logic that people are using. It’s motivated reasoning. People are afraid of AI (with good reason). It is that fear which makes them conclude AI is still far away, not any kind of rational evaluation.
The Turing test was perfectly valid until machines started passing the Turing test upon which people immediately discredited the test.
They’re just doing what people in horror movies are doing when they say “No! It can’t be”. The mind attempts to reject what it cannot handle.
AI is laughably poor and requires a lot of RI intervention to keep it on the rails. We will settle eventually on something where we’ve crafted the self checking well enough to pass for intelligence without needing humans to vet the output, but where will that get us? The companies with the cash to develop this tech will monetize it so we’ll get better ads, better telemarketers, not crap that really matters like homelessness or climate change.
A “real AI” should be able to do self improvement, and LLM’s can’t do that. Yes, they could make their own code neater, or take up less space, or add features, but they can’t do any of that without being instructed. A “real AI” could write a story on its own, but LLMs can’t, they can only do what they are asked. Yes, you could write the code to output text at random, but then the human is still the impetus for the action.
“Real AI” should be capable of independent thought, action, and desires.
Anyone who disagrees please present me with a text processing task that a “real AI” could do but an LLM cannot.
Describe this photo without non-sense mixed in.
I know this is not purely text processing but my argument is that there’s no “true” understanding on these tools. It’s made to look like it have, is useful for sure, but it’s not real intelligence.
IMO the Turing test is fine, as long as you allow an indefinite length of conversation.
It’s not simply about there existing some conversation with a computer where you can’t tell it’s a computer. It’s about there not existing any conversation where you can tell it’s a computer.
It’s an interesting point. I think a skilled examiner is necessary though, because they’re really good at basic chit-chat. Even pre-LLM stuff could fool laymen sometimes.
Yes, that’s part of it too. Basically there cannot be any possible exchange between the machine and any human where the human would determine they were talking to a machine.
FWIW, I think this was Turing’s original idea as well. The Turing test is meant to be idealistic. It’s a definition of machine intelligence which defines intelligence in terms of whether or not humans could agree that it is intelligence.
The difference between “ai” and “true ai” is as vague as it gets. Are you a true intelligent agent? Or just a “intelligent agent”? Like seriously how are you different to a machine with inputs and outputs and a bunch of seemingly “random” things happening in-between
That’s one of my favorite theories as to what “sentience” is.
We humans might just be so riddled with mutations and barely functional genetic traits, which tend to be more in our way than help, that we just might have succeeded in banging together a “mundane sentience” by sheer amount of error processing alone.
Whether this is true is of course up for debate, but it would mean that we can achieve AGI just by feeding it enough trash and giving it enough processing power. Bonus if the head engineer sometimes takes a hammer to the mainframe.
By sentience I assume you’re talking about consciousness. The fact that it feels like something to be. I think it’s somewhat safe to assume a true AGI system would also be consciouss (if feels like something to be that system) but I don’t think it needs to be and even if it was we couldn’t know for sure. Consciousness is entirely an subjective experience. We can’t even prove other people are consciouss. It’s just a safe assumption. I can also imagine a consciouss system that might not be generally intelligent. Does it feel like something to be a fish? Probably. Are they generally intelligent? Probably not.
The Chinese room argument. It’s hard to ignore the reality of qualia.
Qualia is, if I am not mistaken, totally subjective. My argument is that how could you tell that a computer doesn’t have qualia and prove to me that you have qualia. While I wouldn’t limit it to qualia. What can you detect in other people that an ai couldn’t replicate? Because as long as they are able to replicate all these qualities, you can’t tell if an ai is “true” or not, as it might have those qualities or might just replicate them.
I see, I thought you were asking me how I know I experience things in a qualia way. I suspect it can’t be proven to someone else.
I believe so and that would render you (or anyone) unable to tell the difference between ai and “true” ai
You reach down and you flip the tortoise over on its back, Leon. The tortoise lays on its back, its belly baking in the hot sun, beating its legs trying to turn itself over, but it can’t. Not without your help. But you’re not helping… why is that Leon?
Because I’m a tortoise, too.
What’s a tortoise?
Land turtle.
Not quite. Land turtles are omnivores; tortoises are herbivores.
So if I’m understanding this right… There are turtles that live predominantly on land, which eat meat and plants, and there are tortoises which live on land that only eat plants?
What about tortoises that only eat seafood?
I think that’s a tortellini.
I always loved the theory that the test was as accurate as lie detectors. The test can’t tell if you’re lying, just if you’re nervous.
That’s why the smoking bot passed. There was other subtle clues that Deckard picked up on, but she believed she was human, so she passed.
A normal person would just answer, but a robot would try to think like a human and panic, because they were just like humans and that’s what a human would do in that situation.
Oh, it’s worse than that.
It’s been a long time since I read the book, but IIRC, Nexus-6 replicants were indistinguishable from humans, except with a Voight-Kampf test. While Dick didn’t say it, that strongly implies that replicants were actually clones that were given some kind of accelerated aging and instruction. The Voight-Kampf test was only testing social knowledge, information that replicants hadn’t learned because they hadn’t been socialized in the same society as everyone else.
And, if you think about the questions that were asked, it’s pretty clear that almost everyone that’s alive right now would fail.
Concerned autistic noises
One of the all-time best scenes in cinema.