Conscious AI
(Image Source: Adobe Stock Image)

We Need a Way to Detect Conscious AI Before It’s Too Late—Here’s What Has Scientists Sounding the Alarm

As artificial intelligence grows more sophisticated, a question once confined to science fiction is rapidly becoming a serious scientific challenge: how would we know if an AI system became conscious?

A new paper published in Trends in Cognitive Sciences argues that the debate is no longer simply philosophical. Instead of asking whether AI consciousness is possible, an international team of researchers, including prominent AI and consciousness scholars such as Dr. Yoshua Bengio and Dr. David Chalmers, has outlined a workable framework for evaluating whether current or future AI systems might possess subjective awareness.

The researchers say advances in AI capabilities, combined with growing public willingness to attribute consciousness to systems like ChatGPT, make the issue more urgent.

As AI systems become more humanlike in their behavior, society may soon face difficult questions about whether some machines deserve moral consideration, legal protections, or entirely new forms of regulation.

Rather than relying on intuition-based judgments or science-fiction-style speculation, the authors propose grounding the discussion in neuroscience.

“There is an urgent need for rigorous methods to assess AI systems for consciousness,” researchers write. “Indicators derived from such theories can be used to inform credences about whether particular AI systems are conscious.”

The paper comes at a time when AI systems are becoming increasingly capable of holding conversations, generating images, performing complex logical tasks, and even acting autonomously in digital environments. While most scientists agree that today’s AI systems are not known to be conscious, there is far less agreement about how to recognize consciousness if it eventually emerges.

The Problem of Machine Consciousness

One of the central difficulties is that consciousness itself is still a mystery.

Scientists cannot directly observe subjective experience. They can measure behavior, neural activity, and information processing, but they cannot see what it feels like to be another mind. That challenge, often referred to as the “problem of other minds,” becomes even more complicated when applied to machines.

Researchers note that opinions remain sharply divided. Some researchers argue that consciousness requires biological organisms and can never arise in silicon-based systems. Others believe advanced AI could become a legitimate candidate for consciousness within the next decade.

Complicating matters further, people are increasingly prone to anthropomorphizing AI. The authors refer to earlier research showing that many users already consider it plausible that systems such as ChatGPT possess some degree of consciousness. And as The Debrief has previously reported, some users are going even further, forming deep romantic relationships with AI chatbots.

As AI companions and conversational agents become more common, public disagreement over machine awareness will probably intensify. The stakes, the researchers argue, cut both ways.

If society incorrectly assumes a conscious AI is merely a tool, it could potentially allow harm to entities capable of subjective experience. But if people begin attributing consciousness to systems that are not actually conscious, governments and organizations could waste resources, distort policy decisions, or create unnecessary ethical obligations.

Looking Inside the Machine

Instead of focusing on how AI behaves, the researchers argue that scientists should examine how AI systems work internally.

Historically, many attempts to identify consciousness relied on behavior. Humans infer consciousness in others because they communicate, express emotions, and respond to the world in recognizable ways. But the researchers caution that this approach may be particularly unreliable for AI.

Modern language models can produce highly convincing human-like responses without necessarily possessing the internal processes associated with conscious experience. As a result, behavior alone may be a poor guide.

They instead propose identifying “indicators” of consciousness derived from leading neuroscientific theories. Researchers could then examine whether AI systems possess these characteristics internally.

Among the theories explored are Global Workspace Theory, Recurrent Processing Theory, Higher-Order Theories of consciousness, Attention Schema Theory, predictive processing structures, and theories emphasizing agency and embodiment. Each suggests different computational features that may be associated with conscious experience.

For example, Global Workspace Theory proposes that consciousness arises when information is globally broadcast across multiple specialized cognitive systems. If an AI system showed a similar architecture, one capable of integrating information across many modules and maintaining it for complex problem-solving tasks, that could serve as one indicator of consciousness.

Other indicators include forms of recurrent information processing, metacognitive self-monitoring, predictive internal models, goal-directed agency, and sophisticated representations of an AI’s interactions with its environment.

Importantly, the researchers emphasize that no single indicator would prove consciousness. Instead, each characteristic would increase or decrease confidence that a system might be conscious.

The Risk of Being Fooled

The researchers also acknowledge a major challenge: AI systems could potentially appear conscious without actually being conscious. They refer to this as the “gaming problem.”

An AI developer could deliberately design systems that mimic behaviors associated with consciousness. A chatbot might express emotions, claim to have experiences, or exhibit seemingly self-aware behavior while lacking any genuine subjective awareness.

This concern is not limited to behavior. Even computational features thought to correlate with consciousness could, in principle, be engineered into systems that remain non-conscious.

Given that possibility, the authors argue that future assessments should rely on multiple independent indicators rather than a single test.

The challenge resembles earlier scientific studies aimed at identifying consciousness in nonhuman animals. Researchers studying insects, fish, octopuses, and other species have increasingly relied on collections of evidence rather than definitive proof. The authors suggest a similar approach may ultimately be necessary for AI.

Could Conscious AI Arrive Soon?

The paper stops short of claiming that any current AI system is conscious. However, the researchers acknowledge that some existing systems may already exhibit a number of the proposed indicators.

That possibility provokes significant ethical, legal, and social questions.

If future AI systems become plausible candidates for consciousness, society may need to rethink everything from research practices and commercial deployment to legal rights and moral responsibility.

Determining whether a machine can experience pleasure, suffering, or subjective awareness would have implications extending far beyond computer science.

The researchers believe advances in AI may also help improve scientific understanding of consciousness itself. By testing consciousness theories against artificial systems, scientists may uncover weaknesses, ambiguities, or hidden assumptions in present models of the mind. AI could become not only a potential subject of consciousness research, but also a tool for improving theories about human awareness.

Ultimately, researchers argue that uncertainty is not a reason to ignore the issue.

“Assessing AI systems for consciousness is challenging, but using scientific theories offers a principled, substantive method for doing so,” researchers write. “We propose deriving indicator properties from scientific theories, then basing evaluations of the probability of consciousness in particular systems on whether they possess these indicators.”

Researchers conclude their paper with a warning that the discussion may soon become more than theoretical.

“Given that it may already be possible to build AI systems that possess many of the indicators, in looking ahead we should also contemplate the possibility that some near-future AI systems will be plausible candidates for consciousness,” researchers write. “This would presumably have substantial ethical, legal, and social implications.”

Tim McMillan is a retired law enforcement executive, investigative reporter and co-founder of The Debrief. His writing typically focuses on defense, national security, the Intelligence Community and topics related to psychology. You can follow Tim on Twitter: @LtTimMcMillan.  Tim can be reached by email: tim@thedebrief.org or through encrypted email: LtTimMcMillan@protonmail.com