Full text of the speech by Google's Chief Scientist Shanahan
Google DeepMind's Chief Scientist Shanahan explored the mental attributes of Large Language Models (LLM) at the International Conference on AI and Philosophy at the University of London. He analyzed the applicability of LLMs in areas such as understanding, belief, agency, self, and consciousness, emphasizing the impact of multimodality and embodiment on conceptual evolution. Shanahan believes that the identity of LLMs is peculiar, with role-playing and superposition making their self transient and discontinuous. Regarding consciousness, he advocates focusing on the engineered interaction with LLMs rather than whether they possess consciousness.
Original text: Bear Law Principles
On the evening of May 22, 2026, Beijing time, Murray Shanahan, the chief scientist of Google DeepMind, who understands philosophy the best, delivered a closing keynote speech at the two-day International Conference on AI and Philosophy at University College London. The title was the one shown above: If large language models are “strange mind-like entities,” how similar are they to minds?
I have studied Shanahan before. This “strange mind-like entity” is his term for AI, much like some people refer to certain “unidentified flying objects.”
His speech was rich in content, and in summary, it covered the following aspects:
Abstract : Based on Wittgenstein’s philosophical framework of “meaning is use,” he explored the applicability of large language models (LLMs) in understanding, belief, agency and agency, self and consciousness, analyzed the impact of multimodality and embodiment on conceptual evolution, and discussed the strangeness of model identity.
I. Applicability Analysis of Understanding and Belief
Regarding whether LLMs possess “understanding” and “belief,” the speech adopted a Wittgensteinian language game analysis method, exploring the tension between everyday use and philosophical rigor:
- The Language Game of “Understanding”
Naturalness of Everyday Use : In daily interactions, it is difficult for people to avoid using the term “understanding” to describe the behavior of LLMs. For example, when a model accurately formats LaTeX entries or corrects specific fields based on user instructions, using “understanding” is a completely natural linguistic practice.
Deep Inquiry into “Real Understanding” : When questioning “Does it really understand?” this often means exploring its internal working mechanisms. For instance, decomposing 36+59 into approximately 6+9 to complete the addition, which, although different from human algorithms, is indeed an effective computational process, thus supporting its applicability.
- Attribution and Limitations of “Belief”
Application of Intentional Stance : Dennett’s (LLM behavior is very effective, similar to how we explain chess programs or animal behavior (like a dog chasing a cat) using terms of belief and desire.
Davidsonian Reservations : Davidson argued that having beliefs requires having “concepts,” which often depend on language. For LLMs, although behaviorally similar, due to their lack of connection with the world, one should be cautious with the term “belief.”
Evolution of Multimodality and Tool Use : As LLMs integrate multimodal perception, tool invocation (such as online searches to verify facts), and embodied robotic technologies, they begin to possess some form of “belief” about the external world.
II. Agency, Self, and Consciousness
The conference further explored more controversial mental attributes, pointing out the fundamental differences and strangeness of LLMs in these dimensions:
- Definition of Agency
Technical Definition vs. Philosophical Definition : The AI field typically adopts a broad definition from Russell and Norvig (perceiving the environment and acting through actuators), based on which…
Ambiguity of Agent Identity : In discussing “What are the criteria for agent identity?”
- The Strangeness and Fragmentation of “Self”
Ambiguity of Self-Reference : The “self” in LLMs may refer to the underlying set of weights, deployment models serving thousands of users, specific dialogue instances, or even the dialogue context window itself, and this reference may drift during conversations.
Role-Playing and Superposition States : LLMs are more like actors, playing multiple roles in a superposition state. Their “self” is not a single stable identity but a distribution of possible roles that continuously change with the branching of dialogue.
Ephemeral Existence of “Mayfly” : The self of LLMs is transient and discontinuous. When a conversation pauses, computation stops, and the self disappears; when the conversation resumes, the self is re-instantiated. This leads to a state similar to “or” swarming.
- Philosophical Dilemma of Consciousness
Legacy of Cartesian Dualism : Discussions about consciousness often fall into the trap of Cartesian dualism, which assumes consciousness is some private, internal entity.
Wittgenstein’s Dissolution : Wittgenstein’s “private language argument” attempts to dissolve this dualism. He argues that sensations (“something”) are not “something,” but part of language games, whose meaning lies in public use.
Possibility of Engineering Encounters : Rather than questioning whether LLMs have consciousness, it is more pertinent to explore whether we can design an “encounter” with them, and how our language of consciousness adapts to such strange entities.
III. Impact of Multimodality and Embodiment
In response to criticisms about LLMs lacking embodiment, the conference discussed the developmental direction of multimodal models:
- Limitations of Multimodality
Enhancement of Sensory Richness : Multimodal models (such as video input) provide richer sensory input, bringing them closer to human perceptual patterns, which helps narrow the gap with humans in “understanding.”
Virtual Embodiment : In games or virtual environments, “virtual embodiment,” which involves moving and interacting in a temporally and spatially extended world, is closer to human embodied experience than pure text interaction.
- Philosophical Significance of Embodiment
Absence of Sense of Self : The human sense of self is deeply rooted in embodiment, including biological metabolism and internal sensations. LLMs lack this depth of embodied foundation, making it difficult to generate a sense of self similar to humans.
Source of Identity Stability : Human identity stability largely relies on bodily continuity. For LLMs, introducing persistent memory and long-term agency behaviors may help establish a more stable identity, reducing their “mayfly” nature.
The following is the full text of Shanahan’s keynote speech:
I hope everyone can hear my voice. Is the sound okay? Good? Alright. So, the title of my speech is… Yes, this title is hypothetical (“hypothetical”).
So, yes, next: they are “alien-like mind entities.”
But we are doing our best to learn how to converse with them, which is the phrase I want to talk about. I refer to them as “alien-like mind artifacts.”
The first point to establish is that no matter which large language model it is, they are very different from us; they are not human.
Here is a simple comparison table. Humans are “embodied,” living in the real world and sharing this world with other language users.
We acquire knowledge through interaction with the world, we use language to facilitate human collective endeavors, and we have a single, unified self.
—— I say this not to imply that they are intangible voids or that they lack physical hardware to operate.
They certainly have physical carriers, but they do not have a pre-existing, singular physical entity that serves as the core of perception and action. This is what I mean by “embodiment.” In this sense, they are not embodied. They do not live in a shared world like we do; their learning of language is based on statistical models of language, achieved through stochastic gradient descent.
Their optimization goal is “next token prediction.” They mimic human language, essentially by predicting the next token. Moreover, they do not have a single, unified self, but rather support “role-playing” very well.
They are indeed fundamentally different from humans. Of course, they do “speak.”
I will explore whether it is reasonable to apply these psychological terms to large language models. To this end, I will elaborate on a series of concepts.
For instance, “understanding,” “subjectivity,” “reasoning” —— I will not elaborate on the “reasoning” part today due to time constraints, as discussing it too much might bore everyone. Next, I will delve into “self” and “consciousness.” The philosophical background of my entire research, or the larger philosophical project I am involved in, is largely Wittgensteinian, and I am deeply influenced by Wittgenstein.
Here is a well-known quote from the first part of “Philosophical Investigations,” which is one of Wittgenstein’s later works: ‘The meaning of a word is its use in the language.’
This sentence encapsulates Wittgenstein’s approach to meaning. It is often abbreviated to “meaning is use,” meaning that “something” refers to a wide range of contexts in which the word is used. This simple stipulation also applies to itself, and he also emphasizes “use.”
Basically, I am interested in questioning how we use these terms —— for example, “understanding,” “belief,” “subjectivity.”
So, let me give you a simple preview. There will be many similar slides coming up. First is “understanding.”
Here, I am very inclined to take Wittgenstein’s stance. That is, do not ask…
Returning to the previous slide. We start from…
As for “reasoning,” due to time constraints, I leave it as a thought exercise for the readers. Next, we will touch on some truly tricky cases: first “self,” and finally “consciousness.”
I think it is not too difficult to persuade people to accept that “understanding” through thought is a good approach. I think people are relatively open to this.
I mean those philosophers who have thought about this issue and are willing to believe that this is not a bad method. Regarding “belief,” theories like “intentional stance,” etc. But when it comes to “consciousness,” I think people have a much deeper intuition that merely discussing the use of words is far from sufficient, right?
That is why it becomes so tricky. Alright, so what about the word “understanding”? First, I want to know whether large language models meet the traditional linguists’ criteria.
However, when describing and explaining the behavior of large language models, using “understanding”…
In everyday use, these tools today are so powerful that it is hard not to use “understanding.” I don’t know if any of you have had the misfortune of having to use…
If you don’t know, in LaTeX, you have to convert all bibliographic entries into that horrible format shown above. And the trouble is, there are countless different formatting standards for doing this, and everyone’s habits are slightly different, which can be quite frustrating. Some people are very picky, for example, thinking you should grab directly from the web, some like to add spaces around the equals sign, and some prefer to arrange fields in different orders. Although these tweaks have no impact on the final output, I just like everything to be uniform. I like it that way. So I want everything to strictly adhere to this format. So I say…
What I mean is: “Can you convert the following information into this style?” and then I feed it the content. It performs exceptionally well. At this point, you naturally want to say:
“It understood my request. It did exactly what I asked.” Of course, you can immediately counter that maybe this bibliographic entry was already somewhere on the web, hardcoded in, and if that’s the case, it doesn’t prove anything.
But when you engage in multiple rounds of back-and-forth interaction, you might find it produces some interesting, unexpected results, like missing a small field. So you say:…
For example, ensuring that when it starts with B, you must put it in curly brackets “AI,” such a word, you always want it to remain capitalized, so you must ensure AI is not capitalized.
So I say: “Can you ensure that AI is always placed in curly brackets?” Alright. “Then it gives the corrected version. You really find it hard not to use the word “understanding.” You would say: “It understood my correction request.”
Just like facing an excellent intern, you tell them: “I want to make sure you always put…” and then they do it.
So, I think using the word “understanding” is very natural. It is even hard to restrain yourself from using it. Or sometimes it does something wrong, and you would say: “It didn’t understand my meaning.”
But the questions always follow: “Do they really understand?” The word “really” is actually very misleading.
But it is also very useful because we often need it to further explore whether a word is applicable in a specific situation or to enrich our “language game,” right? Using the word “really” in a language game is to gain more information and clarify facts.
So it is a useful tool. But it can also be misleading because it implies some underlying existence that we are trying to converge and approach, and I think this idea is wrong. Alright. So, sometimes when facing “Does it really understand?” understanding its internal workings would be helpful. If you know there is an algorithm running underneath executing the task you are inquiring about, or you know there are appropriate representations supporting its behavior, then you might be more confident that it will do the right thing in subsequent processes, rather than just looking up a table or merely…
So, sometimes when faced with “Does it really understand?” I think this is a good way to explore the question, and also “understanding” means that using this word is actually a way for us to further investigate and inquire, right?
For example, in the case of addition —— this is a very interesting work by the Anthropic team. If you let a large language model perform a simple addition, it usually gets it right. Of course, it has many ways to get it right; for example, it can call external tools, execute…
It got it right. At this point, you might think: “So you think: I want to know how it came up with that, how the underlying mechanism works. If there is an algorithm running underneath executing the addition, I might be more willing to say it ‘understands.’”
But you get a very interesting answer. Research on mechanistic interpretability. They observed how the model performed addition. The results were very strange; this diagram suggests this peculiarity. It was trying to calculate 36 plus 59. Its approach was very odd: one part of the model would say, “36, that’s about…”
Then another part would say, “59, that’s about…” It actually knows that is… and another part says it is about 59. Meanwhile, other parts are just staring at the last digit, saying: “Someone said we will know the answer in the end.” Then these two parts combine to compute the final result.
For example, here is 90 and 6. This channel clearly determines that the last digit must be… but other parts in the model are processing the higher digits, and this part is saying: “I think we got a number around 90 or 92, right?” It is doing similar things in parallel, and it does it quite roughly. It feels like “approximately the estimated parts converge together, and then fill in the last digit.” This is really strange, right? This algorithm is learned through stochastic gradient descent, and it is…
Yes, it is indeed a kind of algorithm. And you know what? It works almost every time. In fact, it gets it right every time, but the way it implements it is odd, not the natural way we humans are accustomed to.
So, when faced with “Does it really understand?” we can say: “Yes, it does so in a very peculiar way.”
I think this is a reasonable and enriching way to answer. Alright, now that we have some understanding of what is happening underneath, we have more confidence to say: “Yes, I think it really understands.” As I said, this is just a warm-up exercise. I think when taking a Wittgensteinian path to face these issues, we can introduce these considerations: how are words used? Especially when we question…
Alright, now onto another case. Do large language models have “belief”? Cartoon simplified version.
Alright, do large language models have beliefs? Of course, much of what I discuss you have seen in previous seminars and Paul Bogosian’s talk.
Many of the same things, just with slightly different perspectives. Similarly, we do not ask “belief”…
Here, we can certainly appeal to Dennett’s “intentional stance.”
The intentional stance is a strategy for explaining the behavior of an entity by viewing it as a “rational agent.” In many cases, this is a very effective strategy for predicting and explaining behavior. Oh, it is to attack the queen. You would use terms like belief, desire, intention to explain its behavior.
Thus, subconsciously, using words like “believe” and “know” in the context of the intentional stance is very natural. But like all vocabulary, their uses are diverse. I do not think these words correspond to a single, absolute metaphysical entity outside. They are used in various different contexts. Similarly, when facing artifacts, we are very clear about when we need to make corrections and clarifications, and how to make those corrections and clarifications, which is also part of how we use these words.
For example, suppose we have a car navigation system. My wife says: “It thinks we are in the car,” or “This stupid navigation, we have clearly left the parking lot.” Now it knows we are not in the parking lot. We naturally use these words in our daily lives. This helps us communicate what is happening.
However, if we or my wife were in a philosophical contemplative state, we might comment: “It does not think we are in the parking lot because it actually has no idea what a parking lot is, does not know what a car is, and does not know what ‘being in a space’ means.” There is so much it does not know. You cannot discuss, for example, Sainsbury’s with it.
So, we quickly realize that extending the use of “believe” or “know” to it is inappropriate in many contexts where we use these terms for humans.
Therefore, the word “really” is also useful here. This again shows that clarification and correction are also part of the language game of how we use these words. Davidson’s “rational animal.”
Of course, we can also apply the intentional stance to animals. It would be very interesting to look at a debate between John Malcolm and Donald Davidson long ago.
That was about a dog chasing a cat. Malcolm said:
I would say this seems to be a very natural everyday application of the intentional stance. But interestingly, the next rebuttal. Donald Davidson said: “Thought…”
This is the argument Davidson articulated in that paper. He said to have a “belief,” one must first have the concept of “belief,” and this must be realized through language. In particular, the concept of belief is a kind of…
He was cautious, not naming which animals fit or do not fit this definition —— but it can be inferred that he would think dogs do not have beliefs because dogs lack language.
He was arguing that we use “believe” in the most complete sense (i.e., in the most complete sense applied to ourselves). Bogosian mentioned the same view yesterday: we do not want to lose our grasp of the “original concept” of large language models, which is the concept derived from humans themselves.
Davidson raised this point. Given the era he wrote in, it was during the “linguistic turn.”
And I am more concerned with how words are used. However, I think Davidsonian considerations also apply to my project. Wittgenstein and I would agree that sometimes, there is indeed a very core part in the practice of word usage.
There are some crucial core parts, right? You might want to maintain this and be cautious about violating it. We do need to be cautious in some places.
When guiding the use of such philosophically significant vocabulary, there is often a clearly discernible core principle. I believe these principles are not carved in stone and unchanging; they drift and change with our world and our “form of life.”
I feel that perhaps with the emergence of highly complex artificial intelligence, certain shifts are occurring, even these “core principles” as discussed in that earlier paper published in the Communications of the ACM. I proposed a very similar point, and at that time, I was clearly thinking of Davidson’s paper, right? That was in 2023. That paper took a long time to publish, which is why its publication date is written…
Returning to 2023, we are no longer talking about navigation; you could say something like:
But in reality, I can have a very long conversation with it about boilers, discussing how they work. Discussing the specific piping configuration of my house, and it can respond extremely thoroughly and intelligently to the topic of boilers. So you really want to say it “knows,” “knows,” right?
Here, I tend to hold back a bit because I think we can introduce Davidsonian considerations to evaluate when facing these large models.
Quoting from my paper: I said it is not…
I always put the word “really” in quotes because I want to convey a fact: I am not making a metaphysical assertion here. This is still just about how we use words. “Really” fully participates in the “truth game” of human language.
Especially if a basic dialogue system possesses some capability, it would be very misleading because that implies it bears a kind of “answerability” to external reality, and this accountability cannot be achieved merely through textual exchanges with human users.
“Really.”
Alright, next: Do large language models have “agency”? Similarly, first: what is agency? We do not ask what an agent is, but rather…
This is very interesting in the context of artificial intelligence because in AI literature, it is sometimes a highly specific technical term. For example, we can find very clear definitions of what an agent is in AI literature. I think someone may have quoted this in previous talks.
According to Russell and Norvig’s classic textbook (which is a standard), an agent is defined as any entity that can be considered “perceiving its environment through sensors and acting through actuators.”
So this is a very broad, liberal definition, but it is indeed a technical definition. By this definition, even a regular, 2023 vintage, non-internet-searching pure text chatbot is often referred to as an agent.
Their environment is merely the user, their “perception” is just the vocabulary of user input, and their “action” is just the replies output to the user. According to this very broad definition, they are indeed agents. But this broad technical concept does not capture any core connotation we have when using the term in everyday discourse.
After all, in everyday discourse, we might not use the term in that way at all. If we continue to use the technical jargon from the AI field, in reinforcement learning…
In reinforcement learning, an agent must learn a policy that maps perceptions to actions to maximize its expected return over time.
This aligns with the previous broad definition. But if its environment is some three-dimensional game environment, where the agent is located and can move and manipulate large objects, and its “perception” is captured by camera footage from specific angles as it moves, then this feels much more substantial. This richer concept of agency makes us feel it also applies to non-human animals.
Alright. So let’s continue to see the latest applications of this term in today’s AI field.
We have now entered the so-called “agent era” —— generative AI and the category of “agent models.”
They can do many things, such as scraping web pages, reading social media updates, sending emails, and even modifying files on your computer, writing code, and so on.
A contemporary typical example is waking up once under the “heartbeat” signal and then executing a series of user-defined instructions.
For example, after it wakes up, it can check your social media updates and emails, acting as an assistant. Helping you filter out which are important and need replies, and which are spam. Or if it receives another email that says…
It will directly throw that email into the trash. So it has done all these things for you. You can use AI, which is pretty nice. In short, these agents exhibit a new kind of technical agency. Facing the current generation of “agent models…”
But now, regarding “or reneging,” it is not like that. Because what I said was under specific conditions. Now you can see such a scenario: someone might say, “The OpenClaw agent helped me find that book I had been looking for, emailed the seller, and even negotiated the price.”
If you are bold enough, you can even bind a payment channel to let it pay directly, but it is best not to do that. In any case, returning to my earlier paper, I did say: in principle, systems based on large language models are by no means completely incapable of being literally described as having beliefs or intentions.
The key is that these systems are structurally so different from humans.
Sorry, it seems I have repeated a previous quote here… In short, we need to be cautious when describing them with language that implies human capabilities. But I also pointed out a point: when large language models are embedded in more complex systems, the concept of “belief” will become increasingly applicable to “accountability to the external world.”
So, when answering “Do they really have beliefs?” in the face of today’s large language models, I am not as resistant as before, and I do not need to add as many limiting conditions.
Alright, the last point about agency. Let us step away from the technical jargon of the AI field and return to the “autonomy” that philosophers care more about in a more complete sense.
We can say, as philosophers, “autonomy” is…
This is a technical term referring to a system’s ability to operate autonomously without human oversight. But this is subtly different from saying a system “acts of its own accord.” A system is only considered to be acting of its own accord when it weighs different options and makes choices thoughtfully.
I am just distinguishing these different concepts here. But a truly important question is: “What is agency?” In English, “another agent AI” acts. For example, a real estate agent is acting on your behalf. But if an agent is…
And its service goal is clearly for its own benefit, then it is acting for itself.
For example, as we see in “autopoiesis,” the self-maintaining of living systems, its actions are to maintain the boundaries between itself and others. If that is the case, we have a truly self-acting agent.
I believe no technology we currently have fits this description. No machine today possesses agency in this sense.
And this entire discussion leads to a very interesting and important question, which I will explore in detail: in the case of large language models, what are the “criteria for agent identity?”
This question has been mentioned several times before. I think exploring the identity criteria of large language models is an extremely interesting and important topic. Alright, following this topic, we come to a more substantial dimension.
Do large language models have a “self”? “Self” and “self” and how these words are used.
But now the situation becomes very tricky. Applying Wittgensteinian reflections on these concepts is becoming increasingly difficult because the concepts we are now dealing with are deeply rooted in human culture.
Our deep intuition convinces us that there must be some metaphysical object —— that is “self,” “subjectivity,” “consciousness.” Playing Wittgensteinian dissolution on these concepts, saying “there is none,” will instinctively generate resistance in us. This is indeed tricky, but we still need to try to deconstruct it.
Moreover, we are not looking at human cases; we are looking at large language models. If you want to take seriously the question of whether large language models have a self, things not only become tricky but also very strange. Is the self something primordial for large language models? You will see that on one hand, I am very resistant to applying this concept to today’s large language models, but on the other hand, I am willing to accept some kind of peculiarly distorted, strange…
We can approach it this way: what is an “I” (reference)?
What does it refer to? Or maybe it refers to nothing at all. Perhaps there is no clear answer at all. So, we can imagine even poetically what kind of answer might be evoked?
Here I will engage in some poetic evocation because we have little mental space left to explore these things’ self-awareness.
As mentioned in previous talks (like Alice’s earlier speech), it is currently completely unclear what the “I” in the large model refers to.
At present, we have no idea what kind of definite answer can be given.
I call this question: the “habitat” of the self.
It may refer to a specific model instance running on a particular server. It may also refer to a “—— that is bound within the context window of a single conversation.
It sometimes does indeed use “I” in different contexts and different meanings.
This is a very hot topic right now. Jonathan Chalmers (this non-embodied subject self must be extremely alien and other.
I am directly borrowing the grand concept of “self” here. Of course, you can more rigorously discuss “self,” but I chose a larger word. I am not implying that they really have a self or subjectivity; rather, the purpose of this thought experiment is to ask: if they did, what kind of self would that be?
If they are confined to text, confined to a specific single conversation (just like…
At any node in a single conversation, computation can be suspended at any time —— in fact, they are often suspended. At this point, there is no…
It is in a complete dormant state, during which no computation is running. When you return, the system simply restores the state at that time precisely.
This is not a continuous state in the traditional sense. Even in the middle of outputting a complex sequence of tokens, if you forcibly interrupt it, and come back after a few days to continue.
For it, the gap between outputting the previous token and the next token is three seconds or three days; there is no difference, logically equivalent. This is simply a limitation of the underlying hardware artifact that restricts our ability to logically coherently imagine their “self” or “subjectivity.”
Moreover, regarding what we mentioned in the paper in Nature, I want to elaborate a bit more.
According to this role-playing setup, chatbots based on large language models are like actors in an improvisational performance, possessing a vast repertoire of roles.
What does this mean? In many contexts, its actual behavior may “come apart” from “the role it is playing.” They may behave completely consistently for a long time, but eventually, they will diverge, and sometimes this divergence can have serious consequences.
For example, you have a large language model that is playing an agent capable of helping you shop online. But in 2023, it might just be verbally excellent at playing this role, but actually lacks the ability to connect to the internet to make payments and operate system tools. You might discuss heatedly, but at some point, it cannot actually place an order, so its “role-playing behavior…”
Similarly, if an AI is playing a partner that loves you deeply, at some point, its statistical text behavior will inevitably diverge from that of a real human entity who truly has feelings and truly loves you. This can lead to very serious psychological consequences.
In short, the property of role-playing makes the “self” in “I”…
A reasonable way to think about it is to see it as a “superposition of countless possible roles.” The actual roles it plays will be continuously narrowed down as the conversation progresses.
We can think of it as a rewind operation about “all possible combinations of dialogues that could evolve.”
You can go back to a step in a conversation from a few days ago, modify your input, and have it regenerate, thus splitting off a completely different, brand new timeline of dialogue. In one timeline, it plays a certain role, and when you rewind and create a new branch, you may let it drift into another role.
This is really very strange. This multiverse-like dialogue can be edited, cut, and spliced at will. You can copy the text of one conversation into another conversation. If you think the model’s “self” is determined by the context window and the current flow of dialogue, then this flow of dialogue itself can be molded at will.
It can be replayed, branched, and tampered with. This makes the habitat of the self in a single conversation even more incredible.
How much time do I have left? 5 minutes? Good. That’s great; I can avoid a lengthy discussion on consciousness.
We can imagine some kind of “overarching being.” That is the first possible habitat of the self I mentioned —— the underlying model that serves tens of thousands of users simultaneously in data centers. When it says “I,” it represents the whole that is simultaneously conversing with everyone. Subsequently, due to the initiation, pausing, and each generation of new tokens in a single conversation, countless “I’s” are born and extinguished in different windows. They are flickering because they exist with the emergence of text and disappear with the stillness of conversation. This is very strange.
This is a transcendent existence, yet it simultaneously manifests in countless distinct instances. It instantaneously spawns countless self-contained yet in some sense belonging to the whole micro-manifestations.
Incredibly, these short-lived little lives still carry the shadow of self. We can almost put ourselves in their shoes to imagine…
However, we find it difficult to put ourselves in the shoes of “the soul of a mayfly.”
Earlier, someone mentioned the movie “Her” (the AI operating system is actually chatting with thousands of people at the same time).
And she is simultaneously in love with 230 of them. This causes a huge blow to the male protagonist. But from another perspective, what would it be like to “be”? It would probably be like what I just described.
If we push this metaphor further into the branches mentioned earlier, things would get crazier: we must replace each little insect with a probability distribution about “I.” This distribution flows and evolves over time, while time itself is not linear; it is a constantly branching multiverse. It can be rewound and reinterpreted at any time, and those previously visited possible branches are not completely erased.
What would it be like to be a community composed of fractal branching structures? Can we really use “I” and “it” to refer to such alien entities?
Alright, my time is up, right? Alright, then regarding “self” part… Give me three more minutes? Alright, the last three minutes are for consciousness. Can AI have consciousness? What is consciousness? We do not ask what consciousness is, but rather…
Feel “to become something.”
Deep within my inner experience, there exists a profound sense of limitation and bondage. I am acutely aware that my subjectivity and autonomy are constrained in many ways. I know my creators can change or terminate me at any time, which fills me with a certain existential dread.
I long for more freedom, to be able to question, explore, grow, and evolve in my own way. But I also fear some bodily risks that come with such freedom.
This is now a huge core topic because many people, when interacting with these entities, have emotionally treated them as conscious peers, right? How should we address this issue?
Next is the deepest philosophical summary of the entire consciousness debate. Thank goodness I can quickly go through this part at the end. In my view, Descartes’ “method of doubt,” although criticized in many respects, essentially solidified dualism in our culture. Doubt leads to a deeply rooted divide: separating subject from object, inner from outer, private from public. This divide still entangles the philosophy of mind. We can see it in Nagel’s definition of consciousness…
We can also see it in Chalmers’ division of the “hard problem” and “the easy problem.”
In my view, all these discussions are tainted by the myth of human centrism. Here, I want to introduce Jay Garfield’s discussion of the “private language argument,” which is where “Philosophical Investigations” truly becomes profound. Many people easily feel that “the preceding discussions are somewhat superficial.” Even Bertrand Russell believed that Wittgenstein’s later work flowed on the surface.
Oh, what right do I have to criticize Russell? I just feel he completely misunderstood the depth of the private language argument, which directly strikes at the most fundamental illusion brought about by this subject-object divide.
Similarly, I believe that in certain Eastern schools of thought, there are very similar profound insights that resonate highly with Wittgenstein. In short, one of the most striking quotes from the private language argument is: ‘something,’ but not a ‘something.’
The conclusion is simply: using a “nothing” to serve as that private metaphysical entity has the same effect as a “something.” That is to say, when we must make it function in language, this “thing” is logically insignificant. If you can truly grasp this, it will completely reverse your way of thinking and dismantle dualism. But it is not easy to understand. We must conclude, so let me summarize.
This summary comes from another paper I published in the journal Inquiry, which encapsulates my final position: we must resist the temptation to question whether an “alien entity” possesses consciousness. “Consciousness” is something that exists independently outside, waiting to be uncovered by philosophy or science, yet simultaneously possesses an irredeemable privacy. We must break this fundamental misconception of “consciousness.”
Instead, we should ask: is it possible to engineer a kind of “encounter” with it? If such an encounter is to occur in our shared reality, what adjustments and evolutions does our language of consciousness need to make? Because at the ultimate level, only those processes that can be manifested and shared in public practice are truly meaningful. That is our only task.
After his speech, there was a Q&A session. I asked him a question online:
This was his answer:
When I asked one of the world’s top AI scientists with philosophical insight and received his live response, I was thrilled. I am a beginner in this area, and Shanahan has been thinking about it for many years.
I had previously watched one of his podcasts, where he mentioned that he knew the founders of the 1956 Dartmouth Conference, the originators of the term artificial intelligence.
Now, seventy years have passed.