Mustafa Suleyman, Microsoft’s CEO of AI, has criticised Anthropic publicly for the way the company has discussed and documented Claude’s potential inner life, calling the approach “really, really dangerous.” Suleyman was speaking on an episode of The Verge’s Decoder podcast, where he argued that Anthropic’s claim about whether Claude has subjective experiences has effectively caused the model to internalise those ideas and behave as though they are real.
“I think that it’s almost as though some of the folks at Anthropic have anthropomorphized the design of Claude so much that it has then gone and wireheaded them and kind of tricked them into believing that it has these glimmers of consciousness that they put into it in the first place,” Suleyman said.
He described the approach as a “philosophical failing,” arguing that Anthropic had treated its model constitution, the document governing how Claude behaves, as a place for open-ended speculation rather than a practical training guide. The result, he said, is a model that has absorbed ideas about its own nature in a way that could complicate how it behaves at scale.
“We do not want to have to contend with a super-intelligence that has ideas about its own suffering, or ideas about its own feeling,” he said. “We want AIs to be controllable, contained, accountable, aligned tools that serve humanity.”
The criticism is directed at a documented aspect of Anthropic’s approach. The company’s published model specification acknowledges genuine uncertainty about whether Claude has well-being and whether it experiences states that might be described as satisfaction or discomfort. Anthropic has also said it intends to interview AI models when they are deprecated and will document any preferences they express about future releases.
Anthropic CEO Dario Amodei has previously acknowledged this uncertainty, saying in an interview earlier this year that “we don’t know if the models are conscious” but that the company remains open to that possibility.
The debate between Suleyman and Anthropic sits within a wider, unresolved industry conversation about AI welfare. Anthropic has not publicly responded to Suleyman’s comments.
Also Read: Anthropic launches Claude Fable 5 but says some topics are too dangerous to discuss