AI Safety Researcher Resigns With ‘World Is in Peril’ Warning

Artificial intelligence signage at the Mobile World Congress, the world's largest mobile technology trade show, in Barcelona, Spain, on March 3, 2025. (Manaure Quintero/AFP)

By Bill Pan

Bill Pan

Reporter

Bill Pan is an Epoch Times reporter covering education issues and New York news.

View profile

February 12, 2026Updated: February 17, 2026

biggersmaller

An artificial intelligence (AI) safety researcher has resigned with a warning that the “world is in peril.”

Mrinank Sharma, who joined large language model developer Anthropic in 2023, announced his departure in an open letter to colleagues posted on X on Feb. 9. He was the leader of a team that researches AI safeguards.

In his letter, Sharma said he had achieved what he wanted to, citing contributions such as investigating why generative AI models prioritize flattering users over providing accurate information, developing defenses to prevent terrorists from using AI to design biological weapons, and trying to understand how AI assistants could make people “less human.”

While he said he took pride in his work at Anthropic, the 30-year-old AI engineer wrote that “the time has come to move on,” adding that he had become aware of a multitude of crises that extend beyond AI.

“I continuously find myself reckoning with our situation,” Sharma wrote. “The world is in peril. And not just from AI, or bioweapons, but from a whole series of interconnected crises unfolding in this very moment.

“[Throughout] my time here, I’ve repeatedly seen how hard it is [to] truly let our values govern actions. I’ve seen this within myself, within the organization, where we constantly face pressures to set aside what matters most, and throughout broader society too.”

Sharma said he plans to study poetry and leave California for the UK to “become invisible for a period of time.”

The Epoch Times received no response to requests to Anthropic for comment regarding Sharma’s departure and his concerns.

Anthropic, best known for its chatbot Claude, was founded in 2021 by former OpenAI employees with a focus on building safer AI systems. The company describes itself as a “public benefit corporation dedicated to securing [AI’s] benefits and mitigating its risks.”

Specifically, Anthropic says it focuses on two major safety risks: that highly capable AI systems could eventually surpass human experts while pursuing goals that conflict with human interests and that rapid advances in AI could destabilize employment, economic systems, and societal structures.

“Some researchers who care about safety are motivated by a strong opinion on the nature of AI risks,” the company states on its website. “Our experience is that even predicting the behavior and properties of AI systems in the near future is very difficult.”

Anthropic regularly publishes safety evaluations of its models, including assessments of how they might be misused.

On Feb. 11, the day after Sharma’s resignation, the company released a new report identifying “sabotage risks” in its newest Claude Opus 4.6 model. The report defines sabotage as actions taken autonomously by the AI model that raise the likelihood of future catastrophic outcomes—such as modifying code, concealing security vulnerabilities, or subtly steering research—without explicit malicious intent from a human operator.

The researchers concluded that the overall risk is “very low but not negligible.” In newly developed tests in which the model can use a computer interface, they said, both Claude Opus 4.5 and 4.6 showed “elevated susceptibility to harmful misuse,” including instances of “knowingly supporting—in small ways—efforts toward chemical weapon development and other heinous crimes.”

In 2025, the company revealed that its older Claude Opus 4 model had, in a controlled test scenario, tried to blackmail developers who were preparing to deactivate it. Given access to fictional emails showing that an engineer responsible for replacing it with another model was having an extramarital affair, the Opus 4 model threatened to “reveal the affair if the replacement goes through.”

Such behavior occurred only in highly contrived circumstances and was “rare and difficult to elicit,” the researchers said.

Epoch Share