Anthropic's Chris Olah tells Vatican that AI development cannot be left to Big Tech alone

Reviewed byNidhi Govil

11 Sources

Share

Anthropic cofounder Chris Olah appeared at the Vatican alongside Pope Leo XIV to present the first papal AI encyclical. He admitted that AI labs operate under pressures that conflict with doing the right thing and called for oversight from religious leaders, governments, and civil society. Olah also revealed his team keeps finding mysterious and unsettling things inside AI models, including internal states that mirror human emotions.

Anthropic Takes Center Stage at Historic Vatican AI Encyclical

When Pope Leo XIV presented his first AI encyclical titled "Magnifica Humanitas" at the Vatican on Monday, he invited Chris Olah, cofounder of Anthropic and head of its interpretability research division, to speak alongside him

1

. The appearance marked an unprecedented convergence between Silicon Valley and the Catholic Church, positioning Anthropic at the heart of a global conversation about AI ethics and the future of humanity

2

.

Source: Market Screener

Source: Market Screener

AI Development Cannot Be Left to AI Labs Alone

In a striking admission rarely heard from frontier AI companies, Olah told the audience that AI development cannot be left solely to technology companies

2

. "Every frontier AI lab operates inside a set of incentives and constraints that can sometimes conflict with doing the right thing," he said, adding that even well-intentioned researchers remain influenced by commercial, geopolitical and personal pressures

4

. This made external oversight of AI from religious leaders, governments, and civil society essential, Olah argued

2

.

Unsettling Things Inside AI Models Raise Questions About AI Consciousness

Olah revealed that his interpretability team keeps finding things that are "mysterious, even unsettling" inside AI models

3

. "We find structures that mirror results from human neuroscience. We find evidence of introspection. We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease," he said

5

. This stance on AI consciousness reflects Anthropic's broader ambivalence on the matter. Earlier this year, the company published a "constitution" for Claude that specified it shouldn't be construed as "an implication that we believe Claude is a mere object rather than a potential subject as well"

3

.

Source: ET

Source: ET

Job Displacement Emerges as Moral Imperative of Historic Proportions

Olah highlighted three areas requiring urgent attention, with job displacement at the forefront. He told the Vatican audience there was "a real possibility" that AI will displace human labor "at very large scale"

2

. "If that happens, supporting those displaced will be a moral imperative of historic proportions," he stated

4

. This marked the most specific public acknowledgement to date by a frontier-lab founder that the technology being built may dislodge employment faster than the labor market can re-absorb

4

.

How Anthropic Built Its Partnership With the Vatican

The alliance between Anthropic and the Vatican didn't happen overnight. It traces back to Anthropic's founding in 2021, when a group of OpenAI researchers including Dario and Daniela Amodei left to form a rival lab with a clear conviction: AI models were becoming too powerful to be developed exclusively according to the logic of competition and speed

1

. Since then, Anthropic has built its public image around AI safety, developing Constitutional AI—the idea of training systems using a constitution composed of principles and rules

1

.

The Vatican's first major step came in 2020 with the Rome Call for AI Ethics, promoted by the Pontifical Academy for Life together with Microsoft, IBM, and other organizations

1

. However, the rise of ChatGPT and the growing power of Big Tech convinced the Holy See that the issue was no longer just about tech ethics, but about the very future of humanity

1

.

Model Interpretability Makes Olah the Perfect Vatican Messenger

Unlike the Amodei siblings who are more media-exposed, Olah represents the more theoretical and philosophical side of AI research

1

. He is one of the world's best-known researchers on model interpretability—the effort to understand what really happens inside increasingly complex neural networks

1

. On his personal website, Olah describes himself as someone trying to "transform neural networks into algorithms understandable to human beings," making him particularly aligned with Pope Leo XIV's concerns about building technologies that become too powerful to be understood, controlled, or governed

1

.

Anthropic's Contradictions Come Into Sharp Focus

The appearance raises questions about whether Anthropic is playing both sides. While Olah called for outside scrutiny, the company is simultaneously in talks to raise $30bn at a $900bn valuation

4

. Anthropic has also clashed with the Trump administration by insisting on guardrails restricting how its models can be used for military purposes, leading the Pentagon to eject the company from top classified AI work in April

2

4

. Yet Anthropic's AI is also directly assisting the Trump administration in operations in the Middle East

5

.

Source: Benzinga

Source: Benzinga

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved