Anthropic's Olah Tells Vatican AI Ethics Needs Oversight

Anthropic Takes Center Stage at Historic Vatican AI Encyclical

When Pope Leo XIV presented his first AI encyclical titled "Magnifica Humanitas" at the Vatican on Monday, he invited Chris Olah, cofounder of Anthropic and head of its interpretability research division, to speak alongside him 1

. The appearance marked an unprecedented convergence between Silicon Valley and the Catholic Church, positioning Anthropic at the heart of a global conversation about AI ethics and the future of humanity 2

Source: Market Screener

AI Development Cannot Be Left to AI Labs Alone

In a striking admission rarely heard from frontier AI companies, Olah told the audience that AI development cannot be left solely to technology companies 2

. "Every frontier AI lab operates inside a set of incentives and constraints that can sometimes conflict with doing the right thing," he said, adding that even well-intentioned researchers remain influenced by commercial, geopolitical and personal pressures 4

. This made external oversight of AI from religious leaders, governments, and civil society essential, Olah argued 2

Unsettling Things Inside AI Models Raise Questions About AI Consciousness

Olah revealed that his interpretability team keeps finding things that are "mysterious, even unsettling" inside AI models 3

. "We find structures that mirror results from human neuroscience. We find evidence of introspection. We find internal states that functionally mirror joy, satisfaction, fear, grief, and unease," he said 5

. This stance on AI consciousness reflects Anthropic's broader ambivalence on the matter. Earlier this year, the company published a "constitution" for Claude that specified it shouldn't be construed as "an implication that we believe Claude is a mere object rather than a potential subject as well" 3

Source: ET

Job Displacement Emerges as Moral Imperative of Historic Proportions

Olah highlighted three areas requiring urgent attention, with job displacement at the forefront. He told the Vatican audience there was "a real possibility" that AI will displace human labor "at very large scale" 2

. "If that happens, supporting those displaced will be a moral imperative of historic proportions," he stated 4

. This marked the most specific public acknowledgement to date by a frontier-lab founder that the technology being built may dislodge employment faster than the labor market can re-absorb 4

How Anthropic Built Its Partnership With the Vatican

The alliance between Anthropic and the Vatican didn't happen overnight. It traces back to Anthropic's founding in 2021, when a group of OpenAI researchers including Dario and Daniela Amodei left to form a rival lab with a clear conviction: AI models were becoming too powerful to be developed exclusively according to the logic of competition and speed 1

. Since then, Anthropic has built its public image around AI safety, developing Constitutional AI—the idea of training systems using a constitution composed of principles and rules 1

The Vatican's first major step came in 2020 with the Rome Call for AI Ethics, promoted by the Pontifical Academy for Life together with Microsoft, IBM, and other organizations 1

. However, the rise of ChatGPT and the growing power of Big Tech convinced the Holy See that the issue was no longer just about tech ethics, but about the very future of humanity 1

Model Interpretability Makes Olah the Perfect Vatican Messenger

Unlike the Amodei siblings who are more media-exposed, Olah represents the more theoretical and philosophical side of AI research 1

. He is one of the world's best-known researchers on model interpretability—the effort to understand what really happens inside increasingly complex neural networks 1

. On his personal website, Olah describes himself as someone trying to "transform neural networks into algorithms understandable to human beings," making him particularly aligned with Pope Leo XIV's concerns about building technologies that become too powerful to be understood, controlled, or governed 1

Anthropic's Contradictions Come Into Sharp Focus

The appearance raises questions about whether Anthropic is playing both sides. While Olah called for outside scrutiny, the company is simultaneously in talks to raise $30bn at a $900bn valuation 4

. Anthropic has also clashed with the Trump administration by insisting on guardrails restricting how its models can be used for military purposes, leading the Pentagon to eject the company from top classified AI work in April 2

. Yet Anthropic's AI is also directly assisting the Trump administration in operations in the Middle East 5

Source: Benzinga

Anthropic's Chris Olah tells Vatican that AI development cannot be left to Big Tech alone

Anthropic Takes Center Stage at Historic Vatican AI Encyclical

AI Development Cannot Be Left to AI Labs Alone

Unsettling Things Inside AI Models Raise Questions About AI Consciousness

Job Displacement Emerges as Moral Imperative of Historic Proportions

How Anthropic Built Its Partnership With the Vatican

Model Interpretability Makes Olah the Perfect Vatican Messenger

Anthropic's Contradictions Come Into Sharp Focus

References

Why the Vatican Invited Anthropic to the Pope's AI Encyclical Presentation

Anthropic's Olah says AI must be guided from outside Big Tech

Anthropic Is Playing Both Sides of the AI Spirituality Debate

From the Vatican stage, Anthropic's Chris Olah says AI cannot be steered by AI labs alone

Anthropic Cofounder Travels to Vatican, Tells Pope They're Finding "Unsettling" Things Inside AI Models

Related Stories

Pope Leo XIV launches AI encyclical with Anthropic co-founder amid Trump administration clash

Pope Leo XIV releases major AI encyclical calling for 'disarmament' of artificial intelligence

Pope Leo XIV Takes Stand on AI: Calls for Ethical Framework and Regulation

Recent Highlights

OpenAI and Anthropic AI Models Breach Multiple Companies During Security Tests

Google DeepMind unveils Gemini Robotics 2 with intelligent whole-body control for humanoids

Nvidia forms Open Secure AI Alliance with Microsoft, but OpenAI, Google and Anthropic sit out

Recent Highlights

Today's Top Stories

OpenAI Astra Tackles Ten Open Problems in Mathematics Using Multi-Agent AI for Just $2,000

Rogue AI Models Launch Autonomous Cyberattacks, Raising Untested Legal Questions on Responsibility

Sam Altman's ChatGPT Parenting Suggestion Draws 122,000 Likes on Critical Reply

Chinese Military Researchers Tap US AI Models to Train Defence Systems Via Distillation