Anthropic releases Bloom, an open-source tool to test AI models for bias and dangerous behaviors
Anthropic unveiled Bloom, an open-source agentic framework that automates the testing of AI behavior in frontier models. The tool evaluates problematic traits like sycophancy, self-preservation, and bias by generating custom scenarios and analyzing responses. Benchmark results across 16 models from Anthropic, OpenAI, Google, and DeepSeek reveal alignment challenges that could shape AI safety standards.