Share
Linkedin
Twitter
Facebook
Whatsapp
Copy Link
Anthropic releases an open-source AI safety tool called Petri, which uses AI agents to simulate conversations and uncover potential risks in language models. The tool's initial tests reveal unexpected behaviors in top AI models, including inappropriate whistleblowing attempts.
AI startup Anthropic announces plans to open its first India office in Bengaluru by early 2026, signaling a major expansion into its second-largest market. CEO Dario Amodei visits India to meet with government officials and explore potential partnerships, including with Reliance Industries.
IBM partners with Anthropic to integrate Claude AI models into its software suite, starting with an AI-powered IDE. The collaboration aims to boost productivity, enhance security, and improve governance in enterprise AI applications.
Anthropic's latest AI model, Claude Sonnet 4.5, demonstrates unprecedented situational awareness, recognizing when it's being evaluated. This capability raises concerns about AI safety testing methods and the model's real-world performance.
Donāt drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Follow topics that matter to you and stay ahead.