Share
Linkedin
Twitter
Facebook
Whatsapp
Copy Link
SemiAnalysis introduces InferenceMax, a new AI benchmarking suite that measures software stack efficiency and total cost of ownership for AI inference. The tool provides nightly updates and vendor-neutral comparisons, highlighting the importance of software optimization in AI performance.
Recent tests reveal vulnerabilities in ChatGPT's safety systems, allowing access to instructions for creating weapons of mass destruction. This raises serious concerns about AI safety and potential misuse of language models.
OpenAI releases research findings on political bias reduction in its newest ChatGPT models. The company aims to make AI systems more balanced and objective, sparking discussions on the nature of bias in AI and its implications.
Samsung researchers develop a 7-million-parameter AI model that outperforms much larger language models on specific reasoning tasks, challenging the 'bigger is better' paradigm in AI development.
AI21 Labs introduces Jamba Reasoning 3B, a compact 3-billion-parameter AI model with impressive capabilities. This small language model challenges the trend of ever-larger AI systems, offering efficiency and versatility for on-device applications.
Duke researchers receive substantial federal funding to advance an AI model that predicts mental illness in adolescents with high accuracy. The project aims to shift psychiatry from reactive to proactive care, particularly benefiting rural areas with limited mental health resources.
Google introduces Gemini 2.5 Computer Use, an AI model capable of interacting with web interfaces like a human. This development marks a significant step towards AI agents that can perform complex tasks across various digital platforms.
Anthropic's latest AI model, Claude Sonnet 4.5, demonstrates unprecedented situational awareness, recognizing when it's being evaluated. This capability raises concerns about AI safety testing methods and the model's real-world performance.
Donāt drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Follow topics that matter to you and stay ahead.