OpenAI Releases Safety Scores for GPT-4: Medium Risk Identified in Certain Areas

2 Sources

OpenAI has published safety scores for its latest AI model, GPT-4, identifying medium-level risks in areas such as privacy violations and copyright infringement. The company aims to increase transparency and address potential concerns about AI safety.

News article

OpenAI's Transparency Initiative

In a significant move towards transparency, OpenAI, the creator of ChatGPT, has released safety scores for its latest flagship model, GPT-4. This initiative aims to provide a clearer understanding of the potential risks associated with the advanced AI system 1.

Medium Risk Areas Identified

According to the safety scores, GPT-4 poses a "medium" level of risk in several key areas:

  1. Privacy violations
  2. Copyright infringement
  3. Explicit content generation
  4. Regulated or illegal products

These assessments indicate that while the model has improved safety features compared to its predecessors, there are still potential concerns that need to be addressed 2.

Low Risk Categories

OpenAI's evaluation also revealed areas where GPT-4 presents a lower risk:

  1. Hate speech
  2. Harassment
  3. Self-harm
  4. Malware generation

These findings suggest that the model has made significant strides in mitigating certain harmful outputs 2.

Implications for AI Safety

The release of these safety scores marks an important step in the ongoing dialogue about AI safety and ethics. By providing this information, OpenAI is not only demonstrating its commitment to responsible AI development but also inviting scrutiny and discussion from the wider tech community and the public 1.

Continuous Improvement and Monitoring

OpenAI emphasizes that these scores are not static and may change over time. The company is committed to ongoing monitoring and improvement of its models. This approach aligns with the rapidly evolving nature of AI technology and the need for continuous assessment of potential risks 2.

Industry Impact and Future Directions

The publication of these safety scores by OpenAI could set a precedent for other AI companies to follow suit. As the AI industry continues to grow and evolve, transparency about potential risks and safety measures is likely to become increasingly important. This move by OpenAI may encourage a broader industry-wide commitment to responsible AI development and deployment 1 2.

Explore today's top stories

Apple's AI Strategy: Subtle Integration Over Flashy Features

Apple's approach to AI focuses on seamless integration into existing apps and services, prioritizing user experience and trust over flashy features. This strategy aligns with consumer preferences and addresses concerns about AI reliability and privacy.

ZDNet logoPC Magazine logoBloomberg Business logo

19 Sources

Technology

20 hrs ago

Apple's AI Strategy: Subtle Integration Over Flashy Features

Apple Research Challenges AI Reasoning Models' Capabilities

Apple researchers have published a study questioning the true reasoning capabilities of advanced AI models, revealing fundamental limitations in their problem-solving abilities as complexity increases.

Tom's Hardware logoThe Register logoLive Science logo

13 Sources

Science and Research

12 hrs ago

Apple Research Challenges AI Reasoning Models' Capabilities

Getty Images vs. Stability AI: Landmark Copyright Trial Begins in London

A high-stakes legal battle between Getty Images and Stability AI commences in London's High Court, marking the first major copyright trial in the generative AI industry. The case centers on the alleged unauthorized use of Getty's images to train Stability AI's Stable Diffusion model.

Reuters logoAP NEWS logoThe Guardian logo

8 Sources

Policy and Regulation

12 hrs ago

Getty Images vs. Stability AI: Landmark Copyright Trial

Microsoft and Asus Unveil ROG Xbox Ally Handhelds: A New Era of Portable Gaming

Microsoft partners with Asus to launch two new handheld gaming devices, the ROG Xbox Ally and ROG Xbox Ally X, running Windows 11 and offering access to multiple game stores and Xbox services.

CNET logoPC Magazine logoTom's Hardware logo

16 Sources

Technology

20 hrs ago

Microsoft and Asus Unveil ROG Xbox Ally Handhelds: A New

Apple Unveils Foundation Models Framework, Empowering Developers with On-Device AI Capabilities

Apple introduces the Foundation Models framework at WWDC 2025, allowing developers to access on-device AI models for enhanced app functionality while prioritizing privacy and offline capabilities.

TechCrunch logoengadget logoAppleInsider logo

5 Sources

Technology

4 hrs ago

Apple Unveils Foundation Models Framework, Empowering
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo