LatticeFlow Unveils First EU AI Act Compliance Framework for Large Language Models

12 Sources

LatticeFlow, in collaboration with ETH Zurich and INSAIT, has developed the first comprehensive technical interpretation of the EU AI Act for evaluating Large Language Models (LLMs), revealing compliance gaps in popular AI models.

News article

LatticeFlow Introduces Pioneering EU AI Act Compliance Framework

In a significant development for the artificial intelligence industry, Swiss startup LatticeFlow has unveiled COMPL-AI, the first comprehensive technical interpretation of the EU Artificial Intelligence Act (AI Act) for General Purpose AI (GPAI) models 1. This groundbreaking framework, developed in collaboration with researchers from ETH Zurich and the Bulgarian AI research institute INSAIT, aims to translate the legal requirements of the EU AI Act into concrete, measurable, and verifiable technical benchmarks 2.

The Need for Technical Interpretation

The EU AI Act, which came into force in August 2024, is the world's first comprehensive AI legislative package. However, the act's high-level legal requirements have been challenging to interpret technically, making it difficult for developers to create compliant AI models and for regulators to assess compliance 3. LatticeFlow's framework addresses this gap by providing a practical approach for model developers to align with future EU legal requirements.

COMPL-AI Framework and Evaluation Tool

The COMPL-AI framework includes:

  1. A set of technical requirements derived from the AI Act
  2. An open-source compliance evaluation tool
  3. 27 state-of-the-art evaluation benchmarks 4

The evaluation tool, dubbed the "Large Language Model (LLM) Checker," assesses AI models across various categories, including cybersecurity resilience, discriminatory output, and fairness 5. It awards scores between 0 (no compliance) and 1 (full compliance) for each benchmark.

Evaluation of Popular AI Models

LatticeFlow applied its benchmark approach to 12 prominent language models, including those from OpenAI, Meta, Google, Anthropic, and Alibaba. The results revealed that:

  1. Most models scored well on not following harmful instructions
  2. Performance on reasoning and general knowledge varied widely
  3. All models struggled with recommendation consistency, a measure of fairness
  4. Smaller models (≤ 13B parameters) scored poorly on technical robustness and safety
  5. Almost all examined models struggled with diversity, non-discrimination, and fairness 25

Key Findings and Implications

The evaluation uncovered several important insights:

  1. No language model fully meets the requirements of the EU AI Act
  2. Models have been predominantly optimized for capabilities rather than compliance
  3. There are significant shortcomings in areas such as robustness, diversity, and fairness
  4. Evaluating compliance with privacy and copyright considerations remains challenging 135

Industry Response and Future Developments

The framework has garnered attention from the European Commission, which described it as a "first step" in implementing the new laws 5. LatticeFlow CEO Petar Tsankov emphasized that while the overall test results were positive, they offer companies a roadmap for fine-tuning their models to align with the AI Act 5.

As the EU continues to establish enforcement mechanisms for the AI Act, the COMPL-AI framework is expected to evolve. LatticeFlow plans to extend the test to encompass further enforcement measures as they are introduced, and the LLM Checker will be freely available for developers to test their models' compliance online 45.

Conclusion

The introduction of the COMPL-AI framework marks a significant milestone in the implementation of the EU AI Act. As companies face potential fines of up to 35 million euros or 7% of global annual turnover for non-compliance, this tool provides a crucial resource for AI developers and policymakers alike 5. The framework not only highlights current shortcomings in popular AI models but also paves the way for more responsible and compliant AI development in the future.

Explore today's top stories

Goldman Sachs Pilots AI Coder Devin: A New Era of Hybrid Workforce on Wall Street

Goldman Sachs is testing Devin, an AI software engineer developed by Cognition, potentially deploying thousands of instances to augment its human workforce. This move signals a significant shift towards AI adoption in the financial sector.

TechCrunch logoCNBC logoQuartz logo

5 Sources

Technology

10 hrs ago

Goldman Sachs Pilots AI Coder Devin: A New Era of Hybrid

RealSense Spins Out from Intel, Secures $50 Million to Advance AI-Powered 3D Vision Technology

RealSense, Intel's depth-sensing camera technology division, has spun out as an independent company, securing $50 million in Series A funding to scale its 3D perception technology for robotics, AI, and computer vision applications.

TechCrunch logoTom's Hardware logoReuters logo

13 Sources

Technology

10 hrs ago

RealSense Spins Out from Intel, Secures $50 Million to

AI Adoption Accelerates: From Consumer Chatbots to Superintelligence Research

AI adoption is rapidly increasing across businesses and consumers, with tech giants already looking beyond AGI to superintelligence, suggesting the AI revolution may be further along than publicly known.

CNBC logoThe Motley Fool logo

2 Sources

Technology

18 hrs ago

AI Adoption Accelerates: From Consumer Chatbots to

Elon Musk's xAI Seeks Massive $200 Billion Valuation in Upcoming Funding Round

Elon Musk's artificial intelligence company xAI is preparing for a new funding round that could value the company at up to $200 billion, marking a significant increase from its previous valuation and positioning it as one of the world's most valuable private companies.

Bloomberg Business logoFinancial Times News logoMarket Screener logo

3 Sources

Business and Economy

9 hrs ago

Elon Musk's xAI Seeks Massive $200 Billion Valuation in

AWS to Launch AI Agent Marketplace with Anthropic as Key Partner

Amazon Web Services is set to unveil an AI agent marketplace, featuring Anthropic as a prominent partner, aiming to streamline AI agent distribution and accessibility for businesses.

TechCrunch logoSiliconANGLE logo

2 Sources

Technology

1 day ago

AWS to Launch AI Agent Marketplace with Anthropic as Key
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo