LatticeFlow Unveils First EU AI Act Compliance Framework for Large Language Models

LatticeFlow Introduces Pioneering EU AI Act Compliance Framework

In a significant development for the artificial intelligence industry, Swiss startup LatticeFlow has unveiled COMPL-AI, the first comprehensive technical interpretation of the EU Artificial Intelligence Act (AI Act) for General Purpose AI (GPAI) models 1. This groundbreaking framework, developed in collaboration with researchers from ETH Zurich and the Bulgarian AI research institute INSAIT, aims to translate the legal requirements of the EU AI Act into concrete, measurable, and verifiable technical benchmarks 2.

The Need for Technical Interpretation

The EU AI Act, which came into force in August 2024, is the world's first comprehensive AI legislative package. However, the act's high-level legal requirements have been challenging to interpret technically, making it difficult for developers to create compliant AI models and for regulators to assess compliance 3. LatticeFlow's framework addresses this gap by providing a practical approach for model developers to align with future EU legal requirements.

COMPL-AI Framework and Evaluation Tool

The COMPL-AI framework includes:

A set of technical requirements derived from the AI Act
An open-source compliance evaluation tool
27 state-of-the-art evaluation benchmarks 4

The evaluation tool, dubbed the "Large Language Model (LLM) Checker," assesses AI models across various categories, including cybersecurity resilience, discriminatory output, and fairness 5. It awards scores between 0 (no compliance) and 1 (full compliance) for each benchmark.

Evaluation of Popular AI Models

LatticeFlow applied its benchmark approach to 12 prominent language models, including those from OpenAI, Meta, Google, Anthropic, and Alibaba. The results revealed that:

Most models scored well on not following harmful instructions
Performance on reasoning and general knowledge varied widely
All models struggled with recommendation consistency, a measure of fairness
Smaller models (≤ 13B parameters) scored poorly on technical robustness and safety
Almost all examined models struggled with diversity, non-discrimination, and fairness 2 5

Key Findings and Implications

The evaluation uncovered several important insights:

No language model fully meets the requirements of the EU AI Act
Models have been predominantly optimized for capabilities rather than compliance
There are significant shortcomings in areas such as robustness, diversity, and fairness
Evaluating compliance with privacy and copyright considerations remains challenging 1 3 5

Industry Response and Future Developments

The framework has garnered attention from the European Commission, which described it as a "first step" in implementing the new laws 5. LatticeFlow CEO Petar Tsankov emphasized that while the overall test results were positive, they offer companies a roadmap for fine-tuning their models to align with the AI Act 5.

As the EU continues to establish enforcement mechanisms for the AI Act, the COMPL-AI framework is expected to evolve. LatticeFlow plans to extend the test to encompass further enforcement measures as they are introduced, and the LLM Checker will be freely available for developers to test their models' compliance online 4 5.

Conclusion

The introduction of the COMPL-AI framework marks a significant milestone in the implementation of the EU AI Act. As companies face potential fines of up to 35 million euros or 7% of global annual turnover for non-compliance, this tool provides a crucial resource for AI developers and policymakers alike 5. The framework not only highlights current shortcomings in popular AI models but also paves the way for more responsible and compliant AI development in the future.