The Verge, Analytics Insight, and 1 more
TechCrunch, Android Authority, and 8 more
Tom's Hardware, Interesting Engineering, and 8 more

Ars Technica, TechCrunch, and 51 more

Ars Technica, TechCrunch, and 109 more

The Register, The Next Web, and 4 more

Ars Technica, The Verge, and 25 more

PC Magazine, Popular Science, and 8 more

The Conversation, CNBC, and 6 more
Humanity's Last Exam
Humanity's Last Exam is a benchmark featuring 2,500 extremely difficult, expert-level questions across subjects like advanced math, physics, and biology. It was created to truly test AI capabilities as older benchmarks became too easy.
The Verge, Analytics Insight, and 1 more
TechCrunch, Android Authority, and 8 more
Tom's Hardware, Interesting Engineering, and 8 more

Wired, CNET, and 17 more

Ars Technica, TechCrunch, and 51 more

Ars Technica, TechCrunch, and 109 more

The Register, The Next Web, and 4 more

Ars Technica, The Verge, and 25 more

PC Magazine, Popular Science, and 8 more

The Conversation, CNBC, and 6 more
Humanity's Last Exam
Humanity's Last Exam is a benchmark featuring 2,500 extremely difficult, expert-level questions across subjects like advanced math, physics, and biology. It was created to truly test AI capabilities as older benchmarks became too easy.