xAI Launches Grok 4 Fast: A More Efficient and Cost-Effective AI Model

Reviewed byNidhi Govil

3 Sources

Share

xAI unveils Grok 4 Fast, a new AI model that offers similar performance to its predecessor while using 40% fewer tokens and reducing costs by 98%. The model features a unified architecture for both reasoning and non-reasoning tasks, making it highly flexible for various applications.

xAI Unveils Grok 4 Fast: A More Efficient and Cost-Effective AI Model

xAI, the artificial intelligence company founded by Elon Musk, has announced the release of Grok 4 Fast, a new version of its flagship AI model that promises improved efficiency and cost-effectiveness. This latest iteration comes just months after the release of Grok 4 and aims to address the growing demand for more accessible and powerful AI solutions

1

.

Source: engadget

Source: engadget

Performance and Efficiency Gains

Grok 4 Fast boasts similar performance to its predecessor while using 40% fewer "thinking tokens" on average. This reduction in token usage translates to a significant 98% decrease in price to achieve comparable performance on frontier benchmarks

1

2

.

The model has demonstrated impressive results on various benchmarks:

  • AIME 2025 math: 92% (compared to Grok 4's 91.7%)
  • GPQA Diamond: 85.7% (compared to Grok 4's 87.5%)
  • X Bench Deepsearch: 74% (up from Grok 4's 66%)
  • SimpleQA: 95%
  • HMMT 2025: 93.3%

    2

    3

Unified Architecture and Flexibility

One of the key innovations in Grok 4 Fast is its unified architecture, which combines non-reasoning and reasoning abilities into a single system. This approach eliminates the need for separate frameworks and allows for seamless transitions between handling complex requests and providing quick responses

1

2

.

Pricing and Availability

xAI has made Grok 4 Fast available through multiple channels, including web, iOS, and Android platforms, as well as via API access. The company offers two main SKUs for the model:

  1. "grok-4-fast-reasoning": $0.20 input / $0.60 output per million tokens
  2. "grok-4-fast-non-reasoning": $0.10 input / $0.30 output per million tokens

Both versions support a 2 million-token context window, which is significantly larger than most commercial models. This pricing structure undercuts other high-performance models and allows for more cost-effective deployment of heavy workloads such as legal analysis, software engineering, and customer support

2

.

Source: Analytics Insight

Source: Analytics Insight

Implications for the AI Landscape

The release of Grok 4 Fast signals a new frontier in the cost-performance ratio of AI models. Independent evaluators, including Artificial Analysis and Professor Ethan Mollick of the University of Pennsylvania's Wharton School of Business, have placed Grok 4 Fast at the top of efficiency charts

2

.

As the AI industry continues to evolve rapidly, xAI's latest offering presents a compelling option for enterprises looking to leverage powerful AI capabilities while managing costs. However, the competitive landscape remains dynamic, with other major players like Google and Anthropic expected to release updates to their respective models in the near future

1

2

.

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo