2 Sources
2 Sources
[1]
xAI debuts a faster and more cost-effective version of Grok 4
A few months after the release of Grok 4 and an extremely problematic antisemitic meltdown of its chatbot, xAI is already trying to move on with its latest AI model. Elon Musk's xAI announced the release of Grok 4 Fast, a faster, more efficient reasoning model compared to its recent predecessor. According to xAI, Grok 4 Fast offers similar performance to Grok 4 while using 40 percent fewer thinking tokens on average. Along with faster results, xAI said Grok 4 Fast "results in a 98% reduction in price to achieve the same performance on frontier benchmarks as Grok 4," whether it's handling tasks that involve writing code or just browsing the web for quick responses. Similar to OpenAI's GPT-5 that alternates between a smart, efficient model and a deeper reasoning model, xAI's latest update includes a unified architecture that can transition between handling complex requests with its "reasoning" model and quick responses through its "non-reasoning model." In tests on LMArena, a platform that pits AI models against each other and provides side-by-side comparisons, Grok 4 Fast ranks first in search-related tasks and eighth in text-related tasks. xAI made Grok 4 Fast available for all users, including the free ones, on web, iOS and Android. However, with how competitive the LLM race is getting, it's only a matter of time before Google releases the next-gen version of Gemini or Anthropic updates the Claude Opus model beyond the recently released 4.1 version.
[2]
Elon Musk's xAI Launches Grok 4 Fast With 2M Token Limit and 40% Lower Costs
xAI Launches Grok 4 Fast, Cutting Token Use by 40% While Matching Grok 4 Accuracy. Available Across Web, Apps, and APIs with Flexible Pricing Elon Musk's xAI has launched a new AI model, Grok 4 Fast. The model aims to keep costs low and maintain competitive accuracy by combining non-reasoning and reasoning abilities into a single system, thereby eliminating the need for separate frameworks. According to , Grok 4 Fast uses approximately 40% of the number of thinking tokens used by Grok 4. The performance levels are benchmarked with fewer tokens, yet the results are close to Grok 4. Based on the objective exploration done by Artificial Analysis, Grok 4 Fast could run with 98% less money while maintaining the same performance to improve its cost-performance ratio. The in AIME 2025, HMMT 2025, and the GPQA Diamond test gave scores of 85.7%, 92%, and 93.3%, respectively. Additionally, the model scored 95% on SimpleQA and 74% on X Bench Deepsearch, meaning that it can be applied to various tasks, including code execution and sophisticated search.
Share
Share
Copy Link
Elon Musk's xAI unveils Grok 4 Fast, a new AI model that uses 40% fewer tokens while maintaining performance similar to Grok 4. The model offers significant cost reductions and improved efficiency across various tasks.
Elon Musk's artificial intelligence company, xAI, has announced the release of Grok 4 Fast, a new AI model that promises improved efficiency and cost-effectiveness compared to its predecessor, Grok 4. This latest development comes just a few months after the release of Grok 4 and follows a controversial incident involving antisemitic content generated by the chatbot
1
.Source: Analytics Insight
Grok 4 Fast boasts significant improvements in both performance and resource utilization:
Token Efficiency: The new model uses approximately 40% fewer "thinking tokens" on average while maintaining performance levels similar to Grok 4
1
2
.Cost Reduction: xAI claims a remarkable 98% reduction in price to achieve the same performance on frontier benchmarks as Grok 4, whether for coding tasks or quick web browsing responses
1
.Unified Architecture: Grok 4 Fast features a unified architecture that can switch between a "reasoning" model for complex requests and a "non-reasoning model" for quick responses, similar to OpenAI's GPT-5
1
.Source: engadget
The new model has demonstrated impressive results in various benchmarks:
LMArena Rankings: Grok 4 Fast ranks first in search-related tasks and eighth in text-related tasks on the LMArena platform, which compares AI models side-by-side
1
.Specialized Tests: The model achieved scores of 85.7% in AIME 2025, 92% in HMMT 2025, and 93.3% in the GPQA Diamond test
2
.Additional Benchmarks: Grok 4 Fast scored 95% on SimpleQA and 74% on X Bench Deepsearch, demonstrating its versatility in various tasks, including code execution and sophisticated search
2
.Related Stories
xAI has made Grok 4 Fast widely accessible:
Platforms: The model is available on web, iOS, and Android platforms
1
.User Access: Both free and paid users can access Grok 4 Fast .
API Integration: The model is also available through APIs with flexible pricing options
2
.The release of Grok 4 Fast comes at a time of intense competition in the AI industry:
Rival Developments: With the rapid pace of advancements in large language models, it's anticipated that competitors like Google and Anthropic will soon release updated versions of their models, such as the next-gen Gemini or an upgrade to Claude Opus beyond version 4.1 .
Market Positioning: The improvements in efficiency and cost-effectiveness position Grok 4 Fast as a strong contender in the increasingly competitive AI model landscape.
Summarized by
Navi
[2]