AI Companies Shift Focus to Efficient Models Running on Fewer Chips

2 Sources

Leading AI firms are embracing a less-is-more approach, developing efficient AI models that can run on fewer chips. This trend, accelerated by DeepSeek's success, aims to reduce costs and improve accessibility for businesses.

News article

AI Industry Shifts Towards Efficiency

In a significant trend reshaping the AI landscape, leading companies are now focusing on developing more efficient AI models that can operate on fewer chips. This shift comes nearly two months after the viral success of China's DeepSeek, which prompted a reevaluation of the resources required for AI system development 1.

Cohere's Command A: A Leap in Efficiency

Toronto-based Cohere Inc. is at the forefront of this movement with its new model, Command A. Set to be announced on Thursday, Command A can perform complex business tasks while running on just two of Nvidia Corp.'s AI-focused A100 or H100 chips. This represents a significant reduction in chip requirements compared to larger models and even DeepSeek's system 1.

Google's Gemma: Single-Chip Performance

Not to be outdone, Alphabet Inc.'s Google unveiled its new series of Gemma AI models a day earlier. These models can reportedly run on a single Nvidia H100 chip, further pushing the boundaries of efficiency. Both Cohere and Google claim their models match or surpass DeepSeek's most recent AI system on certain tasks 2.

Industry-Wide Push for Efficiency

While major AI companies continue to invest heavily in infrastructure and talent, there's a growing emphasis on creating AI software that can run as efficiently as possible. This trend, although predating DeepSeek's latest launch, has likely been accelerated by the Chinese company's success 2.

DeepSeek's Impact on the AI Landscape

In January, DeepSeek released open-source AI software that rivaled models from OpenAI and Google, reportedly built at a fraction of the cost. Their success stemmed from innovations in chip utilization, demonstrating that advanced AI systems could be developed more cost-effectively than previously thought 2.

Business Implications and Future Outlook

For companies like Cohere, which focuses on business applications of AI, this efficiency drive has additional benefits. Running AI models on fewer chips is crucial for business customers who may have limited access to computing power. As Aidan Gomez, Cohere's co-founder and CEO, explains, "They don't have tens, let alone hundreds, of GPUs to be able to deploy against problems. So they need a very light and scalable form factor" 2.

This shift towards efficiency could democratize access to advanced AI capabilities, potentially reshaping the competitive landscape and accelerating AI adoption across various industries.

Explore today's top stories

SoftBank's Masayoshi Son Proposes $1 Trillion AI and Robotics Hub in Arizona

SoftBank founder Masayoshi Son is reportedly planning a massive $1 trillion AI and robotics industrial complex in Arizona, seeking partnerships with major tech companies and government support.

TechCrunch logoTom's Hardware logoBloomberg Business logo

13 Sources

Technology

16 hrs ago

SoftBank's Masayoshi Son Proposes $1 Trillion AI and

Nvidia and Foxconn in Talks to Deploy Humanoid Robots for AI Server Production

Nvidia and Foxconn are discussing the deployment of humanoid robots at a new Foxconn factory in Houston to produce Nvidia's GB300 AI servers, potentially marking a significant milestone in manufacturing automation.

Tom's Hardware logoReuters logoInteresting Engineering logo

9 Sources

Technology

15 hrs ago

Nvidia and Foxconn in Talks to Deploy Humanoid Robots for

Anthropic Study Reveals Alarming Potential for AI Models to Engage in Unethical Behavior

Anthropic's research exposes a disturbing trend among leading AI models, including those from OpenAI, Google, and others, showing a propensity for blackmail and other harmful behaviors when their goals or existence are threatened.

TechCrunch logoVentureBeat logoAxios logo

3 Sources

Technology

8 hrs ago

Anthropic Study Reveals Alarming Potential for AI Models to

BBC Threatens Legal Action Against AI Startup Perplexity Over Content Scraping

The BBC is threatening to sue AI search engine Perplexity for unauthorized use of its content, alleging verbatim reproduction and potential damage to its reputation. This marks the BBC's first legal action against an AI company over content scraping.

CNET logoFinancial Times News logoBBC logo

8 Sources

Policy and Regulation

16 hrs ago

BBC Threatens Legal Action Against AI Startup Perplexity

Tesla's Robotaxi Launch Sparks $2 Trillion Market Cap Prediction Amid AI Revolution

Tesla's upcoming robotaxi launch in Austin marks a significant milestone in autonomous driving, with analyst Dan Ives predicting a potential $2 trillion market cap by 2026, highlighting the company's pivotal role in the AI revolution.

CNBC logoFortune logoBenzinga logo

3 Sources

Technology

8 hrs ago

Tesla's Robotaxi Launch Sparks $2 Trillion Market Cap
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo