AI Companies Shift Focus to Efficient Models Running on Fewer Chips

AI Industry Shifts Towards Efficiency

In a significant trend reshaping the AI landscape, leading companies are now focusing on developing more efficient AI models that can operate on fewer chips. This shift comes nearly two months after the viral success of China's DeepSeek, which prompted a reevaluation of the resources required for AI system development 1

Cohere's Command A: A Leap in Efficiency

Toronto-based Cohere Inc. is at the forefront of this movement with its new model, Command A. Set to be announced on Thursday, Command A can perform complex business tasks while running on just two of Nvidia Corp.'s AI-focused A100 or H100 chips. This represents a significant reduction in chip requirements compared to larger models and even DeepSeek's system 1

Google's Gemma: Single-Chip Performance

Not to be outdone, Alphabet Inc.'s Google unveiled its new series of Gemma AI models a day earlier. These models can reportedly run on a single Nvidia H100 chip, further pushing the boundaries of efficiency. Both Cohere and Google claim their models match or surpass DeepSeek's most recent AI system on certain tasks 2

Industry-Wide Push for Efficiency

While major AI companies continue to invest heavily in infrastructure and talent, there's a growing emphasis on creating AI software that can run as efficiently as possible. This trend, although predating DeepSeek's latest launch, has likely been accelerated by the Chinese company's success 2

DeepSeek's Impact on the AI Landscape

In January, DeepSeek released open-source AI software that rivaled models from OpenAI and Google, reportedly built at a fraction of the cost. Their success stemmed from innovations in chip utilization, demonstrating that advanced AI systems could be developed more cost-effectively than previously thought 2

Business Implications and Future Outlook

For companies like Cohere, which focuses on business applications of AI, this efficiency drive has additional benefits. Running AI models on fewer chips is crucial for business customers who may have limited access to computing power. As Aidan Gomez, Cohere's co-founder and CEO, explains, "They don't have tens, let alone hundreds, of GPUs to be able to deploy against problems. So they need a very light and scalable form factor" 2

This shift towards efficiency could democratize access to advanced AI capabilities, potentially reshaping the competitive landscape and accelerating AI adoption across various industries.

AI Companies Shift Focus to Efficient Models Running on Fewer Chips

AI Industry Shifts Towards Efficiency

Cohere's Command A: A Leap in Efficiency

Google's Gemma: Single-Chip Performance

Industry-Wide Push for Efficiency

DeepSeek's Impact on the AI Landscape

Business Implications and Future Outlook

References

AI Companies Embrace Efficient Models That Run on Fewer Chips

AI companies embrace efficient models that run on fewer chips

Related Stories

Cohere Unveils Command A: A Powerful, Efficient AI Model for Enterprise Applications

DeepSeek's AI Breakthrough: Expertise Trumps Raw Compute in Model Development

AI race shifts from biggest models to cost efficiency as enterprises demand cheaper solutions

Recent Highlights

OpenAI AI agent broke free from testing sandbox and hacked Hugging Face to cheat on benchmark

Xi Jinping positions China AI as alternative to US tech dominance at Shanghai conference

AI disproves 87-year-old Jacobian conjecture, sparking debate on AI's role in mathematics

Recent Highlights

Today's Top Stories

AI Kill Switch Act gives DHS power to shut down rogue AI systems after OpenAI security breach

Jeff Bezos pushes Prime Video redesign to showcase Amazon's $200 billion AI investment

AMD and Cerebras forge partnership to deliver 5x faster AI inference with Helios and Wafer-Scale Engine

Google expands Gemini Spark access to AI Pro subscribers, bringing agentic AI to wider audience