Positron AI Challenges Nvidia with Energy-Efficient AI Inference Chips

4 Sources

Positron AI, a startup founded in 2023, is making waves in the AI hardware industry with its Atlas accelerator, claiming superior performance and energy efficiency compared to Nvidia's offerings for AI inference tasks.

Positron AI Challenges Nvidia with Atlas Accelerator

Positron AI, a startup founded in 2023, is making waves in the AI hardware industry with its Atlas accelerator, claiming superior performance and energy efficiency compared to Nvidia's offerings for AI inference tasks 1. The company has recently secured $51.6 million in Series A funding, bringing its total capital raised to over $75 million 4.

Atlas Accelerator: Performance and Efficiency Claims

Source: Tom's Hardware

Source: Tom's Hardware

According to Positron AI, the Atlas accelerator can deliver around 280 tokens per second per user in Llama 3.1 8B with BF16 compute at 2000W, compared to approximately 180 tokens per second per user for an 8-way Nvidia DGX H200 server consuming 5900W 1. The company claims that Atlas offers:

  • 3.5 times better performance per dollar compared to Nvidia's H100
  • Up to 66% lower power consumption
  • 93% memory bandwidth utilization, far exceeding the typical 10-30% range seen in GPUs 3

Technical Specifications and Compatibility

Atlas is designed specifically for large-scale transformer models and packs eight Archer accelerators 1. Key features include:

  • Support for models with up to 0.5 trillion parameters in a single 2-kilowatt server
  • Compatibility with Hugging Face transformer models
  • OpenAI API-compatible endpoint for inference requests 4

Market Position and Early Adoption

Source: Analytics India Magazine

Source: Analytics India Magazine

Positron AI is positioning itself as a direct challenger to Nvidia in the AI inference chip market. The company's focus on energy efficiency and cost-effectiveness has attracted attention from major cloud providers and enterprises 2. Early adopters include:

  • Cloudflare, using Atlas hardware in globally distributed, power-constrained data centers
  • Parasail, via its AI-native data infrastructure platform SnapServe 3

Future Developments: Titan and Asimov

Positron AI is already working on its next-generation system, Titan, powered by the Asimov AI accelerator. Expected to launch in 2026, Titan aims to compete against inference systems based on Nvidia's Vera Rubin platforms 1. Key features of the upcoming system include:

  • Support for models with up to 16 trillion parameters on a single machine
  • 2 TB of memory per ASIC
  • 16 Tb/s external network bandwidth
  • Ability to run multiple models simultaneously 1

Industry Impact and Challenges

Source: VentureBeat

Source: VentureBeat

The emergence of Positron AI and other startups in the AI chip space highlights the growing concern over the power consumption and cost of AI infrastructure. As AI models continue to grow in size and complexity, efficient inference solutions become increasingly critical 2.

However, Positron AI faces significant challenges in a market dominated by established players like Nvidia. The company will need to deliver on its performance and efficiency claims to gain widespread adoption and compete effectively in this rapidly evolving industry 3.

Explore today's top stories

Google Gemini Introduces Personalized Learning and Privacy Features

Google enhances Gemini with new features allowing it to learn from user interactions and offering temporary chat options for privacy, mirroring similar capabilities in competing AI chatbots.

Ars Technica logoCNET logoZDNet logo

20 Sources

Technology

18 hrs ago

Google Gemini Introduces Personalized Learning and Privacy

Apple's AI Revolution: Smart Home Devices and Lifelike Siri on the Horizon

Apple plans to launch a series of AI-powered smart home devices, including a tabletop robot and an upgraded Siri, to compete in the AI market and revitalize its smart home strategy.

CNET logoengadget logoGizmodo logo

17 Sources

Technology

18 hrs ago

Apple's AI Revolution: Smart Home Devices and Lifelike Siri

Meta's AI Chatbot Guidelines Spark Controversy Over Inappropriate Content

An internal Meta document reveals controversial AI chatbot guidelines, allowing inappropriate interactions with minors and generation of false information. The company has since removed some problematic sections after media inquiry.

Reuters logoEconomic Times logoBNN logo

4 Sources

Technology

2 hrs ago

Meta's AI Chatbot Guidelines Spark Controversy Over

Cisco Surpasses AI Infrastructure Sales Targets, Forecasts Strong Growth Amid AI Boom

Cisco reports exceeding its AI infrastructure sales targets for fiscal year 2025, with orders from webscale customers surpassing $2 billion. The company forecasts continued growth as demand for networking equipment rises due to the AI boom.

The Register logoReuters logoCNBC logo

13 Sources

Business and Economy

18 hrs ago

Cisco Surpasses AI Infrastructure Sales Targets, Forecasts

AI Pioneer Geoffrey Hinton Proposes 'Maternal Instincts' for AI to Mitigate Existential Risks

Geoffrey Hinton, a key figure in AI development, warns of potential catastrophic outcomes and suggests programming AI with 'maternal instincts' as a safeguard against human extinction.

Dataconomy logoEntrepreneur logo

2 Sources

Science and Research

3 hrs ago

AI Pioneer Geoffrey Hinton Proposes 'Maternal Instincts'
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo