Curated by THEOUTPOST
On Tue, 20 Aug, 4:05 PM UTC
3 Sources
[1]
AI startup Recogni unveils new computing method to slash costs, power requirements
Recogni introduced a novel computing system named Pareto, which enhances AI chip efficiency by converting complex operations into simpler ones, reducing power consumption. Tested successfully on AI models from major platforms, Recogni aims to collaborate with hardware companies for broader application. This innovation could significantly impact AI technology by making it more efficient and cost-effective.AI chip and software startup Recogni unveiled a novel computing method on Tuesday that could make its chips used to train and run artificial intelligence systems smaller, faster and less expensive to operate. Backed by BMW, Bosch and venture capital firm Mayfield, Recogni develops specialized chips and software to enable AI inferencing - the process of trained AI models making predictions or decisions on new, unseen data. The company said the new patented system, called Pareto, utilizes a logarithmic approach that outperforms existing methods when running large AI models. "It is a huge leap in all of the KPIs (key performance indicators) that influence silicon hardware system design when it comes to AI computing," Recogni's co-founder and VP of AI, Gilles Backhus told Reuters. Current AI models, such as OpenAI's GPT-4 and Google's Gemini, require hundreds of thousands of power-hungry mathematical operations for the simple of prompts on chatbots like ChatGPT. Recogni said that its new system converts these multiplication operations into additions, significantly reducing power consumption while maintaining accuracy. The startup said it has already tested Pareto on AI models developed by Meta Platforms, Stability AI, and others. Recogni, whose first chip was designed, manufactured Taiwan Semiconductor Manufacturing Co's seven nanometer process, said it was working with an unnamed partner to make Pareto more widely available and will announce the partnership in the coming months. "We are speaking to companies that are putting hardware in data centers and offering it to the world to whoever wants to basically rent it ... that's definitely one of the deployment routes that we're considering," Backhus added.
[2]
AI startup Recogni unveils new computing method to slash costs, power requirements
(Reuters) - AI chip and software startup Recogni unveiled a novel computing method on Tuesday that could make its chips used to train and run artificial intelligence systems smaller, faster and less expensive to operate. Backed by BMW, Bosch and venture capital firm Mayfield, Recogni develops specialized chips and software to enable AI inferencing - the process of trained AI models making predictions or decisions on new, unseen data. The company said the new patented system, called Pareto, utilizes a logarithmic approach that outperforms existing methods when running large AI models. "It is a huge leap in all of the KPIs (key performance indicators) that influence silicon hardware system design when it comes to AI computing," Recogni's co-founder and VP of AI, Gilles Backhus told Reuters. Current AI models, such as OpenAI's GPT-4 and Google's Gemini, require hundreds of thousands of power-hungry mathematical operations for the simple of prompts on chatbots like ChatGPT. Recogni said that its new system converts these multiplication operations into additions, significantly reducing power consumption while maintaining accuracy. The startup said it has already tested Pareto on AI models developed by Meta Platforms, Stability AI, and others. Recogni, whose first chip was designed, manufactured Taiwan Semiconductor Manufacturing Co's seven nanometer process, said it was working with an unnamed partner to make Pareto more widely available and will announce the partnership in the coming months. "We are speaking to companies that are putting hardware in data centers and offering it to the world to whoever wants to basically rent it ... that's definitely one of the deployment routes that we're considering," Backhus added. (Reporting by Akash Sriram in Bengaluru; Editing by Tasim Zahid)
[3]
Recogni's new Pareto system optimizes AI compute with minimal accuracy loss - SiliconANGLE
Recogni's new Pareto system optimizes AI compute with minimal accuracy loss Generative artificial intelligence company Recogni Inc. today announced a logarithmic number system for AI that delivers benefits that it says include low power, high computer density and low latency without compromising accuracy. Called Pareto, the number system provides benefits for all AI chip design criteria by simplifying AI compute. It does show by turning multiplications into additions, resulting in AI chips that are smaller, faster and less energy-hungry. The problem the system is seeking to address is one where the latest generative AI models demand multiplications and additions on the order of petaFLOPS, or quadrillions of operations per second, posing challenges in power consumption and computational speed. Pareto addresses the challenge by converting multiplications into additions, significantly reducing power usage and execution time without compromising accuracy. Recogni says it's the first to market with a logarithmic system that outperforms other quantized number systems for generative AI inference. Pareto's efficiency enables a more compact chip design and hence allows for significantly increased compute in data centers while reducing costs. The numbering system also reduces power consumption and outperforms traditional FP8 and FP16 formats -- formats that define the precision levels for floating-point numbers in computing. AI models using Pareto are also said to experience minimal accuracy loss, with less than 0.1% drop in 16-bit precision and under 1% in eight-bit precision, all without requiring retraining. "By turning multiplications into additions, Pareto significantly reduces power consumption, latency and chip size, making it the optimal choice for modern AI chip design," said Chief Executive Marc Bolitho. "Organizations running gen AI inference can now keep operating costs lower than any other technology and ensure uncompromised AI model quality for the widest variety of multimodal gen AI Inference applications and use cases." Pareto has undergone extensive testing on various AI models with some highly impressive results. Testing on Mixtral-8x22B, Llama3-70B, Falcon-180B, Stable Diffusion XL and Llama3.1-405B shows that Pareto achieves a relative accuracy of over 99.9% compared to the trained high-precision baseline model, while consuming significantly less power. "With Pareto we came up with a number system that allows businesses to instantly deploy their models at high power efficiency with virtually no loss across all key performance and accuracy metrics," said Gilles Backhus, founder and vice president of AI at Recogni. "While companies using standard math are spending considerable time converting models to lower precision to reduce the power and operational expenses, Pareto allows companies to bring new models to production faster and cheaper while maintaining high accuracy." Pareto is now available through a seven-nanometer chip manufactured by Taiwan Semiconductor Manufacturing Co. Ltd. Recogni also plans to announce a technology partnership that will make Pareto more widely available in the coming months. The company was previously in the news in February, when it raised $102 million in additional venture capital funding. Investors include Celesta Capital, GreatPoint Ventures Management, Mayfield Fund, DNS Capital, BMW i Ventures GmbH and Tasaru Mobility Investments.
Share
Share
Copy Link
AI startup Recogni has introduced a groundbreaking computing method called Pareto that significantly reduces costs and power requirements for AI inference. This innovation promises to make AI more accessible and efficient across various industries.
Recogni, a prominent AI startup, has unveiled a revolutionary computing method named Pareto, which promises to dramatically reduce costs and power consumption in AI inference processes. This innovation comes at a crucial time when the AI industry is grappling with the challenges of escalating computational demands and energy consumption 1.
The Pareto method, developed by Recogni, is designed to optimize AI computations by focusing on the most critical data points. This approach allows for significant reductions in computational requirements without compromising accuracy. According to the company, Pareto can slash inference costs by up to 90% while maintaining 99% accuracy compared to traditional methods 2.
One of the most notable benefits of the Pareto method is its ability to reduce power consumption. By optimizing computations, Recogni's technology can potentially decrease the energy requirements for AI processes by up to 1,000 times. This breakthrough has significant implications for various industries, particularly those relying heavily on AI for their operations 1.
The Pareto system is expected to have wide-ranging applications across multiple sectors. Recogni has highlighted its potential in autonomous driving, where efficient AI processing is crucial for real-time decision-making. The technology could also be applied in data centers, edge computing, and other AI-intensive fields, potentially revolutionizing how these industries approach AI implementation 3.
The unveiling of the Pareto method has generated significant interest in the tech industry. Experts believe that this innovation could address some of the most pressing challenges in AI deployment, such as high costs and energy consumption. As AI continues to evolve and integrate into various aspects of business and daily life, technologies like Pareto are expected to play a crucial role in making AI more accessible and sustainable 2.
Recogni's introduction of the Pareto method positions the company as a potential leader in the next generation of AI computing solutions. The startup's focus on efficiency and cost-effectiveness aligns with the growing demand for more sustainable AI technologies. As the AI landscape continues to evolve, Recogni's innovation could significantly influence the direction of future developments in the field 3.
Reference
[1]
[2]
Sageance, a Silicon Valley startup, is developing analog AI chips that could significantly reduce power consumption for large language models, potentially revolutionizing the AI hardware landscape.
2 Sources
Researchers at BitEnergy AI have developed a new algorithm called Linear-Complexity Multiplication (L-Mul) that could potentially reduce AI energy consumption by up to 95% without significant performance loss. This breakthrough could address growing concerns about AI's increasing energy demands.
5 Sources
As Nvidia dominates the AI training chip market with GPUs, competitors are focusing on developing specialized AI inference chips to meet the growing demand for efficient AI deployment and reduce computing costs.
6 Sources
Startup Untether has introduced a new AI chip designed for edge computing and inference tasks, aiming to revolutionize AI applications in various sectors including automotive and agriculture.
2 Sources
SIMA.ai, a leading edge AI company, has introduced MLSoC Modalix, a new product family designed to enhance generative AI capabilities at the edge. This expansion of their One Platform for Edge AI aims to bring multimodal generative AI to various devices and applications.
3 Sources
The Outpost is a comprehensive collection of curated artificial intelligence software tools that cater to the needs of small business owners, bloggers, artists, musicians, entrepreneurs, marketers, writers, and researchers.
© 2024 TheOutpost.AI All rights reserved