2 Sources
[1]
AI21 debuts Maestro AI planning and orchestration system - SiliconANGLE
AI21 Labs Ltd. today introduced Maestro, a software system that promises to significantly boost the output quality of large language models. Israel-based AI21 is an artificial intelligence startup backed by $336 million in funding from Nvidia Corp., Google LLC and other investors. It provides a series of enterprise-focused LLMs called Jamba. The models can process prompts with up 256,000 tokens and support RAG, a machine learning technique that allows an AI to analyze information not included in its training dataset. Before enterprises deploy an LLM in production, they take steps to reduce the risk of output quality issues. The process often involves creating a software workflow that automatically checks prompt responses for errors. Such workflows can significantly reduce the risk of hallucinations, but they're difficult to create and maintain. AI21's newly debuted Maestro platform is designed to address challenge. The platform, which is described as an AI planning and orchestration system, reduces the amount of work involved in mitigating LLM output errors. It also promises to ease several related tasks. To use Maestro, workers have to provide a prompt along with a set of requirements that should be met while the prompt is being processed. For example, a user could specify that the cost of generating an LLM response shouldn't exceed a certain threshold. AI21 says that Maestro automatically applies those customer-provided requirements and thereby reduces the need for manual coding. When it receives a complex prompt, Maestro breaks down the task into substeps. Simplifying tasks in this manner has been shown to improve the quality of LLM responses. After completing the process, Maestro runs simulations to identify the most efficient way of entering the request into an LLM and delivering an accurate answer. AI21 says that the platform considers multiple processing approaches and picks the one with the highest likelihood of delivering a correct LLM response. If necessary, Maestro can also scale inference-time compute. This is a method of improving reasoning-optimized LLMs' accuracy by increasing the amount of time and infrastructure they spend on a task. After a prompt response is generated, Maestro checks it for errors. The system also creates a log that displays every step of the process through which the prompt response was generated. Workers can check this log to validate the accuracy of LLM output. In a series of internal tests, AI21 applied Maestro to several popular LLMs. It determined that the system boosts the accuracy of AI models by up to 50% in some cases. According to AI21, that means reasoning-optimized LLMs such as o3-mini can answer more than 95% of prompts correctly when they're connected to Maestro. The company envisions customers applying the system to a range of use cases. It says that Maestro can make LLMs better at analyzing complex documents and answering user questions. Additionally, the system lends itself to automating repetitive business chores such as data entry. "Mass adoption of AI by enterprises is the key to the next industrial revolution," said AI21 co-Chief Executive Officer Ori Goshen. "AI21's Maestro is the first step toward that future - moving beyond the unpredictability of available solutions to deliver AI that is reliable at scale." Maestro is currently in early access. AI21 plans to make the platform generally available later this year.
[2]
AI21 Introduces Maestro, the World's First AI Planning and Orchestration System Built for the Enterprise
Enter your email to get Benzinga's ultimate morning update: The PreMarket Activity Newsletter AI21 is leading the shift from LLMs and Reasoning models to planning AI systems. Maestro increases the accuracy of GPT-4o and Claude Sonnet 3.5 by up to 50% on complex, multi-requirement tasks, transforming AI from an unpredictable tool to a trustworthy system. LAS VEGAS, March 10, 2025 /PRNewswire/ -- AI21, a pioneer in frontier models and AI systems, today unveiled Maestro, the world's first AI Planning and Orchestration System designed to deliver trustworthy AI at scale for organizations. Introduced at the HumanX 2025 conference, Maestro marks a significant advancement in enterprise AI, boosting the instruction-following accuracy of paired Large Language Models (LLMs) by up to 50% and ensuring guaranteed quality, reliability, and observability. This technology transcends the limitations of traditional LLMs and Large Reasoning Models (LRMs), setting a new benchmark for AI capabilities. Maestro delivers a substantial improvement in LLM performance on complex tasks. It elevates the accuracy of models like GPT-4o and Claude Sonnet 3.5 by up to 50% and empowers reasoning models, such as o3-mini, to surpass 95% accuracy. Notably, Maestro bridges the performance gap between non-reasoning and reasoning models, aligning the accuracy of Claude Sonnet 3.5 with advanced reasoning models like o3-mini. While enterprises are eager to integrate AI into their operations, large-scale generative AI deployments often falter. According to the Amazon Web Services (AWS) CDO Agenda 2024, only 6% of organizations have a generative AI application in deployment, highlighting the fundamental limitations of current AI solutions for mission-critical tasks. The prevailing approaches -- "Prompt and Pray" and hard-coded chains -- present significant challenges. The "Prompt and Pray" method, which relies on LLMs and LRMs to execute open-ended tasks, lacks control and reliability due to the probabilistic nature of these models. Hard-coded chains, while more predictable, are rigid, labor-intensive, and prone to failure under changing conditions. Reasoning models, designed to solve complex tasks through thinking tokens, have not alleviated these issues. They exhibit inconsistent performance, struggle to adhere to instructions, and fail to reliably utilize tools. Consequently, none of these approaches delivers the accuracy, reliability, and adaptability essential for widespread enterprise adoption. "Mass adoption of AI by enterprises is the key to the next industrial revolution," said Ori Goshen, Co-CEO of AI21. "AI21's Maestro is the first step toward that future - moving beyond the unpredictability of available solutions to deliver AI that is reliable at scale. Delivering complex decision-making with built-in quality control, it enables businesses to harness AI with confidence. This is how we bridge the gap between AI potential and real-world solutions." "Wix is leading the charge in LLM adoption, powering hundreds of AI applications," said Avishai Abrahami, CEO of WIX. "Maestro ushers in a new era of agentic AI - striking a necessary balance between quality, control, and trust that could be a key factor in our ability to develop trustworthy AI applications at scale." "The potential of enterprise AI lies in balancing innovation with reliability," said Elad Tsur, Chief AI Officer at Applied Systems. "AI21 Maestro is a promising step toward making AI more controllable and useful for business applications, bridging the gap between powerful AI models and real-world enterprise needs." Maestro, powered by the AI Planning and Orchestration System (AIPOS), delivers reliable, system-level AI by integrating LLMs or LRMs into a framework that analyzes actions, plans solutions, and validates results. This framework learns the enterprise environment to ensure accuracy and efficiency, allowing builders to define requirements and obtain results that meet their criteria within seconds. By eliminating the need for prompt engineering and rigid workflows, Maestro delivers on the promise of truly trustworthy AI. Request early access to Maestro API by visiting http://ai21.com/maestro. About AI21 AI21 is a pioneer in Foundation Models and AI Systems designed for enterprises. AI21's mission is to create trustworthy artificial intelligence that powers humanity towards superproductivity. Founded in 2017 by AI visionaries Prof. Amnon Shashua, Prof. Yoav Shoham, and Ori Goshen, AI21 has secured $336 million in funding from industry leaders, including NVIDIA, Google, and Intel, reinforcing its commitment to advancing AI innovation. View original content to download multimedia:https://www.prnewswire.com/news-releases/ai21-introduces-maestro-the-worlds-first-ai-planning-and-orchestration-system-built-for-the-enterprise-302397075.html SOURCE AI21 Labs Market News and Data brought to you by Benzinga APIs
Share
Copy Link
AI21 Labs introduces Maestro, an innovative AI planning and orchestration system designed to enhance the accuracy and reliability of large language models for enterprise use, promising up to 50% improvement in output quality.
AI21 Labs, an Israel-based artificial intelligence startup, has unveiled Maestro, a groundbreaking AI planning and orchestration system aimed at revolutionizing enterprise-scale AI deployment 12. Introduced at the HumanX 2025 conference, Maestro represents a significant leap forward in addressing the challenges of implementing large language models (LLMs) in production environments.
Maestro promises to boost the output quality of LLMs by up to 50% in some cases, particularly for complex, multi-requirement tasks 2. The system is designed to work with various models, including GPT-4o and Claude Sonnet 3.5, elevating their instruction-following accuracy significantly. For reasoning-optimized LLMs like o3-mini, Maestro enables them to answer more than 95% of prompts correctly 1.
The platform operates by breaking down complex prompts into manageable substeps, a method proven to enhance LLM response quality. Maestro then runs simulations to determine the most efficient approach for processing the request and generating an accurate answer 1. Key features include:
Maestro aims to solve critical issues hindering widespread AI adoption in enterprises. According to the AWS CDO Agenda 2024, only 6% of organizations have deployed generative AI applications, highlighting the limitations of current solutions 2. Maestro offers an alternative to the unreliable "Prompt and Pray" method and inflexible hard-coded chains, providing a more adaptable and trustworthy approach.
The introduction of Maestro has garnered attention from industry leaders. Avishai Abrahami, CEO of Wix, praised the system's potential to balance quality, control, and trust in AI applications 2. Elad Tsur, Chief AI Officer at Applied Systems, highlighted Maestro's promise in making AI more controllable and useful for business applications 2.
AI21 envisions Maestro being applied to a range of use cases, from analyzing complex documents to automating repetitive business tasks 1. The system is currently in early access, with general availability planned for later this year 1. Interested parties can request early access to the Maestro API through AI21's website 2.
As enterprises continue to explore AI integration, Maestro represents a significant step towards reliable, large-scale AI deployment. By addressing key challenges in LLM implementation, AI21 is positioning itself at the forefront of the next wave of AI innovation in the business world.
Disney and NBCUniversal have filed a landmark lawsuit against AI image-synthesis company Midjourney, accusing it of copyright infringement for allowing users to create images of copyrighted characters like Darth Vader and Shrek.
47 Sources
Technology
10 hrs ago
47 Sources
Technology
10 hrs ago
Nvidia CEO Jensen Huang announces major AI infrastructure investments across Europe, including partnerships with Mistral AI and plans for multiple data centers, positioning the company at the forefront of Europe's AI development.
11 Sources
Technology
18 hrs ago
11 Sources
Technology
18 hrs ago
Google creates a new executive position, Chief AI Architect, appointing Koray Kavukcuoglu to lead AI-powered product development and integration across the company.
4 Sources
Technology
10 hrs ago
4 Sources
Technology
10 hrs ago
NVIDIA announces the construction of the world's first industrial AI cloud in Germany, featuring 10,000 GPUs to boost European manufacturing capabilities and AI adoption across various industries.
6 Sources
Technology
18 hrs ago
6 Sources
Technology
18 hrs ago
Meta unveils V-JEPA 2, an advanced AI model designed to help AI agents and robots understand and predict physical world interactions, potentially revolutionizing fields like robotics and autonomous vehicles.
7 Sources
Technology
10 hrs ago
7 Sources
Technology
10 hrs ago