Databricks Unveils Synthetic Data API to Streamline AI Agent Evaluation

2 Sources

Databricks introduces a new API for generating synthetic datasets, aimed at simplifying and accelerating the evaluation process for AI agents. This tool is integrated into their Mosaic AI Agent Evaluation platform, offering developers a more efficient way to create high-quality artificial datasets.

News article

Databricks Introduces Synthetic Data API for AI Agent Evaluation

Databricks, a leader in the data ecosystem, has unveiled a new Application Programming Interface (API) designed to generate synthetic datasets for machine learning projects 1. This innovative tool is integrated into the company's Mosaic AI Agent Evaluation platform, which is part of their flagship data lakehouse offering 12.

The Need for Synthetic Data in AI Development

The introduction of this API addresses a significant challenge in AI development: the time-consuming and complex process of evaluating AI agent performance. By enabling the creation of high-quality artificial datasets, Databricks aims to streamline the development workflow, reducing the need for constant consultation with subject matter experts (SMEs) and accelerating the path to production for AI agents 2.

How the Synthetic Data API Works

The process of creating a dataset with the new API involves three main steps:

  1. Uploading a frame or file collection containing relevant business information.
  2. Specifying the number of questions and answers to be generated.
  3. Optionally providing additional instructions to customize the API's output 1.

The API is designed to generate question and answer collections, which are particularly useful for developing applications powered by large language models 1. Importantly, the synthetic answers produced are sets of facts required to answer the questions, rather than complete responses written by the language model. This approach facilitates faster review and editing by SMEs 1.

Integration with Mosaic AI Agent Evaluation

The synthetic data capabilities are tightly integrated with Databricks' Mosaic AI Agent Evaluation platform. This integration allows developers to generate high-quality evaluation datasets for preliminary assessment quickly, reducing the workload on SMEs to final validation and accelerating the iterative development process 2.

Performance Improvements and Future Enhancements

Internal tests conducted by Databricks have shown significant improvements in agent performance across various metrics when using the synthetic data for evaluation and improvement. For instance, they observed a nearly 2X increase in the agent's ability to find relevant documents and improvements in the overall correctness of responses 2.

Looking ahead, Databricks plans to release several enhancements to the API in early 2024, including:

  1. A new graphical interface for faster error checking of question-answer pairs.
  2. Tools for tracking changes in synthetic datasets over time 1.

Competitive Advantage

While there are other tools available for generating synthetic datasets, Databricks' offering stands out due to its seamless integration with the Mosaic AI Agent Evaluation platform. This integration eliminates the need for developers to leave their workflows, streamlining the entire process from data generation to agent evaluation 2.

As enterprises increasingly adopt compound AI agents capable of reasoning and handling diverse tasks across different domains, Databricks' synthetic data API represents a significant step forward in simplifying the development and evaluation of these sophisticated AI systems.

Explore today's top stories

Taiwan Adds Huawei and SMIC to Export Control List, Impacting AI Chip Development

Taiwan has added Chinese tech giants Huawei and SMIC to its export control list, requiring government approval for any tech exports to these companies. This move significantly impacts China's AI chip development efforts and aligns with US restrictions.

Bloomberg Business logoReuters logoEconomic Times logo

4 Sources

Technology

6 hrs ago

Taiwan Adds Huawei and SMIC to Export Control List,

AI Reshaping Talent Acquisition: ManpowerGroup Insights on the Future of Work

ManpowerGroup's Chief Innovation Officer discusses how AI is transforming recruitment and the skills employers will seek in the future, highlighting the need for soft skills and potential over traditional credentials.

Phys.org logoEconomic Times logo

2 Sources

Business and Economy

22 hrs ago

AI Reshaping Talent Acquisition: ManpowerGroup Insights on

Tech Giants Race to Create the Ultimate AI Device, Led by OpenAI and Jony Ive Collaboration

OpenAI partners with former Apple design chief Jony Ive to develop a revolutionary AI gadget, while other tech companies explore new interfaces for AI interaction.

France 24 logoEconomic Times logo

2 Sources

Technology

6 hrs ago

Tech Giants Race to Create the Ultimate AI Device, Led by

AI and Space Lasers Revolutionize Forest Carbon Mapping for Climate Science

A groundbreaking study combines satellite data, space-based LiDAR, and AI algorithms to rapidly and accurately map forest carbon, potentially transforming climate change research and forest management.

ScienceDaily logoPhys.org logo

2 Sources

Science and Research

6 hrs ago

AI and Space Lasers Revolutionize Forest Carbon Mapping for

Amazon to Invest $13 Billion in Australia's Data Center Infrastructure, Boosting AI Capabilities

Amazon announces a significant $13 billion investment in Australia's data center infrastructure from 2025 to 2029, aimed at expanding AI capabilities and supporting generative AI workloads.

Reuters logoEconomic Times logoMarket Screener logo

3 Sources

Business and Economy

14 hrs ago

Amazon to Invest $13 Billion in Australia's Data Center
TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Twitter logo
Instagram logo
LinkedIn logo