2 Sources
2 Sources
[1]
OpenAI is pushing for industry-specific AI benchmarks - why that matters
Benchmark performance results typically accompany the launch of every new AI model to showcase how well the models can perform on various tasks. However, these tasks are not catered to individual industries but are more general, such as grade school mathematics (GSM8K) or graduate-level reasoning (GPQA). Also: ChatGPT will remember everything you tell it now - like a real personal assistant To fill that gap, OpenAI launched the OpenAI Pioneers Program, intended to advance AI model development for specific industries and real-world use cases. The program is a two-pronged effort in which companies will collaborate with OpenAI researchers to develop more domain-specific evaluations and fine-tuned models. In the blog post, OpenAI shared that "industries like legal, finance, insurance, healthcare, accounting, and many others are missing a unified source of truth for model benchmarking." As a result, OpenAI will now work with multiple companies across each industry to develop those evaluations, which are aimed not only at developing models but also at building better trust between the public and these systems. Also: AI isn't hitting a wall, it's just getting too smart for benchmarks, says Anthropic Research has highlighted this void of benchmarks as a major gap in AI for enterprise use cases. For example, Silvio Savarese, head of Salesforce AI Research, released a blog post on Enterprise General Intelligence (EGI), a concept he is pioneering that refers to more advanced AI solutions tailored to businesses' domain-specific needs. In a conversation with ZDNET, he shared that one of the major steps needed to reach EGI is benchmarks that look at evaluating domain-specific functions. Beyond evaluations, OpenAI will also collaborate with the team to refine existing models for three industry-specific use cases using a technique known as reinforcement fine-tuning (RFT). The OpenAI team will help guide the companies on how to use RFT, and then the companies can decide how to deploy the models, which should be ready for large-scale deployment, according to OpenAI. Also: The AI model race has suddenly gotten a lot closer, say Stanford scholars The first cohort will consist of a handful of startups working on use cases that can "drive real-world impact." If your company fits these criteria, you can apply by filling out the form with basic information about the company on the OpenAI Pioneers Program webpage. Get the morning's top stories in your inbox each day with our Tech Today newsletter.
[2]
OpenAI Wants to Partner With Startups on AI -- Here's How to Apply
OpenAI has announced the OpenAI Pioneers Program, a new initiative in which the company will work with startups to devise new methods for grading an AI's performance in specific use cases, and develop new, industry-specific AI models. Businesses can apply to the program now. More specifically, the program will see OpenAI collaborating with businesses to develop customized AI solutions and benchmarks across a wide variety of sectors and "real world use cases." In a blog post announcing the program, OpenAI wrote that as AI becomes more integrated across industries, it's vitally important that businesses can accurately gauge the performance of their AI solutions. One of the best ways to do this, according to the company, is by creating "domain-specific evals," which are benchmarks specific to an industry or use case. For example, a law firm might want to test a model's legal accuracy, or a manufacturing firm might want to test a model's knowledge of materials. Another is by creating customized AI models, which have been fine-tuned for specific industries. Now, OpenAI wants help developing those domain-specific evals, and is calling on businesses for assistance. The ChatGPT creator says it "will be working with companies building new products in high-impact verticals to expand their product capabilities through individualized efforts with our research teams."
Share
Share
Copy Link
OpenAI introduces the Pioneers Program, collaborating with startups to create domain-specific AI evaluations and fine-tuned models for various industries, addressing the lack of specialized benchmarks in AI development.
OpenAI, the renowned artificial intelligence research laboratory, has launched the OpenAI Pioneers Program, a groundbreaking initiative aimed at advancing AI model development for specific industries and real-world use cases
1
. This program addresses a critical gap in the AI landscape: the lack of industry-specific benchmarks for evaluating AI models.Traditionally, AI model launches are accompanied by benchmark performance results that showcase the model's capabilities on various tasks. However, these benchmarks tend to be general in nature, such as grade school mathematics (GSM8K) or graduate-level reasoning (GPQA), rather than catering to specific industries
1
.OpenAI recognizes that industries like legal, finance, insurance, healthcare, and accounting are missing a unified source of truth for model benchmarking
1
. This void has been highlighted by researchers as a major obstacle in AI development for enterprise use cases.The OpenAI Pioneers Program is designed as a two-pronged effort:
Developing Domain-Specific Evaluations: OpenAI will collaborate with multiple companies across various industries to create specialized benchmarks. These evaluations aim not only to improve model development but also to build better trust between the public and AI systems
1
.Fine-Tuning Models for Industry-Specific Use Cases: Using a technique known as reinforcement fine-tuning (RFT), OpenAI will work with partners to refine existing models for three industry-specific use cases. The OpenAI team will guide companies on how to use RFT, enabling them to deploy models that are ready for large-scale implementation
1
.The move towards industry-specific AI solutions aligns with emerging concepts like Enterprise General Intelligence (EGI), pioneered by Silvio Savarese, head of Salesforce AI Research. EGI refers to advanced AI solutions tailored to businesses' domain-specific needs, with specialized benchmarks being a crucial step in this direction
1
.Related Stories
The first cohort of the OpenAI Pioneers Program will consist of a select group of startups working on use cases that can "drive real-world impact"
1
. Interested companies can apply by filling out a form on the OpenAI Pioneers Program webpage, providing basic information about their organization2
.This initiative by OpenAI could significantly accelerate the development of AI solutions across various industries. By creating domain-specific evaluations and fine-tuned models, the program aims to enhance the accuracy and reliability of AI in specialized fields such as law, manufacturing, and healthcare
2
.As AI continues to integrate across industries, the ability to accurately gauge the performance of AI solutions becomes increasingly vital. The OpenAI Pioneers Program represents a significant step towards creating more tailored, efficient, and trustworthy AI systems for specific business needs.
Summarized by
Navi
[2]
01 Apr 2025โขTechnology
02 Oct 2024โขTechnology
06 Aug 2025โขTechnology
1
Business and Economy
2
Business and Economy
3
Policy and Regulation