Microsoft Copilot now uses GPT and Claude AI models together to fact-check research outputs

Reviewed byNidhi Govil

12 Sources

Share

Microsoft unveiled a multi-model approach for its Copilot Researcher tool that leverages both OpenAI's GPT and Anthropic's Claude simultaneously. The new Critique feature has GPT draft responses while Claude reviews for accuracy, achieving a 13.8% improvement on industry benchmarks. The company also rolled out Copilot Cowork to early-access customers as competition intensifies in enterprise AI.

Microsoft Copilot Combines Multiple AI Models in New Research Tool

Microsoft unveiled significant upgrades to Microsoft 365 Copilot on Monday, introducing a multi-model approach that allows users to utilize multiple AI models simultaneously within the same workflow

1

. The centerpiece of this update is the Critique feature, which pairs OpenAI's GPT with Anthropic's Claude to deliver more accurate research outputs through the Copilot Researcher tool

3

.

Source: ET

Source: ET

In this new workflow, GPT generates the initial response while Claude reviews the output for accuracy, completeness, and citation integrity before presenting it to the user

4

. Microsoft expects to make this process bi-directional in the future, allowing GPT to review Claude's drafts as well

1

. "Having various different models from different vendors in Copilot is highly attractive - but we're taking this to the next level, where customers actually get the benefits of the models working together," said Nicole Herskowitz, corporate vice president of Microsoft 365 and Copilot, in an interview with Reuters

1

.

Source: Engadget

Source: Engadget

Measurable Improvements in Research Accuracy

The multi-model approach has delivered tangible results. Microsoft reports that the Critique feature enabled Researcher to score 13.8% higher on the DRACO benchmark, an industry standard for deep research quality that measures accuracy, completeness, and objectivity

2

5

. This improvement positions Microsoft ahead of standalone deep-research tools from OpenAI, Google, Perplexity, and Anthropic

4

.

The strategy addresses a critical challenge in AI: hallucinations, where systems generate false information. By having Claude fact-check GPT's work, Microsoft creates a feedback loop similar to what occurs in academic and professional research settings

3

. This approach helps keep AI hallucinations in check while speeding up user workflow and producing more reliable outputs

1

.

Model Council Offers Side-by-Side Comparisons

Alongside Critique, Microsoft introduced Model Council, a feature that allows users to compare responses from different AI models side-by-side

1

. This tool shows where models agree and disagree, giving users more autonomy in assembling the best workflow for their needs

2

. However, this approach comes with trade-offs: Model Council costs roughly 2.5 times as much as using a single model, while the Critique approach costs about 20% more

5

. These costs aren't directly passed to users given Microsoft Copilot operates on a subscription model, but they inform where Microsoft deploys multiple models versus single algorithms

5

.

Copilot Cowork Expands to Early Access Program

Microsoft also made Copilot Cowork more widely available to members in its Frontier program, which provides customers with early access to the latest AI features

1

. Built on technology from Anthropic's viral Claude Cowork product, this agentic AI tool goes beyond simple chatbot interactions to become a digital assistant that handles long-running, multi-step tasks inside Microsoft 365

2

4

.

Source: TechRadar

Source: TechRadar

Strategic Positioning Amid Intense Competition

These updates arrive as Microsoft races to improve research accuracy and drive better adoption amid intense competition from rivals including Google's Gemini and autonomous agents such as Claude Cowork

1

. The company reported 15 million paid Copilot seats in January, representing roughly 3.3% of its 450 million commercial Microsoft 365 users

4

. The multi-model system also helps Microsoft demonstrate it isn't overly reliant on OpenAI, a strategic consideration as leading frontier labs frequently leapfrog one another

5

.

"It's becoming very clear to us that there will be many models," Microsoft executive VP Charles Lamanna told Axios. "Come summertime there will be many more models than just these two inside of Copilot"

5

. Lamanna noted that businesses are interested in AI tools that can easily change which models run under the hood, and Microsoft is building more homegrown models that might first appear working alongside outside models rather than as full replacements

5

. Both the Critique and Model Council features are currently available through Microsoft 365 Copilot's Frontier program

3

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo