Apple Intelligence relies on Google and Nvidia tech while downplaying Gemini's contributions

Reviewed byNidhi Govil

4 Sources

Share

At WWDC 2026, Apple revealed its Apple Intelligence architecture combines on-device and cloud-based AI models, with Google's Gemini used for distillation and Nvidia GPUs handling complex cloud processing. The company emphasized its privacy-first approach through Private Cloud Compute while clarifying it uses none of Google's customer-facing Gemini models or infrastructure.

Apple Unveils Complex AI Architecture at WWDC 2026

Apple lifted the curtain on its Apple Intelligence strategy at the Worldwide Developers Conference in Cupertino, revealing a sophisticated system that relies on partnerships with Google and Nvidia while maintaining its privacy-focused approach

1

. The announcement showcased a redesigned Siri capable of conversational interactions, checking concert dates, setting reminders, and coordinating directions—a significant upgrade from previous versions

1

. However, the technical details revealed a more nuanced story about Apple's AI advancements and its reliance on industry partners.

Source: Wccftech

Source: Wccftech

Apple Foundation Models Span Five Distinct AI Systems

Apple's new AI architecture centers on a family of Apple Foundation Models distributed across on-device and cloud-based AI models. AI VP Amar Subramanya outlined the structure: AFM Core, a next-generation dense architecture model, and AFM Core Advanced, which uses a sparse architecture with 20 billion parameters and runs natively on the A19 Pro chip

2

4

. The on-device AI models enable features including invitation handling and expressive voices without cloud requests

2

.

Source: MacRumors

Source: MacRumors

On the server side, AFM Cloud handles latency-optimized Private Cloud Compute requests, while AFM Cloud Image powers image generation and editing features

2

. The fifth model, AFM Cloud Pro, handles agentic tool use and complex reasoning tasks with quality similar to Gemini frontier models

2

. This model runs on Nvidia GPUs hosted in Google Cloud, marking Apple's first extension of its private infrastructure beyond its own data centers

1

.

Google Partnership Limited to Model Distillation

Apple executives took pains to clarify the scope of the Google partnership during a post-keynote tech talk. Craig Federighi, Apple's SVP of Software Engineering, stated bluntly: "The amount of the Google Assistant we use is none"

2

. Apple uses none of the Gemini models deployed to Google's customers, none of Google's client-side code, and no Google Search infrastructure

2

.

Instead, Google's contribution centers on distillation. Subramanya explained that Apple's models were "trained using proprietary data with reinforcement learning and refined using outputs from Gemini frontier models"

2

. Apple licensed a 1.2-trillion-parameter Gemini model from Google for distillation purposes, then conducted its own pre-training and post-training operations on the AFM Cloud

4

.

Nvidia Chips Enable Privacy-Preserving Cloud Processing

The collaboration with Nvidia addresses Apple's need for advanced processing power while maintaining privacy and security guarantees. Software VP Sebastien Marineau-Mes explained that Apple wanted to use Nvidia's latest chips but required them configured so they couldn't read the contents of Apple's servers

2

. Nvidia's "ambiguous confidential compute" technology provided the solution, enabling Apple to extend Private Cloud Compute to third-party infrastructure

2

.

Source: MacRumors

Source: MacRumors

Apple's Private Cloud Compute on Google Cloud maintains the same core requirements: stateless computation, enforceable guarantees, no privileged runtime access, non-targetability, and verifiable transparency

3

. The implementation uses Nvidia Confidential Computing with Nvidia GPUs, Intel CPUs with TDX, and Google's Titan chip

3

. Apple maintains a cryptographically verifiable ledger of all Google Cloud hardware in the PCC fleet to mitigate supply chain attack risks

3

.

System Orchestrator Routes Requests for Data Protection

Federighi described the System Orchestrator as "key to the privacy architecture of our entire system"

2

. This software routes queries to the appropriate model—on-device or cloud—based on request complexity and personal context required. When users submit requests through Siri, a localized orchestrator calls required tools, collects data, and generates structured prompts for the AFM Cloud

4

. Critically, raw data stays on the device; only structured prompts reach the cloud

4

.

For current events queries, responses come through Apple's own World Knowledge Service, which Federighi said the company has been building for several years

2

. Apple maintains that all Private Cloud Compute infrastructure, including extended Nvidia GPU capacity in Google Cloud, can be independently verified by third-party researchers

2

. PCC on Google Cloud binaries will be available for public inspection, with research tooling and access to live PCC nodes offered through Apple's Security Bounty Program

3

.

Today's Top Stories

© 2026 TheOutpost.AI All rights reserved