Tiiny AI unveils world's smallest supercomputer that can run 120B models in your pocket

Reviewed byNidhi Govil

4 Sources

Share

US startup Tiiny AI has introduced the Pocket Lab, verified by Guinness World Records as the world's smallest personal AI supercomputer. Despite measuring just 14.2 Γ— 8 Γ— 2.53 cm and weighing 300 grams, the device can run large language models with up to 120 billion parameters entirely on-device. The company aims to reduce cloud dependency and enhance privacy by bringing server-grade AI capabilities to a portable device that fits in your hand.

Tiiny AI challenges cloud dependency with pocket-sized innovation

US deep-tech startup Tiiny AI has unveiled the Pocket Lab, officially verified by Guinness World Records as the world's smallest personal AI supercomputer.

1

The device measures just 14.2 Γ— 8 Γ— 2.53 cm and weighs only 300 grams, making it comparable in size to a power bank, yet it promises to run large language models with up to 120 billion parameters entirely on-device.

2

This marks a shift in how advanced AI computing could reach individual users, moving capabilities that typically require expensive server racks or professional GPUs into something that fits in your pocket.

Source: Wccftech

Source: Wccftech

Tiiny AI argues that the real bottleneck in AI today isn't computing power but our reliance on cloud infrastructure. GTM director Samar Bhoj states, "intelligence shouldn't belong to data centers, but to people."

1

By enabling local execution of AI models, the Pocket Lab aims to reduce cloud dependency while simultaneously working to enhance privacy by keeping computations and user data on-device rather than transmitting them to remote servers.

2

This approach addresses growing concerns about data vulnerability and the sustainability challenges associated with centralized cloud AI systems.

3

Technical specifications enable server-grade performance

The AI supercomputer is built on the ARM v9.2 architecture with a 12-core CPU and supports popular open-source models including GPT-OSS, Llama, Qwen, DeepSeek, Mistral, and Phi.

1

At its core sits a discrete Neural Processing Unit (NPU) capable of delivering 190 TOPS, paired with 80 gigabytes of LPDDR5X memory.

2

This substantial memory allocation facilitates aggressive quantization techniques that compress model data to lower precision levels, enabling massive models to run 120B AI models locally without sacrificing essential computational accuracy.

4

Source: TweakTown

Source: TweakTown

The device operates within a 65W power envelope, delivering performance at a fraction of the energy and carbon footprint of traditional GPU-based systems.

3

For context, competing compact devices like NVIDIA's Project Digits cost approximately $3,000, while the DGX Spark comes in at $4,000, putting them out of reach for most everyday users.

1

Tiiny AI positions the Pocket Lab as a more accessible alternative that democratizes access to Edge AI capabilities.

Proprietary technologies optimize inference efficiency

Tiiny AI has integrated two proprietary technologies that make running massive models practical on such compact hardware. TurboSparse employs neuron-level sparse activation, selectively deactivating less critical neural pathways during inference to increase efficiency without reducing model intelligence. PowerInfer, an open-source heterogeneous inference engine with over 8,000 GitHub stars, dynamically distributes AI workloads across the CPU and NPU.

4

This split-processing approach delivers server-grade performance while keeping power consumption low enough for portable operation.

The combination of these technologies enables capabilities previously reserved for professional GPUs costing thousands of dollars.

4

The device supports multi-step reasoning, deep context understanding, agent workflows, content generation, and secure processing of sensitive information, even without internet access.

3

It also provides true long-term personal memory by storing user data, preferences, and documents locally with bank-level encryption, offering persistence that cloud-based AI systems cannot match.

Market positioning and future outlook

Tiiny AI plans to showcase the Pocket Lab at CES 2026, though pricing and release details remain undisclosed.

1

The device targets developers, researchers, creators, professionals, and students who need powerful AI capabilities without cloud infrastructure dependence.

3

Industry observers will watch closely to see whether the Pocket Lab can deliver on its promises when it reaches real users, particularly regarding thermal management and sustained performance in such a compact form factor. The success of this personal AI supercomputer could signal a broader shift toward decentralized AI computing, where individuals gain direct control over their intelligence tools rather than relying on data center infrastructure.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

Β© 2025 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo