2 Sources
[1]
Scaled Cognition raises $100M led by Khosla
Scaled Cognition has raised $100M led by Khosla Ventures to build AI that does not hallucinate. The startup says its model will not give a wrong answer, a bold claim in a field built on probability. Scaled Cognition, a Mountain View AI lab, has raised $100m in a Series A round led by Khosla Ventures. The company chases one prize: reliability. The Wall Street Journal reported the round values the startup at about $750m. The pitch is simple to state and hard to deliver. Today's AI is capable but unreliable. It hallucinates, and those mistakes block its use in workflows where a wrong answer costs real money. Scaled Cognition wants to fix that at the root. The startup is already in production with Fortune 500 firms across financial services, healthcare, telecom and insurance. These are industries where errors carry consequences, from a wrong bank balance to a botched insurance claim. An architecture problem, not an effort problem The company's core claim cuts against the grain. You cannot bolt reliability on, it argues. You have to design it in. That sets it apart from rivals who wrap a safety layer around an existing frontier model. "We spent years trying to apply AI to business applications and found it was essentially impossible to make these systems reliable," said Dan Roth, the chief executive and co-founder. "You could have an interaction that was spectacular, think the singularity is here, and then look at the data and discover the system was making grievous errors." For Roth, the fix was not more compute or more effort. "The problem isn't resources or effort, it's architecture," he said. The company spent years on the rebuild. That framing echoes a wider debate. The hardest failures are not the obvious ones. They are the answers that look completely correct and are quietly wrong, the kind a human reviewer waves through. The model: APT Scaled Cognition calls its flagship model APT, short for Agentic Pretrained Transformer. The company brands its output "Super-Reliable Intelligence." The promises are large. APT is meant to match the conversational quality of leading models while eliminating hallucinations and sticking to policy. It is also smaller, faster, cheaper and, the company says, more accurate than frontier models. The deployment angle matters too. APT runs in a private cloud or fully self-hosted, so enterprises own their AI rather than renting it from a third party. For regulated industries wary of sending data to outside model providers, that control is a selling point. "Reliability is engineered into the architecture of our models, not bolted on after the fact," said Dan Klein, the chief technology officer and co-founder. "If you want AI to take real actions on behalf of customers, that's the problem you have to solve." The founders and the backer The team has form. Klein is a UC Berkeley professor of AI and a veteran natural-language researcher. Roth and Klein previously built and sold one of the first agentic AI companies to Microsoft. The lead investor reinforces the point. Vinod Khosla, founding partner of Khosla Ventures, framed the bet as a hard road most avoid. He has made similar wagers before, backing startups like Pramaana Labs that try to make AI verifiable. "The way to quickly get into the market is to take a frontier model and put a layer on top," Khosla said. "Most people are too lazy to do that. The result is Super-Reliable Intelligence: a model that will not give you a wrong answer. In any industry where an agent takes a real action, nothing else counts." Customers and scale The early customers give the claim weight. Genesys, a cloud contact-centre giant serving more than 8,000 organisations in over 100 countries, uses APT for agentic virtual agents inside its platform. Genesys has also invested in the startup. The numbers are ambitious. Over the next twelve months, companies using Scaled Cognition's models are on track to automate more than one billion customer support interactions. Roth ties reliability to the bottom line. "When a system makes mistakes 30% of the time, complex issues go unresolved and customers don't come back," he said. He claims its models resolve most issues, saving "hundreds of millions" in operational costs. Beyond the model, the company sells a full platform. It includes agentic tooling, simulation and evaluation frameworks, and live monitoring of agents in production. The bigger prize Customer experience is only the first step. The real target is the $600bn business process outsourcing market, the sprawling world of outsourced customer service, IT support, HR and finance. The thesis is a reversal. For years, enterprises shipped these jobs to third-party providers. Now some want to insource them again, swapping managed services for AI workforces they own and control. Scaled Cognition wants to supply the engine. Timing helps the pitch. Enterprises have run AI pilots for two years but stalled on wider rollout, often because a single bad answer can trigger a complaint, a fine or a lawsuit. Scaled Cognition aims its entire message at that hesitation. It is a vast market, and a crowded one. Every contact-centre platform is bolting AI onto its tools, and frontier labs are pushing into agents that act, not just chat. To win, Scaled Cognition must prove its reliability claim survives contact with messy, real customers. The bold claim The risk lies in the promise itself. "A model that will not give you a wrong answer" is a striking line in a field where models work, by design, on probability. Few in AI would make it so flatly. Scaled Cognition is betting the architecture backs the boast. If APT really does cut the quiet, confident errors that block enterprise use, the reward is enormous. If it merely lowers them, it joins a long line of tools that promised reliability and delivered improvement. Either way, the pitch is sharp. The company is selling trust, not raw capability, and trust is the thing enterprises keep saying they lack. Whether one model can guarantee it, at the scale of a billion interactions, is the question this round leaves open.
[2]
Scaled Cognition Raises $100 Million to Address AI Hallucinations | PYMNTS.com
The company's Series A round, announced Thursday (June 9), values Scaled Cognition at $750 million and will help it expand its research team and speed product development. In an interview with the Wall Street Journal (WSJ) about the new funding, Scaled Cognition Co-Founder and CEO Dan Roth said the company works to address problems caused by artificial intelligence (AI) frontier models, describing them as "amazing," but also "sort of like schizophrenic geniuses." "They can create incredible answers, and then you can ask them the same question a second time and get a completely different answer that ... might not even be correct," he said. "We really believe that for these systems to really be useful, you have to be able to trust them. And in order for you to trust them, they have to be provably reliable." Just one error can have horrific repercussions, the CEO added, using the example of a healthcare AI hallucinating a single digit in a prescription and giving a patient incorrect medication. Roth and Co-Founder/Chief Technology Officer Dan Klein, the report said, wanted to develop alternative AI architecture that provides reliably accurate results, which led to APT (Agentic Pretrained Transformer), their company's flagship model. In addition to APT, Scaled Cognition has also constructed a platform for enterprise AI deployment that features agentic tooling, live agent monitoring and simulation and evaluation frameworks, the WSJ added. As covered here last year, hallucinations have become "a headline risk." A federal judge in Wyoming threatened to sanction attorneys who submitted AI-generated briefs peppered with phony cases, while Butler Snow, a law major firm, admitted last spring that its lawyers relied on hallucinated citations. "What might look like quirky tech failures in consumer chat apps quickly turns into reputational and regulatory landmines when applied to banking, payments or compliance," PYMNTS wrote at the time. "The industry once dismissed hallucinations as teething errors. Today, they are seen as structural." A WSJ report last year said that leading developers had begun training AI to say "I don't know" rather than improvise, realizing that probabilistic models will never be free from errors. In the payments world, Lloyds Bank and Coinbase increased confidence in their hallucination protections after deploying safer generative AI systems. And in the insurance sector, companies have begun offering policies to cover AI-related errors, including hallucinated outputs, "highlighting how serious the risk has become," PYMNTS added.
Share
Copy Link
Scaled Cognition, a Mountain View AI lab, has secured $100 million in Series A funding led by Khosla Ventures at a $750 million valuation. The startup claims its Agentic Pretrained Transformer model eliminates hallucinations by engineering reliability into the architecture, not bolting it on afterward. Fortune 500 companies in finance, healthcare, and insurance are already using the system.

Scaled Cognition has closed a $100 million Series A funding round led by Khosla Ventures, valuing the Mountain View startup at approximately $750 million
1
. The company targets a problem that has plagued AI adoption in high-stakes environments: AI hallucinations that produce confidently wrong answers. Rather than wrapping safety layers around existing frontier models, Scaled Cognition built its architecture from scratch to engineer reliability at the core1
.The startup is already working with Fortune 500 firms across financial services, healthcare, telecom, and insurance—regulated industries where a single error can trigger financial losses or compliance violations
1
. Co-founder and CEO Dan Roth told the Wall Street Journal that frontier models are "amazing" but also "sort of like schizophrenic geniuses" that can produce incredible answers one moment and completely incorrect ones the next2
.The company calls its flagship system Agentic Pretrained Transformer, or APT, and brands its output as Super-Reliable Intelligence
1
. APT is designed to match the conversational quality of leading models while sticking to policy and avoiding the probabilistic guesswork that causes hallucinations. Chief Technology Officer Dan Klein explained that "reliability is engineered into the architecture of our models, not bolted on after the fact"1
.Dan Roth argues the fix required years of rebuilding, not just more compute or effort. "The problem isn't resources or effort, it's architecture," he said
1
. The hardest failures, he noted, are not obvious errors but answers that look completely correct yet are quietly wrong—the kind a human reviewer waves through. In healthcare, for example, a hallucinated single digit in a prescription could lead to incorrect medication2
.Beyond the model itself, Scaled Cognition offers a full enterprise AI deployment platform that includes agentic tooling, simulation and evaluation frameworks, and live monitoring of agents in production
1
2
. APT runs in a private cloud or fully self-hosted, allowing enterprises to own their AI rather than renting it from third-party providers—a critical factor for regulated industries wary of sending sensitive data outside their control1
.Genesys, a cloud contact-center giant serving more than 8,000 organizations in over 100 countries, has deployed APT for agentic virtual agents and invested in the startup
1
. Over the next twelve months, companies using Scaled Cognition's models are on track to automate more than one billion customer support interactions1
. Roth claims the system resolves most issues and saves "hundreds of millions" in operational costs, noting that when a system makes mistakes 30% of the time, complex issues go unresolved and customers don't return1
.The company's ambitions extend beyond automating customer support interactions. Scaled Cognition is targeting the $600 billion business process outsourcing market, which includes outsourced customer service, IT support, HR, and finance
1
. The thesis represents a reversal: for years, enterprises shipped these jobs to third-party providers, but now some want to insource AI-driven workflows again, swapping managed services for AI workforces they own and control1
.Vinod Khosla, founding partner of Khosla Ventures, framed the investment as a hard road most avoid. "The way to quickly get into the market is to take a frontier model and put a layer on top," he said. "Most people are too lazy to do that. The result is Super-Reliable Intelligence: a model that will not give you a wrong answer. In any industry where an agent takes a real action, nothing else counts"
1
. Khosla has made similar bets before, backing startups like Pramaana Labs that work to make AI verifiable1
.AI hallucinations have become a headline risk across sectors. A federal judge in Wyoming threatened to sanction attorneys who submitted AI-generated briefs filled with phony cases, while major law firm Butler Snow admitted its lawyers relied on hallucinated citations
2
. What might look like quirky tech failures in consumer chat apps quickly turns into reputational and regulatory landmines when applied to banking, payments, or compliance2
.Enterprises have run AI pilots for two years but many have stalled on wider rollout, often because a single error can derail trust
1
. Leading developers have begun training AI to say "I don't know" rather than improvise, recognizing that probabilistic models will never be entirely free from errors2
. In the insurance sector, companies have started offering policies to cover AI-related errors, including hallucinated outputs, highlighting how serious the risk has become2
.The founding team brings credibility to the challenge. Dan Klein is a UC Berkeley professor of AI and veteran natural-language researcher, while Dan Roth and Klein previously built and sold one of the first agentic AI companies to Microsoft
1
. The Series A funding will help expand the research team and accelerate product development2
. As enterprises look to move from pilots to production, AI reliability and the ability to trust systems that take real actions will determine which models gain traction in sectors where mistakes carry consequences.Summarized by
Navi
[1]
1
Technology

2
Policy and Regulation

3
Technology
