2 Sources
[1]
Why Financial Institutions Are Converging on Transaction Foundation Models to Build Their Own Intelligence
Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it's also constrained by siloed systems. Siloed systems prevent institutions from developing a unified understanding of consumers' financial behavior. As enterprise datasets keep growing, so does the gap between what institutions know and what their AI can reason over -- creating a major opportunity for the industry to build intelligence using proprietary data. NVIDIA's 2026 State of AI in Financial Services report shows 65% of institutions now use AI, with nearly 90% deploying or assessing it and almost all maintaining or increasing spend. But as AI scales, so does complexity, and fragmented model architectures become the limiting factor. Leading firms are tackling this challenge by rethinking the architecture itself. Where the industry once relied on statistical and machine learning algorithms purpose-built for each line of business, transformer-based transaction foundation models now make it possible to learn a single, unified representation of consumer behavior trained entirely on proprietary data. Transaction foundation models are large-scale AI systems trained on billions of financial events -- such as payments, transfers, product interactions and behavioral signals -- that transform raw data into intelligence, helping firms better serve their customers. The shift is structural. A traditional fraud model evaluates isolated signals. A foundation model interprets behavior in context where timing, device, location and prior activity shape meaning. More importantly, it brings the power of transformer architectures to tabular data, extracting signals previously invisible to traditional algorithms. A payment at midnight means something different when it's the fourth in 10 minutes, on an unfamiliar device, in a city the customer's never transacted from before. That contextual depth improves performance across tasks, not just within them. In collaboration with NVIDIA, Revolut built PRAGMA -- a family of transformer-based foundation models trained on 24 billion events across 26 million user records spanning over 100 countries. Powered by NVIDIA's full AI stack -- including NVIDIA Hopper GPUs, the NVIDIA cuDF library and NVIDIA Nemotron open models -- running on Nebius cloud, a single foundation model outperforms strong task-specific models across domains like credit scoring, fraud detection and product recommendations while reducing reliance on handcrafted features. "We move from weeks, or even in some cases months, in feature engineering to no time required for it at all," said Tadas Kriščiūnas, head of group credit data science at Revolut. Any institution can now adopt this approach using NVIDIA's new Build Your Own Transaction Foundation Model developer example, which enables teams to start building transformer embeddings on tabular transaction data -- integrating into existing pipelines without rebuilding from scratch. The Cost of Fragmentation The problem isn't today's models, it's the trajectory. Every new use case adds another model. Every new market needs retraining. Models that can't share context leave value on the table. Mastercard is developing a proprietary large tabular foundation model for payments, trained on billions of anonymized transactions today and designed to scale to hundreds of billions across additional datasets including fraud, authorization, chargeback, merchant location and loyalty data. Built with capabilities from NVIDIA, AWS and Databricks -- including the NVIDIA NeMo AutoModel open library, part of NVIDIA NeMo framework, and accelerated computing -- the model is intended to reduce reliance on a multitude of AI models across markets, customers and use cases. Early testing shows it outperforming standard machine learning techniques, with promising applications in cybersecurity, fraud detection, loyalty, personalization, portfolio optimization and analytics. Adyen has also deployed transaction foundation models at scale, processing $1 trillion in payments. Using reinforcement learning, Adyen maximizes conversion and minimizes risk for merchants. "Even fractional improvements like a 0.1% uplift in authorization can translate to massive incremental gross merchandise value and substantial cost reductions," said Dhruv Ghulati, principal AI product manager at Adyen. Semantic Layer for Agentic Commerce Forty-two percent of financial firms are already using or assessing agentic AI. As these systems begin to execute transactions -- like managing subscriptions, routing payments and making purchases -- the nature of financial behavior is changing. Stripe is using the NVIDIA and AWS platform to build foundation models that understand the full context of transactional behavior rather than reacting to individual signals -- blocking close to $112 billion in fraud last year and delivering an average 38% reduction in fraud rates. Transaction data is the proprietary history that competitors can't replicate. The data already exists. The architecture is proven. The infrastructure is ready. Scaling Through Ecosystem Partners The Build Your Own Transaction Foundation Model developer example is available for customers to run on Amazon Web Services (AWS), deployed with Amazon SageMaker HyperPod, as well as Nebius AI Cloud -- powered by NVIDIA accelerated computing. Nebius AI Cloud supports the full transaction foundation model lifecycle -- from deployment of the developer example through multi-node training to managed inference on Token Factory -- powered by NVIDIA accelerated computing. Financial services firms can also work with services partners EXL, Infosys, GFT IT Consulting and Thoughtworks to apply the developer example to their specific use cases. EXL is integrating transaction foundation models into its EXLerate.ai platform to unify siloed financial data into a scalable, enterprise intelligence layer powered by proprietary transaction data. In collaboration with NVIDIA, EXL is using this architecture to help financial institutions accelerate model development, enhance contextual decisioning and operationalize agentic AI at scale. Thoughtworks is helping financial institutions operationalize transaction foundation models within complex banking environments, integrating them into payment, servicing and risk while establishing the necessary governance and AI operating models. The company will be showcasing a demo and presentation on transaction foundation models at the upcoming AWS Summit in New York City on Wednesday, June 17. GFT IT Consulting is integrating transaction foundation models into its flagship solutions: Wynxx, an agentic AI platform used by over 100 financial institutions for secure AI adoption in areas like credit risk, and Smaragd, a compliance engine that reduces false positives by up to 75% for major banks. Join NVIDIA at Money20/20 Europe from June 2-4 to learn how transaction foundation models are powering the next generation of AI in financial services. Explore the Build Your Own Transaction Foundation Model developer example on build.nvidia.com.
[2]
Banks and FinTechs Are Sitting on the Most Powerful AI Dataset in Finance | PYMNTS.com
The architecture mirrors how large language models learn from text, except the training material is financial behavior. The timing is pointed: Nvidia's 2026 State of AI in Financial Services report found 65% of financial institutions already use AI, with nearly 90% deploying or assessing it. The bigger obstacle, it found, isn't adoption, but the sprawl those efforts created. Most banks have built too many disconnected AI systems, and the fragmentation is now what's slowing them down. From Hundreds of Models to One Instead of building a new AI system every time a new problem arises, banks train one model on all their transaction data and apply it across problems. That shared history changes what the system can see: a payment at midnight looks different when it's the fourth in ten minutes, from an unfamiliar device, in a city the customer has never bought from before. Revolut showed what that looks like in practice. In April, the neobank published results from PRAGMA, a model trained on 40 billion transactions across 25 million customers in 111 countries. One system now handles credit decisions, fraud detection and product recommendations that previously required separate ones. Revolut's head of group credit data science, Tadas Kriščiūnas, said in a statement that the initiative cut the time needed to set up a new use case from weeks or months to "no time required for it at all." Mastercard is building toward the same outcome at larger scale. PYMNTS reported in March that the payments network is developing a model trained on billions of anonymized card transactions, including fraud, chargebacks, merchant data and loyalty activity. Personal identifiers are stripped before training begins, so the system works from spending patterns rather than individual records. Stripe is also rolling out its own AI strategy. According to a May report from TechCrunch, Stripe launched its payments foundation model trained on tens of billions of transactions, raising detection rates for one common type of payment fraud on large businesses from 59% to 97%. Using the Nvidia and AWS platform, Stripe blocked close to $112 billion in fraud last year and cut average fraud rates by 38%, according to a Nvidia blog post on Monday (June 1). The Business Case Against Fragmentation The argument for consolidation isn't just operational. It's competitive. Every bank that maintains separate AI models for separate problems pays a compounding cost: each new market requires retraining from scratch, each new use case adds another system to maintain, and none of those systems can use what the others have learned. Adyen, which processes $1 trillion in payments annually, said in the Nvidia blog post that even a 0.1% improvement in the rate at which payments successfully clear translates to significant incremental revenue for merchants. Shared intelligence makes those gains possible across the board, not just in one product line. A bank that consolidates its AI into one model trained on its full transaction history can move faster, make better decisions and extend those gains to new problems without starting over. An Industry Starting Point Not every institution has Revolut's data or Stripe's engineering capacity. Nvidia's blueprint is designed to give smaller institutions a starting point, letting teams build on their own transaction data without rebuilding systems from scratch. Services firms including Infosys, EXL and Thoughtworks are helping banks integrate the approach into existing credit, servicing and compliance environments. What Revolut, Stripe, Mastercard and Adyen share is an asset competitors can't replicate: years of their own customers' transaction history. For all PYMNTS AI coverage, subscribe to the daily AI Newsletter.
Share
Copy Link
Financial institutions are replacing fragmented AI systems with transaction foundation models trained on billions of proprietary events. Revolut's PRAGMA handles credit decisions, fraud detection, and recommendations through one model trained on 40 billion transactions. Mastercard and Stripe are developing similar systems, with Stripe blocking $112 billion in fraud last year using its payments foundation model.
Financial institutions are dismantling years of siloed AI infrastructure in favor of transaction foundation models that learn from billions of proprietary financial data points. According to NVIDIA's 2026 State of AI in Financial Services report, 65% of institutions now use AI, with nearly 90% deploying or assessing it
1
. But adoption isn't the challenge anymore—fragmentation is. Banks have built too many disconnected AI models for fraud detection, credit decisions, and product recommendations, creating systems that can't share context or scale efficiently2
.The shift to transaction foundation models represents a structural change in how financial institutions approach AI capabilities in finance. Instead of training separate models for each use case, banks now build transformer-based systems trained on complete transaction data histories—payments, transfers, product interactions, and behavioral signals—that extract patterns invisible to traditional algorithms
1
.
Source: NVIDIA
Revolut demonstrated the power of this consolidated AI strategy with PRAGMA, a family of transformer-based foundation models trained on 24 billion events across 26 million user records spanning over 100 countries. Built in collaboration with NVIDIA using Hopper GPUs, the cuDF library, and Nemotron open models running on Nebius cloud, PRAGMA outperforms task-specific models across credit scoring, fraud detection, and product recommendations while eliminating handcrafted features entirely
1
."We move from weeks, or even in some cases months, in feature engineering to no time required for it at all," said Tadas Kriščiūnas, head of group credit data science at Revolut
1
. The neobank's approach mirrors large language model architecture, except the training material is financial behavior rather than text. A payment at midnight means something different when it's the fourth in 10 minutes, on an unfamiliar device, in a city the customer's never transacted from before—contextual depth that traditional fraud detection systems miss2
.Mastercard is developing a proprietary large tabular foundation model for payments, trained on billions of anonymized transactions today and designed to scale to hundreds of billions across fraud, authorization, chargeback, merchant location, and loyalty data. Built with capabilities from NVIDIA, AWS, and Databricks—including the NeMo AutoModel open library and accelerated computing—the model aims to reduce reliance on multiple AI models across markets, customers, and use cases
1
. Early testing shows it outperforming standard machine learning techniques, with promising applications in cybersecurity, portfolio optimization, and analytics2
.Stripe launched its payments foundation model trained on tens of billions of transactions, raising detection rates for one common type of payment fraud on large businesses from 59% to 97%. Using the NVIDIA and AWS platform, Stripe blocked close to $112 billion in fraud last year and delivered an average fraud rate reduction of 38%
1
2
.
Source: PYMNTS
Adyen has deployed transaction foundation models at scale, processing $1 trillion in payments annually. Using reinforcement learning, the FinTech maximizes conversion and minimizes risk for merchants. "Even fractional improvements like a 0.1% uplift in authorization can translate to massive incremental gross merchandise value and substantial cost reductions," said Dhruv Ghulati, principal AI product manager at Adyen
1
.Related Stories
The argument for consolidation extends beyond operational efficiency to competitive advantage. Every bank maintaining separate AI models pays a compounding cost: each new market requires retraining from scratch, each new use case adds another system to maintain, and none of those systems can use what the others have learned
2
. What Revolut, Stripe, Mastercard, and Adyen share is an asset competitors can't replicate—years of their own customers' transaction history.NVIDIA released a Build Your Own Transaction Foundation Model developer example, enabling teams to start building transformer embeddings on tabular transaction data and integrate into existing pipelines without rebuilding from scratch
1
. Services firms including Infosys, EXL, and Thoughtworks are helping banks integrate the approach into existing credit, servicing, and compliance environments2
.Forty-two percent of financial firms are already using or assessing agentic AI. As these systems begin to execute transactions—managing subscriptions, routing payments, and making purchases—the nature of financial behavior is changing
1
. Transaction foundation models provide the semantic layer these agents need to understand the full context of transactional behavior rather than reacting to individual signals, positioning financial institutions to move faster and make better decisions without starting over for each new problem.🟡 centrifugal force and more effective problem-solvingSummarized by
Navi
1
Technology

2
Policy and Regulation

3
Technology
