AI in Chemistry breakthrough: Two systems accelerate molecule design and drug discovery by 10x

Reviewed byNidhi Govil

2 Sources

Share

Researchers unveiled two AI systems that transform chemical synthesis and molecule design. Yale's MOSAIC successfully helped synthesize 35 new compounds for pharmaceuticals and materials, while NYU's PropMolFlow generates molecular candidates ten times faster than previous tools. Both systems address critical bottlenecks in drug discovery and could accelerate the path from laboratory concept to real-world products.

AI in Chemistry Removes Critical Bottleneck in Drug Discovery

Two groundbreaking AI systems are reshaping how scientists approach chemical synthesis and molecule design, potentially cutting years from the drug discovery process. Yale University researchers developed MOSAIC, an AI system for chemical synthesis that successfully helped create 35 new compounds with potential applications in pharmaceuticals, agrochemicals, and cosmetics

1

. Meanwhile, teams at New York University and the University of Florida built PropMolFlow, which generates new compounds approximately ten times faster than earlier tools

2

.

Source: Earth.com

Source: Earth.com

"The synthesis of small molecules is the slow step in drug discovery and a number of other important areas," says Timothy Newhouse, a chemist at Yale University and study co-author

1

. MOSAIC addresses this bottleneck by drafting complete laboratory instructions detailed enough for chemists to follow when creating molecules that have not previously existed.

MOSAIC Uses Expert Models for AI-Assisted Chemistry

The MOSAIC system represents a conceptual shift in how AI in chemistry operates. Rather than deploying a single massive large language model, researchers trained Meta's partially open-source Llama LLM to create 2,498 separate expert models

1

. Each specialized model focuses on one combination of chemical transformation starting from a specific molecule type, drawn from 2,285 subsets clustered from a database of around one million reactions extracted from patents.

Martin Seifrid, a materials scientist at North Carolina State University, notes that MOSAIC avoids "throwing the largest possible model at a problem, instead choosing to focus on a carefully designed system of much smaller 'expert' models." Each specialized model proves more accurate within its domain

1

. This modular approach can run on local computers because it uses fewer parameters than major large language model systems.

When researchers tested MOSAIC's recommendations in the laboratory, they successfully produced 35 out of 52 suggested new substances. The system also accurately predicted the color and form of the compounds. More remarkably, MOSAIC suggested reaction methodologies absent from the millions of reactions used in training, proposing an entirely new way to make azaindoles that proved successful when tested

1

.

Source: Nature

Source: Nature

PropMolFlow Accelerates Chemical Synthesis Through Property-First Design

While MOSAIC optimizes existing chemical knowledge, PropMolFlow tackles molecule design from a different angle. Led by Stefano Martiniani at New York University, the system works backward from target properties to molecular structure rather than tweaking existing chemistry

2

. This property-first approach lets researchers specify desired traits and receive candidate molecular blueprints on demand.

PropMolFlow achieves its speed advantage through computational efficiency. The system starts from random noise and refines it step by step until atoms and bonds form a stable pattern, requiring about 100 computational steps to reach a valid structure where other approaches often need around 1,000

2

. Fewer steps translate directly to reduced waiting time, enabling early screening cycles to move much more quickly.

Chemical Validity Remains Critical Challenge

Speed means little if generated molecules violate basic chemistry rules or cannot be synthesized in actual laboratories. PropMolFlow addresses this fundamental challenge by producing structures with correct bonding patterns and realistic shapes more than 90 percent of the time

2

. "This matters because many earlier approaches produced structures that looked superficially plausible but violated basic chemical rules," Martiniani explains.

The team also tackled another critical issue in AI-assisted chemistry: the risk of neural networks grading their own work. "If a neural network generates a molecule and another neural network predicts its properties, both systems may share similar blind spots because they are drawing from the same reservoir of information; AI is then grading its own homework," Martiniani observes

2

. To avoid this trap, the team used density functional theory, a quantum method that calculates properties from electrons rather than relying on another neural network.

Industry Adoption and Open-Source Availability

The Yale team developed MOSAIC in collaboration with researchers at Boehringer Ingelheim's Connecticut site, who are already using the system. "They are interested in designing new synthetic pathways," says Victor Batista, a theoretical and computational chemist at Yale University and study co-author. "If they reduce the number of steps, they save a lot of money." MOSAIC is available as open-source code for other groups to use

1

.

Kuangbiao Liao, a chemist at Guangzhou National Laboratory, calls the expert framework "an important conceptual advance in AI-assisted chemistry" that moves AI from prediction to action. The framework "preserves competing chemical objectives instead of collapsing them into one averaged model, which better reflects how chemists actually reason at the bench"

1

.

Implications for Discovery of New Medicines and Materials

These systems enable iterative design cycles that could fundamentally change how laboratories operate. "With the ability to generate thousands of chemically valid, property-targeted candidates in minutes rather than hours, researchers can iterate faster: generate candidates, filter computationally, validate the best ones with physics or experiments, and feed results back to improve the next round," Martiniani explains

2

.

Newhouse suggests that integrating MOSAIC's step-by-step instructions into automated systems would be a "natural next step"

1

. This integration could create fully automated pipelines from computational chemistry design to physical synthesis, though chemists will still determine which candidates deserve experimental validation.

Both systems currently face limitations when scaling to larger, more complex molecules typical of modern pharmaceuticals. Bigger structures introduce more failure modes, awkward shapes, unstable charges, and synthesis challenges. Researchers acknowledge the need to adapt these models for larger molecular systems where new atoms and bonds create far more ways for designs to fail

2

. However, by cutting computational time and improving hit rates in early screening, these AI systems promise to push more promising ideas toward serious testing, potentially shortening the timeline from laboratory concept to market-ready product.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo