Google DeepMind's AlphaGenome AI reads 1 million DNA letters to decode the genome's dark matter

Reviewed byNidhi Govil

7 Sources

Share

Google DeepMind unveiled AlphaGenome, an AI model that analyzes up to 1 million DNA letters at once to predict how mutations affect gene expression. The tool tackles the human genome's non-coding regions—98% of our DNA once dismissed as junk—and could accelerate research into genetic diseases, cancer, and gene therapies. Nearly 3,000 scientists across 160 countries have already used it.

News article

AlphaGenome tackles the genome's mysterious non-coding regions

Google DeepMind has introduced AlphaGenome, an AI model designed to decode the vast stretches of DNA that have long puzzled scientists

1

. The tool analyzes up to 1 million base pairs of DNA sequence at once, predicting how single-letter mutations in those stretches affect gene expression and other biological functions

2

. This marks a significant leap from previous models like Borzoi, which could handle only 500,000 DNA letters

2

. The human genome contains roughly 3 billion letters of genetic code, but only about 2% consists of genes that encode proteins

4

. The remaining 98%—non-coding DNA once dismissed as "junk"—plays a crucial role in regulating gene activity and harbors many mutations linked to disease

3

. AlphaGenome targets this dark matter of the genome, offering researchers a way to predict the molecular impact of variations across entire DNA sequences

3

.

How the deep learning model works and what it predicts

Described in Nature, AlphaGenome is a deep learning model trained on human and mouse genome data

3

. The AI model takes a DNA sequence as input and delivers predictions on 11 distinct biological processes, including gene expression, RNA splicing, DNA accessibility to proteins, and chromatin structure

2

. It can predict nearly 6,000 human genetic signals tied to specific functions simultaneously

5

. "The genome is like the recipe of life, and really understanding 'What is the effect of changing any part of the recipe?' is what AlphaGenome sort of looks at," said Pushmeet Kohli, Google DeepMind's vice president for science

1

. The model matched or outperformed state-of-the-art tools in 25 out of 26 tasks predicting the effects of genetic variations

3

. It identified gene activity changes in certain cell types 14.7% better than Borzoi2

2

. A key advance is AlphaGenome's ability to pinpoint biologically important spots down to single-base resolution, far higher than Borzoi's 32 base-pair bins

2

.

Practical applications for genetic diseases and cancer research

AlphaGenome functions as a research tool with potential applications in diagnosing rare conditions, identifying cancer-driving mutations, and designing gene therapies

1

. The model has already been used to simulate known DNA mutations responsible for a type of leukaemia, predicting the same results observed in lab experiments

3

. Dr. Gareth Hawkes from the University of Exeter is using AlphaGenome to explore how mutations alter risk for obesity and diabetes

4

. Studies sequencing tens of thousands of people have identified variants linked to these conditions, often in the dark genome, and AlphaGenome helps researchers rapidly predict what those variants do so they can be tested in the lab

4

. In cancer research, the tool can predict which mutations fuel tumor growth and serve as potential drug targets, distinguishing them from incidental mutations

4

. The model could also accelerate understanding of diseases through genome-wide association studies and help craft better treatments for genetic diseases

1

.

Building on AlphaFold's Nobel Prize-winning legacy

AlphaGenome follows Google DeepMind's AlphaFold, an AI model that predicts protein structure from amino acid sequences—work that earned researchers the Nobel Prize in Chemistry in 2024

1

. The company also released AlphaMissense in 2023, which predicts how mutations in protein-coding regions affect gene function

1

. AlphaGenome extends this work to the entire genome, unifying multiple genomics tasks under one roof

3

. "Previously, the field required separate models for separate tasks," says Žiga Avsec, a research scientist leading DeepMind's genomics initiative

3

. Robert Goldstone, head of genomics at the Francis Crick Institute, called AlphaGenome "a foundational, high-quality tool that turns the static code of the genome into a decipherable language for discovery"

3

.

Early adoption and acknowledged limitations

Google DeepMind released a preview of AlphaGenome for non-commercial research in June last year, and nearly 3,000 scientists in 160 countries have since used it, submitting around 1 million requests daily

3

. Despite its capabilities, the model has limitations. It struggles to predict the influence of genetic alterations more than 100,000 base pairs apart and can only make predictions about DNA sequences from cell types used in training—primarily human and mouse

3

. Unpublished data indicates the model has difficulty predicting how gene activity changes in individuals

2

. "Even when the model makes accurate predictions, it does not always directly inform us of the underlying biological processes," notes Jian Zhou, a genomics machine learning researcher at the University of Chicago

3

. Ben Lehner from the Wellcome Sanger Institute tested AlphaGenome using over half a million experiments and confirmed it performs very well, but cautioned that "AlphaGenome is far from perfect and there is still a lot of work to do"

5

. Researchers suggest the next major advance will come from generating new types of data for the model to analyze

2

.

Today's Top Stories

TheOutpost.ai

Your Daily Dose of Curated AI News

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Spend less time searching for the latest in AI and get straight to action.

© 2026 Triveous Technologies Private Limited
Instagram logo
LinkedIn logo