Âé¶¹ÒùÔº

March 3, 2025

Predicting SARS-CoV-2 variant infectivity with biophysical principles

Link between fitness and thermodynamics of protein folding and binding. High fitness variants may exhibit improved stability in the folded state or in the ACE2-bound state to facilitate cellular entry or have the capacity to destabilize bound-to-antibody states, thereby enabling evasion. Credit: Dianzhuo Wang
× close
Link between fitness and thermodynamics of protein folding and binding. High fitness variants may exhibit improved stability in the folded state or in the ACE2-bound state to facilitate cellular entry or have the capacity to destabilize bound-to-antibody states, thereby enabling evasion. Credit: Dianzhuo Wang

When the COVID-19 pandemic first began, we saw how quickly the SARS-CoV-2 virus evolved. New variants emerged with mutations that increased transmissibility or helped the virus evade our immune systems. But predicting which mutations would be most successful—and why—has remained a challenge.

In our study, in the Proceedings of the National Academy of Sciences, we take a different approach. Rather than relying purely on genomic surveillance, we apply biophysical principles to understand viral evolution. Specifically, we developed a statistical mechanics model that connects the thermodynamic properties of the virus's spike protein to its ability to spread.

A biophysical map of viral fitness

The key player in SARS-CoV-2's infection process is the receptor binding domain (RBD) of the spike protein. The RBD determines how well the virus binds to ACE2, the human cellular receptor, and how effectively it avoids neutralization by antibodies.

We found that binding affinities—how tightly the viral RBD binds to ACE2 and neutralizing antibodies—are powerful predictors of how a will behave in a population. Stronger ACE2 binding can make a variant more infectious, while weaker antibody binding allows it to escape immune defenses.

Our biophysical model integrates these molecular measurements to generate a fitness landscape—a map that predicts which viral mutations will be favored by evolution, which we validated with real world sequencing data.

Predicting fitness using experimental and machine learning predictions

To make this model practical, we combined experimental binding affinity data with machine learning predictions. Some RBD mutations have already been studied in the lab, but many have not. To fill in the gaps, we trained a transformer to estimate the dissociation constants (KD) for unseen mutations.

These predicted values were then fed into our biophysical model, allowing us to forecast the fitness of variants before they appear in large numbers in the population.

Predicted fitness compared with actual fitness for variants observed in 2020, 2021 and 2022. Each point is a variant of SARS-CoV-2 whose binding affinities are acquired by our trained machine learning model. Credit: Dianzhuo Wang
× close
Predicted fitness compared with actual fitness for variants observed in 2020, 2021 and 2022. Each point is a variant of SARS-CoV-2 whose binding affinities are acquired by our trained machine learning model. Credit: Dianzhuo Wang

Get free science updates with Science X Daily and Weekly Newsletters — to customize your preferences!

Why some mutations disappear

Our model also helps explain puzzling evolutionary reversals. For example, the Q493R mutation initially helped the virus evade antibodies, but later variants like BA.4 and BA.5 dropped this mutation. Why? Because fitness is a trade-off—while Q493R helped with immune escape, it also weakened ACE2 binding. As other mutations accumulated, the cost of keeping Q493R became too high, and natural selection reversed it.

This is an example of epistasis—the way mutations interact with each other. Instead of acting independently, mutations can amplify or cancel each other's effects. Our model captures these interactions, making it more realistic than previous approaches that assumed all act in isolation.

Beyond COVID-19: A universal approach to viral evolution

Although we developed this model for SARS-CoV-2, the principles behind it apply to many other viruses. Any virus that relies on protein binding for infection and immune escape—such as influenza or HIV—could be analyzed using similar methods.

By linking molecular properties to real-world epidemiology, our approach offers a powerful tool for pandemic preparedness. Instead of waiting for new variants to spread, we could predict their fitness in advance, guiding public health decisions, vaccine updates, and treatment strategies. By grounding in biophysical reality, we move beyond trial-and-error surveillance toward a predictive science of pandemic evolution.

This story is part of , where researchers can report findings from their published research articles. for information about Science X Dialog and how to participate.

More information: Dianzhuo Wang et al, Biophysical principles predict fitness of SARS-CoV-2 variants, Proceedings of the National Academy of Sciences (2024).

Journal information: Proceedings of the National Academy of Sciences

Load comments (0)

This article has been reviewed according to Science X's and . have highlighted the following attributes while ensuring the content's credibility:

fact-checked
peer-reviewed publication
trusted source
written by researcher(s)
proofread

Get Instant Summarized Text (GIST)

The study applies biophysical principles to predict SARS-CoV-2 variant infectivity, focusing on the spike protein's receptor binding domain (RBD). Binding affinities to ACE2 and antibodies are key predictors of variant behavior. A statistical mechanics model integrates these affinities to create a fitness landscape, validated with sequencing data. Machine learning estimates dissociation constants for unstudied mutations, enhancing prediction accuracy. The model explains evolutionary reversals and epistasis, offering a universal approach applicable to other viruses, aiding in pandemic preparedness by predicting variant fitness in advance.

This summary was automatically generated using LLM.