Âé¶¹ÒùÔº

July 23, 2025

Flabby and flexible: How machine learning helps to build new proteins

From noise to date: artist's impression of the Geometric Algebra Flow Matching (GAFL). Credit: HITS
× close
From noise to date: artist's impression of the Geometric Algebra Flow Matching (GAFL). Credit: HITS

The natural protein universe is vast, and yet, going beyond and designing new proteins not observed in nature can yield new functions and can solve problems in medicine or materials science. The past few years have marked the golden age of de novo protein design: machine learning methods have led to an unprecedented level of modeling accuracy. This progress enables researchers to design protein structures with specific functional properties never observed before. This is of particular interest for biotechnological applications, therapeutics development and sustainability problems, such as plastic degradation.

One of the key features of functional proteins—large biomolecules with complex structures—is their inherent structural flexibility: They wiggle, jiggle and change shape. But current designs largely lack this important feature.

For a team of researchers from the Heidelberg Institute for Theoretical Studies (HITS) and the Max Planck Institute for Polymer Research (MPIP) this was the starting point to deliberate about whether one could design proteins with a custom flexibility from scratch. They the results of their work at the International Conference on Machine Learning () in Vancouver, Canada.

Matching the flow: A model for de novo proteins

"We wanted to build a model that learns how to generate proteins such that their structures are flexible to a given extent at a given position," says first author Vsevolod Viliuga (MPIP).

To that end, the team introduced a framework for generating flexible protein structures. This framework is based both on a trained to predict the flexibilities of backbones and a generative model for .

"Natural proteins are so excellent in fulfilling their tasks because they are flexible wherever needed," says co-author Leif Seute (HITS). "We can now design novel proteins that mimic this key property."

HITS group leader Jan Stühmer adds, "It is an extension of the Geometric Algebra Flow Matching model, in short: GAFL, that we developed last year."

GAFL is three times faster than comparable models and not only achieves high designability, but also resembles the natural proteins better in various aspects.

In the end, the team showed that the model can generate proteins with the desired flexibility patterns, even for patterns that are uncommon in natural proteins. Frauke Gräter (MPIP), one of the team leaders, resumes: "This work is a step forward to design new proteins for applications where flexibility is required, such as enzyme catalysts."

More information: Flexibility-conditioned protein structure design with flow matching:

Load comments (0)

This article has been reviewed according to Science X's and . have highlighted the following attributes while ensuring the content's credibility:

fact-checked
trusted source
proofread

Get Instant Summarized Text (GIST)

Machine learning models now enable the design of de novo proteins with customizable structural flexibility, a key property for biological function. A neural network-based framework predicts and generates protein backbones with specified flexibility patterns, outperforming previous models in speed and designability. This approach expands possibilities for biotechnological and therapeutic applications.

This summary was automatically generated using LLM.