Âé¶¹ÒùÔº

July 8, 2024

Improving safety of AI research for engineering biology

Proactively exploring data hazards in synthetic biology research. Credit: Thomas Gorochowski/University of Bristol
× close
Proactively exploring data hazards in synthetic biology research. Credit: Thomas Gorochowski/University of Bristol

Hazards posed by using data-centric methods to engineer biology have been identified by experts at the University of Bristol with the aim of making future research safer.

The potential misuse of data-centric approaches in synthetic biology poses significant risk. The ease of access to tools may enable nefarious actors to develop harmful biological agents for purposes such as bioterrorism or to disrupt intentionally.

, published in Synthetic Biology, suggest additional Data Hazard labels that describe data related risks in area of synthetic biology.

The work is the result of a collaboration between researchers from across the Bristol Center for Engineering Biology (BrisEngBio) and the Jean Golding Institute for Data Intensive Research.

Get free science updates with Science X Daily and Weekly Newsletters — to customize your preferences!

Kieren Sharma, co-author and Ph.D. student working in AI for cellular modeling in the School of Engineering Mathematics and Technology said, "We're entering a transformative era where artificial intelligence and converge to revolutionize , accelerating the discovery of novel compounds, from life-saving pharmaceuticals to sustainable biofuels.

"Our study has uncovered potential risks associated with the specific types of data being used to train the latest systems biology models. For instance, inconsistencies in measurements from complex and dynamic living organisms and privacy concerns that could compromise the safety of next-generation models trained on human genome data."

The project extends the work of the , which aims to create a clear vocabulary of the potential hazards of data science research.

Co-author and co-lead of the Data Hazards project, Dr. Nina Di Cara from the School of Psychological Science, explained, "Having a clear vocabulary of hazards makes it easier for researchers to think proactively about what the risks of their work are and to help put mitigating actions in place. It also makes communication easier for people working across fields who sometimes use different language to talk about the same issues."

To achieve these clear vocabularies, interdisciplinary collaboration is essential.

Dr. Daniel Lawson, Director of the Jean Golding Institute and Associate Professor in Data Science in the School of Mathematics noted that "As datasets grow in magnitude and ambition, increasingly sophisticated algorithms are developed to gain new insights. This complexity makes an un-siloed collaborative approach to identifying and preventing downstream harms essential."

Dr. Thomas Gorochowski, senior author and Associate Professor of Biological Engineering in the School of Biological Sciences, added, "Data science is set to revolutionize how we engineer biology to harness its unique capabilities to tackle global challenges covering the sustainable production of materials and fuels the development of innovative therapeutics.

"The extensions developed by our team will help bioengineers consider and discuss risks around data-centric approaches to their research and help ensure the huge benefits of bio-based solutions are realized in a safe way."

More information: Natalie R Zelenka et al, Data hazards in synthetic biology, Synthetic Biology (2024).

Provided by University of Bristol

Load comments (0)

This article has been reviewed according to Science X's and . have highlighted the following attributes while ensuring the content's credibility:

fact-checked
trusted source
proofread

Get Instant Summarized Text (GIST)

This summary was automatically generated using LLM.