麻豆淫院


Computational tool helps to identify long non-coding RNAs in plants

New computational tool helps to identify long non-coding RNAs
Jian-Feng Mao led the international team that developed the new long non-coding RNA analysis tool. Credit: Mattias Pettersson

An international research team, led by Jian-Feng Mao, has developed PlantLncBoost, a new computational tool that helps to identify long non-coding RNAs in plants. These RNAs are crucial for numerous biological processes but differ a lot between different plant species. PlantLncBoost addresses this challenge with very high accuracy, offering new possibilities for genomic studies in plants. These findings were recently in the journal New Phytologist.

Long non-coding RNAs, called lncRNAs, are transcribed from DNA as other RNAs but they do not carry instructions for proteins. Instead, they help control genes, guide plant development and are involved in plant responses to stress like drought or heat. Identifying these lncRNAs has been difficult because their genetic sequences vary greatly between different plant species.

The team around Mao tackled the problem using machine learning, a type of artificial intelligence that is trained on large amounts of data to find patterns. They analyzed more than 1,600 different features of lncRNAs and identified just three key features that could effectively distinguish lncRNAs from RNAs containing the code for a protein.

Identification of sequence patterns using mathematical parameters

What makes PlantLncBoost particularly innovative is its use of mathematical parameters to capture intrinsic sequence properties beyond traditional biological features. The research team used so-called Fourier transformation-based approaches. That allowed them to detect patterns in the RNA sequences that are consistent across diverse plant species despite the high variability in the genetic sequences.

"Through systematic evaluation of multiple machine learning algorithms and rigorous parameter optimization, we have developed a tool that achieves both high accuracy and strong generalization capabilities," explains Mao, Associate professor at Ume氓 University, who established his lab at the Ume氓 Plant Science Center in 2023.

To make sure their new tool worked, the team tested PlantLncBoost on datasets from 20 different plant species. It correctly identified lncRNAs with more than 96% accuracy, significantly outperforming existing tools. The tool even recognized nearly all 358 long lncRNAs that had been experimentally validated before, including those from 12 species that were not included in the training set used to develop the tool.

New possibilities to analyze long non-coding RNAs across species

"Developing PlantLncBoost was an exciting opportunity to apply machine learning to solve a complex biological problem," says first author Xue-Chan Tian, who completed this work as part of her Ph.D. thesis at Beijing Forestry University. "My doctoral program focused on combining advanced computational methods with plant genomics to extract meaningful biological insights from complex sequence data."

The project brought together experts in genomics, bioinformatics and computer science from around the world, including researchers from Sweden, China and Brazil. The tool is now freely available to the scientific community and has been integrated in a larger analysis workflow that was developed earlier by Mao's group. It allows not only the identification but also the characterization of lncRNAs in plants.

By implementing PlantLncBoost in this workflow, researchers can now identify long non-coding RNAs from different plant much more accurately, making it easier to compare and analyze them.

More information: Xue鈥怌han Tian et al, PlantLncBoost: key features for plant lncRNA identification and significant improvement in accuracy and generalization, New Phytologist (2025).

Journal information: New Phytologist

Provided by Umea University

Citation: Computational tool helps to identify long non-coding RNAs in plants (2025, June 4) retrieved 30 July 2025 from /news/2025-06-tool-coding-rnas.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A new tool for plant long non-coding RNA identification

0 shares

Feedback to editors