麻豆淫院


Big data approach makes plant predictions more accurate

New data approach makes plant predictions more accurate
Prediction abilities for yield of across-series training sets sharing breeding-related characteristics. Credit: Plant Biotechnology Journal (2025). DOI: 10.1111/pbi.70095

Large amounts of data ("big data") offer enormous potential for improving the accuracy of genome-wide predictions in plant breeding. Encouraged by successful results with wheat hybrids, researchers at the IPK Leibniz Institute have now extended this approach to so-called inbred lines.

For the first time, they combined phenotypic and genotypic data from four commercial wheat breeding programs. The study results were in the Plant Biotechnology Journal.

Deep learning methods have become increasingly crucial in genomic prediction in recent years. In contrast to conventional methods, deep learning approaches work with flexible, non-linear transformations of the input data. The aim is to recognize patterns in the data and link these to observable characteristics such as yield or plant height.

The parameters required for this are optimized based on extensive training data. Such methods promise particular advantages when plant characteristics are strongly influenced by that are insufficiently considered in conventional models.

In this context, a research team at the IPK has taken on the role of academic data trustee and merged the data from four wheat breeding programs with trial data from earlier public-private partnerships.

"We needed data from many genotypes that had already been tested in different environments, i.e., at different locations," explains Prof. Dr. Jochen Reif, head of the department "Breeding Research" at the IPK.

The new data set covered 12 years of trial activity in 168 environments and formed a training set for genomic predictions with up to 9,500 genotypes鈥攊ncluding grain yield, plant height and heading date. One main challenge was merging the different data and ultimately making it comparable.

"Despite the heterogeneous phenotypic and genotypic information, we were able to break down the companies' data silos and thus obtain linkable data through meticulous data preparation, including the imputation of missing SNPs," says Prof. Dr. Reif.

The team used this data to compare classic genomic prediction methods with deep learning approaches based on . With the help of neural networks, it was possible to recognize patterns in structured data.

"Our analyses showed that different test series can be flexibly combined for genomic predictions and that the prediction accuracy continuously improves as the size of the training set increases鈥攁t least up to around 4,000 genotypes," explains Moritz Lell, first author of the study. If the training set is increased further, the prediction values increase only slightly.

"However, we assume that this plateau can be overcome if we include significantly more environments in the data set," emphasizes Prof. Dr. Reif. "This would make it possible to utilize the potential of in breeding research even better."

More information: Moritz Lell et al, Breaking down data silos across companies to train genome鈥恮ide predictions: A feasibility study in wheat, Plant Biotechnology Journal (2025).

Journal information: Plant Biotechnology Journal

Provided by Leibniz Institute of Plant Genetics and Crop Plant Research

Citation: Big data approach makes plant predictions more accurate (2025, May 13) retrieved 24 May 2025 from /news/2025-05-big-approach-accurate.html
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Big data: IPK researchers double accuracy in predicting wheat yields

0 shares

Feedback to editors