麻豆淫院

April 26, 2024

New multi-task deep learning framework integrates large-scale single-cell proteomics and transcriptomics data

Integration of COVID-19 cell atlas. Credit: Advanced Science (2024). DOI: 10.1002/advs.202307835
× close
Integration of COVID-19 cell atlas. Credit: Advanced Science (2024). DOI: 10.1002/advs.202307835

The exponential progress in single-cell multi-omics technologies has led to the accumulation of large and diverse multi-omics datasets. However, the integration of single-cell proteomics and transcriptomics (or epigenomics) data poses a significant challenge to existing methods. Several transformer-based models, such as Geneformer, have significantly changed the paradigm of single-cell transcriptome analysis. However, these methods place significant demands on computational resources.

To address these , researchers at the Wuhan Botanical Garden of the Chinese Academy of Sciences have developed a Transformer-based method, called scmFormer, to integrate large-scale single-cell proteomics and transcriptomics data using a multi-task transformer. The study titled "scmFormer Integrates Large鈥怱cale Single鈥怌ell Proteomics and Transcriptomics Data by Multi鈥怲ask Transformer" was in Advanced Science.

The researchers presented a comprehensive evaluation and made case studies of this method, the results showed that scmFormer exhibited remarkable proficiency in harmonizing large-scale single-cell omics plus proteomics datasets at both the cell type and finer-scale cell level with limited computer resources.

In addition, scmFormer possesses the ability to integrate multiple single-cell paired multimodal datasets, leading to the dual benefit of reduced high cost and improved biological insights.

Moreover, scmFormer shows an outstanding ability to eliminate technical differences between different omics modalities while preserving the underlying biological information inherent in the data, spanning both and experimental conditions.

The application of scmFormer for the integration of two COVID-19 datasets with 1.48 million cells further demonstrated the distinct advantage of scmFormer for handling large datasets on regular laptops.

More information: Jing Xu et al, scmFormer Integrates Large鈥怱cale Single鈥怌ell Proteomics and Transcriptomics Data by Multi鈥怲ask Transformer, Advanced Science (2024).

Journal information: Advanced Science

Load comments (0)

This article has been reviewed according to Science X's and . have highlighted the following attributes while ensuring the content's credibility:

fact-checked
peer-reviewed publication
trusted source
proofread

Get Instant Summarized Text (GIST)

This summary was automatically generated using LLM.