Graph attention autoencoder model with dual decoder for clustering single-cell RNA sequencing data

Shudong Wang, Yu Zhang, Yuanyuan Zhang, Yulin Zhang*, Shanchen Pang, Jionglong Su, Yingye Liu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Single-cell ribonucleic acid sequencing (scRNA-seq) allows researchers to study cell heterogeneity and diversity at the individual cell level. Cell clustering is an essential component of scRNA-seq data processing. However, the high dimensionality and high noise characteristics of scRNA-seq data may pose problems during data processing. Although many methods are available for scRNA-seq clustering analysis, most of them ignore the topological relationships of scRNA-seq data and do not fully utilize the potential associations between cells. In this study, we present scGAD, a graph attention autoencoder model with a dual decoder structure for clustering scRNA-seq data. We utilize a graph attention autoencoder with two decoders to learn feature representations of cells in latent space. To ensure that the learned latent feature representation maintains node properties and graph structure, we use an inner product decoder and a learnable graph attention decoder to reconstruct graph structure and node properties, respectively. On the 12 real scRNA-seq datasets, the average NMI and ARI scores of scGAD are 0.762 and 0.695, respectively, outperforming state-of-the-art single-cell clustering approaches. Biological analysis shows that the cell labels predicted by scGAD can assist in the downstream analysis of scRNA-seq data.

Original languageEnglish
Pages (from-to)5136-5146
Number of pages11
JournalApplied Intelligence
Volume54
Issue number6
DOIs
Publication statusPublished - Mar 2024

Keywords

  • Bioinformatics
  • Graph attention network
  • Graph autoencoder
  • scRNA-seq data
  • Spectral clustering

Fingerprint

Dive into the research topics of 'Graph attention autoencoder model with dual decoder for clustering single-cell RNA sequencing data'. Together they form a unique fingerprint.

Cite this