TY - JOUR
T1 - scSID
T2 - A lightweight algorithm for identifying rare cell types by capturing differential expression from single-cell sequencing data
AU - Wang, Shudong
AU - Li, Hengxiao
AU - Zhang, Kuijie
AU - Wu, Hao
AU - Pang, Shanchen
AU - Wu, Wenhao
AU - Ye, Lan
AU - Su, Jionglong
AU - Zhang, Yulin
N1 - Publisher Copyright:
© 2023 The Author(s)
PY - 2024/12
Y1 - 2024/12
N2 - Single-cell RNA sequencing (scRNA-seq) is currently an important technology for identifying cell types and studying diseases at the genetic level. Identifying rare cell types is biologically important as one of the downstream data analyses of single-cell RNA sequencing. Although rare cell identification methods have been developed, most of these suffer from insufficient mining of intercellular similarities, low scalability, and being time-consuming. In this paper, we propose a single-cell similarity division algorithm (scSID) for identifying rare cells. It takes cell-to-cell similarity into consideration by analyzing both inter-cluster and intra-cluster similarities, and discovers rare cell types based on the similarity differences. We show that scSID outperforms other existing methods by benchmarking it on different experimental datasets. Application of scSID to multiple datasets, including 68K PBMC and intestine, highlights its exceptional scalability and remarkable ability to identify rare cell populations.
AB - Single-cell RNA sequencing (scRNA-seq) is currently an important technology for identifying cell types and studying diseases at the genetic level. Identifying rare cell types is biologically important as one of the downstream data analyses of single-cell RNA sequencing. Although rare cell identification methods have been developed, most of these suffer from insufficient mining of intercellular similarities, low scalability, and being time-consuming. In this paper, we propose a single-cell similarity division algorithm (scSID) for identifying rare cells. It takes cell-to-cell similarity into consideration by analyzing both inter-cluster and intra-cluster similarities, and discovers rare cell types based on the similarity differences. We show that scSID outperforms other existing methods by benchmarking it on different experimental datasets. Application of scSID to multiple datasets, including 68K PBMC and intestine, highlights its exceptional scalability and remarkable ability to identify rare cell populations.
KW - Rare cell types
KW - Scalability
KW - Similarity analysis
KW - Single-cell RNA sequencing
UR - http://www.scopus.com/inward/record.url?scp=85182878362&partnerID=8YFLogxK
U2 - 10.1016/j.csbj.2023.12.043
DO - 10.1016/j.csbj.2023.12.043
M3 - Article
AN - SCOPUS:85182878362
SN - 2001-0370
VL - 23
SP - 589
EP - 600
JO - Computational and Structural Biotechnology Journal
JF - Computational and Structural Biotechnology Journal
ER -