TY - JOUR
T1 - NanoMUD
T2 - Profiling of pseudouridine and N1-methylpseudouridine using Oxford Nanopore direct RNA sequencing
AU - Zhang, Yuxin
AU - Yan, Huayuan
AU - Wei, Zhen
AU - Hong, Haifeng
AU - Huang, Daiyun
AU - Liu, Guopeng
AU - Qin, Qianshan
AU - Rong, Rong
AU - Gao, Peng
AU - Meng, Jia
AU - Ying, Bo
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/6
Y1 - 2024/6
N2 - Nanopore direct RNA sequencing provided a promising solution for unraveling the landscapes of modifications on single RNA molecules. Here, we proposed NanoMUD, a computational framework for predicting the RNA pseudouridine modification (Ψ) and its methylated analog N1-methylpseudouridine (m1Ψ), which have critical application in mRNA vaccination, at single-base and single-molecule resolution from direct RNA sequencing data. Electric signal features were fed into a bidirectional LSTM neural network to achieve improved accuracy and predictive capabilities. Motif-specific models (NNUNN, N = A, C, U or G) were trained based on features extracted from designed dataset and achieved superior performance on molecule-level modification prediction (Ψ models: min AUC = 0.86, max AUC = 0.99; m1Ψ models: min AUC = 0.87, max AUC = 0.99). We then aggregated read-level predictions for site stoichiometry estimation. Given the observed sequence-dependent bias in model performance, we trained regression models based on the distribution of modification probabilities for sites with known stoichiometry. The distribution-based site stoichiometry estimation method allows unbiased comparison between different contexts. To demonstrate the feasibility of our work, three case studies on both in vitro and in vivo transcribed RNAs were presented. NanoMUD will make a powerful tool to facilitate the research on modified therapeutic IVT RNAs and provides useful insight to the landscape and stoichiometry of pseudouridine and N1-pseudouridine on in vivo transcribed RNA species.
AB - Nanopore direct RNA sequencing provided a promising solution for unraveling the landscapes of modifications on single RNA molecules. Here, we proposed NanoMUD, a computational framework for predicting the RNA pseudouridine modification (Ψ) and its methylated analog N1-methylpseudouridine (m1Ψ), which have critical application in mRNA vaccination, at single-base and single-molecule resolution from direct RNA sequencing data. Electric signal features were fed into a bidirectional LSTM neural network to achieve improved accuracy and predictive capabilities. Motif-specific models (NNUNN, N = A, C, U or G) were trained based on features extracted from designed dataset and achieved superior performance on molecule-level modification prediction (Ψ models: min AUC = 0.86, max AUC = 0.99; m1Ψ models: min AUC = 0.87, max AUC = 0.99). We then aggregated read-level predictions for site stoichiometry estimation. Given the observed sequence-dependent bias in model performance, we trained regression models based on the distribution of modification probabilities for sites with known stoichiometry. The distribution-based site stoichiometry estimation method allows unbiased comparison between different contexts. To demonstrate the feasibility of our work, three case studies on both in vitro and in vivo transcribed RNAs were presented. NanoMUD will make a powerful tool to facilitate the research on modified therapeutic IVT RNAs and provides useful insight to the landscape and stoichiometry of pseudouridine and N1-pseudouridine on in vivo transcribed RNA species.
KW - Deep learning
KW - Epi-transcriptome
KW - N1-methylpseudouridine
KW - Nanopore direct RNA sequencing
KW - Pseudouridine
KW - mRNA vaccines
UR - http://www.scopus.com/inward/record.url?scp=85193255720&partnerID=8YFLogxK
U2 - 10.1016/j.ijbiomac.2024.132433
DO - 10.1016/j.ijbiomac.2024.132433
M3 - Article
C2 - 38759861
AN - SCOPUS:85193255720
SN - 0141-8130
VL - 270
JO - International Journal of Biological Macromolecules
JF - International Journal of Biological Macromolecules
M1 - 132433
ER -