TY - JOUR
T1 - Prediction and Motif Analysis of 2’-O-methylation Using a Hybrid Deep Learning Model from RNA Primary Sequence and Nanopore Signals
AU - Pan, Shiyang
AU - Zhang, Yuxin
AU - Wei, Zhen
AU - Meng, Jia
AU - Huang, Daiyun
N1 - Publisher Copyright:
© 2022 Bentham Science Publishers.
PY - 2022/11
Y1 - 2022/11
N2 - Background: 2’-O-Methylation (2’-O-Me) is a post-transcriptional RNA modification that occurs in the ribose sugar moiety of all four nucleotides and is abundant in both coding and non-coding RNAs. Accurate prediction of each subtype of 2’-O-Me (Am, Cm, Gm, Um) helps understand their role in RNA metabolism and function. Objective: This study aims to build models that can predict each subtype of 2’-O-Me from RNA sequence and nanopore signals and exploit the model interpretability for sequence motif mining. Methods: We first propose a novel deep learning model DeepNm to better capture the sequence features of each subtype with a multi-scale framework. Based on DeepNm, we continue to propose HybridNm, which combines sequences and nanopore signals through a dual-path framework. The nanopore signal-derived features are first passed through a convolutional layer and then merged with sequence features extracted from different scales for final classification. Results: A 5-fold cross-validation process on Nm-seq data shows that DeepNm outperforms two state-of-the-art 2’-O-Me predictors. After incorporating nanopore signal-derived features, HybridNm further achieved significant improvements. Through model interpretation, we identified not only subtype-specific motifs but also revealed shared motifs between subtypes. In addition, Cm, Gm, and Um shared motifs with the well-studied m6A RNA methylation, suggesting a potential interplay among different RNA modifications and the complex nature of epitranscriptome regulation. Conclusion: The proposed frameworks can be useful tools to predict 2’-O-Me subtypes accurately and reveal specific sequence patterns.
AB - Background: 2’-O-Methylation (2’-O-Me) is a post-transcriptional RNA modification that occurs in the ribose sugar moiety of all four nucleotides and is abundant in both coding and non-coding RNAs. Accurate prediction of each subtype of 2’-O-Me (Am, Cm, Gm, Um) helps understand their role in RNA metabolism and function. Objective: This study aims to build models that can predict each subtype of 2’-O-Me from RNA sequence and nanopore signals and exploit the model interpretability for sequence motif mining. Methods: We first propose a novel deep learning model DeepNm to better capture the sequence features of each subtype with a multi-scale framework. Based on DeepNm, we continue to propose HybridNm, which combines sequences and nanopore signals through a dual-path framework. The nanopore signal-derived features are first passed through a convolutional layer and then merged with sequence features extracted from different scales for final classification. Results: A 5-fold cross-validation process on Nm-seq data shows that DeepNm outperforms two state-of-the-art 2’-O-Me predictors. After incorporating nanopore signal-derived features, HybridNm further achieved significant improvements. Through model interpretation, we identified not only subtype-specific motifs but also revealed shared motifs between subtypes. In addition, Cm, Gm, and Um shared motifs with the well-studied m6A RNA methylation, suggesting a potential interplay among different RNA modifications and the complex nature of epitranscriptome regulation. Conclusion: The proposed frameworks can be useful tools to predict 2’-O-Me subtypes accurately and reveal specific sequence patterns.
KW - 2’-O-Methylation
KW - Deep Learning
KW - RNA methylation
KW - epitranscriptome
KW - nanopore RNA sequencing
KW - site prediction
UR - http://www.scopus.com/inward/record.url?scp=85142602795&partnerID=8YFLogxK
U2 - 10.2174/1574893617666220815153653
DO - 10.2174/1574893617666220815153653
M3 - Article
AN - SCOPUS:85142602795
SN - 1574-8936
VL - 17
SP - 873
EP - 882
JO - Current Bioinformatics
JF - Current Bioinformatics
IS - 9
ER -