Prediction of m6A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network

Yuxuan Wu; Yuxin Zhang; Ruoqi Wang; Jia Meng; Kunqi Chen; Yiyou Song; Daiyun Huang

doi:10.1145/3469678.3469706

Prediction of m⁶A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network

Yuxuan Wu, Yuxin Zhang, Ruoqi Wang, Jia Meng, Kunqi Chen, Yiyou Song, Daiyun Huang^*

^*Corresponding author for this work

University of Liverpool

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

N6-methyladenosine (m6A), one of the most common post-transcriptional mRNA modifications, has been proved to correlate with multiple biological functions through the process of binding to specific m6A reader proteins. Various m6A readers exist among the genome of human beings, however, owing to the scarce wet experiments related to this topic, the binding specificity of proteins was not elucidated. Therefore, a deep learning approach combined with CNN and RNN frameworks was generated to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, EIF3A). Additionally, layer-wise relevance calculation was conducted to obtain each input feature contribution and tried to explain the model training process. Finally, we achieved superior performance in the classification, with an average AUROC of 0.942 in EIF3A full transcript, higher than the typical conventional machine learning algorithms (SVM) under the same condition. Moreover, we quantified the most optimal sequence length (1001bp) during the m6A reader substrate prediction. This research paves the way for further RNA methylation target prediction and functional characterization of m6A readers.

Original language	English
Title of host publication	Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021
Editors	Bin Chen
Publisher	Association for Computing Machinery
ISBN (Electronic)	9781450389297
DOIs	https://doi.org/10.1145/3469678.3469706
Publication status	Published - 20 Jul 2021
Externally published	Yes
Event	5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021 - Hangzhou, China Duration: 20 Jul 2021 → 21 Jul 2021

Publication series

Name	ACM International Conference Proceeding Series

Conference

Conference	5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021
Country/Territory	China
City	Hangzhou
Period	20/07/21 → 21/07/21

Keywords

Convolutional neural network
Deep learning
Readers
Recurrent neural network
m6A

Access to Document

10.1145/3469678.3469706

Cite this

Wu, Y., Zhang, Y., Wang, R., Meng, J., Chen, K., Song, Y., & Huang, D. (2021). Prediction of m⁶A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network. In B. Chen (Ed.), Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021 Article 3469706 (ACM International Conference Proceeding Series). Association for Computing Machinery. https://doi.org/10.1145/3469678.3469706

@inproceedings{2d78183520474f08baed04b3d169a100,

title = "Prediction of m6A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network",

abstract = "N6-methyladenosine (m6A), one of the most common post-transcriptional mRNA modifications, has been proved to correlate with multiple biological functions through the process of binding to specific m6A reader proteins. Various m6A readers exist among the genome of human beings, however, owing to the scarce wet experiments related to this topic, the binding specificity of proteins was not elucidated. Therefore, a deep learning approach combined with CNN and RNN frameworks was generated to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, EIF3A). Additionally, layer-wise relevance calculation was conducted to obtain each input feature contribution and tried to explain the model training process. Finally, we achieved superior performance in the classification, with an average AUROC of 0.942 in EIF3A full transcript, higher than the typical conventional machine learning algorithms (SVM) under the same condition. Moreover, we quantified the most optimal sequence length (1001bp) during the m6A reader substrate prediction. This research paves the way for further RNA methylation target prediction and functional characterization of m6A readers.",

keywords = "Convolutional neural network, Deep learning, Readers, Recurrent neural network, m6A",

author = "Yuxuan Wu and Yuxin Zhang and Ruoqi Wang and Jia Meng and Kunqi Chen and Yiyou Song and Daiyun Huang",

note = "Publisher Copyright: {\textcopyright} 2021 ACM.; 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021 ; Conference date: 20-07-2021 Through 21-07-2021",

year = "2021",

month = jul,

day = "20",

doi = "10.1145/3469678.3469706",

language = "English",

series = "ACM International Conference Proceeding Series",

publisher = "Association for Computing Machinery",

editor = "Bin Chen",

booktitle = "Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021",

}

Wu, Y, Zhang, Y, Wang, R, Meng, J, Chen, K, Song, Y & Huang, D 2021, Prediction of m⁶A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network. in B Chen (ed.), Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021., 3469706, ACM International Conference Proceeding Series, Association for Computing Machinery, 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021, Hangzhou, China, 20/07/21. https://doi.org/10.1145/3469678.3469706

Prediction of m⁶A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network. / Wu, Yuxuan; Zhang, Yuxin; Wang, Ruoqi et al.
Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021. ed. / Bin Chen. Association for Computing Machinery, 2021. 3469706 (ACM International Conference Proceeding Series).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Prediction of m6A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network

AU - Wu, Yuxuan

AU - Zhang, Yuxin

AU - Wang, Ruoqi

AU - Meng, Jia

AU - Chen, Kunqi

AU - Song, Yiyou

AU - Huang, Daiyun

PY - 2021/7/20

Y1 - 2021/7/20

N2 - N6-methyladenosine (m6A), one of the most common post-transcriptional mRNA modifications, has been proved to correlate with multiple biological functions through the process of binding to specific m6A reader proteins. Various m6A readers exist among the genome of human beings, however, owing to the scarce wet experiments related to this topic, the binding specificity of proteins was not elucidated. Therefore, a deep learning approach combined with CNN and RNN frameworks was generated to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, EIF3A). Additionally, layer-wise relevance calculation was conducted to obtain each input feature contribution and tried to explain the model training process. Finally, we achieved superior performance in the classification, with an average AUROC of 0.942 in EIF3A full transcript, higher than the typical conventional machine learning algorithms (SVM) under the same condition. Moreover, we quantified the most optimal sequence length (1001bp) during the m6A reader substrate prediction. This research paves the way for further RNA methylation target prediction and functional characterization of m6A readers.

AB - N6-methyladenosine (m6A), one of the most common post-transcriptional mRNA modifications, has been proved to correlate with multiple biological functions through the process of binding to specific m6A reader proteins. Various m6A readers exist among the genome of human beings, however, owing to the scarce wet experiments related to this topic, the binding specificity of proteins was not elucidated. Therefore, a deep learning approach combined with CNN and RNN frameworks was generated to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, EIF3A). Additionally, layer-wise relevance calculation was conducted to obtain each input feature contribution and tried to explain the model training process. Finally, we achieved superior performance in the classification, with an average AUROC of 0.942 in EIF3A full transcript, higher than the typical conventional machine learning algorithms (SVM) under the same condition. Moreover, we quantified the most optimal sequence length (1001bp) during the m6A reader substrate prediction. This research paves the way for further RNA methylation target prediction and functional characterization of m6A readers.

KW - Convolutional neural network

KW - Deep learning

KW - Readers

KW - Recurrent neural network

KW - m6A

UR - http://www.scopus.com/inward/record.url?scp=85120522236&partnerID=8YFLogxK

U2 - 10.1145/3469678.3469706

DO - 10.1145/3469678.3469706

M3 - Conference Proceeding

AN - SCOPUS:85120522236

T3 - ACM International Conference Proceeding Series

BT - Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021

A2 - Chen, Bin

PB - Association for Computing Machinery

T2 - 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021

Y2 - 20 July 2021 through 21 July 2021

ER -

Wu Y, Zhang Y, Wang R, Meng J, Chen K, Song Y et al. Prediction of m⁶A Reader Substrate Sites Using Deep Convolutional and Recurrent Neural Network. In Chen B, editor, Proceedings of the 5th International Conference on Biological Information and Biomedical Engineering, BIBE 2021. Association for Computing Machinery. 2021. 3469706. (ACM International Conference Proceeding Series). doi: 10.1145/3469678.3469706