TY - JOUR
T1 - m6A Reader
T2 - Epitranscriptome Target Prediction and Functional Characterization of N6-Methyladenosine (m6A) Readers
AU - Zhen, Di
AU - Wu, Yuxuan
AU - Zhang, Yuxin
AU - Chen, Kunqi
AU - Song, Bowen
AU - Xu, Haiqi
AU - Tang, Yujiao
AU - Wei, Zhen
AU - Meng, Jia
N1 - Publisher Copyright:
© Copyright © 2020 Zhen, Wu, Zhang, Chen, Song, Xu, Tang, Wei and Meng.
PY - 2020/8/11
Y1 - 2020/8/11
N2 - N6-methyladenosine (m6A) is the most abundant post-transcriptional modification in mRNA, and regulates critical biological functions via m6A reader proteins that bind to m6A-containing transcripts. There exist multiple m6A reader proteins in the human genome, but their respective binding specificity and functional relevance under different biological contexts are not yet fully understood due to the limitation of experimental approaches. An in silico study was devised to unveil the target specificity and regulatory functions of different m6A readers. We established a support vector machine-based computational framework to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, and EIF3A) based on 58 genomic features as well as the conventional sequence-derived features. Our model achieved an average AUC of 0.981 and 0.893 under the full-transcript and mature mRNA model, respectively, marking a substantial improvement in accuracy compared to the sequence encoding schemes tested. Additionally, the distinct biological characteristics of each individual m6A reader were explored via the distribution, conservation, Gene Ontology enrichment, cellular components and molecular functions of their target m6A sites. A web server was constructed for predicting the putative binding readers of m6A sites to serve the research community, and is freely accessible at: http://m6areader.rnamd.com.
AB - N6-methyladenosine (m6A) is the most abundant post-transcriptional modification in mRNA, and regulates critical biological functions via m6A reader proteins that bind to m6A-containing transcripts. There exist multiple m6A reader proteins in the human genome, but their respective binding specificity and functional relevance under different biological contexts are not yet fully understood due to the limitation of experimental approaches. An in silico study was devised to unveil the target specificity and regulatory functions of different m6A readers. We established a support vector machine-based computational framework to predict the epitranscriptome-wide targets of six m6A reader proteins (YTHDF1-3, YTHDC1-2, and EIF3A) based on 58 genomic features as well as the conventional sequence-derived features. Our model achieved an average AUC of 0.981 and 0.893 under the full-transcript and mature mRNA model, respectively, marking a substantial improvement in accuracy compared to the sequence encoding schemes tested. Additionally, the distinct biological characteristics of each individual m6A reader were explored via the distribution, conservation, Gene Ontology enrichment, cellular components and molecular functions of their target m6A sites. A web server was constructed for predicting the putative binding readers of m6A sites to serve the research community, and is freely accessible at: http://m6areader.rnamd.com.
KW - N6-methyladenosine
KW - YTH domain
KW - eIF3a
KW - mA reader
KW - machine learning (ML)
UR - http://www.scopus.com/inward/record.url?scp=85089890714&partnerID=8YFLogxK
U2 - 10.3389/fcell.2020.00741
DO - 10.3389/fcell.2020.00741
M3 - Article
AN - SCOPUS:85089890714
SN - 2296-634X
VL - 8
JO - Frontiers in Cell and Developmental Biology
JF - Frontiers in Cell and Developmental Biology
M1 - 741
ER -