TY - JOUR
T1 - WHISTLE
T2 - A high-accuracy map of the human N6-methyladenosine (m6A) epitranscriptome predicted using a machine learning approach
AU - Chen, Kunqi
AU - Wei, Zhen
AU - Zhang, Qing
AU - Wu, Xiangyu
AU - Rong, Rong
AU - Lu, Zhiliang
AU - Su, Jionglong
AU - De Magalhães, João Pedro
AU - Rigden, Daniel J.
AU - Meng, Jia
N1 - Publisher Copyright:
© 2019 The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.
PY - 2019/4/23
Y1 - 2019/4/23
N2 - N 6-methyladenosine (m6A) is the most prevalent post-transcriptional modification in eukaryotes, and plays a pivotal role in various biological processes, such as splicing, RNA degradation and RNA-protein interaction. We report here a prediction framework WHISTLE for transcriptome-wide m6A RNA-methylation site prediction. When tested on six independent datasets, our approach, which integrated 35 additional genomic features besides the conventional sequence features, achieved a major improvement in the accuracy of m6A site prediction (average AUC: 0.948 and 0.880 under the full transcript or mature messenger RNA models, respectively) compared to the state-of-the-art computational approaches MethyRNA (AUC: 0.790 and 0.732) and SRAMP (AUC: 0.761 and 0.706). It also out-performed the existing epitranscriptome databases MeT-DB (AUC: 0.798 and 0.744) and RMBase (AUC: 0.786 and 0.736), which were built upon hundreds of epitranscriptome high-throughput sequencing samples. To probe the putative biological processes impacted by changes in an individual m6A site, a network-based approach was implemented according to the 'guilt-by-association' principle by integrating RNA methylation profiles, gene expression profiles and protein-protein interaction data. Finally, the WHISTLE web server was built to facilitate the query of our high-accuracy map of the human m6A epitranscriptome, and the server is freely available at: www.xjtlu.edu.cn/biologicalsciences/whistle and http://whistle-epitranscriptome.com.
AB - N 6-methyladenosine (m6A) is the most prevalent post-transcriptional modification in eukaryotes, and plays a pivotal role in various biological processes, such as splicing, RNA degradation and RNA-protein interaction. We report here a prediction framework WHISTLE for transcriptome-wide m6A RNA-methylation site prediction. When tested on six independent datasets, our approach, which integrated 35 additional genomic features besides the conventional sequence features, achieved a major improvement in the accuracy of m6A site prediction (average AUC: 0.948 and 0.880 under the full transcript or mature messenger RNA models, respectively) compared to the state-of-the-art computational approaches MethyRNA (AUC: 0.790 and 0.732) and SRAMP (AUC: 0.761 and 0.706). It also out-performed the existing epitranscriptome databases MeT-DB (AUC: 0.798 and 0.744) and RMBase (AUC: 0.786 and 0.736), which were built upon hundreds of epitranscriptome high-throughput sequencing samples. To probe the putative biological processes impacted by changes in an individual m6A site, a network-based approach was implemented according to the 'guilt-by-association' principle by integrating RNA methylation profiles, gene expression profiles and protein-protein interaction data. Finally, the WHISTLE web server was built to facilitate the query of our high-accuracy map of the human m6A epitranscriptome, and the server is freely available at: www.xjtlu.edu.cn/biologicalsciences/whistle and http://whistle-epitranscriptome.com.
UR - http://www.scopus.com/inward/record.url?scp=85064989965&partnerID=8YFLogxK
U2 - 10.1093/nar/gkz074
DO - 10.1093/nar/gkz074
M3 - Article
C2 - 30993345
AN - SCOPUS:85064989965
SN - 0305-1048
VL - 47
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 7
M1 - e41
ER -