TY - GEN
T1 - Ac4CGE
T2 - 9th International Conference on Biomedical Signal and Image Processing, ICBIP 2024
AU - Li, Yuchao
AU - Song, Bowen
AU - Meng, Jia
AU - Zhou, Jingxian
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2024/10/16
Y1 - 2024/10/16
N2 - N4-acetylcytidine(ac4C) is one of the highly conserved epigenetic modifications in RNA, possessing the ability to facilitate mRNA expression, enhance its stability, and influence mRNA decoding efficiency, thereby promoting substrate translation. The identification of ac4C in archaea has been considered a laborious and time-intensive endeavor using conventional biological methods. Therefore, it is essential to develop a computational method with high confidence to explore the ac4C modification sites. In this study, we present the first predictor, ac4CDE, which utilizes both sequence-derived and graph-embedding features to predict ac4C modification sites in an archaea dataset. The result achieves remarkable accuracy and performance. The best feature, NCP-ND, achieves Matthew's correlation coefficient of 0.9921, an F1-score of 0.9929, and a mean squared error of 0.0014. These findings underscore the applicability and robustness of our model as a valuable tool for predicting N4-acetylcytidine sites.
AB - N4-acetylcytidine(ac4C) is one of the highly conserved epigenetic modifications in RNA, possessing the ability to facilitate mRNA expression, enhance its stability, and influence mRNA decoding efficiency, thereby promoting substrate translation. The identification of ac4C in archaea has been considered a laborious and time-intensive endeavor using conventional biological methods. Therefore, it is essential to develop a computational method with high confidence to explore the ac4C modification sites. In this study, we present the first predictor, ac4CDE, which utilizes both sequence-derived and graph-embedding features to predict ac4C modification sites in an archaea dataset. The result achieves remarkable accuracy and performance. The best feature, NCP-ND, achieves Matthew's correlation coefficient of 0.9921, an F1-score of 0.9929, and a mean squared error of 0.0014. These findings underscore the applicability and robustness of our model as a valuable tool for predicting N4-acetylcytidine sites.
KW - ac4C
KW - archaea
KW - graph embedding
KW - machine learning
KW - random forest
UR - http://www.scopus.com/inward/record.url?scp=85212843370&partnerID=8YFLogxK
U2 - 10.1145/3691521.3691524
DO - 10.1145/3691521.3691524
M3 - Conference Proceeding
AN - SCOPUS:85212843370
T3 - ACM International Conference Proceeding Series
SP - 91
EP - 97
BT - Proceedings of the 2024 9th International Conference on Biomedical Signal and Image Processing, ICBIP 2024
PB - Association for Computing Machinery
Y2 - 23 August 2024 through 25 August 2024
ER -