Ac4CGE: Predicting N4-acetylcytidine (ac4C) RNA Modification Sites in Archaea Using Graph-based Machine Learning Approach

Yuchao Li, Bowen Song, Jia Meng, Jingxian Zhou*

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

N4-acetylcytidine(ac4C) is one of the highly conserved epigenetic modifications in RNA, possessing the ability to facilitate mRNA expression, enhance its stability, and influence mRNA decoding efficiency, thereby promoting substrate translation. The identification of ac4C in archaea has been considered a laborious and time-intensive endeavor using conventional biological methods. Therefore, it is essential to develop a computational method with high confidence to explore the ac4C modification sites. In this study, we present the first predictor, ac4CDE, which utilizes both sequence-derived and graph-embedding features to predict ac4C modification sites in an archaea dataset. The result achieves remarkable accuracy and performance. The best feature, NCP-ND, achieves Matthew's correlation coefficient of 0.9921, an F1-score of 0.9929, and a mean squared error of 0.0014. These findings underscore the applicability and robustness of our model as a valuable tool for predicting N4-acetylcytidine sites.

Original languageEnglish
Title of host publicationProceedings of the 2024 9th International Conference on Biomedical Signal and Image Processing, ICBIP 2024
PublisherAssociation for Computing Machinery
Pages91-97
Number of pages7
ISBN (Electronic)9798400717970
DOIs
Publication statusPublished - 16 Oct 2024
Event9th International Conference on Biomedical Signal and Image Processing, ICBIP 2024 - Suzhou, China
Duration: 23 Aug 202425 Aug 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference9th International Conference on Biomedical Signal and Image Processing, ICBIP 2024
Country/TerritoryChina
CitySuzhou
Period23/08/2425/08/24

Keywords

  • ac4C
  • archaea
  • graph embedding
  • machine learning
  • random forest

Cite this