M6ACali: Machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

Haokai Ye; Tenglong Li; Daniel J. Rigden; Zhen Wei

doi:10.1093/nar/gkae280

M6ACali: Machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

Haokai Ye, Tenglong Li, Daniel J. Rigden, Zhen Wei^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-Target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.

Original language	English
Pages (from-to)	4830-4842
Number of pages	13
Journal	Nucleic Acids Research
Volume	52
Issue number	9
DOIs	https://doi.org/10.1093/nar/gkae280
Publication status	Published - 22 May 2024

Access to Document

10.1093/nar/gkae280

Cite this

@article{afbf9f0f6f844282a0e2e5ab12b82bad,

title = "M6ACali: Machine learning-powered calibration for accurate m6A detection in MeRIP-Seq",

abstract = "We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-Target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.",

author = "Haokai Ye and Tenglong Li and Rigden, {Daniel J.} and Zhen Wei",

note = "Publisher Copyright: {\textcopyright} 2024 The Author(s). Published by Oxford University Press on behalf of Nucleic Acids Research.",

year = "2024",

month = may,

day = "22",

doi = "10.1093/nar/gkae280",

language = "English",

volume = "52",

pages = "4830--4842",

journal = "Nucleic Acids Research",

issn = "0305-1048",

number = "9",

}

TY - JOUR

T1 - M6ACali

T2 - Machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

AU - Ye, Haokai

AU - Li, Tenglong

AU - Rigden, Daniel J.

AU - Wei, Zhen

PY - 2024/5/22

Y1 - 2024/5/22

N2 - We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-Target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.

AB - We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a genomic feature-based classifier that refines the identification of m6A sites, distinguishing those genuinely present from those that can be detected in in-vitro transcribed (IVT) control experiments. We find that m6ACali effectively identifies non-specific binding peaks reported by exomePeak2 and MACS2 in novel MeRIP-Seq datasets without the need for paired IVT controls. The model interpretation revealed that off-Target antibody binding sites commonly occur at short exons and short mRNAs, originating from high read coverage regions that share the motif sequence with true m6A sites. We also reveal that the ML strategy can efficiently adjust differentially methylated peaks and other antibody-dependent, base-resolution m6A detection techniques. As a result, m6ACali offers a promising method for the universal enhancement of m6A profiles generated by MeRIP-Seq experiments, elevating the benchmark for omics-level m6A data integration.

UR - http://www.scopus.com/inward/record.url?scp=85193986597&partnerID=8YFLogxK

U2 - 10.1093/nar/gkae280

DO - 10.1093/nar/gkae280

M3 - Article

C2 - 38634812

AN - SCOPUS:85193986597

SN - 0305-1048

VL - 52

SP - 4830

EP - 4842

JO - Nucleic Acids Research

JF - Nucleic Acids Research

IS - 9

ER -

M6ACali: Machine learning-powered calibration for accurate m6A detection in MeRIP-Seq

Abstract

Access to Document

Other files and links

Fingerprint

Cite this