MBMM: Moment Estimating Beta Mixture Model-based Clustering Algorithm for m6 A Co-methylation Module Mining

Zhaoyang Liu, Hongsheng Yin*, Shutao Chen, Hui Liu, Jia Meng, Honglei Wang, Lin Zhang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Background: m6A methylation is a ubiquitous post-transcriptional modification that ex-ists in mammals. MeRIP-seq technology makes the acquisition of m6A data in the whole transcrip-tome under different conditions realizable. The specific regulation of the enzyme will present co-methylation module on m6A methylation level data. Thus, mining the co-methylation module from which can help to unveil the mechanism of m6A methylation modification and its mechanism in the occurrence and development of complex diseases such as cancer. Objective: To develop a clustering algorithm that can effectively realize the mining of m6A co-methylation module. Methods: In this study, a novel beta mixture model-based clustering algorithm named MBMM was proposed, which is based on the EM framework and introduces the method of moment estimating in M-step for parameter estimation to tackle the high-dimensional small sample m6A data. Simulation research was employed to evaluate the clustering performance of the proposed algorithm, and by which the co-methylation module mining was done based on real data. Biological significance cor-relation analysis was employed to explore whether the clustering results are co-methylation mod-ules. Results and Conclusion: Simulation research demonstrated that MBMM performed out than other clustering algorithms. In real data, seven co-methylation modules were found by MBMM. Six m6A-related pathways specific analysis showed that six co-methylation modules were enriched in the pathway and were different. Five enzymes substrate-specific analysis revealed that seven co-methylation modules expressed varying degrees of enrichment. Gene Ontology enrichment analysis indicated that these modules may be regulated by enzymes while having potential functional speci-ficity.

Original languageEnglish
Pages (from-to)1244-1256
Number of pages13
JournalCurrent Bioinformatics
Volume16
Issue number10
DOIs
Publication statusPublished - 2021

Keywords

  • Beta mixture model
  • Clustering
  • Co-methylation module
  • Data mining
  • M A
  • Methylation

Cite this