TY - JOUR
T1 - FBCwPlaid
T2 - A Functional Biclustering Analysis of Epi-Transcriptome Profiling Data Via a Weighted Plaid Model
AU - Chen, Shutao
AU - Zhang, Lin
AU - Lu, Lin
AU - Meng, Jia
AU - Liu, Hui
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2022
Y1 - 2022
N2 - Recent studies have shown that in-depth studies on epi-transcriptomic patterns of N6-methyladenosine (m6A) may help understand its complex functions and co-regulatory mechanisms. Since most biclustering algorithms are developed in scenarios of gene expression analysis, which does not share the same characteristics with m6A methylation profile, we propose a weighted Plaid biclustering model (FBCwPlaid) based on the Lagrange multiplier method to discover the potential functional patterns. Each pattern is achieved by minimizing approximation error between FBCwPlaid predicted value and real data. To address the issue that site expression level determines methylation level confidence, it uses RNA expression levels of each site as weights to make lower expressed sites less confident. FBCwPlaid also allows overlapping biclusters, indicating some sites may participate in multiple biological functions. FBCwPlaid was then applied on MeRIP-Seq data of 69,446 methylation sites under 32 experimental conditions, each of which represented a stimulus to a particular cell line or environment. Finally, three patterns were discovered, and further pathway analysis and enzyme specificity test showed that sites involved in each pattern are highly relevant to m6A methyltransferases. Further detailed analyses showed that some patterns are condition-specific, indicating that some specific sites' methylation profiles may occur in specific cell lines or conditions.
AB - Recent studies have shown that in-depth studies on epi-transcriptomic patterns of N6-methyladenosine (m6A) may help understand its complex functions and co-regulatory mechanisms. Since most biclustering algorithms are developed in scenarios of gene expression analysis, which does not share the same characteristics with m6A methylation profile, we propose a weighted Plaid biclustering model (FBCwPlaid) based on the Lagrange multiplier method to discover the potential functional patterns. Each pattern is achieved by minimizing approximation error between FBCwPlaid predicted value and real data. To address the issue that site expression level determines methylation level confidence, it uses RNA expression levels of each site as weights to make lower expressed sites less confident. FBCwPlaid also allows overlapping biclusters, indicating some sites may participate in multiple biological functions. FBCwPlaid was then applied on MeRIP-Seq data of 69,446 methylation sites under 32 experimental conditions, each of which represented a stimulus to a particular cell line or environment. Finally, three patterns were discovered, and further pathway analysis and enzyme specificity test showed that sites involved in each pattern are highly relevant to m6A methyltransferases. Further detailed analyses showed that some patterns are condition-specific, indicating that some specific sites' methylation profiles may occur in specific cell lines or conditions.
KW - Lagrange multiplier method
KW - biclustering
KW - mA methylation
KW - plaid model
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85099413232&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2021.3049366
DO - 10.1109/TCBB.2021.3049366
M3 - Article
C2 - 33400655
AN - SCOPUS:85099413232
SN - 1545-5963
VL - 19
SP - 1640
EP - 1650
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 3
ER -