TY - JOUR
T1 - Comparing Bayesian‐Based Reconstruction Strategies in Topology‐Based Pathway Enrichment Analysis
AU - Wang, Yajunzi
AU - Li, Jing
AU - Huang, Daiyun
AU - Hao, Yang
AU - Li, Bo
AU - Wang, Kai
AU - Chen, Boya
AU - Li, Ting
AU - Liu, Xin
N1 - Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/7
Y1 - 2022/7
N2 - The development of high‐throughput omics technologies has enabled the quantification of vast amounts of genes and gene products in the whole genome. Pathway enrichment analysis (PEA) provides an intuitive solution for extracting biological insights from massive amounts of data. Topology‐based pathway analysis (TPA) represents the latest generation of PEA methods, which exploit pathway topology in addition to lists of differentially expressed genes and their expression profiles. A subset of these TPA methods, such as BPA, BNrich, and PROPS, reconstruct pathway structures by training Bayesian networks (BNs) from canonical biological pathways, providing su-perior representations that explain causal relationships between genes. However, these methods have never been compared for their differences in the PEA and their different topology reconstruction strategies. In this study, we aim to compare the BN reconstruction strategies of the BPA, BNrich, PROPS, Clipper, and Ensemble methods and their PEA and performance on tumor and non‐tumor classification based on gene expression data. Our results indicate that they performed equally well in distinguishing tumor and non‐tumor samples (AUC > 0.95) yet with a varying ranking of path-ways, which can be attributed to the different BN structures resulting from the different cyclic structure removal strategies. This can be clearly seen from the reconstructed JAK‐STAT networks by different strategies. In a nutshell, BNrich, which relies on expert intervention to remove loops and cyclic structures, produces BNs that best fit the biological facts. The plausibility of the Clipper strategy can also be partially explained by intuitive biological rules and theorems. Our results may offer an informed reference for the proper method for a given data analysis task.
AB - The development of high‐throughput omics technologies has enabled the quantification of vast amounts of genes and gene products in the whole genome. Pathway enrichment analysis (PEA) provides an intuitive solution for extracting biological insights from massive amounts of data. Topology‐based pathway analysis (TPA) represents the latest generation of PEA methods, which exploit pathway topology in addition to lists of differentially expressed genes and their expression profiles. A subset of these TPA methods, such as BPA, BNrich, and PROPS, reconstruct pathway structures by training Bayesian networks (BNs) from canonical biological pathways, providing su-perior representations that explain causal relationships between genes. However, these methods have never been compared for their differences in the PEA and their different topology reconstruction strategies. In this study, we aim to compare the BN reconstruction strategies of the BPA, BNrich, PROPS, Clipper, and Ensemble methods and their PEA and performance on tumor and non‐tumor classification based on gene expression data. Our results indicate that they performed equally well in distinguishing tumor and non‐tumor samples (AUC > 0.95) yet with a varying ranking of path-ways, which can be attributed to the different BN structures resulting from the different cyclic structure removal strategies. This can be clearly seen from the reconstructed JAK‐STAT networks by different strategies. In a nutshell, BNrich, which relies on expert intervention to remove loops and cyclic structures, produces BNs that best fit the biological facts. The plausibility of the Clipper strategy can also be partially explained by intuitive biological rules and theorems. Our results may offer an informed reference for the proper method for a given data analysis task.
KW - Bayesian network
KW - gene expression
KW - network reconstruction
KW - topology‐based pathway analysis
UR - http://www.scopus.com/inward/record.url?scp=85132899568&partnerID=8YFLogxK
U2 - 10.3390/biom12070906
DO - 10.3390/biom12070906
M3 - Article
C2 - 35883462
AN - SCOPUS:85132899568
SN - 2218-273X
VL - 12
JO - Biomolecules
JF - Biomolecules
IS - 7
M1 - 906
ER -