TY - GEN
T1 - An integrated computational proteomics method to extract protein targets for fanconi anemia studies
AU - Chen, Jake Yue
AU - Pinkerton, Sarah L.
AU - Shen, Changyu
AU - Wang, Mu
PY - 2006
Y1 - 2006
N2 - Fanconi Anemia (FA) is a rare autosomal genetic disease with multiple birth defects and severe childhood complications for its patients. The lack of sequence homology of the entire FA Complementation Group proteins in such as FANCC, FANCG, FANCA makes them extremely difficult to characterize using conventional bioinformatics methods. In this work, we describe how to use computational methods to extract protein targets for FA, using protein interaction data set collected for FANC group C protein (FANCC). We first generated an initial set of 130 FA-interacting proteins as "FANCC seed proteins" by merging an in-house experimental set of FANCC Tandem Affinity Purification (TAP) Pulldown Proteomics data identified from Mass Spectrometry methods with publicly available human FANCC-interacting proteins. Next, we expanded the FANCC seed proteins using a nearest-neighbor method to generate a FANCC protein interaction subnetwork of 948 proteins in 903 protein interactions. We show that this network is statistically significant, with high indices of aggregation and separations. We also show a visualization of the network, support the evidence that many well-connected proteins exists in the network. Further, we developed and applied an interaction network protein scoring algorithm, which allows us to calculate a ranked list of significant FA proteins. Our result has been supporting further biological investigations of disease biologists on our team. We believe our method can be generalized to other disease biology studies with similar problems.
AB - Fanconi Anemia (FA) is a rare autosomal genetic disease with multiple birth defects and severe childhood complications for its patients. The lack of sequence homology of the entire FA Complementation Group proteins in such as FANCC, FANCG, FANCA makes them extremely difficult to characterize using conventional bioinformatics methods. In this work, we describe how to use computational methods to extract protein targets for FA, using protein interaction data set collected for FANC group C protein (FANCC). We first generated an initial set of 130 FA-interacting proteins as "FANCC seed proteins" by merging an in-house experimental set of FANCC Tandem Affinity Purification (TAP) Pulldown Proteomics data identified from Mass Spectrometry methods with publicly available human FANCC-interacting proteins. Next, we expanded the FANCC seed proteins using a nearest-neighbor method to generate a FANCC protein interaction subnetwork of 948 proteins in 903 protein interactions. We show that this network is statistically significant, with high indices of aggregation and separations. We also show a visualization of the network, support the evidence that many well-connected proteins exists in the network. Further, we developed and applied an interaction network protein scoring algorithm, which allows us to calculate a ranked list of significant FA proteins. Our result has been supporting further biological investigations of disease biologists on our team. We believe our method can be generalized to other disease biology studies with similar problems.
KW - Disease target
KW - Fanconi anemia
KW - Protein interaction network
KW - Proteomics
UR - http://www.scopus.com/inward/record.url?scp=33751045168&partnerID=8YFLogxK
U2 - 10.1145/1141277.1141316
DO - 10.1145/1141277.1141316
M3 - Conference Proceeding
AN - SCOPUS:33751045168
SN - 1595931082
SN - 9781595931085
T3 - Proceedings of the ACM Symposium on Applied Computing
SP - 173
EP - 179
BT - Applied Computing 2006 - The 21st Annual ACM Symposium on Applied Computing - Proceedings of the 2006 ACM Symposium on Applied Computing
PB - Association for Computing Machinery
T2 - 2006 ACM Symposium on Applied Computing
Y2 - 23 April 2006 through 27 April 2006
ER -