High-throughput identification of interacting protein-protein binding sites
2007

Identifying Protein-Protein Binding Sites Using Machine Learning

Sample size: 784 publication 10 minutes Evidence: high

Author Information

Author(s): Chung Jo-Lan, Wang Wei, Bourne Philip E

Primary Institution: University of California, San Diego

Hypothesis

Can machine learning techniques effectively predict interacting protein binding sites?

Conclusion

The study demonstrates a high-throughput pipeline capable of identifying protein binding sites and their interactions on a large scale.

Supporting Evidence

  • 87.4% of interacting binding sites were correctly identified.
  • 68.6% of non-interacting binding sites were correctly identified.
  • The method does not require structure templates for predictions.

Takeaway

The researchers created a computer program that can find out if two parts of proteins stick together, which is important for understanding how proteins work.

Methodology

A support vector machine was trained on a dataset of protein dimers to predict interactions based on sequence and structural information.

Potential Biases

Potential bias due to the reliance on available structural data and the training set composition.

Limitations

The method may not perform as well on hetero-dimers compared to homo-dimers due to differences in residue contact preferences.

Participant Demographics

The dataset included 584 homo-dimers and 196 hetero-dimers from the Protein Data Bank.

Statistical Information

P-Value

0.0001

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-8-223

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication