Searching for interpretable rules for disease mutations: a simulated annealing bump hunting strategy
2006

Interpretable Rules for Disease Mutations

Sample size: 9610 publication Evidence: high

Author Information

Author(s): Jiang Rui, Yang Hua, Sun Fengzhu, Chen Ting

Primary Institution: University of Southern California

Hypothesis

Can a novel feature set and simulated annealing bump hunting strategy improve the prediction of amino acid substitutions in proteins?

Conclusion

The proposed feature set and methods significantly improve the prediction accuracy and interpretability of amino acid substitutions related to diseases.

Supporting Evidence

  • The proposed feature set outperformed existing methods in predicting amino acid substitution effects.
  • The simulated annealing bump hunting strategy extracted interpretable rules consistent with biological knowledge.
  • The study validated the methods using experimental data from E. coli and bacteriophage T4 lysozyme.

Takeaway

This study helps scientists understand how changes in proteins can lead to diseases by using simple rules to predict the effects of amino acid changes.

Methodology

The study used support vector machines and random forests with a novel feature set based on physicochemical properties and evolutionary profiles to predict the effects of amino acid substitutions.

Potential Biases

Potential bias due to reliance on existing databases and the complexity of feature interactions.

Limitations

The feature set relies on known protein domains, limiting its applicability to unclassified substitutions.

Participant Demographics

The study focused on human proteins with known amino acid substitutions.

Digital Object Identifier (DOI)

10.1186/1471-2105-7-417

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication