An analysis of single amino acid repeats as use case for application specific background models
2011

Analyzing Single Amino Acid Repeats with Background Models

Sample size: 1900000 publication Evidence: high

Author Information

Author(s): Łabaj Paweł P, Sykacek Peter, Kreil David P

Primary Institution: Boku University Vienna

Hypothesis

Signal peptides show an unusual enrichment of certain repeats (relative to a common background distribution).

Conclusion

The study demonstrates that application-specific background models are necessary for accurately detecting biologically interesting signals in sequence analysis.

Supporting Evidence

  • The study found that traditional models failed to predict the high frequency of single amino acid repeats.
  • Application-specific models outperformed standard models in predicting repeat occurrences.
  • Significant enrichment of leucine repeats was observed in signal peptides compared to other amino acids.

Takeaway

This study looks at how certain patterns in proteins, called single amino acid repeats, can be better understood using special models that fit the data well.

Methodology

The study used zero-inflated relevance vector machine models to analyze single amino acid repeats in eukaryotic proteins.

Potential Biases

Potential biases may arise from the choice of training sets and the specific characteristics of the protein sequences analyzed.

Limitations

The models may not generalize well to all types of proteins or regions outside of the studied context.

Participant Demographics

The study focused on eukaryotic proteins, specifically analyzing 1.9 million sequences.

Statistical Information

P-Value

p < 10^-35

Statistical Significance

p < 0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-12-173

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication