DatabaseRepSeq – A database of amino acid repeats present in lower eukaryotic pathogens
2007

RepSeq: A Database for Amino Acid Repeats in Pathogens

publication Evidence: moderate

Author Information

Author(s): Depledge Daniel P, Lower Ryan PJ, Smith Deborah F

Primary Institution: University of York

Hypothesis

The RepSeq database can effectively identify amino acid repeat-containing proteins in lower eukaryotic pathogens.

Conclusion

The RepSeq database serves as a valuable resource for identifying and analyzing repeat-containing proteins in parasitic protozoa.

Supporting Evidence

  • RepSeq identifies over 98% of repeat-containing proteins.
  • The database allows for both individual and cross-species proteome analyses.
  • Identification of repeat-containing proteins aids in studying pathogenicity and virulence factors.

Takeaway

RepSeq is a tool that helps scientists find proteins with repeating parts, which can be important for understanding diseases caused by tiny organisms.

Methodology

The RepSeq algorithm identifies amino acid repeats by analyzing protein sequences using a sliding window approach.

Limitations

The algorithm may produce false positives, particularly when identifying mismatch repeats.

Digital Object Identifier (DOI)

10.1186/1471-2105-8-122

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication