RepSeq: A Database for Amino Acid Repeats in Pathogens
Author Information
Author(s): Depledge Daniel P, Lower Ryan PJ, Smith Deborah F
Primary Institution: University of York
Hypothesis
The RepSeq database can effectively identify amino acid repeat-containing proteins in lower eukaryotic pathogens.
Conclusion
The RepSeq database serves as a valuable resource for identifying and analyzing repeat-containing proteins in parasitic protozoa.
Supporting Evidence
- RepSeq identifies over 98% of repeat-containing proteins.
- The database allows for both individual and cross-species proteome analyses.
- Identification of repeat-containing proteins aids in studying pathogenicity and virulence factors.
Takeaway
RepSeq is a tool that helps scientists find proteins with repeating parts, which can be important for understanding diseases caused by tiny organisms.
Methodology
The RepSeq algorithm identifies amino acid repeats by analyzing protein sequences using a sliding window approach.
Limitations
The algorithm may produce false positives, particularly when identifying mismatch repeats.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website