FOSTA: A Database for Extracting Functionally Equivalent Proteins
Author Information
Author(s): Lisa EM McMillan, Andrew CR Martin
Primary Institution: University College London
Hypothesis
Can we automate the extraction of functionally equivalent proteins from UniProtKB/Swiss-Prot?
Conclusion
FOSTA successfully automates the extraction of functionally equivalent proteins from UniProtKB/Swiss-Prot, demonstrating high-quality annotations.
Supporting Evidence
- FOSTA was evaluated against manually annotated datasets and performed well.
- FOSTA can highlight inconsistencies in UniProtKB/Swiss-Prot annotations.
- Over half of the species in UniProtKB/Swiss-Prot are not represented in FOSTA.
Takeaway
FOSTA is like a smart robot that helps scientists find proteins that do the same job in different species, making research easier.
Methodology
FOSTA uses a three-stage filtering process to identify functionally equivalent proteins based on sequence similarity and functional annotations.
Potential Biases
The method may miss functional equivalences due to varying annotation formats across species.
Limitations
FOSTA relies on the quality of UniProtKB/Swiss-Prot annotations, which may contain errors or inconsistencies.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website