Automatically extracting functionally equivalent proteins from SwissProt
2008

FOSTA: A Database for Extracting Functionally Equivalent Proteins

publication Evidence: high

Author Information

Author(s): Lisa EM McMillan, Andrew CR Martin

Primary Institution: University College London

Hypothesis

Can we automate the extraction of functionally equivalent proteins from UniProtKB/Swiss-Prot?

Conclusion

FOSTA successfully automates the extraction of functionally equivalent proteins from UniProtKB/Swiss-Prot, demonstrating high-quality annotations.

Supporting Evidence

  • FOSTA was evaluated against manually annotated datasets and performed well.
  • FOSTA can highlight inconsistencies in UniProtKB/Swiss-Prot annotations.
  • Over half of the species in UniProtKB/Swiss-Prot are not represented in FOSTA.

Takeaway

FOSTA is like a smart robot that helps scientists find proteins that do the same job in different species, making research easier.

Methodology

FOSTA uses a three-stage filtering process to identify functionally equivalent proteins based on sequence similarity and functional annotations.

Potential Biases

The method may miss functional equivalences due to varying annotation formats across species.

Limitations

FOSTA relies on the quality of UniProtKB/Swiss-Prot annotations, which may contain errors or inconsistencies.

Digital Object Identifier (DOI)

10.1186/1471-2105-9-418

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication