ProtGraph: a tool for the quick and comprehensive exploration and exploitation of the peptide search space derived from protein sequence databases using graphs
2025

ProtGraph: A Tool for Exploring Peptide Search Space from Protein Sequences

publication Evidence: high

Author Information

Author(s): Dominik Lux, Katrin Marcus-Alic, Martin Eisenacher, Julian Uszkoreit

Primary Institution: Ruhr University Bochum

Hypothesis

Can a graph-based representation of protein sequences improve the exploration of peptide search space in mass spectrometry-based proteomics?

Conclusion

The ProtGraph tool allows for a comprehensive characterization of the peptide search space, enabling the identification of peptides with annotated features that are often overlooked.

Supporting Evidence

  • ProtGraph can generate protein graphs for arbitrary-sized species databases in less than a day.
  • The tool allows for the quick extraction of all possible peptides created by protein graphs using different filters.
  • Peptides with annotated features can be identified, providing new insights into proteins with non-canonical sequences.
  • Using ProtGraph, the total number of possible peptides can be characterized quickly.
  • Variant features contribute significantly to the total number of peptides generated in protein graphs.

Takeaway

This study introduces a new tool called ProtGraph that helps scientists find and analyze different peptide sequences from proteins more easily, which can lead to better understanding of proteins and diseases.

Methodology

The study developed a graph-based representation of protein sequences, allowing for the inclusion of various annotated features and enabling quick retrieval of peptide information.

Limitations

The approach may be more time-consuming than traditional workflows, and the complexity of protein graphs can lead to challenges in managing large datasets.

Digital Object Identifier (DOI)

10.1093/bib/bbae671

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication