Compressing Proteomes: The Relevance of Medium Range Correlations
Author Information
Author(s): Dario Benedetto, Emanuele Caglioti, Claudia Chica
Primary Institution: Dipartimento di Matematica, Università di Roma "La Sapienza"
Hypothesis
The study investigates the nonrandomness of proteome sequences by analyzing correlations between amino acids at short and medium ranges.
Conclusion
The study shows that considering medium-range correlations in protein sequences leads to better compression rates and suggests that this redundancy is linked to the evolutionary origin of proteomes.
Supporting Evidence
- Statistical models that consider medium-range correlations achieve better compression rates.
- The redundancy in protein sequences is linked to their evolutionary origins.
Takeaway
The way proteins are built has patterns that can help us make them smaller in size, and these patterns might be because of how proteins evolved over time.
Methodology
The study analyzes correlations between amino acids located 10 or 100 residues apart to assess the nonrandomness of proteome sequences.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website