Challenges in Adjusting Scoring Matrices for Protein Motifs
Author Information
Author(s): Patryk Jarnot
Primary Institution: Silesian University of Technology
Hypothesis
The gold standard method for scoring matrix adjustment fails to emphasize frequent amino acids in protein sequences.
Conclusion
Adjusting scoring matrices inappropriately reduces the significance of important residues like glycine in collagen-like domains.
Supporting Evidence
- The adjusted scoring matrices decreased the accuracy of collagen-like domain alignments.
- Turning off the scoring matrix adjustment improved the true positive rate significantly.
- The study found that glycine, a crucial residue in collagen, received lower scores in adjusted matrices.
Takeaway
This study shows that the way we adjust scoring systems for protein comparisons can make it harder to find important similarities, especially in proteins with unusual building blocks.
Methodology
The study used BLAST to align collagen-like domains with and without scoring matrix adjustments and compared the results.
Potential Biases
The scoring matrix adjustments may lead to false positives in alignments.
Limitations
The study primarily focuses on collagen-like domains and may not generalize to all protein types.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website