A robust linear regression based algorithm for automated evaluation of peptide identifications from shotgun proteomics by use of reversed-phase liquid chromatography retention time

2008

Automated Evaluation of Peptide Identifications in Proteomics

Sample size: 3166 publication 15 minutes Evidence: high

Author Information

Author(s): Xu Hua, Yang Lanhao, Freitas Michael A

Primary Institution: The Ohio State University

Hypothesis

Can a robust linear regression algorithm improve the evaluation of peptide identifications from shotgun proteomics?

Conclusion

The algorithm significantly reduces false positive peptide matches while maintaining a high number of true positives.

Supporting Evidence

The algorithm improved the R2 value from 0.35 to 0.90 after outlier removal.
96.21% of true peptide matches fell within the 99% confidence band of the trained model.
The algorithm filtered 60.98% of false peptide matches with minimal loss of true positives.

Takeaway

The study created a smart computer program that helps scientists find the right proteins in a mixture by checking their 'retention time' in a special test.

Methodology

The study used a robust linear regression algorithm to evaluate peptide identifications based on their retention times from mass spectrometry data.

Potential Biases

Potential bias may arise from the reliance on the linear regression model, which assumes that true peptide matches follow a specific distribution.

Limitations

The algorithm's performance may vary based on the quality of the training data set and the presence of outliers.

Statistical Information

P-Value

p<0.01

Confidence Interval

99%

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-9-347

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication

Home