Application of nonnegative matrix factorization to improve profile-profile alignment features for fold recognition and remote homolog detection
2008

Using NMF to Improve Protein Analysis

Sample size: 630 publication 10 minutes Evidence: high

Author Information

Author(s): Jung Inkyung, Lee Jaehyung, Lee Soo-Young, Kim Dongsup

Primary Institution: KAIST

Hypothesis

Can nonnegative matrix factorization (NMF) improve the performance of fold recognition and remote homolog detection in biological sequences?

Conclusion

Applying NMF to profile-profile alignments significantly enhances the performance of fold recognition and remote homolog detection.

Supporting Evidence

  • NMF features improved fold recognition performance, achieving > 0.99 ROC scores for 30% of proteins.
  • NMF features detected 25% of remotely related proteins at > 0.90 ROC50 scores, compared to only 1% with original PPA features.
  • NMF basis vectors showed significant overlap with functionally important sites and structurally conserved regions.

Takeaway

This study shows that a special math method called NMF can help scientists find important parts of proteins better, making it easier to understand how they work.

Methodology

The study used nonnegative matrix factorization (NMF) to analyze profile-profile alignment features and compared the performance with traditional methods using ROC scores.

Potential Biases

Potential biases may arise from the selection of training and testing datasets.

Limitations

The study's results may not generalize to all types of protein sequences or alignments.

Participant Demographics

Proteins from the SCOP ASTRAL Compendium version 1.67 were used, with no shared superfamily members in training and testing sets.

Statistical Information

P-Value

0.0001

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-9-298

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication