A Computational Screen for Type I Polyketide Synthases in Metagenomics Shotgun Data
2008

Screening for Type I Polyketide Synthases in Metagenomic Data

Sample size: 6613204 publication Evidence: moderate

Author Information

Author(s): Konrad U. Foerstner, Tobias Doerks, Christopher J. Creevey, Anja Doerks, Peer Bork

Primary Institution: European Molecular Biology Laboratory, Heidelberg, Germany

Hypothesis

Can computational methods effectively identify type I polyketide synthases in metagenomic shotgun data?

Conclusion

The screening approach successfully identified numerous novel type I polyketide synthase sequences in various environments, particularly in Minnesota farm soil.

Supporting Evidence

  • The highest density of PKS I proteins was found in Minnesota farm soil with 136 proteins identified.
  • The study identified 22,106 candidate sequences of PKS I domains from metagenomic samples.
  • The approach improved the annotation of proteins with previously unknown functions.

Takeaway

Scientists used a computer program to find special proteins in dirt and water samples that can help make medicines. They found a lot of these proteins in farm soil.

Methodology

The study used Hidden-Markov-Models to screen predicted proteins from six metagenomic datasets and constructed maximum-likelihood trees for classification.

Potential Biases

Potential contamination in environmental samples could bias the identification of PKS I proteins.

Limitations

The fragmented nature of metagenomic sequences may affect the identification and classification of type I polyketide synthases.

Digital Object Identifier (DOI)

10.1371/journal.pone.0003515

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication