Evaluating the Fidelity of De Novo Short Read Metagenomic Assembly Using Simulated Data
2011

Evaluating Metagenomic Assembly

Sample size: 3270435 publication Evidence: moderate

Author Information

Author(s): Pignatelli Miguel, Moya Andrés

Primary Institution: Unitat Mixta d'Investigació en Genòmica i Salut, Centre Superior d'Investigació en Salut Pública/UVEG-Institut Cavanilles, Valencia, Spain

Hypothesis

Can current de novo short read assembly tools accurately assemble metagenomic data in multi-genome scenarios?

Conclusion

The study found that assembly tools struggle with accuracy in multi-genome scenarios, but higher sequencing coverage can improve results.

Supporting Evidence

  • The assembly process reduces the accuracy of functional classification of metagenomic data.
  • Errors in assembly can be mitigated by increasing the coverage of the studied metagenome.
  • Chimeric contigs were frequently observed, especially in complex datasets.

Takeaway

This study looked at how well different computer programs can put together DNA sequences from many different organisms, and found that using more data helps make better results.

Methodology

The study used simulated datasets of metagenomic reads to evaluate the performance of various assembly tools.

Potential Biases

Potential biases in assembly accuracy due to the inherent complexity of metagenomic samples.

Limitations

The study primarily used simulated data, which may not fully represent real-world complexities in metagenomic assembly.

Digital Object Identifier (DOI)

10.1371/journal.pone.0019984

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication