Evaluation of Methods for De Novo Genome Assembly from High-Throughput Sequencing Reads Reveals Dependencies That Affect the Quality of the Results
2011

Evaluating Genome Assembly from Short Reads

publication Evidence: moderate

Author Information

Author(s): Haiminen Niina, Kuhn David N., Parida Laxmi, Rigoutsos Isidore

Primary Institution: Computational Biology Center, IBM Thomas J. Watson Research Center

Hypothesis

Can de novo genome assembly from short reads be effectively achieved given the challenges posed by read length and sequencing errors?

Conclusion

The study demonstrates that various factors, including sequencing coverage and the choice of assembly program, significantly affect the quality of genome assembly from short reads.

Supporting Evidence

  • The study evaluated the feasibility of de novo genome assembly from short reads through simulations.
  • It was found that factors like genome architecture and sequencing error rates significantly impact assembly quality.
  • The performance of assembly tools varied based on the characteristics of the genomic input.

Takeaway

This study looks at how well we can piece together DNA sequences from short snippets, showing that some methods work better than others depending on the situation.

Methodology

The study involved simulating reads from various genomic sequences and evaluating the performance of several assembly tools under different conditions.

Limitations

The simulations represent ideal conditions, and real data may present additional challenges not accounted for in the study.

Digital Object Identifier (DOI)

10.1371/journal.pone.0024182

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication