Meraculous: De Novo Genome Assembly with Short Paired-End Reads
2011

Meraculous: A New Tool for Genome Assembly

Sample size: 87 publication Evidence: high

Author Information

Author(s): Chapman Jarrod A., Ho Isaac, Sunkara Sirisha, Luo Shujun, Schroth Gary P., Rokhsar Daniel S.

Primary Institution: U.S. Department of Energy Joint Genome Institute

Hypothesis

Can a new algorithm improve the assembly of genomes from short paired-end reads?

Conclusion

The Meraculous algorithm successfully reconstructs 95% of the Pichia stipitis genome with high accuracy and long contigs.

Supporting Evidence

  • Meraculous recovers more than 95% of the genome with no errors.
  • Half of the assembled sequence is in contigs longer than 101 kilobases.
  • The algorithm incorporates a novel memory-efficient hashing scheme.

Takeaway

Scientists created a new tool called Meraculous to help piece together the DNA of a yeast, and it worked really well, putting together most of the yeast's genetic information without mistakes.

Methodology

The study used paired-end sequencing to assemble the genome of Pichia stipitis, employing a new algorithm that avoids explicit error correction.

Limitations

The algorithm assumes data from a haploid genome and may not perform as well with diploid samples due to heterozygous variations.

Digital Object Identifier (DOI)

10.1371/journal.pone.0023501

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication