Anchorage: A New Tool for Assembling Synthetic Long Reads
Author Information
Author(s): Zang Xiaofei, Carl Huck, Li Xiang, Metcalfe Kyle, Ben-Yehezkel Tuval, Kelley Ryan, Shao Mingfu
Primary Institution: The Pennsylvania State University
Hypothesis
Can Anchorage improve the assembly of anchor-enabled, ultra-high-depth sequencing data compared to existing methods?
Conclusion
Anchorage significantly outperforms existing assembly methods, especially in the presence of sequencing artifacts.
Supporting Evidence
- Anchorage achieved an average genome fraction percentage of 97.5%, significantly higher than other methods.
- Anchorage maintained robust performance even with high sequencing depth and artifacts.
- Anchorage identified both anchors for all 7 sequences, indicating full-length assemblies.
Takeaway
Anchorage is a new tool that helps scientists put together long DNA sequences more accurately, especially when there are mistakes in the data.
Methodology
Anchorage uses a kmer-based approach and dynamic programming to assemble sequences from ultra-high-depth sequencing data by leveraging anchor sequences.
Limitations
Anchorage is specifically designed for anchor-labeled single molecules and may require modifications for general-purpose genome assembly.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website