Movi: A fast and cache-efficient full-text pangenome index
Author Information
Author(s): Zakeri Mohsen, Brown Nathaniel K., Ahmed Omar Y., Gagie Travis, Langmead Ben
Primary Institution: Johns Hopkins University
Hypothesis
We hypothesized the move structure would exhibit superior cache characteristics compared to SPUMONI.
Conclusion
Movi is the fastest available tool for full-text pangenome indexing and querying, making it well suited for real-time applications like adaptive sampling for nanopore sequencing.
Supporting Evidence
- Movi computes sophisticated matching queries for classification up to 30 times faster than existing methods.
- Movi can handle output from 26,890 nanopores simultaneously.
- Movi's index grows more slowly than other pangenome indexes as genomes are added.
Takeaway
Movi is a new tool that helps scientists quickly analyze DNA sequences from many different organisms at the same time, making it much faster than older tools.
Methodology
Movi uses a compressed-index data structure based on the Burrows-Wheeler Transform (BWT) for indexing and querying pangenomes.
Limitations
Movi's index is larger compared to other tools, which may limit its applicability in some scenarios.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website