Movi: A fast and cache-efficient full-text pangenome index
2024

Movi: A fast and cache-efficient full-text pangenome index

Sample size: 800000 publication Evidence: high

Author Information

Author(s): Zakeri Mohsen, Brown Nathaniel K., Ahmed Omar Y., Gagie Travis, Langmead Ben

Primary Institution: Johns Hopkins University

Hypothesis

We hypothesized the move structure would exhibit superior cache characteristics compared to SPUMONI.

Conclusion

Movi is the fastest available tool for full-text pangenome indexing and querying, making it well suited for real-time applications like adaptive sampling for nanopore sequencing.

Supporting Evidence

  • Movi computes sophisticated matching queries for classification up to 30 times faster than existing methods.
  • Movi can handle output from 26,890 nanopores simultaneously.
  • Movi's index grows more slowly than other pangenome indexes as genomes are added.

Takeaway

Movi is a new tool that helps scientists quickly analyze DNA sequences from many different organisms at the same time, making it much faster than older tools.

Methodology

Movi uses a compressed-index data structure based on the Burrows-Wheeler Transform (BWT) for indexing and querying pangenomes.

Limitations

Movi's index is larger compared to other tools, which may limit its applicability in some scenarios.

Digital Object Identifier (DOI)

10.1016/j.isci.2024.111464

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication