MCHelper automatically curates transposable element libraries across eukaryotic species
2024

MCHelper: A Tool for Automating Transposable Element Library Curation

Sample size: 6 publication 10 minutes Evidence: high

Author Information

Author(s): Orozco-Arias Simon, Sierra Pío, Durbin Richard, González Josefa

Primary Institution: Institute of Evolutionary Biology, CSIC, UPF

Hypothesis

MCHelper can automate the curation of transposable element libraries to improve genome annotations.

Conclusion

MCHelper significantly enhances the quality and accuracy of transposable element libraries across various eukaryotic species.

Supporting Evidence

  • MCHelper reduces the number of consensus sequences by up to 65%.
  • It removes up to 11.4% of false positive sequences.
  • MCHelper libraries contain longer consensus sequences compared to raw libraries.
  • It can classify up to 48% of previously unclassified transposable elements.
  • MCHelper improves genome-wide TE annotations, reducing overlapping annotations.
  • The tool is easy to install and can be run with minimal input.

Takeaway

MCHelper is a computer program that helps scientists quickly and accurately organize genetic information about jumping genes in different organisms.

Methodology

MCHelper automates the curation process by integrating protocols from manual curation, using outputs from de novo TE identification tools.

Potential Biases

Potential biases may arise from the reliance on existing databases for classification and the inherent variability in TE content across species.

Limitations

The tool's effectiveness may vary based on the quality of the input libraries and the specific characteristics of the genomes being analyzed.

Participant Demographics

The study involved six species: fruit fly, rice, hooded crow, zebrafish, maize, and human.

Digital Object Identifier (DOI)

10.1101/gr.278821.123

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication