ReRep: A Tool for Detecting Repetitive Sequences in Genomes
Author Information
Author(s): Otto Thomas D, Gomes Leonardo HF, Alves-Ferreira Marcelo, de Miranda Antonio B, Degrave Wim M
Primary Institution: Laboratory for Functional Genomics and Bioinformatics, IOC, Fiocruz, Rio de Janeiro, Brazil
Hypothesis
Can a computational pipeline effectively identify repetitive sequences in genome survey sequences?
Conclusion
The ReRep approach for identifying repetitive elements in GSS datasets proved to be straightforward and efficient.
Supporting Evidence
- ReRep was applied to a dataset of 970 GSSs from Leishmania braziliensis.
- The pipeline effectively identified several potential repetitive sequences.
- ReRep was also tested on E. coli data to validate its applicability.
Takeaway
Scientists created a tool called ReRep to help find repeated DNA sequences in genomes, which can make understanding and assembling genomes easier.
Methodology
The ReRep pipeline uses similarity searches and sequence landscapes to identify repetitive sequences in GSS datasets.
Potential Biases
Bias may arise from library construction, sequencing procedures, or data submission, leading to false positives.
Limitations
The accuracy of repeat detection is influenced by the coverage of the dataset and the parameters set for analysis.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website