Algebraic Distribution of Segmental Duplication Lengths in Whole-Genome Sequence Self-Alignments
2011

Algebraic Distribution of Genomic Duplications

publication Evidence: moderate

Author Information

Author(s): Gao Kun, Miller Jonathan, DeSalle Robert

Primary Institution: Okinawa Institute of Science and Technology

Hypothesis

The study investigates the distribution of duplicated sequence lengths in whole-genome sequences and their implications for evolutionary modeling.

Conclusion

The observed distributions of duplicated sequences deviate from exponential forms, suggesting a novel kind of long-distance correlation in genome evolution.

Supporting Evidence

  • The study identifies a heavy, roughly algebraic tail for long sequences in genomic duplications.
  • Duplicated sequence lengths exhibit a power-law distribution rather than an exponential one.
  • Findings suggest that long-range correlations in genome sequences are significant in evolutionary processes.

Takeaway

Scientists looked at how often certain DNA sequences are duplicated in genomes and found that the patterns are more complex than expected, hinting at new ways DNA can change over time.

Methodology

The study involved self-comparisons of several genomes using standard genome alignment methods to analyze the distribution of duplicated sequence lengths.

Limitations

The study may be limited by the quality of genome assemblies and the exclusion of certain repetitive sequences.

Digital Object Identifier (DOI)

10.1371/journal.pone.0018464

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication