Evaluating Context-Dependent Substitution Models in Molecular Evolution
Author Information
Author(s): Lindsay Helen, Yap Von Bing, Hua Ying, Huttley Gavin A
Primary Institution: The Australian National University
Hypothesis
Are the nucleotide frequency weighted (NF) models more suitable than tuple frequency weighted (TF) models for measuring context-dependent substitution effects?
Conclusion
The NF model is more reliable for analyzing context-dependent substitution processes, as it avoids confounding effects related to nucleotide composition.
Supporting Evidence
- The NF model consistently nests the independent nucleotide process, while the TF model does not.
- Simulations showed that TF can falsely identify context effects when none exist.
- Analysis of primate introns revealed overestimation of nucleotide neighbor effects under the TF model.
Takeaway
This study shows that the way we look at how neighboring DNA letters affect mutations can change our understanding of evolution, and one method is better than the other.
Methodology
The study compared NF and TF models using simulations and real biological data from primate introns to assess their effectiveness in measuring context-dependent substitution rates.
Potential Biases
The TF model may lead to misleading conclusions about context effects due to its dependence on nucleotide composition.
Limitations
The study primarily focuses on intron sequences and may not generalize to other genomic regions or organisms.
Participant Demographics
Aligned introns from human, chimpanzee, and macaque genomes were analyzed.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website