Pitfalls of the most commonly used models of context dependent substitution
2008

Evaluating Context-Dependent Substitution Models in Molecular Evolution

Sample size: 470 publication 10 minutes Evidence: high

Author Information

Author(s): Lindsay Helen, Yap Von Bing, Hua Ying, Huttley Gavin A

Primary Institution: The Australian National University

Hypothesis

Are the nucleotide frequency weighted (NF) models more suitable than tuple frequency weighted (TF) models for measuring context-dependent substitution effects?

Conclusion

The NF model is more reliable for analyzing context-dependent substitution processes, as it avoids confounding effects related to nucleotide composition.

Supporting Evidence

  • The NF model consistently nests the independent nucleotide process, while the TF model does not.
  • Simulations showed that TF can falsely identify context effects when none exist.
  • Analysis of primate introns revealed overestimation of nucleotide neighbor effects under the TF model.

Takeaway

This study shows that the way we look at how neighboring DNA letters affect mutations can change our understanding of evolution, and one method is better than the other.

Methodology

The study compared NF and TF models using simulations and real biological data from primate introns to assess their effectiveness in measuring context-dependent substitution rates.

Potential Biases

The TF model may lead to misleading conclusions about context effects due to its dependence on nucleotide composition.

Limitations

The study primarily focuses on intron sequences and may not generalize to other genomic regions or organisms.

Participant Demographics

Aligned introns from human, chimpanzee, and macaque genomes were analyzed.

Digital Object Identifier (DOI)

10.1186/1745-6150-3-52

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication