Improving Exon Prediction with Comparative Genomics
Author Information
Author(s): Wu Jing, Mary Qu Yang, Jack Y Yang, Hamid R Arabnia, Youping Deng
Primary Institution: Purdue University
Hypothesis
Can a statistical approach utilizing evolutionary conservation improve the specificity of exon prediction?
Conclusion
The proposed method significantly enhances the specificity of exon prediction while maintaining high sensitivity.
Supporting Evidence
- The method improved exon specificity by 73% and 32% for GENSCAN and TWINSCAN respectively.
- It retained 98% of correctly predicted RefSeq gene structures while removing 26% of predicted genes in non-coding regions.
- The log odds ratio was used to classify putative exons effectively.
Takeaway
This study shows a new way to find real gene parts in DNA by comparing them with similar DNA from other species, making it easier to tell which parts are important.
Methodology
A statistical model based on evolutionary conservation and codon dependency was developed to classify putative exons.
Potential Biases
The performance is dependent on the quality of the putative exons provided by existing methods.
Limitations
The method does not predict new exons and relies on the sensitivity of existing gene prediction methods.
Statistical Information
Statistical Significance
p<0.05
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website