Analysis of Transcribed Regions in the Human Genome
Author Information
Author(s): Wu Jia Qian, Du Jiang, Rozowsky Joel, Zhang Zhengdong, Urban Alexander E, Euskirchen Ghia, Weissman Sherman, Gerstein Mark, Snyder Michael
Primary Institution: Yale University
Hypothesis
How much of the human genome produces transcripts that are present in the mRNA population?
Conclusion
RACE sequencing reveals that much of the human genome is represented in poly(A)+ RNA and that a fraction of novel RNAs can encode proteins.
Supporting Evidence
- RACE sequencing detected low levels of transcripts in specific cell types that were not detectable by microarrays.
- 60% of novel transcriptionally active regions are connected to other novel TARs or known exons.
- 17% of previously unannotated random regions were shown to produce overlapping transcripts.
- 9% of the novel transcripts identified may encode proteins.
Takeaway
Scientists looked at parts of our DNA to see how much of it makes RNA, and they found that a lot of it does, which is important for understanding how our genes work.
Methodology
The study used RACE sequencing to analyze transcribed loci in selected ENCODE regions.
Potential Biases
Potential artifacts from reverse transcription may affect the detection of antisense transcripts.
Limitations
The study primarily focused on specific cell types and may not represent all human tissues.
Participant Demographics
RNA samples were taken from human cell lines and placental tissue.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website