Systematic analysis of transcribed loci in ENCODE regions using RACE sequencing reveals extensive transcription in the human genome
2008

Analysis of Transcribed Regions in the Human Genome

Sample size: 420 publication 10 minutes Evidence: high

Author Information

Author(s): Wu Jia Qian, Du Jiang, Rozowsky Joel, Zhang Zhengdong, Urban Alexander E, Euskirchen Ghia, Weissman Sherman, Gerstein Mark, Snyder Michael

Primary Institution: Yale University

Hypothesis

How much of the human genome produces transcripts that are present in the mRNA population?

Conclusion

RACE sequencing reveals that much of the human genome is represented in poly(A)+ RNA and that a fraction of novel RNAs can encode proteins.

Supporting Evidence

  • RACE sequencing detected low levels of transcripts in specific cell types that were not detectable by microarrays.
  • 60% of novel transcriptionally active regions are connected to other novel TARs or known exons.
  • 17% of previously unannotated random regions were shown to produce overlapping transcripts.
  • 9% of the novel transcripts identified may encode proteins.

Takeaway

Scientists looked at parts of our DNA to see how much of it makes RNA, and they found that a lot of it does, which is important for understanding how our genes work.

Methodology

The study used RACE sequencing to analyze transcribed loci in selected ENCODE regions.

Potential Biases

Potential artifacts from reverse transcription may affect the detection of antisense transcripts.

Limitations

The study primarily focused on specific cell types and may not represent all human tissues.

Participant Demographics

RNA samples were taken from human cell lines and placental tissue.

Digital Object Identifier (DOI)

10.1186/gb-2008-9-1-r3

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication