Unexpected observations after mapping LongSAGE tags to the human genome
2007

Mapping LongSAGE Tags to the Human Genome

Sample size: 3616090 publication 10 minutes Evidence: moderate

Author Information

Author(s): Keime Céline, Sémon Marie, Mouchiroud Dominique, Duret Laurent, Gandrillon Olivier

Primary Institution: Université de Lyon

Hypothesis

What is the origin of LongSAGE tags that do not match the human genome sequence?

Conclusion

The study found that many LongSAGE tags do not match the human genome, suggesting a more complex human transcriptome than currently annotated.

Supporting Evidence

  • 31% of the tags correspond to unannotated transcripts.
  • Nearly half of the tags mapping to known transcripts are located in antisense or new variants.
  • The study suggests that the level of sequencing error may have been underestimated.

Takeaway

The researchers looked at tags from a method called LongSAGE to see where they fit in the human genome, and found many tags that didn't match, meaning there are likely more genes than we thought.

Methodology

The study involved filtering LongSAGE tags from public libraries, mapping them to the human genome, and analyzing their origins.

Potential Biases

There may be bias due to contamination from murine transcripts in the libraries used.

Limitations

The study's conclusions are limited by the potential underestimation of sequencing errors.

Participant Demographics

The tags were derived from various human tissues, including stem cell lines and tumoral tissues.

Statistical Information

P-Value

0.000744

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-8-154

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication