Characterization and modeling of the Haemophilus influenzae core and supragenomes based on the complete genomic sequences of Rd and 12 clinical nontypeable strains
2007

Understanding the Genomes of Haemophilus influenzae Strains

Sample size: 13 publication 10 minutes Evidence: moderate

Author Information

Author(s): Hogg Justin S, Hu Fen Z, Janto Benjamin, Boissy Robert, Hayes Jay, Keefe Randy, Post J Christopher, Ehrlich Garth D

Primary Institution: Allegheny General Hospital, Allegheny-Singer Research Institute, Center for Genomic Sciences, Pittsburgh, Pennsylvania, USA

Hypothesis

The distributed genome hypothesis (DGH) posits that chronic bacterial pathogens utilize polyclonal infection and reassortment of genic characters to ensure persistence in the face of adaptive host defenses.

Conclusion

The study developed a finite supragenome model that predicts the NTHi supragenome contains between 4,425 and 6,052 genes, supporting the distributed genome hypothesis.

Supporting Evidence

  • 2,786 genes were identified among the strains, with 1,461 common to all strains.
  • Comparisons revealed between 107 and 158 insertions and 100 and 213 deletions per genome.
  • The mean insertion and deletion sizes were 1,356 and 1,020 base-pairs, respectively.

Takeaway

Scientists studied the genes of different strains of a bacteria called Haemophilus influenzae to see how they change and adapt, finding that they share many genes that help them survive.

Methodology

The genomes of 12 clinical nontypeable H. influenzae strains were sequenced and compared with a reference strain to characterize the core and supragenomes.

Potential Biases

The clustering methods may introduce bias due to frame shifts and assembly gaps.

Limitations

The study may have biases due to the use of unfinished genomes and potential sequencing errors.

Participant Demographics

The study involved 12 clinical strains of nontypeable H. influenzae isolated from patients.

Statistical Information

Confidence Interval

4,425 to 6,052

Digital Object Identifier (DOI)

10.1186/gb-2007-8-6-r103

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication