Assessing batch effects of genotype calling algorithm BRLMM for the Affymetrix GeneChip Human Mapping 500 K array set using 270 HapMap samples
2008

Effects of Batch Size and Composition on Genotype Calling in GWAS

Sample size: 270 publication Evidence: high

Author Information

Author(s): Hong Huixiao, Su Zhenqiang, Ge Weigong, Shi Leming, Perkins Roger, Fang Hong, Xu Joshua, Chen James J, Han Tao, Kaput Jim, Fuscoe James C, Tong Weida

Primary Institution: National Center for Toxicological Research, US Food and Drug Administration

Hypothesis

How do batch size and composition affect the accuracy of genotype calling algorithms in genome-wide association studies?

Conclusion

Batch size and composition significantly affect genotype calling results in GWAS using the BRLMM algorithm.

Supporting Evidence

  • Batch size and composition significantly affect genotype calling results.
  • Larger batch sizes lead to more consistent genotype calls.
  • Homogeneous samples in batches yield higher call rates.

Takeaway

When scientists study genes, they need to be careful about how they group their samples, because the way they do it can change the results.

Methodology

The study analyzed the effects of batch size and composition on genotype calling using the BRLMM algorithm with data from 270 HapMap samples.

Potential Biases

Potential bias due to the selection of samples and batch composition.

Limitations

The study may not generalize to other genotype calling algorithms or different datasets.

Participant Demographics

270 HapMap samples from three population groups: European, Asian, and African.

Statistical Information

P-Value

1.736 × 10-6

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-9-S9-S17

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication