Microarray data mining: A novel optimization-based approach to uncover biologically coherent structures
2008

Optimizing Gene Clustering from Microarray Data

Sample size: 5652 publication 15 minutes Evidence: high

Author Information

Author(s): Tan Meng P, Smith Erin N, Broach James R, Floudas Christodoulos A

Primary Institution: Princeton University

Hypothesis

Can an iterative clustering approach improve the biological coherence of gene clusters derived from DNA microarray data?

Conclusion

The iterative clustering method significantly enhances the biological coherence of gene clusters, allowing for better insights into gene functions and regulatory networks.

Supporting Evidence

  • 64% of genes fell into clusters with significant functional coherence.
  • After 6 iterations, over 90% of genes were placed in clusters with p-values of 10^-3 or less.
  • The iterative method improved the average cluster correlation.
  • Comparison with other clustering methods showed superior performance in biological coherence.

Takeaway

This study shows a new way to group genes based on their behavior, helping scientists understand how they work together, especially when studying yeast.

Methodology

The study used an iterative clustering algorithm called EP_GOS_Clust to analyze DNA microarray data from yeast, assessing biological coherence through Gene Ontology annotations.

Potential Biases

Potential bias from reliance on existing Gene Ontology annotations.

Limitations

The method may be less effective in organisms with sparse or inaccurate functional annotations.

Participant Demographics

Yeast (Saccharomyces cerevisiae) gene expression data.

Statistical Information

P-Value

10^-3

Statistical Significance

p<0.001

Digital Object Identifier (DOI)

10.1186/1471-2105-9-268

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication