Fast Gene Ontology Clustering for Microarray Experiments
Author Information
Author(s): Kristian Ovaska, Marko Laakso, Sampsa Hautaniemi
Primary Institution: University of Helsinki
Hypothesis
Can we develop a fast software for advanced gene annotation using semantic similarity for Gene Ontology terms?
Conclusion
The R-based semantic similarity package offers a speed advantage of over 2000-fold compared to existing implementations, facilitating advanced gene annotation.
Supporting Evidence
- The csbl.go package achieved speed gains of 2100-5000-fold compared to other semantic similarity packages.
- The package allows for the analysis of gene sets with hundreds of genes typically seen in microarray experiments.
- Hierarchical clustering and heat map visualization were used to identify gene clusters based on GO annotations.
Takeaway
This study created a tool that helps scientists quickly group genes based on their functions, making it easier to understand how they work together.
Methodology
The study developed a software package that computes semantic similarities between genes based on Gene Ontology annotations and visualizes the results using clustering and heat maps.
Limitations
The package relies on the accuracy of GO annotations and may not capture all biological processes.
Statistical Information
P-Value
0.0063
Statistical Significance
p<0.05
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website