New R Function for Aggregating Gene Expression Data
Author Information
Author(s): Jeremy A. Miller, Chaochao Cai, Peter Langfelder, Daniel H. Geschwind, Sunil M. Kurian, Daniel R. Salomon, Steve Horvath
Primary Institution: UCLA
Hypothesis
How can we effectively summarize multiple related gene expression variables into a single representative value?
Conclusion
The R function collapseRows provides robust methods for aggregating gene expression data, improving reproducibility and biological relevance.
Supporting Evidence
- The collapseRows function improves reproducibility in gene expression analysis.
- Choosing the probe with the highest mean expression leads to better consistency across studies.
- Network-based methods for collapsing gene data can enhance biological interpretation.
Takeaway
Scientists created a new tool in R that helps combine lots of gene data into simpler forms, making it easier to understand and compare.
Methodology
The study introduces the collapseRows function, which implements various statistical and network-based methods for collapsing gene expression data.
Limitations
The study primarily uses weighted correlation networks and does not explore other potential methods for finding representative genes.
Statistical Information
P-Value
p<10-8
Statistical Significance
p<10-8
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website