Strategies for aggregating gene expression data: The collapseRows R function
2011

New R Function for Aggregating Gene Expression Data

publication Evidence: high

Author Information

Author(s): Jeremy A. Miller, Chaochao Cai, Peter Langfelder, Daniel H. Geschwind, Sunil M. Kurian, Daniel R. Salomon, Steve Horvath

Primary Institution: UCLA

Hypothesis

How can we effectively summarize multiple related gene expression variables into a single representative value?

Conclusion

The R function collapseRows provides robust methods for aggregating gene expression data, improving reproducibility and biological relevance.

Supporting Evidence

  • The collapseRows function improves reproducibility in gene expression analysis.
  • Choosing the probe with the highest mean expression leads to better consistency across studies.
  • Network-based methods for collapsing gene data can enhance biological interpretation.

Takeaway

Scientists created a new tool in R that helps combine lots of gene data into simpler forms, making it easier to understand and compare.

Methodology

The study introduces the collapseRows function, which implements various statistical and network-based methods for collapsing gene expression data.

Limitations

The study primarily uses weighted correlation networks and does not explore other potential methods for finding representative genes.

Statistical Information

P-Value

p<10-8

Statistical Significance

p<10-8

Digital Object Identifier (DOI)

10.1186/1471-2105-12-322

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication