MCAM: Multiple Clustering Analysis Methodology for Deriving Hypotheses and Insights from High-Throughput Proteomic Datasets
2011

MCAM: A New Method for Analyzing High-Throughput Proteomic Data

Sample size: 77 publication 10 minutes Evidence: moderate

Author Information

Author(s): Kristen M. Naegle, Roy E. Welsch, Michael B. Yaffe, Forest M. White, Douglas A. Lauffenburger, Jason A. Papin

Primary Institution: Massachusetts Institute of Technology

Hypothesis

Can a new computational framework enhance the analysis of high-throughput proteomic datasets to infer biological insights?

Conclusion

The MCAM framework successfully identifies biological insights from complex proteomic data by employing diverse clustering techniques.

Supporting Evidence

  • MCAM employs diverse data transformations and clustering algorithms to enhance biological insight extraction.
  • The framework was applied to dynamic phosphorylation measurements of the ERBB network.
  • Statistical enrichment analysis was used to evaluate the biological relevance of clustering results.
  • MCAM allows for the exploration of relationships between clustering parameters and biological metrics.
  • Robust biological hypotheses were generated through the analysis of clustering results.
  • MCAM can be applied to various proteomic datasets to improve understanding of molecular networks.
  • Results indicate that clustering parameters significantly affect the biological insights derived from the data.
  • MCAM provides a method for comparing independent measurements of biological systems.

Takeaway

This study created a new way to analyze protein data that helps scientists understand how proteins work together in cells.

Methodology

The study developed a framework that combines various clustering algorithms and metrics to analyze proteomic data and infer biological meaning.

Potential Biases

Potential biases may arise from the choice of clustering parameters and the reliance on existing biological annotations.

Limitations

The study's findings may not generalize to all types of biological datasets due to the specific nature of the clustering parameters used.

Participant Demographics

The study focused on human mammary epithelial cells (HMEC) and their response to EGF stimulation.

Statistical Information

P-Value

p<0.05

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1371/journal.pcbi.1002119

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication