Statistical analysis of genomic protein family and domain controlled annotations for functional investigation of classified gene lists
2007

New Tools for Analyzing Protein Families and Domains in Genes

Sample size: 321 publication 10 minutes Evidence: moderate

Author Information

Author(s): Masseroli Marco, Bellistri Elisa, Franceschini Andrea, Pinciroli Francesco

Primary Institution: Politecnico di Milano

Hypothesis

Can new modules in GFINDer improve the analysis of protein family and domain annotations for gene classification?

Conclusion

The new GFINDer modules enhance the understanding of gene functions by providing better analyses of protein family and domain annotations.

Supporting Evidence

  • The GFINDer modules allow for the annotation of user-classified nucleotide sequences with controlled protein family information.
  • Statistical analysis highlighted significant protein families and domains associated with specific gene classes.
  • Logistic regression analysis identified functional characteristics that explain gene classifications.

Takeaway

The researchers created new tools to help scientists understand what genes do by looking at the proteins they make and how those proteins are related.

Methodology

The study involved developing new modules in GFINDer for statistical analysis of protein family and domain annotations based on user-uploaded gene lists.

Potential Biases

Potential bias may arise from reliance on existing databases for protein annotations, which may not cover all genes comprehensively.

Limitations

The study may be limited by the availability and accuracy of protein family and domain annotations in the databases used.

Statistical Information

P-Value

p = 0.00932, p = 0.02122, p < 0.00001

Statistical Significance

p<0.05

Digital Object Identifier (DOI)

10.1186/1471-2105-8-S1-S14

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication