DWARF – a data warehouse system for analyzing protein families
2006

DWARF: A Data Warehouse for Protein Families

Sample size: 6138 publication Evidence: high

Author Information

Author(s): Fischer Markus, Thai Quan K, Grieb Melanie, Pleiss Jürgen

Primary Institution: Institute of Technical Biochemistry, University of Stuttgart

Hypothesis

The DWARF data warehouse system can effectively integrate and analyze diverse biological data related to protein families.

Conclusion

DWARF serves as a valuable tool for constructing databases of large structurally related protein families and evaluating their sequence-structure-function relationships.

Supporting Evidence

  • DWARF integrates data from various public databases to provide a comprehensive view of protein families.
  • The system has been applied to the family of α/β-hydrolases, hosting the Lipase Engineering database.
  • Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures.

Takeaway

DWARF is like a big library that helps scientists understand how different proteins work together by organizing lots of information about them.

Methodology

DWARF integrates data on sequence, structure, and functional annotation for protein fold families using a relational data model.

Limitations

The functional annotation can be incomplete and inconsistent due to manual integration from publications.

Digital Object Identifier (DOI)

10.1186/1471-2105-7-495

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication