DWARF: A Data Warehouse for Protein Families
Author Information
Author(s): Fischer Markus, Thai Quan K, Grieb Melanie, Pleiss Jürgen
Primary Institution: Institute of Technical Biochemistry, University of Stuttgart
Hypothesis
The DWARF data warehouse system can effectively integrate and analyze diverse biological data related to protein families.
Conclusion
DWARF serves as a valuable tool for constructing databases of large structurally related protein families and evaluating their sequence-structure-function relationships.
Supporting Evidence
- DWARF integrates data from various public databases to provide a comprehensive view of protein families.
- The system has been applied to the family of α/β-hydrolases, hosting the Lipase Engineering database.
- Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures.
Takeaway
DWARF is like a big library that helps scientists understand how different proteins work together by organizing lots of information about them.
Methodology
DWARF integrates data on sequence, structure, and functional annotation for protein fold families using a relational data model.
Limitations
The functional annotation can be incomplete and inconsistent due to manual integration from publications.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website