CATH v4.4: Major Expansion of CATH by Experimental and Predicted Structural Data
Author Information
Author(s): Waman Vaishali P, Bordin Nicola, Lau Andy, Kandathil Shaun, Wells Jude, Miller David, Velankar Sameer, Jones David T, Sillitoe Ian, Orengo Christine
Primary Institution: University College London
Conclusion
The CATH database has significantly expanded its structural information, revealing new folds and architectures, although many of these remain hypothetical until experimentally confirmed.
Supporting Evidence
- CATH has expanded the number of superfamilies from 5841 to 6573.
- The number of folds increased from 1349 to 2078.
- The number of architectures increased from 41 to 77.
- FunFams coverage increased by 276%.
Takeaway
CATH is like a big library that helps scientists understand protein structures better, and now it has a lot more information to help them find new types of proteins.
Methodology
The study involved classifying protein domains using a combination of automated deep-learning approaches and manual curation.
Limitations
Many new folds and architectures are hypothetical and require experimental confirmation.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website