InterPro: the protein sequence classification resource in 2025
Author Information
Author(s): Blum Matthias, Andreeva Antonina, Florentino Laise Cavalcanti, Chuguransky Sara Rocio, Grego Tiago, Hobbs Emma, Pinto Beatriz Lazaro, Orr Ailsa, Paysan-Lafosse Typhaine, Ponamareva Irina, Salazar Gustavo A, Bordin Nicola, Bork Peer, Bridge Alan, Colwell Lucy, Gough Julian, Haft Daniel H, Letunic Ivica, Llinares-López Felipe, Marchler-Bauer Aron, Meng-Papaxanthos Laetitia, Mi Huaiyu, Natale Darren A, Orengo Christine A, Pandurangan Arun P, Piovesan Damiano, Rivoire Catherine, Sigrist Christian J A, Thanki Narmada, Thibaud-Nissen Françoise, Thomas Paul D, Tosatto Silvio C E, Wu Cathy H, Bateman Alex
Primary Institution: European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI)
Conclusion
InterPro has significantly improved its capabilities in protein sequence classification and functional annotation by integrating new data sources and enhancing user accessibility.
Supporting Evidence
- InterPro provides annotations for over 200 million sequences.
- More than 5000 new InterPro entries have been created in the past two years.
- InterPro integrates predictive models from multiple databases to classify protein sequences.
Takeaway
InterPro helps scientists understand proteins by organizing them into families and providing information about their functions, making it easier to study them.
Methodology
InterPro integrates predictive models from various member databases to classify protein sequences and predict the presence of domains and significant sites.
Limitations
The manual curation process remains a significant bottleneck in updating and integrating new data into InterPro.
Digital Object Identifier (DOI)
Want to read the original?
Access the complete publication on the publisher's website