UniProt: the Universal Protein Knowledgebase in 2025
2025

UniProt: the Universal Protein Knowledgebase in 2025

publication Evidence: high

Author Information

Author(s): Alex Bateman, Maria-Jesus Martin, Sandra Orchard, Michele Magrane, Aduragbemi Adesina, Shadab Ahmad, Emily H. Bowler-Barnett, Hema Bye-A-Jee, David Carpentier, Paul Denny, Jun Fan, Penelope Garmiri, Leonardo Jose da Costa Gonzales, Abdulrahman Hussein, Alexandr Ignatchenko, Giuseppe Insana, Rizwan Ishtiaq, Vishal Joshi, Dushyanth Jyothi, Swaathi Kandasaamy, Antonia Lock, Aurelien Luciani, Jie Luo, Yvonne Lussi, Juan Sebastian Martinez Marin, Pedro Raposo, Daniel L. Rice, Rafael Santos, Elena Speretta, James Stephenson, Prabhat Totoo, Nidhi Tyagi, Nadya Urakova, Preethi Vasudev, Kate Warner, Supun Wijerathne, Conny Wing-Heng Yu, Rossana Zaru, Alan J. Bridge, Lucila Aimo, Ghislaine Argoud-Puy, Andreas H. Auchincloss, Kristian B. Axelsen, Parit Bansal, Delphine Baratin, Teresa M. Batista Neto, Marie-Claude Blatter, Jerven T. Bolleman, Emmanuel Boutet, Lionel Breuza, Blanca Cabrera Gil, Cristina Casals-Casas, Kamal Chikh Echioukh, Elisabeth Coudert, Beatrice Cuche, Edouard de Castro, Anne Estreicher, Maria L. Famiglietti, Marc Feuermann, Elisabeth Gasteiger, Pascale Gaudet, Sebastien Gehant, Vivienne Gerritsen, Arnaud Gos, Nadine Gruaz, Chantal Hulo, Nevila Hyka-Nouspikel, Florence Jungo, Arnaud Kerhornou, Philippe Le Mercier, Damien Lieberherr, Patrick Masson, Anne Morgat, Salvo Paesano, Ivo Pedruzzi, Sandrine Pilbout, Lucille Pourcel, Sylvain Poux, Monica Pozzato, Manuela Pruess, Nicole Redaschi, Catherine Rivoire, Christian J. A. Sigrist, Karin Sonesson, Shyamala Sundaram, Anastasia Sveshnikova, Cathy H. Wu, Cecilia N. Arighi, Chuming Chen, Yongxing Chen, Hongzhan Huang, Kati Laiho, Minna Lehvaslaiho, Peter McGarvey, Darren A. Natale, Karen Ross, C. R. Vinayaka, Yuqi Wang, Jian Zhang

Primary Institution: European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI)

Conclusion

The UniProt database will adapt its data processing pipelines to manage the increasing volume of whole genome sequencing data while maintaining high-quality service.

Supporting Evidence

  • UniProt has been awarded Global Core Biodata Resource status for its critical role in molecular biology.
  • The database aims to provide high-quality, non-redundant reference proteomes.
  • Community curation is encouraged to ensure key publications are not missed.
  • Machine learning techniques are being utilized to assist in the curation process.
  • UniProtKB includes annotations to over 12,500 biochemical reactions linked to protein sequence records.

Takeaway

UniProt is a big database that helps scientists find and understand proteins. They are making changes to keep it up-to-date and easy to use as more genomes are sequenced.

Methodology

The publication describes updates to the UniProt production pipeline, including manual curation, machine learning techniques, and community curation.

Digital Object Identifier (DOI)

10.1093/nar/gkae1010

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication