PanKB: An interactive microbial pangenome knowledgebase for research, biotechnological innovation, and knowledge mining
2025

PanKB: A Microbial Pangenome Knowledgebase

Sample size: 8402 publication Evidence: high

Author Information

Author(s): Sun Binhuan, Pashkova Liubov, Pieters Pascal Aldo, Harke Archana Sanjay, Mohite Omkar Satyavan, Santos Alberto, Zielinski Daniel C, Palsson Bernhard O, Phaneuf Patrick Victor

Primary Institution: Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark

Conclusion

PanKB serves as a comprehensive resource for microbial pangenomics, facilitating research and biotechnological applications through interactive analytics and AI-assisted knowledge extraction.

Supporting Evidence

  • PanKB includes 51 pangenomes from 8 industrially relevant microbial families.
  • It comprises 8402 genomes, over 500,000 genes, and over 7 million mutations.
  • PanKB features interactive pangenomic analytics and a global search function.
  • It integrates a bibliome of 833 open-access pangenomic papers.
  • PanKB empowers researchers to harness microbial pangenomics for practical applications.

Takeaway

PanKB is like a big library for scientists that helps them understand the genes of tiny living things called microbes, making it easier to find useful information for things like medicine and food.

Methodology

The PanKB database was constructed using genome data retrieved from NCBI, followed by quality control and pangenome construction using the BGCFlow pipeline.

Potential Biases

The selection of only industrially relevant species may overlook other important microbial strains.

Limitations

The initial collection was manually selected, which may introduce bias in the representation of microbial diversity.

Digital Object Identifier (DOI)

10.1093/nar/gkae1042

Want to read the original?

Access the complete publication on the publisher's website

View Original Publication