Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

proGenomes4: providing 2 million accurately and consistently annotated high-quality prokaryotic genomes

[thumbnail of Publisher's Version]
Preview
PDF (Publisher's Version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB

Item Type:Article
Title:proGenomes4: providing 2 million accurately and consistently annotated high-quality prokaryotic genomes
Creators Name:Fullam, Anthony, Letunic, Ivica, Maistrenko, Oleksandr M., Castro, Alexandre Areias, Coelho, Luis Pedro, Grekova, Anastasiia, Schudoma, Christian, Khedkar, Supriya, Robbani, Mahdi, Kuhn, Michael, Schmidt, Thomas S.B., Bork, Peer and Mende, Daniel R.
Abstract:The pervasive availability of publicly available microbial genomes has opened many new avenues for microbiology research, yet it also demands robust quality control and consistent annotation pipelines to ensure meaningful biological insights. proGenomes4 (prokaryotic Genomes v4) addresses this challenge by providing a resource of nearly 2 million high-quality microbial genomes, a doubling in scale from previous versions, encompassing over 7 billion genes. Each genome underwent rigorous quality assessment and comprehensive functional annotation by applying multiple standardized annotation workflows, including the systematic identification of mobile genetic elements and biosynthetic gene clusters. proGenomes4 contains 32 887 species with ecological habitat metadata as well as precomputed pan-genomes. This substantially expanded resource provides the microbiology community with a foundation for large-scale comparative studies and is freely accessible via a newly developed command line interface and at https://progenomes.embl.de/.
Source:Nucleic Acids Research
ISSN:0305-1048
Publisher:Oxford University Press
Page Range:gkaf1208
Date:20 November 2025
Official Publication:https://doi.org/10.1093/nar/gkaf1208
PubMed:View item in PubMed
Related to:

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library