Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

A catalog of small proteins from the global microbiome

[thumbnail of Original Article]
Preview
PDF (Original Article) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB
[thumbnail of Supplementary Information] Other (Supplementary Information)
20MB

Item Type:Article
Title:A catalog of small proteins from the global microbiome
Creators Name:Duan, Y., Santos-Júnior, C.D., Schmidt, T.S., Fullam, A., de Almeida, B.L.S., Zhu, C., Kuhn, M., Zhao, X.M., Bork, P. and Coelho, L.P.
Abstract:Small open reading frames (smORFs) shorter than 100 codons are widespread and perform essential roles in microorganisms, where they encode proteins active in several cell functions, including signal pathways, stress response, and antibacterial activities. However, the ecology, distribution and role of small proteins in the global microbiome remain unknown. Here, we construct a global microbial smORFs catalog (GMSC) derived from 63,410 publicly available metagenomes across 75 distinct habitats and 87,920 high-quality isolate genomes. GMSC contains 965 million non-redundant smORFs with comprehensive annotations. We find that archaea harbor more smORFs proportionally than bacteria. We moreover provide a tool called GMSC-mapper to identify and annotate small proteins from microbial (meta)genomes. Overall, this publicly-available resource demonstrates the immense and underexplored diversity of small proteins.
Keywords:Archaea, Bacteria, Bacterial Proteins, Metagenome, Microbiota, Molecular Sequence Annotation, Open Reading Frames
Source:Nature Communications
ISSN:2041-1723
Publisher:Nature Publishing Group
Volume:15
Number:1
Page Range:7563
Date:31 August 2024
Official Publication:https://doi.org/10.1038/s41467-024-51894-6
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library