Helmholtz Gemeinschaft


Towards the biogeography of prokaryotic genes

Item Type:Article
Title:Towards the biogeography of prokaryotic genes
Creators Name:Coelho, L.P. and Alves, R. and Del Río, Á.R. and Myers, P.N. and Cantalapiedra, C.P. and Giner-Lamia, J. and Schmidt, T.S. and Mende, D.R. and Orakov, A. and Letunic, I. and Hildebrand, F. and Van Rossum, T. and Forslund, S.K. and Khedkar, S. and Maistrenko, O.M. and Pan, S. and Jia, L. and Ferretti, P. and Sunagawa, S. and Zhao, X.M. and Nielsen, H.B. and Huerta-Cepas, J. and Bork, P.
Abstract:Microbial genes encode the majority of the functional repertoire of life on earth. However, despite increasing efforts in metagenomic sequencing of various habitats, little is known about the distribution of genes across the global biosphere, with implications for human and planetary health. Here we constructed a non-redundant gene catalogue of 303 million species-level genes (clustered at 95% nucleotide identity) from 13,174 publicly available metagenomes across 14 major habitats and use it to show that most genes are specific to a single habitat. The small fraction of genes found in multiple habitats is enriched in antibiotic-resistance genes and markers for mobile genetic elements. By further clustering these species-level genes into 32 million protein families, we observed that a small fraction of these families contain the majority of the genes (0.6% of families account for 50% of the genes). The majority of species-level genes and protein families are rare. Furthermore, species-level genes, and in particular the rare ones, show low rates of positive (adaptive) selection, supporting a model in which most genetic variability observed within each protein family is neutral or nearly neutral.
Keywords:Computational Biology and Bioinformatics, Microbial Ecology
Publisher:Nature Publishing Group
Page Range:252-256
Date:13 January 2022
Official Publication:https://doi.org/10.1038/s41586-021-04233-4
PubMed:View item in PubMed

Repository Staff Only: item control page

Open Access
MDC Library