Helmholtz Gemeinschaft


Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Item Type:Preprint
Title:Fast genome-wide functional annotation through orthology assignment by eggNOG-mapper
Creators Name:Huerta-Cepas, J. and Forslund, K. and Szklarczyk, D. and Jensen, L.J. and von Mering, C. and Bork, P.
Abstract:Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines relatively inaccessible, less precise homology-based functional transfer is still the default for (meta-)genome annotation. We therefore developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from eggNOG. To validate our method, we benchmarked Gene Ontology predictions against two widely used homology-based approaches: BLAST and InterProScan. Compared to BLAST, eggNOG-mapper reduced by 7% the rate of false positive assignments, and increased by 19% the ratio of curated terms recovered over all terms assigned per protein. Compared to InterProScan, eggNOG-mapper achieved similar proteome coverage and precision, while predicting on average 32 more terms per protein and increasing by 26% the rate of curated terms recovered over total term assignments per protein. Through strict orthology assignments, eggNOG-mapper further renders more specific annotations than possible from domain similarity only (e.g. predicting gene family names). eggNOG-mapper runs ~15x than BLAST and at least 2.5x faster than InterProScan. The tool is available standalone or as an online service at http://eggnog-mapper.embl.de.
Publisher:Cold Spring Harbor Laboratory Press
Article Number:076331
Date:22 September 2016
Official Publication:https://doi.org/10.1101/076331
Related to:
https://edoc.mdc-berlin.de/16431/Final version

Repository Staff Only: item control page


Downloads per month over past year

Open Access
MDC Library