Helmholtz Gemeinschaft


InterPro, progress and status in 2005

[img] PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Item Type:Article
Title:InterPro, progress and status in 2005
Creators Name:Mulder, N.J. and Apweiler, R. and Attwood, T.K. and Bairoch, A. and Bateman, A. and Binns, D. and Bradley, P. and Bork, P. and Bucher, P. and Cerutti, L. and Copley, R. and Courcelle, E. and Das, U. and Durbin, R. and Fleischmann, W. and Gough, J. and Haft, D. and Harte, N. and Hulo, N. and Kahn, D. and Kanapin, A. and Krestyaninova, M. and Lonsdale, D. and Lopez, R. and Letunic, I. and Madera, M. and Maslen, J. and McDowall, J. and Mitchell, A. and Nikolskaya, A.N. and Orchard, S. and Pagni, M. and Ponting, C.P. and Quevillon, E. and Selengut, J. and Sigrist, C.J. and Silventoinen, V. and Studholme, D.J. and Vaughan, R. and Wu, C.H.
Abstract:InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is provided in an abstract, Gene Ontology mapping and links to specialized databases. New features of InterPro include extended protein match views, taxonomic range information and protein 3D structure data. One of the new match views is the InterPro Domain Architecture view, which shows the domain composition of protein matches. Two new entry types were introduced to better describe InterPro entries: these are active site and binding site. PIRSF and the structure-based SUPERFAMILY are the latest member databases to join InterPro, and CATH and PANTHER are soon to be integrated. InterPro release 8.0 contains 11 007 entries, representing 2573 domains, 8166 families, 201 repeats, 26 active sites, 21 binding sites and 20 post-translational modification sites. InterPro covers over 78% of all proteins in the Swiss-Prot and TrEMBL components of UniProt. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).
Keywords:Protein Databases, Tertiary Protein Structure, Proteins, Sequence Alignment, Protein Sequence Analysis, Systems Integration
Source:Nucleic Acids Research
Publisher:Oxford University Press
Number:Database Issue
Page Range:D201-D205
Date:1 January 2005
Official Publication:https://doi.org/10.1093/nar/gki106
PubMed:View item in PubMed

Repository Staff Only: item control page


Downloads per month over past year

Open Access
MDC Library