Helmholtz Gemeinschaft


Detection of alpha-rod protein repeats using a neural network and application to huntingtin

[img] PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Item Type:Article
Title:Detection of alpha-rod protein repeats using a neural network and application to huntingtin
Creators Name:Palidwor, G.A. and Shcherbinin, S. and Huska, M.R. and Rasko, T. and Stelzl, U. and Arumughan, A. and Foulle, R. and Porras, P. and Sanchez-Pulido, L. and Wanker, E.E. and Andrade-Navarro, M.A.
Abstract:A growing number of solved protein structures display an elongated structural domain, denoted here as alpha-rod, composed of stacked pairs of anti-parallel alpha-helices. Alpha-rods are flexible and expose a large surface, which makes them suitable for protein interaction. Although most likely originating by tandem duplication of a two-helix unit, their detection using sequence similarity between repeats is poor. Here, we show that alpha-rod repeats can be detected using a neural network. The network detects more repeats than are identified by domain databases using multiple profiles, with a low level of false positives (<10%). We identify alpha-rod repeats in approximately 0.4% of proteins in eukaryotic genomes. We then investigate the results for all human proteins, identifying alpha-rod repeats for the first time in six protein families, including proteins STAG1-3, SERAC1, and PSMD1-2 & 5. We also characterize a short version of these repeats in eight protein families of Archaeal, Bacterial, and Fungal species. Finally, we demonstrate the utility of these predictions in directing experimental work to demarcate three alpha-rods in huntingtin, a protein mutated in Huntington's disease. Using yeast two hybrid analysis and an immunoprecipitation technique, we show that the huntingtin fragments containing alpha-rods associate with each other. This is the first definition of domains in huntingtin and the first validation of predicted interactions between fragments of huntingtin, which sets up directions toward functional characterization of this protein. An implementation of the repeat detection algorithm is available as a Web server with a simple graphical output: http://www.ogic.ca/projects/ard. This can be further visualized using BiasViz, a graphic tool for representation of multiple sequence alignments.
Keywords:Algorithms, Amino Acid Sequence, Binding Sites, Computer Simulation, Chemical Models, Molecular Models, Molecular Sequence Data, Nerve Tissue Proteins, Neural Networks , Nuclear Proteins, Automated Pattern Recognition, Protein Binding, Amino Acid Repetitive Sequences, Protein Sequence Analysis
Source:PLoS Computational Biology
Publisher:Public Library of Science
Page Range:e1000304
Date:13 March 2009
Official Publication:https://doi.org/10.1371/journal.pcbi.1000304
PubMed:View item in PubMed

Repository Staff Only: item control page


Downloads per month over past year

Open Access
MDC Library