Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models

[thumbnail of Article] PDF (Article) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
533kB
[thumbnail of Supplementary material] PDF (Supplementary material) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB

Item Type:Article
Title:Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models
Creators Name:Maaskola, J. and Rajewsky, N.
Abstract:We present a discriminative learning method for pattern discovery of binding sites in nucleic acid sequences based on hidden Markov models. Sets of positive and negative example sequences are mined for sequence motifs whose occurrence frequency varies between the sets. The method offers several objective functions, but we concentrate on mutual information of condition and motif occurrence. We perform a systematic comparison of our method and numerous published motif-finding tools. Our method achieves the highest motif discovery performance, while being faster than most published methods. We present case studies of data from various technologies, including ChIP-Seq, RIP-Chip and PAR-CLIP, of embryonic stem cell transcription factors and of RNA-binding proteins, demonstrating practicality and utility of the method. For the alternative splicing factor RBM10, our analysis finds motifs known to be splicing-relevant. The motif discovery method is implemented in the free software package Discrover. It is applicable to genome- and transcriptome-scale data, makes use of available repeat experiments and aside from binary contrasts also more complex data configurations can be utilized.
Keywords:Binding Sites, Chromatin Immunoprecipitation, DNA-Binding Proteins, DNA Sequence Analysis, Embryonic Stem Cells, Markov Chains, Nucleotide Motifs, RNA-Binding Proteins, RNA Sequence Analysis, Transcription Factors, Animals, Mice
Source:Nucleic Acids Research
ISSN:0305-1048
Publisher:Oxford University Press
Volume:42
Number:21
Page Range:12995-13011
Date:1 December 2014
Official Publication:https://doi.org/10.1093/nar/gku1083
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library