Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields

[img] PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB

Item Type:Article
Title:Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
Creators Name:Pruteanu-Malinici, I. and Majoros, W.H. and Ohler, U.
Abstract:Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and across multiple time points, phenotypes are frequently annotated independently, for individual time points only. In particular, for the analysis of developmental gene expression patterns, it is biologically sensible when images across multiple time points are jointly accounted for, such that spatial and temporal dependencies are captured simultaneously. Methods: We describe a discriminative undirected graphical model to label gene-expression time-series image data, with an efficient training and decoding method based on the junction tree algorithm. The approach is based on an effective feature selection technique, consisting of a non-parametric sparse Bayesian factor analysis model. The result is a flexible framework, which can handle large-scale data with noisy incomplete samples, i.e. it can tolerate data missing from individual time points. Results: Using the annotation of gene expression patterns across stages of Drosophila embryonic development as an example, we demonstrate that our method achieves superior accuracy, gained by jointly annotating phenotype sequences, when compared with previous models that annotate each stage in isolation. The experimental results on missing data indicate that our joint learning method successfully annotates genes for which no expression data are available for one or more stages.
Keywords:Algorithms, Bayes Theorem, Computer-Assisted Image Processing, Controlled Vocabulary, Embryonic Development, Gene Expression Profiling, In Situ Hybridization, Messenger RNA, Nonparametric Statistics, Statistical Factor Analysis, Statistical Models, Animals, Drosophila
Source:Bioinformatics
ISSN:1367-4803
Publisher:Oxford University Press
Volume:29
Number:13
Page Range:i27-i35
Date:1 July 2013
Official Publication:https://doi.org/10.1093/bioinformatics/btt206
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library