Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Interpreting deep neural networks for the prediction of translation rates

[thumbnail of Original Article]
Preview
PDF (Original Article) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB
[thumbnail of Supplementary Information]
Preview
PDF (Supplementary Information) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
722kB

Item Type:Article
Title:Interpreting deep neural networks for the prediction of translation rates
Creators Name:Korbel, F., Eroshok, E. and Ohler, U.
Abstract:BACKGROUND: The 5’ untranslated region of mRNA strongly impacts the rate of translation initiation. A recent convolutional neural network (CNN) model accurately quantifies the relationship between massively parallel synthetic 5’ untranslated regions (5’UTRs) and translation levels. However, the underlying biological features, which drive model predictions, remain elusive. Uncovering sequence determinants predictive of translation output may allow us to develop a more detailed understanding of translation regulation at the 5’UTR. RESULTS: Applying model interpretation, we extract representations of regulatory logic from CNNs trained on synthetic and human 5’UTR reporter data. We reveal a complex interplay of regulatory sequence elements, such as initiation context and upstream open reading frames (uORFs) to influence model predictions. We show that models trained on synthetic data alone do not sufficiently explain translation regulation via the 5’UTR due to differences in the frequency of regulatory motifs compared to natural 5’UTRs. CONCLUSIONS: Our study demonstrates the significance of model interpretation in understanding model behavior, properties of experimental data and ultimately mRNA translation. By combining synthetic and human 5’UTR reporter data, we develop a model (OptMRL) which better captures the characteristics of human translation regulation. This approach provides a general strategy for building more successful sequence-based models of gene regulation, as it combines global sampling of random sequences with the subspace of naturally occurring sequences. Ultimately, this will enhance our understanding of 5’UTR sequences in disease and our ability to engineer translation output.
Keywords:Translation Regulation, 5’ Untranslated Region, Massively Parallel Reporter Assay, Deep Neural Networks, Explainable Artificial Intelligence
Source:BMC Genomics
ISSN:1471-2164
Publisher:BioMed Central
Volume:25
Number:1
Page Range:1061
Date:9 November 2024
Official Publication:https://doi.org/10.1186/s12864-024-10925-8

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library