Preview |
PDF (Original Article)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB |
Preview |
PDF (Supplementary Information)
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
722kB |
Item Type: | Article |
---|---|
Title: | Interpreting deep neural networks for the prediction of translation rates |
Creators Name: | Korbel, F., Eroshok, E. and Ohler, U. |
Abstract: | BACKGROUND: The 5’ untranslated region of mRNA strongly impacts the rate of translation initiation. A recent convolutional neural network (CNN) model accurately quantifies the relationship between massively parallel synthetic 5’ untranslated regions (5’UTRs) and translation levels. However, the underlying biological features, which drive model predictions, remain elusive. Uncovering sequence determinants predictive of translation output may allow us to develop a more detailed understanding of translation regulation at the 5’UTR. RESULTS: Applying model interpretation, we extract representations of regulatory logic from CNNs trained on synthetic and human 5’UTR reporter data. We reveal a complex interplay of regulatory sequence elements, such as initiation context and upstream open reading frames (uORFs) to influence model predictions. We show that models trained on synthetic data alone do not sufficiently explain translation regulation via the 5’UTR due to differences in the frequency of regulatory motifs compared to natural 5’UTRs. CONCLUSIONS: Our study demonstrates the significance of model interpretation in understanding model behavior, properties of experimental data and ultimately mRNA translation. By combining synthetic and human 5’UTR reporter data, we develop a model (OptMRL) which better captures the characteristics of human translation regulation. This approach provides a general strategy for building more successful sequence-based models of gene regulation, as it combines global sampling of random sequences with the subspace of naturally occurring sequences. Ultimately, this will enhance our understanding of 5’UTR sequences in disease and our ability to engineer translation output. |
Keywords: | Translation Regulation, 5’ Untranslated Region, Massively Parallel Reporter Assay, Deep Neural Networks, Explainable Artificial Intelligence |
Source: | BMC Genomics |
ISSN: | 1471-2164 |
Publisher: | BioMed Central |
Volume: | 25 |
Number: | 1 |
Page Range: | 1061 |
Date: | 9 November 2024 |
Official Publication: | https://doi.org/10.1186/s12864-024-10925-8 |
Repository Staff Only: item control page