Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

SetQuence & SetOmic: deep set transformers for whole genome and exome tumour analysis

[thumbnail of Original Article]
Preview
PDF (Original Article) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
3MB

Item Type:Article
Title:SetQuence & SetOmic: deep set transformers for whole genome and exome tumour analysis
Creators Name:Jurenaite, N., León-Periñán, D., Donath, V., Torge, S. and Jäkel, R.
Abstract:In oncology, Deep Learning has shown great potential to personalise tasks such as tumour type classification, based on per-patient omics data-sets. Being high dimensional, incorporation of such data in one model is a challenge, often leading to one-dimensional studies and, therefore, information loss. Instead, we first propose relying on non-fixed sets of whole genome or whole exome variant-associated sequences, which can be used for supervised learning of oncology-relevant tasks by our Set Transformer based Deep Neural Network, SetQuence. We optimise this architecture to improve its efficiency. This allows for exploration of not just coding but also non-coding variants, from large datasets. Second, we extend the model to incorporate these representations together with multiple other sources of omics data in a flexible way with SetOmic. Evaluation, using these representations, shows improved robustness and reduced information loss compared to previous approaches, while still being computationally tractable. By means of Explainable Artificial Intelligence methods, our models are able to recapitulate the biological contribution of highly attributed features in the tumours studied. This validation opens the door to novel directions in multi-faceted genome and exome wide biomarker discovery and personalised treatment among other presently clinically relevant tasks.
Keywords:Multi-Omics, Language Model, Deep Neural Network, Set Representations, Whole-Genome Motifs
Source:Biosystems
ISSN:0303-2647
Publisher:Elsevier
Volume:235
Page Range:105095
Date:January 2024
Official Publication:https://doi.org/10.1016/j.biosystems.2023.105095
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library