Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Flexynesis: A deep learning framework for bulk multi-omics data integration for precision oncology and beyond

[thumbnail of Preprint]
Preview
PDF (Preprint) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB
[thumbnail of Supplementary Material] Other (Supplementary Material)
80kB
Item Type:Preprint
Title:Flexynesis: A deep learning framework for bulk multi-omics data integration for precision oncology and beyond
Creators Name:Uyar, B., Savchyn, T., Wurmus, R., Sarigun, A., Shaik, M.M., Franke, V. and Akalin, A.
Abstract:Accurate decision making in precision oncology depends on integration of multimodal molecular information, such as the genetic data, gene expression, protein abundance, and epigenetic measurements. Deep learning methods facilitate integration of heterogeneous datasets. However, almost all published deep learning-based bulk multi-omics integration methods have constrained usability. They suffer from lack of transparency, modularity, deployability, and are applicable exclusively to narrow tasks. To address these limitations, we introduce Flexynesis, a versatile tool designed with usability, and adaptability in mind. Flexynesis streamlines data processing, enforces structured data splitting, and ensures rigorous model evaluation. It offers unsupervised feature selection, different omics layer fusion options, and hyperparameter tuning. Users can choose from distinct architectures – fully connected networks, variational autoencoders, multi-triplet networks, graph neural networks, and cross-modality encoding networks. Each model is complemented with a straightforward input interface and standardized training, evaluation, and feature importance quantification methods, enabling easy incorporation into data integration pipelines. For improved user experience, Flexynesis supports features such as on-the-fly task determination and compatibility with regression, classification, and survival modeling. It accommodates multi-task prediction of a mixture of numerical/categorical outcome variables with a tolerance for missing labels. We also developed an extensive benchmarking pipeline, showcasing the tool’s capability across diverse real-life datasets. This toolset should make deep-learning based bulk multi-omics data integration in the context of clinical/pre-clinical data analysis and marker discovery more accessible to a wider audience with or without experience in deep-learning development. Flexynesis is available at https://github.com/BIMSBbioinfo/flexynesis and can be installed from https://pypi.org/project/flexynesis/.
Source:bioRxiv
Publisher:Cold Spring Harbor Laboratory Press
Article Number:2024.07.16.603606
Date:18 July 2024
Official Publication:https://doi.org/10.1101/2024.07.16.603606

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library