Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Enhancing biomarker-based oncology trial matching using large language models

[thumbnail of Preprint]
Preview
PDF (Preprint) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB
Item Type:Preprint
Title:Enhancing biomarker-based oncology trial matching using large language models
Creators Name:Al Khoury, N., Shaik, M., Wurmus, R. and Akalin, A.
Abstract:Clinical trials are an essential component of drug development for new cancer treatments, yet the information required to determine a patient’s eligibility for enrollment is scattered in large amounts of unstructured text. Genomic biomarkers are especially important in precision medicine and targeted therapies, making them essential for matching patients to appropriate trials. Large language models (LLMs) offer a promising solution for extracting this information from clinical trial data, aiding both physicians and patients in identifying suitable matches. In this study, we explore various LLM strategies for extracting genetic biomarkers from oncology trials to improve patient enrollment rates. Our results show that open-source language models, when applied out-of-the-box, effectively capture complex logical expressions and structure genomic biomarkers in disjunctive normal form, outperforming closed-source models such as GPT-4 and GPT-3.5-Turbo. Furthermore, fine-tuning these open-source models with additional data significantly enhances their performance.
Source:bioRxiv
Publisher:Cold Spring Harbor Laboratory Press
Article Number:2024.09.13.612922
Date:19 September 2024
Official Publication:https://doi.org/10.1101/2024.09.13.612922

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library