Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Metrics matter: why we need to stop using silhouette in single-cell benchmarking

[thumbnail of Preprint]
Preview
PDF (Preprint) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
714kB
[thumbnail of Supplementary Material]
Preview
PDF (Supplementary Material) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB
Item Type:Preprint
Title:Metrics matter: why we need to stop using silhouette in single-cell benchmarking
Creators Name:Rautenstrauch, P. and Ohler, U.
Abstract:Current-day single-cell studies comprise complex data sets affected by nested batch effects caused by technical and biological factors, relying on advanced integration methods. Silhouette is an established metric for assessing clustering results, comparing within-cluster cohesion to between-cluster separation, and adaptations of it have emerged as the dominant choice to evaluate the success of these integration methods. However, silhouette's assumptions are often violated in single-cell data integration scenarios. We demonstrate that silhouette-based metrics can neither reliably assess batch effect removal nor biological signal conservation and are thus inherently unsuitable for data with (nested) batch effects. We propose alternative, robust evaluation strategies that enable accurate integration method assessment and call to update benchmarking practices.
Source:bioRxiv
Publisher:Cold Spring Harbor Laboratory Press
Article Number:2025.01.21.634098
Date:24 January 2025
Official Publication:https://doi.org/10.1101/2025.01.21.634098

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library