Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

Construction of whole genomes from scaffolds using single cell strand-seq data

[thumbnail of Original Article]
Preview
PDF (Original Article) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
2MB
[thumbnail of Supplementary Material] Other (Supplementary Material)
29MB

Item Type:Article
Title:Construction of whole genomes from scaffolds using single cell strand-seq data
Creators Name:Hills, M., Falconer, E., O'Neill, K., Sanders, A.D., Howe, K., Guryev, V. and Lansdorp, P.M.
Abstract:Accurate reference genome sequences provide the foundation for modern molecular biology and genomics as the interpretation of sequence data to study evolution, gene expression, and epigenetics depends heavily on the quality of the genome assembly used for its alignment. Correctly organising sequenced fragments such as contigs and scaffolds in relation to each other is a critical and often challenging step in the construction of robust genome references. We previously identified misoriented regions in the mouse and human reference assemblies using Strand-seq, a single cell sequencing technique that preserves DNA directionality Here we demonstrate the ability of Strand-seq to build and correct full-length chromosomes by identifying which scaffolds belong to the same chromosome and determining their correct order and orientation, without the need for overlapping sequences. We demonstrate that Strand-seq exquisitely maps assembly fragments into large related groups and chromosome-sized clusters without using new assembly data. Using template strand inheritance as a bi-allelic marker, we employ genetic mapping principles to cluster scaffolds that are derived from the same chromosome and order them within the chromosome based solely on directionality of DNA strand inheritance. We prove the utility of our approach by generating improved genome assemblies for several model organisms including the ferret, pig, Xenopus, zebrafish, Tasmanian devil and the Guinea pig.
Keywords:Genome Assembly, Strand-Seq, Genome Scaffolds, Contig Assembly, Reference Genomes, Animals
Source:International Journal of Molecular Sciences
ISSN:1422-0067
Publisher:MDPI
Volume:22
Number:7
Page Range:3617
Date:1 April 2021
Official Publication:https://doi.org/10.3390/ijms22073617
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library