Helmholtz Gemeinschaft

Search
Browse
Statistics
Feeds

HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources

[thumbnail of 6842oa.pdf] PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
136kB

Item Type:Article
Title:HGVbase: a human sequence variation database emphasizing data quality and a broad spectrum of data sources
Creators Name:Fredman, D., Siegfried, M., Yuan, Y.P., Bork, P., Lehvaslaiho, H. and Brookes, A.J.
Abstract:HGVbase (Human Genome Variation database; http://hgvbase.cgb.ki.se, formerly known as HGBASE) is an academic effort to provide a high quality and non-redundant database of available genomic variation data of all types, mostly comprising single nucleotide polymorphisms (SNPs). Records include neutral polymorphisms as well as disease-related mutations. Online search tools facilitate data interrogation by sequence similarity and keyword queries, and searching by genome coordinates is now being implemented. Downloads are freely available in XML, Fasta, SRS, SQL and tagged-text file formats. Each entry is presented in the context of its surrounding sequence and many records are related to neighboring human genes and affected features therein. Population allele frequencies are included wherever available. Thorough semi-automated data checking ensures internal consistency and addresses common errors in the source information. To keep pace with recent growth in the field, we have developed tools for fully automated annotation. All variants have been uniquely mapped to the draft genome sequence and are referenced to positions in EMBL/GenBank files. Data utility is enhanced by provision of genotyping assays and functional predictions. Recent data structure extensions allow the capture of haplotype and genotype information, and a new initiative (along with BiSC and HUGO-MDI) aims to create a central repository for the broad collection of clinical mutations and associated disease phenotypes of interest.
Keywords:Base Sequence, Chromosome Mapping, Database Management Systems, Nucleic Acid Databases, Gene Frequency, Inborn Genetic Diseases, Human Genome, Information Storage and Retrieval, Internet, Single Nucleotide Polymorphism, Quality Control, Systems Integration, Variation
Source:Nucleic Acids Research
ISSN:0305-1048
Publisher:Oxford University Press
Volume:30
Number:1
Page Range:387-391
Date:1 January 2002
Official Publication:http://nar.oxfordjournals.org/cgi/content/abstract/30/1/387
PubMed:View item in PubMed

Repository Staff Only: item control page

Downloads

Downloads per month over past year

Open Access
MDC Library