Genome Research cityscape

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 15:1594-1600, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zaitlen, N. A.
Right arrow Articles by Eskin, E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zaitlen, N. A.
Right arrow Articles by Eskin, E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Resources

Inference and analysis of haplotypes from combined genotyping studies deposited in dbSNP

Noah A. Zaitlen1, Hyun Min Kang2, Michael L. Feolo3, Stephen T. Sherry3, Eran Halperin4 and Eleazar Eskin1,2,5

1 Bioinformatics Program, University of California, San Diego, La Jolla, California 92093, USA 2 Department of Computer Science, University of California, San Diego, La Jolla, California 92093, USA 3 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA 4 International Computer Science Institute, Berkeley, California 94704, USA

In the attempt to understand human variation and the genetic basis of complex disease, a tremendous number of single nucleotide polymorphisms (SNPs) have been discovered and deposited into NCBI's dbSNP public database. More than 2.7 million SNPs in the database have genotype information. This data provides an invaluable resource for understanding the structure of human variation and the design of genetic association studies. The genotypes deposited to dbSNP are unphased, and thus, the haplotype information is unknown. We applied the phasing method HAP to obtain the haplotype information, block partitions, and tag SNPs for all publicly available genotype data and deposited this information into the dbSNP database. We also deposited the orthologous chimpanzee reference sequence for each predicted haplotype block computed using the UCSC BLASTZ alignments of human and chimpanzee. Using dbSNP, researchers can now easily perform analyses using multiple genotype data sets from the same genomic regions. Dense and sparse genotype data sets from the same region were combined to show that the number of common haplotypes is significantly underestimated in whole genome data sets, while the predicted haplotypes over the common SNPs are consistent between studies. To validate the accuracy of the predictions, we benchmarked HAP's running time and phasing accuracy against PHASE. Although HAP is slightly less accurate than PHASE, HAP is over 1000 times faster than PHASE, making it suitable for application to the entire set of genotypes in dbSNP.


[The sequence data from this study have been submitted to dbSNP under accession nos. phs3.1, vs:3:4136.1–vs:3:835194.1, sh:3:142355.1–sh:3:5247813.1]

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.4297805. Freely available online through the Genome Research Immediate Open Access option.

5 Corresponding author.
E-mail eeskin{at}cs.ucsd.edu; fax (858) 534-7029.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
CirculationHome page
F. Rao, L. Zhang, J. Wessel, K. Zhang, G. Wen, B. P. Kennedy, B. K. Rana, M. Das, J. L. Rodriguez-Flores, D. W. Smith, et al.
Tyrosine Hydroxylase, the Rate-Limiting Enzyme in Catecholamine Biosynthesis: Discovery of Common Human Genetic Variants Governing Transcription, Autonomic Activity, and Blood Pressure In Vivo
Circulation, August 28, 2007; 116(9): 993 - 1006.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2005 by Cold Spring Harbor Laboratory Press.