Genome Research songbird

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 15:1496-1502, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Clark, A. G.
Right arrow Articles by Nielsen, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Clark, A. G.
Right arrow Articles by Nielsen, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Letter

Ascertainment bias in studies of human genome-wide polymorphism

Andrew G. Clark1,2,4, Melissa J. Hubisz2, Carlos D. Bustamante2, Scott H. Williamson2 and Rasmus Nielsen3

1 Molecular Biology and Genetics and Computational Biology, Cornell University, Ithaca, New York 14853, USA 2 Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA 3 Center for Bioinformatics, University of Copenhagen, Copenhagen, 2100 KbhØ, Denmark

Large-scale SNP genotyping studies rely on an initial assessment of nucleotide variation to identify sites in the DNA sequence that harbor variation among individuals. This "SNP discovery" sample may be quite variable in size and composition, and it has been well established that properties of the SNPs that are found are influenced by the discovery sampling effort. The International HapMap project relied on nearly any piece of information available to identify SNPs—including BAC end sequences, shotgun reads, and differences between public and private sequences—and even made use of chimpanzee data to confirm human sequence differences. In addition, the ascertainment criteria shifted from using only SNPs that had been validated in population samples, to double-hit SNPs, to finally accepting SNPs that were singletons in small discovery samples. In contrast, Perlegen's primary discovery was a resequencing-by-hybridization effort using the 24 people of diverse origin in the Polymorphism Discovery Resource. Here we take these two data sets and contrast two basic summary statistics, heterozygosity and FST, as well as the site frequency spectra, for 500-kb windows spanning the genome. The magnitude of disparity between these samples in these measures of variability indicates that population genetic analysis on the raw genotype data is ill advised. Given the knowledge of the discovery samples, we perform an ascertainment correction and show how the post-correction data are more consistent across these studies. However, discrepancies persist, suggesting that the heterogeneity in the SNP discovery process of the HapMap project resulted in a data set resistant to complete ascertainment correction. Ascertainment bias will likely erode the power of tests of association between SNPs and complex disorders, but the effect will likely be small, and perhaps more importantly, it is unlikely that the bias will introduce false-positive inferences.


[Supplemental material is available online at www.genome.org.]

Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.4107905. Freely available online through the Genome Research Immediate Open Access option.

4 Corresponding author.
E-mail ac347{at}cornell.edu; fax (607) 255-6249.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Zayed and C. W. Whitfield
A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera
PNAS, March 4, 2008; 105(9): 3421 - 3426.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. L. Kelley and W. J. Swanson
Dietary Change and Adaptive Evolution of enamelin in Humans and Among Primates
Genetics, March 1, 2008; 178(3): 1595 - 1603.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C. Li and M. Li
GWAsimulator: a rapid whole-genome simulation program
Bioinformatics, January 1, 2008; 24(1): 140 - 142.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. Lencz, C. Lambert, P. DeRosse, K. E. Burdick, T. V. Morgan, J. M. Kane, R. Kucherlapati, and A. K. Malhotra
Runs of homozygosity reveal highly penetrant recessive loci in schizophrenia
PNAS, December 11, 2007; 104(50): 19942 - 19947.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. E. Madsen, P. Villesen, and C. Wiuf
A periodic pattern of SNPs in the human genome
Genome Res., October 1, 2007; 17(10): 1414 - 1419.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
F. A. Wright, H. Huang, X. Guan, K. Gamiel, C. Jeffries, W. T. Barry, F. Pardo-Manuel de Villena, P. F. Sullivan, K. C. Wilhelmsen, and F. Zou
Simulating association studies: a data-based resampling method for candidate regions or whole genome scans
Bioinformatics, October 1, 2007; 23(19): 2581 - 2588.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
H. Chen, R. E. Green, S. Paabo, and M. Slatkin
The Joint Allele-Frequency Spectrum in Closely Related Species
Genetics, September 1, 2007; 177(1): 387 - 398.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
E. B. Rosenblum and J. Novembre
Ascertainment Bias in Spatially Structured Populations: A Case Study in the Eastern Fence Lizard
J. Hered., July 4, 2007; (2007) esm031v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Gojobori, H. Tang, J. M. Akey, and C.-I Wu
Adaptive evolution in humans revealed by the negative correlation between the polymorphism and fixation phases of evolution
PNAS, March 6, 2007; 104(10): 3907 - 3912.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
K. R. Thornton and J. D. Jensen
Controlling the False-Positive Rate in Multilocus Genome Scans for Selection
Genetics, February 1, 2007; 175(2): 737 - 750.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
B. McEvoy, S. Beleza, and M. D. Shriver
The genetic architecture of normal variation in human pigmentation: an evolutionary perspective and model
Hum. Mol. Genet., October 15, 2006; 15(suppl_2): R176 - R181.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. Boursot and K. Belkhir
Mouse SNPs for evolutionary biology: Beware of ascertainment biases
Genome Res., October 1, 2006; 16(10): 1191 - 1192.
[Full Text] [PDF]


Home page
Brief BioinformHome page
M. R. Barnes
Navigating the HapMap
Brief Bioinform, September 1, 2006; 7(3): 211 - 224.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. L. Kelley, J. Madeoy, J. C. Calhoun, W. Swanson, and J. M. Akey
Genomic signatures of positive selection in humans and the limits of outlier approaches
Genome Res., August 1, 2006; 16(8): 980 - 989.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
D. Meyer, R. M. Single, S. J. Mack, H. A. Erlich, and G. Thomson
Signatures of Demographic History and Natural Selection in the Human Major Histocompatibility Complex Loci
Genetics, August 1, 2006; 173(4): 2121 - 2142.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
P. C. Sabeti, S. F. Schaffner, B. Fry, J. Lohmueller, P. Varilly, O. Shamovsky, A. Palma, T. S. Mikkelsen, D. Altshuler, and E. S. Lander
Positive natural selection in the human lineage.
Science, June 16, 2006; 312(5780): 1614 - 1620.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2005 by Cold Spring Harbor Laboratory Press.