Genome Res. 15:1566-1575, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
Methods
Genomic scans for selective sweeps using SNP data
Rasmus Nielsen1,3,5,
Scott Williamson1,
Yuseob Kim4,
Melissa J. Hubisz1,
Andrew G. Clark2 and
Carlos Bustamante1
1 Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
2 Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
3 Center for Bioinformatics and Department of Biology, University of Copenhagen, Copenhagen, Denmark
4 Department of Biology, University of Rochester, Rochester, New York 14627, USA
Detecting selective sweeps from genomic SNP data is complicated by the intricate ascertainment schemes used to discover SNPs, and by the confounding influence of the underlying complex demographics and varying mutation and recombination rates. Current methods for detecting selective sweeps have little or no robustness to the demographic assumptions and varying recombination rates, and provide no method for correcting for ascertainment biases. Here, we present several new tests aimed at detecting selective sweeps from genomic SNP data. Using extensive simulations, we show that a new parametric test, based on composite likelihood, has a high power to detect selective sweeps and is surprisingly robust to assumptions regarding recombination rates and demography (i.e., has low Type I error). Our new test also provides estimates of the location of the selective sweep(s) and the magnitude of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence for selective sweeps is also found in many other regions, including genes known to be associated with disease risk such as DPP10 and COL4A3.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.4252305. Freely available online through the Genome Research Immediate Open Access option.
[The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: J.C. Mullikin.]
5 Corresponding author. E-mail rasmus{at}binf.ku.dk; fax +45 35321300.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
S. Beisswanger and W. Stephan
Evidence that strong positive selection drives neofunctionalization in the tandemly duplicated polyhomeotic genes in Drosophila
PNAS,
April 8, 2008;
105(14):
5447 - 5452.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. J. Cai
PGEToolbox: A Matlab Toolbox for Population Genetics and Evolution
J. Hered.,
February 29, 2008;
(2008)
esm127v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. D. Jensen, K. R. Thornton, and C. F. Aquadro
Inferring Selection in Partially Sequenced Regions
Mol. Biol. Evol.,
February 1, 2008;
25(2):
438 - 446.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. J. Hoggart, M. Chadeau-Hyam, T. G. Clark, R. Lampariello, J. C. Whittaker, M. De Iorio, and D. J. Balding
Sequence-Level Population Simulations Over Large Genomic Regions
Genetics,
November 1, 2007;
177(3):
1725 - 1731.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. D. Hernandez, S. H. Williamson, L. Zhu, and C. D. Bustamante
Context-Dependent Mutation Rates May Cause Spurious Signatures of a Fixation Bias Favoring Higher GC-Content in Humans
Mol. Biol. Evol.,
October 1, 2007;
24(10):
2196 - 2202.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. D. Jensen, K. R. Thornton, C. D. Bustamante, and C. F. Aquadro
On the Utility of Linkage Disequilibrium as a Statistic for Identifying Targets of Positive Selection in Nonequilibrium Populations
Genetics,
August 1, 2007;
176(4):
2371 - 2379.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Wagner
Rapid Detection of Positive Selection in Genes and Genomes Through Variation Clusters
Genetics,
August 1, 2007;
176(4):
2451 - 2463.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. D. Hernandez, S. H. Williamson, and C. D. Bustamante
Context Dependence, Ancestral Misidentification, and Spurious Signatures of Natural Selection
Mol. Biol. Evol.,
August 1, 2007;
24(8):
1792 - 1800.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Zeng, S. Shi, and C.-I Wu
Compound Tests for the Detection of Hitchhiking Under Positive Selection
Mol. Biol. Evol.,
August 1, 2007;
24(8):
1898 - 1908.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. McVean
The Structure of Linkage Disequilibrium Around a Selective Sweep
Genetics,
March 1, 2007;
175(3):
1395 - 1406.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. A. Shapiro, W. Huang, C. Zhang, M. J. Hubisz, J. Lu, D. A. Turissini, S. Fang, H.-Y. Wang, R. R. Hudson, R. Nielsen, et al.
Adaptive genic evolution in the Drosophila genomes
PNAS,
February 13, 2007;
104(7):
2271 - 2276.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. R. Thornton and J. D. Jensen
Controlling the False-Positive Rate in Multilocus Genome Scans for Selection
Genetics,
February 1, 2007;
175(2):
737 - 750.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Park, S. Hwang, Y. S. Lee, S.-C. Kim, and D. Lee
SNP@Ethnos: a database of ethnically variant single-nucleotide polymorphisms
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D711 - D715.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. C. Barnes, A. V. Grant, N. N. Hansel, P. Gao, and G. M. Dunston
African Americans with Asthma: Genetic Insights
Proceedings of the ATS,
January 1, 2007;
4(1):
58 - 68.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Bachtrog and P. Andolfatto
Selection, Recombination and Demographic History in Drosophila miranda
Genetics,
December 1, 2006;
174(4):
2045 - 2059.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. L.F. Johnson and M. Slatkin
Inference of population genetic parameters in metagenomics: A clean look at messy data
Genome Res.,
October 1, 2006;
16(10):
1320 - 1327.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. M. Marshall and R. E. Weiss
A Bayesian Heterogeneous Analysis of Variance Approach to Inferring Recent Selective Sweeps
Genetics,
August 1, 2006;
173(4):
2357 - 2370.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. M. Teshima, G. Coop, and M. Przeworski
How reliable are empirical genomic scans for selective sweeps?
Genome Res.,
June 1, 2006;
16(6):
702 - 712.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|