|
|
|
|
Published online before print
November 22, 2006, 10.1101/gr.5629106 Genome Res. 16:1575-1584, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
Methods Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays1 Research Center for Advanced Science and Technology, The University of Tokyo, Meguro, Tokyo 153-8904, Japan; 2 Department of Advanced Interdisciplinary Studies, Graduate School of Engineering, The University of Tokyo, Bunkyo-ku, Tokyo 113-8656, Japan; 3 Affymetrix, Inc., Santa Clara, California 95051, USA; 4 The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, United Kingdom; 5 Department of Pathology, Brigham and Womens Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA; 6 The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, Toronto, Ontario, M5G 1L7, Canada; 7 Japan Science and Technology Agency, Kawaguchi, Saitama, 332-0012, Japan
Recent reports indicate that copy number variations (CNVs) within the human genome contribute to nucleotide diversity to a larger extent than single nucleotide polymorphisms (SNPs). In addition, the contribution of CNVs to human disease susceptibility may be greater than previously expected, although a complete understanding of the phenotypic consequences of CNVs is incomplete. We have recently reported a comprehensive view of CNVs among 270 HapMap samples using high-density SNP genotyping arrays and BAC array CGH. In this report, we describe a novel algorithm using Affymetrix GeneChip Human Mapping 500K Early Access (500K EA) arrays that identified 1203 CNVs ranging in size from 960 bp to 3.4 Mb. The algorithm consists of three steps: (1) Intensity pre-processing to improve the resolution between pairwise comparisons by directly estimating the allele-specific affinity as well as to reduce signal noise by incorporating probe and target sequence characteristics via an improved version of the Genomic Imbalance Map (GIM) algorithm; (2) CNV extraction using an adapted SW-ARRAY procedure to automatically and robustly detect candidate CNV regions; and (3) copy number inference in which all pairwise comparisons are summarized to more precisely define CNV boundaries and accurately estimate CNV copy number. Independent testing of a subset of CNVs by quantitative PCR and mass spectrometry demonstrated a >90% verification rate. The use of high-resolution oligonucleotide arrays relative to other methods may allow more precise boundary information to be extracted, thereby enabling a more accurate analysis of the relationship between CNVs and other genomic features.
In the last several years following completion of the human genome sequence (International Human Genome Sequencing Consortium 2004
High-density DNA oligonucleotide arrays allow unsurpassed levels of genetic information to be acquired in single experiments (Fodor et al. 1991
We have recently used two complementary experimental approaches, namely, BAC-based array CGH and high-density SNP genotyping arrays, to produce a first-generation global CNV map of the human genome, based on analyses of the HapMap population (Redon et al. 2006
The 500K EA arrays, a pre-commercial version of the GeneChip Human Mapping 500K Array Set, contain 534,500 SNPs on two genotyping arrays (see Methods for details of the assay). To minimize the impact of cross-hybridization, probes were removed whose central 21 bases perfectly matched additional locations in the genome, with the exception of segmental duplications, which are enriched for CNVs. Probes corresponding to NspI or StyI restriction fragments in which the enzyme recognition site contains a SNP were also removed. These steps trimmed the total probe content to 474,642 SNPs (88.8% of the original) with a relatively minor effect on genome coverage (Supplemental Fig. 1).
CNV calling algorithm The algorithm described here contains three major parts as depicted in Figure 1. Intensity pre-processing includes probe selection, noise reduction, normalization, and merging signal ratios from the NspI and StyI arrays. CNV detection begins with pairwise comparisons of probe intensities for all possible pairs of samples (i.e., 269 comparisons for each HapMap sample), which are then merged to extract candidate CNV regions for each sample. Homozygous deletions are detected separately using an alternative approach that relies on the discrimination ratio between alternate SNP alleles in lieu of SNP genotypes (see Supplemental Methods for details). The copy number inference step uses signal ratios and SNP information to more precisely define CNV boundaries and the copy number within each region. The final step uses a maximum clique algorithm to define the diploid samples for any given region based on the results from the large reference data. Through a comparison of the test sample to the diploid subset, precise boundaries and accurate copy number inferences can be drawn (Fig. 2). Critical aspects of the CNV calling algorithm are highlighted in detail below.
Intensity pre-processing When intensity signals are compared from two individuals with different SNP genotypes, the signal ratio may be artificially skewed because of differences in the allele-specific probe affinities. Although the original version of GIM does not consider such comparisons, the current algorithm directly estimates the affinity differences using signal ratios between probe A and B in the AB genotype group and corrects the comparison accordingly. This increases the average number of SNPs used in any pairwise comparison from 256,257 to 429,104, resulting in a 67.5% improvement in resolution. GIM has also been improved through the use of robust BIC (Qian and Kunsch 1996
Modification of SW-ARRAY for CNV detection
Four subsets of parameters were extensively studied including (1) intensity ratio threshold value, (2) significance cutoff, (3) constraints on number of SNPs and number of restriction fragments used to define a CNV, and (4) density optimization (Fig. 3). For the intensity ratio threshold, we first used samples with different numbers of X chromosomes, and observed an average intensity ratio of 1.3 between two versus three copies. The use of 1.3 as a stringent cutoff results in detection of 50% of the single-copy gains (based on the X chromosome data) with minimal false-positive signals. We then tested 31 threshold values evenly distributed between 1 and 1.3 using the three replicates of NA15510 and NA10851 and found the optimal threshold to be 1.12 (Fig. 3A). The significance value of 0.01 was derived using a similar approach (Fig. 3B). To increase the confidence of a CNV call, we introduced the new requirement that multiple probes on different fragments show consistent intensity change. Two combinations were examined, namely, four SNPs on three restriction fragments (4 SNPs:3 fragments) or three on two (3 SNPs:2 fragments). The former was chosen because the number and size of CNVs is similar using the two settings (Fig. 3C), while the reproducibility increased from 44% to 48% and the selfself false positives decreased 58% with the 4 SNPs:3 fragments setting. CNV extraction based on a given density cutoff (the fraction of times that the region is called as a CNV when compared with reference samples) is a new parameter added during the summarization step to allow confident CNV regions to be extracted from a given test sample and a large reference set. To optimize this parameter, CNVs were called from the same triplicate experiments with NA15510 and NA10851 as described above, but this time they were compared to all 270 HapMap samples as a reference set. Independently verified regions that include both CNVs and diploid regions were placed into different density bins (Fig. 3D). The 10% cutoff gave the optimal result with 7.7% false positives and 33.3% false negatives.
Copy number inference: Identification of diploid samples
CNV boundary determination
HapMap CNVs The CNV calling algorithm, with an optimized density cutoff of 10%, was applied to 270 HapMap samples, and 6469 sample-level CNVs in total were identified with an average of 24 CNVs per individual (Table 1; Redon et al. 2006
Mendelian inheritance CNVs that were detected from 60 trios from the CEU and YRI populations were analyzed for Mendelian inheritance, and 1229 regions in 60 offspring were identified as CNVs with an inferred copy number of at least three for gains or at most one for losses. The signal intensities were evaluated in the parents for these regions, and 1185 (96.4%) of the CNVs were clearly inherited or displayed a signal intensity profile in one of the parents that is just below the threshold cutoff (Fig. 5A). In addition, 3.6% (44) of CNVs do not show any signal indicative of a possible copy number alteration in one of the parents (Fig. 5B). The latter category may represent de novo CNVs, CNVs present as gains and losses in both parents of the trio (i.e., both parents have one chromosome with two copies, and one chromosome with zero copies), or cell line artifacts. Taken together, these data suggest that at least 96.4% of CNVs display Mendelian inheritance, confirming previous conclusions that CNVs are highly heritable (Locke et al. 2006
Experimental validation and false-positive estimation of HapMap CNV calls In order to estimate the percentage of HapMap CNV calls that are likely to be false positives, we used quantitative PCR (qPCR) and mass spectrometry for experimental validation and compared replicates of the same DNA sample (selfself comparisons). Experimental validation of CNVs called in three replicate experiments for DNA samples NA15510 and NA10851 (each compared to the HapMap reference set) indicated that the average percent of false positives was 2.5% (Table 2). Similarly, selfself comparisons of 10 HapMap samples, each done in triplicate, identified an average of 0.73 CNVs per experiment (Supplemental Table 3). In addition, when these 10 samples are compared to the HapMap reference set, 80% of the CNVs are called in all three replicates (see Redon et al. 2006
HapMap CNVs called in only one individual (singletons) represent a large percentage of the total CNVs found in the population (Redon et al. 2006
Homozygous deletions, which are called using a separate algorithm, were also tested and found to be correctly identified at least 89% of the time, with 11% of homozygous deletion regions giving rise to a PCR product using nonquantitative measurements (Table 2). The reproducibility of the homozygous deletion detection algorithm was assessed using the replicates of NA15510 and NA10851. In total, two homozygous deletion regions were identified and validated (Redon et al. 2006 As mentioned above, the precise delineation of CNV boundaries in each sample is challenging. Estimation of CNV ends can be complicated by chromosomal mosaicism in cell culture, where the CNV ends may exhibit differences from cell to cell, or, in the case of high-frequency CNVs, the edges may be sample-specific, making it difficult to define a single population consensus boundary. Another possibility is that the variability is simply a reflection of experimental noise. To test this idea, and to evaluate the accuracy of border estimations given by the algorithm, a representative sampling of CNVs was experimentally tested in the regions between the flanking SNP and the 10% borders, the 10% and the 90% borders, and within the 90% borders (Supplemental Table 4). In all six cases, the region between the highest confidence 90% borders was confirmed as a true region of copy number change. In four unique sequence CNVs (i.e., not in segmentally duplicated regions), all eight 10% borders were confirmed. For these same CNVs, the regions flanking the 10% borders were altered in four out of eight cases. In contrast, for CNV regions that contain segmental duplications, the border determinations were not as accurate, and in all four examples the 10% specific regions were not confirmed. This shows that borders of CNVs in unique sequence regions can be determined with high confidence, but less so for common CNVs, especially those associated with segmental duplications. Thus, the accuracy of border determination reflects the underlying genomic structure in regions of CNV.
We have developed a multistep algorithm that allows accurate CNV calls to be derived from the GeneChip Human Mapping 500K EA arrays. The method described here has been developed to reduce systematic noise and precisely extract significant intensity information. It is substantially different from the previously developed GIM algorithm in several aspects including (1) an intensity pre-processing step, (2) an allele-specific ratio adjustment, (3) the incorporation of new variables (restriction enzyme recognition site normalization, signal ratio adjustment based on G:C content of SNP-surrounding sequence) to remove systematic noise, and (4) the use of a robust regression with Bayesian Information Criterion (BIC) selection (Qian 1996
When used with DNA samples from the HapMap population, the approach described here led to the identification of 1203 CNV events spanning a broad size range from <1 kb to >3 Mb (Redon et al. 2006
Efforts directed at a global characterization of CNVs are an important first step toward understanding the role of CNVs in the biology of the cell. The CNVs identified using this algorithm, combined with the complementary data derived from the WGTP platform, provide the framework for the first comprehensive global map of human CNVs (Redon et al. 2006
Experiments 500K EA Arrays and the Whole-Genome Sampling Assay (WGSA) The 500K EA arrays contain 534,500 SNPs on two enzyme-specific arrays and are used in conjunction with whole-genome sampling analysis (WGSA). Each array interrogates SNPs residing on NspI or StyI PCR amplicons that range in size from 200 bp to 1000 bp. The method described here should be directly applicable to the commercially available Affymetrix 500K array. As an example, 97.5% of the CNVs identified in this study contain at least four SNPs (the requirement for calling a CNV) that are present on the commercial array. DNA from cell lines derived from the 270 HapMap individuals as well as NA15510 used for parameter tuning were purchased from NIGMS the Human Genetic Cell Repository, Coriell Institute for Medical Research (Camden, NJ). For preparation of the DNA prior to hybridization, we used the pre-commercial or early access version of WGSA, which is identical to the manufacturers commercial protocol (Affymetrix; http://www.affymetrix.com) with the following modifications. For PCR, 5 µL of diluted, adapter-ligated DNA and 3.5 µM primer were used in a total volume of 100 µL, and three reactions were prepared for each DNA sample per enzyme. Sixty micrograms of purified product were fragmented and end-labeled using 0.57 mM DLR (GeneChip DNA Labeling Reagent) and 105 U of TdT (Promega) for 2 h at 37°C. Hybridization onto the 250K Nsp and 250K Sty EA arrays and subsequent washing steps were done exactly as described by the manufacturer (Affymetrix).
For data quality assessment, genotype calls were generated using the DM (Dynamic Modeling) calling algorithm with a cutoff P-value of 0.17 (Di et al. 2005
Validation
Quantitative validation of CNVs using mass spectrometry
PCR validation of homozygous deletions
Summary of data analysis The parameters used for CNV calling required that four SNPs on three restriction fragments gave rise to a signal intensity ratio above 1.12 for insertions or 0.89 for deletions. CNVs were considered significant for P-values <0.01 using 5000 permutations of the data (see Results). For data integration, only CNVs called in at least 10% of the comparisons to the diploid samples were retained.
Data release
We thank Hiroko Meguro for technical assistance. This research was supported by the Core Research for Evolutional Science and Technology (CREST) from the Japan Science and Technology Agency (to H. Aburatani), Grants-in-Aid for Young Scientists, and Grant-in-Aid for Scientific Research on Priority Areas "Applied Genomics" (to S. Ishikawa) from the Ministry of Education, Culture, Sports, Science and Technology of Japan, and Grants-in-Aid from National Institute of Biomedical Innovation (to S. Ihara). S.W.S. is an Investigator of the CIHR and International Scholar of the Howard Hughes Medical Institute.
8 These authors contributed equally to this work. E-mail jing_huang{at}affymetrix.com; fax (408) 732-7025.
E-mail haburata-tky{at}umin.ac.jp; fax 81-3-5452-5355. [Supplemental material is available online at www.genome.org. The array data from this study have been submitted to GEO under accession nos. GSE5013 and GSE5173.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5629106
Bansal, A., van den Boom, D., Kammerer, S., Honisch, C., Adam, G., Cantor, C.R., Kleyn, P., and Braun, A. 2002. Association testing by DNA pooling: An effective initial screen. Proc. Natl. Acad. Sci. 99: 1687116874. Beroukhim, R., Lin, M., Park, Y., Hao, K., Zhao, X., Garraway, L.A., Fox, E.A., Hochberg, E.P., Mellinghoff, I.K., and Hofer, M.D., et al. 2006. Inferring loss-of-heterozygosity from unpaired tumors using high-density oligonucleotide SNP arrays. PLoS Comput. Biol. 2: e41.[CrossRef][Medline] Bignell, G.R., Huang, J., Greshock, J., Watt, S., Butler, A., West, S., Grigorova, M., Jones, K.W., Wei, W., and Stratton, M.R., et al. 2004. High-resolution analysis of DNA copy number using oligonucleotide microarrays. Genome Res. 14: 287295. Conrad, D.F., Andrews, T.D., Carter, N.P., Hurles, M.E., and Pritchard, J.K. 2006. A high-resolution survey of deletion polymorphism in the human genome. Nat. Genet. 38: 7581.[CrossRef][Medline] Di, X., Matsuzaki, H., Webster, T.A., Hubbell, E., Liu, G., Dong, S., Bartell, D., Huang, J., Chiles, R., and Yang, G., et al. 2005. Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays. Bioinformatics 21: 19581963. Downes, K., Barratt, B.J., Akan, P., Bumpstead, S.J., Taylor, S.D., Clayton, D.G., and Deloukas, P. 2004. SNP allele frequency estimation in DNA pools and variance components analysis. Biotechniques 36: 840845.[Medline] Feuk, L., Carson, A.R., and Scherer, S.W. 2006a. Structural variation in the human genome. Nat. Rev. Genet. 7: 8597.[Medline] Feuk, L., Marshall, C.R., Wintle, R.F., and Scherer, S.W. 2006b. Structural variants: Changing the landscape of chromosomes and design of disease studies. Hum. Mol. Genet. 15: R57R66. Fodor, S.P., Read, J.L., Pirrung, M.C., Stryer, L., Lu, A.T., and Solas, D. 1991. Light-directed, spatially addressable parallel chemical synthesis. Science 251: 767773. Fodor, S.P., Rava, R.P., Huang, X.C., Pease, A.C., Holmes, C.P., and Adams, C.L. 1993. Multiplexed biochemical assays with biological chips. Nature 364: 555556.[CrossRef][Medline] Fredman, D., White, S.J., Potter, S., Eichler, E.E., Den Dunnen, J.T., and Brookes, A.J. 2004. Complex SNP-related sequence variation in segmental genome duplications. Nat. Genet. 36: 861866.[CrossRef][Medline] Freeman, J.L., Perry, G.H., Feuk, L., Redon, R., McCarroll, S.A., Altshuler, D.M., Aburatani, H., Jones, K.W., Tyler-Smith, C., and Hurles, M.E., et al. 2006. Copy number variation: New insights in genome diversity. Genome Res. 16: 949961. Gonzalez, E., Kulkarni, H., Bolivar, H., Mangano, A., Sanchez, R., Catano, G., Nibbs, R.J., Freedman, B.I., Quinones, M.P., and Bamshad, M.J., et al. 2005. The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science 307: 14341440. Hinds, D.A., Kloek, A.P., Jen, M., Chen, X., and Frazer, K.A. 2006. Common deletions and SNPs are in linkage disequilibrium in the human genome. Nat. Genet. 38: 8285.[Medline] Hoheisel, J.D. 2006. Microarray technology: Beyond transcript profiling and genotype analysis. Nat. Rev. Genet. 7: 200210.[CrossRef][Medline] Huang, J., Wei, W., Zhang, J., Liu, G., Bignell, G.R., Stratton, M.R., Futreal, P.A., Wooster, R., Jones, K.W., and Shapero, M.H. 2004. Whole genome DNA copy number changes identified by high density oligonucleotide arrays. Hum. Genomics 1: 287299.[Medline] Huang, J., Wei, W., Chen, J., Zhang, J., Liu, G., Di, X., Mei, R., Ishikawa, S., Aburatani, H., and Jones, K.W., et al. 2006. CARAT: A novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays. BMC Bioinformatics 7: 83.[CrossRef][Medline] Iafrate, A.J., Feuk, L., Rivera, M.N., Listewnik, M.L., Donahoe, P.K., Qi, Y., Scherer, S.W., and Lee, C. 2004. Detection of large-scale variation in the human genome. Nat. Genet. 36: 949951.[CrossRef][Medline] Inoue, K. and Lupski, J.R. 2002. Molecular mechanisms for genomic disorders. Annu. Rev. Genomics Hum. Genet. 3: 199242.[CrossRef][Medline] . International HapMap Consortium 2005. A haplotype map of the human genome. Nature 437: 12991320.[CrossRef][Medline] . International Human Genome Sequencing Consortium 2004. Finishing the euchromatic sequence of the human genome. Nature 431: 931945.[CrossRef][Medline] Ishikawa, S., Komura, D., Tsuji, S., Nishimura, K., Yamamoto, S., Panda, B., Huang, J., Fukayama, M., Jones, K.W., and Aburatani, H. 2005. Allelic dosage analysis with genotyping microarrays. Biochem. Biophys. Res. Commun. 333: 13091314.[CrossRef][Medline] Kennedy, G.C., Matsuzaki, H., Dong, S., Liu, W.M., Huang, J., Liu, G., Su, X., Cao, M., Chen, W., and Zhang, J., et al. 2003. Large-scale genotyping of complex DNA. Nat. Biotechnol. 21: 12331237.[CrossRef][Medline] Komura, D., Nishimura, K., Ishikawa, S., Panda, B., Huang, J., Nakamura, H., Ihara, S., Hirose, M., Jones, K.W., and Aburatani, H. 2006. Noise reduction from genotyping microarrays using probe level information. In Silico Biol. 6: 9. Laframboise, T., Weir, B.A., Zhao, X., Beroukhim, R., Li, C., Harrington, D., Sellers, W.R., and Meyerson, M. 2005. Allele-specific amplification in cancer revealed by SNP array analysis. PLoS Comput. Biol. 1: e65.[CrossRef][Medline] Locke, D.P., Sharp, A.J., McCarroll, S.A., McGrath, S.D., Newman, T.L., Cheng, Z., Schwartz, S., Albertson, D.G., Pinkel, D., and Altshuler, D.M., et al. 2006. Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. Am. J. Hum. Genet. 79: 275290.[CrossRef][Medline] Matsuzaki, H., Dong, S., Loi, H., Di, X., Liu, G., Hubbell, E., Law, J., Berntsen, T., Chadha, M., and Hui, H., et al. 2004a. Genotyping over 100,000 SNPs on a pair of oligonucleotide arrays. Nat. Methods 1: 109111.[CrossRef][Medline] Matsuzaki, H., Loi, H., Dong, S., Tsai, Y.-Y., Fang, J., Law, J., Di, X., Liu, W.-M., Yang, G., and Liu, G., et al. 2004b. Parallel genotyping of over 10,000 SNPs using a one-primer assay on a high density oligonucleotide array. Genome Res. 14: 414425. McCarrol, S.A., Hadnott, T.N., Perry, G.H., Sabeti, P.C., Zody, M.C., Barrett, J.C., Dallaire, S., Gabriel, S.B., Lee, C., and Daly, M.J., et al. 2006. Common deletion polymorphisms in the human genome. Nat. Genet. 38: 8692.[Medline] Midorikawa, Y., Yamamoto, S., Ishikawa, S., Kamimura, N., Igarashi, H., Sugimura, H., Makuuchi, M., and Aburatani, H. 2006. Molecular karyotyping of human hepatocellular carcinoma using single-nucleotide polymorphism arrays. Oncogene 25: 55815590.[CrossRef][Medline] Mohlke, K.L., Erdos, M.R., Scott, L.J., Fingerlin, T.E., Jackson, A.U., Silander, K., Hollstein, P., Boehnke, M., and Collins, F.S. 2002. High-throughput screening for evidence of association by using mass spectrometry genotyping on DNA pools. Proc. Natl. Acad. Sci. 99: 1692816933. Nannya, Y., Sanada, M., Nakazaki, K., Hosoya, N., Wang, L., Hangaishi, A., Kurokawa, M., Chiba, S., Bailey, D.K., and Kennedy, G.C., et al. 2005. A robust algorithm for copy number detection using high-density oligonucleotide single nucleotide polymorphism genotyping arrays. Cancer Res. 65: 60716079. Newman, T.L., Rieder, M.J., Morrison, V.A., Sharp, A.J., Smith, J.D., Sprague, L.J., Kaul, R., Carlson, C.S., Olson, M.V., and Nickerson, D.A., et al. 2006. High-throughput genotyping of intermediate-size structural variation. Hum. Mol. Genet. 15: 11591167. Ouahchi, K., Lindeman, N., and Lee, C. 2006. Copy number variants and pharmacogenomics. Pharmacogenomics 7: 2529.[CrossRef][Medline] Pease, A.C., Solas, D., Sullivan, E.J., Cronin, M.T., Holmes, C.P., and Fodor, S.P. 1994. Light-generated oligonucleotide arrays for rapid DNA sequence analysis. Proc. Natl. Acad. Sci. 91: 50225026. Price, T.S., Regan, R., Mott, R., Hedman, A., Honey, B., Daniels, R.J., Smith, L., Greenfield, A., Tiganescu, A., and Buckle, V., et al. 2005. SW-ARRAY: A dynamic programming solution for the identification of copy-number changes in genomic DNA using array comparative genome hybridization data. Nucleic Acids Res. 33: 34553464. Qian, G. and Kunsch, R.H. 1996. On model selection in robust linear regression. In Technical Report 80, Seminar Fur Statistik. Eidgenossische Technische Hochschule (ETH), Zurich, Switzerland. Redon, R., Ishikawa, S., Fitch, K.R., Feuk, L., Perry, G.H., Andrews, T.D., Fiegler, H., Shapero, M.H., and Chen, W., et al. 2006. Global variation in copy number in the human genome. Nature (in press). Sebat, J., Lakshmi, B., Troge, J., Alexander, J., Young, J., Lundin, P., Maner, S., Massa, H., Walker, M., and Chi, M., et al. 2004. Large-scale copy number polymorphism in the human genome. Science 305: 525528. Sharp, A.J., Locke, D.P., McGrath, S.D., Cheng, Z., Bailey, J.A., Vallente, R.U., Pertz, L.M., Clark, R.A., Schwartz, S., and Segraves, R., et al. 2005. Segmental duplications and copy-number variation in the human genome. Am. J. Hum. Genet. 77: 7888.[CrossRef][Medline] Slater, H.R., Bailey, D.K., Ren, H., Cao, M., Bell, K., Nasioulas, S., Henke, R., Choo, K.H., and Kennedy, G.C. 2005. High-resolution identification of chromosomal abnormalities using oligonucleotide arrays containing 116,204 SNPs. Am. J. Hum. Genet. 77: 709726.[CrossRef][Medline] Speicher, M.R. and Carter, N.P. 2005. The new cytogenetics: Blurring the boundaries with molecular biology. Nat. Rev. Genet. 6: 782792.[CrossRef][Medline] Tuzun, E., Sharp, A.J., Bailey, J.A., Kaul, R., Morrison, V.A., Pertz, L.M., Haugen, E., Hayden, H., Albertson, D., and Pinkel, D., et al. 2005. Fine-scale structural variation of the human genome. Nat. Genet. 37: 727732.[CrossRef][Medline] Urban, A.E., Korbel, J.O., Selzer, R., Richmond, T., Hacker, A., Popescu, G.V., Cubells, J.F., Green, R., Emanuel, B.S., and Gerstein, M.B., et al. 2006. High-resolution mapping of DNA copy alterations in human chromosome 22 using high-density tiling oligonucleotide arrays. Proc. Natl. Acad. Sci. 103: 45344539. Wang, Y., Moorhead, M., Karlin-Neumann, G., Falkowski, M., Chen, C., Siddiqui, F., Davis, R.W., Willis, T.D., and Faham, M. 2005. Allele quantification using molecular inversion probes (MIP). Nucleic Acids Res. 33: e183. Zhao, X., Li, C., Paez, J.G., Chin, K., Janne, P.A., Chen, T.H., Girard, L., Minna, J., Christiani, D., and Leo, C., et al. 2004. An integrated view of copy number and allelic alterations in the cancer genome using single nucleotide polymorphism arrays. Cancer Res. 64: 30603071.
Received June 12, 2006; accepted in revised format August 29, 2006. Related Article
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||