|
|
|
|
Published online before print
July 3, 2007, 10.1101/gr.6307307 Genome Res. 17:1228-1235, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Resource Detecting genetic variation in microarray expression data1 The Salk Institute for Biological Studies, La Jolla, California 92037, USA; 2 Neurosciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA; 3 Biomedical Sciences Graduate Program, School of Medicine, University of California, San Diego, California 92093, USA; 4 Polymorphism Research Laboratory, Department of Psychiatry, University of California, San Diego, California 92093, USA; 5 Genes and Disease Program, Center for Genomic Regulation (CRG-UPF), Barcelona 08003, Spain; 6 Brain Cells, Inc., San Diego, California 92121, USA; 7 Amicus Therapeutics, Cranbury, New Jersey 08512, USA
The use of high-density oligonucleotide arrays to measure the expression levels of thousands of genes in parallel has become commonplace. To take further advantage of the growing body of data, we developed a method, termed "GeSNP," to mine the detailed hybridization patterns in oligonucleotide array expression data for evidence of genetic variation. To demonstrate the performance of the algorithm, the hybridization patterns in data obtained previously from SAMP8/Ta, SAMP10/Ta, and SAMR1/Ta inbred mice and from humans and chimpanzees were analyzed. Genes with consistent strain-specific and species-specific hybridization pattern differences were identified, and 90% of the candidate genes were independently confirmed to harbor sequence differences. Importantly, the quality of gene expression data was also improved by masking the probes of regions with putative sequence differences between species and strains. To illustrate the application to human disease groups, data from an inflammatory bowel disease study were analyzed. GeSNP identified sequence differences in candidate genes previously discovered in independent association and linkage studies and uncovered many promising new candidates. This approach enables the opportunistic extraction of genetic variation information from new or pre-existing gene expression data obtained with high-density oligonucleotide arrays.
High-density oligonucleotide arrays are used routinely to measure quantitative levels of gene expression and to screen thousands of genes for expression differences (Lipshutz et al. 1999
Although gene expression arrays were not designed to detect sequence differences, we reasoned that the underlying multi-probe hybridization patterns could be retrospectively analyzed, probe by probe, to identify possible sequence differences between strains, species, or individuals. In addition, the identification and removal or "masking" of probe pairs that target regions with sequence differences should produce more accurate expression results in comparisons between distinct genetic populations (Cáceres et al. 2003
Here, we demonstrate the performance of a user-friendly Web-based program, "GeSNP" (available at http://porifera.ucsd.edu/~cabney/cgi-bin/geSNP.cgi) that can be used to identify potential sequence variation from gene expression data sets. This algorithm was used previously to identify sequence differences between three rare strains of inbred mice (Carter et al. 2005
Development of the algorithm On an Affymetrix oligonucleotide array, each gene is represented by a probe set, which is comprised of 11–20 different oligonucleotide probe pairs that are designed to hybridize to specific regions of a gene. Each probe pair consists of a matched set of two 25-base oligonucleotide probes, a perfect match (PM) for the gene of interest and a mismatch (MM) containing a single nucleotide substitution in the middle of the probe (position 13). The MM serves as a measure of nonspecific background binding and noise. The GeSNP algorithm compares the detailed hybridization patterns across the oligonucleotide probe pairs for a gene, after normalizing for expression level differences, in order to find probe pairs that show consistent, statistically significant differences between two sets of samples. The algorithm works as follows (see Fig. 1): First, the individual hybridization intensity values are extracted from the cell-by-cell intensity (CEL) file. The difference between the perfect match and the mismatch (PM – MM) intensities is calculated for each probe pair for each CEL file. In order to minimize false predictions of sequence differences due to inadequate hybridization signal, a probe set from a particular CEL file is excluded if <65% of the PM – MM values for the probe set are positive, indicating that the gene was not likely expressed at a detectable level. After eliminating data from samples that do not fulfill this criterion, the PM – MM values for all of the probe sets for each sample are globally scaled to compensate for gene expression differences. The scaling factor is calculated by dividing an arbitrary target value of 200 by the standard deviation of the PM – MM values for a probe set while ignoring the largest and smallest PM – MM values. Next, the scaled values for each sample group are averaged, and an average and a variance are calculated for each probe pair in a probe set. To further reduce false positives, only probe sets for which at least four files in both sample groups have exceeded the pattern quality threshold are analyzed.
To identify statistically significant pattern differences, the Students t-test using the separate variance formula was employed. The t-value for each probe pair (PP) was calculated as follows:
Identification of sequence differences among three rare mouse strains
Identification of sequence differences between species In order to further validate the algorithm and extend its applicability to interspecies comparisons, gene expression data from 10 humans (Homo sapiens) and seven chimpanzees (Pan troglodytes) obtained with human HG-U95Av2 Affymetrix arrays were analyzed (Enard et al. 2002 5). At a t-value threshold of 6, 16 of 19 probe sets (84% true positive rate) or 42 of 59 probe pairs (71% true positive rate) identified as SFPs were independently confirmed to contain sequence differences (Table 2). Similarly, 414 of 431 probe pairs (96% specificity) covering identical sequence between species did not contain SFPs. In addition, taking into account the quality of the current genome sequences, comparable results were obtained based on a genome-wide analysis of the available human and chimpanzee genome sequence (Supplemental Table 7).
Compared with the analyses of closely related mouse strains, the numbers of false negatives are higher and the detection rates are lower in this interspecies comparison. The greater sequence variation in non-isogenic groups and the resulting larger statistical variance can yield a smaller t-value not identified as significant, especially for more subtle hybridization differences. Consistent with this observation, 50% of the false negatives between humans and chimpanzees are due to the sequence difference lying within the first five or last five nucleotides of the probe sequence. Increasing the number of individuals with array data in both the human and chimpanzee groups is likely to increase the detection of these smaller hybridization changes. Furthermore, data from a larger number of individuals will improve the resolution of the results, decreasing the number of false positives due to intraspecies SNPs and sharpening the peaks for the true positives.
Comparison of the GeSNP algorithm with other methods
Improving the quality of array-based gene expression data Sequence differences in gene regions covered by the oligonucleotide probes can affect the quantitative measurement of expression levels. This effect is especially important when mRNA from one species is interrogated with arrays designed for a different species. The ability to identify probes that cover regions with sequence differences and eliminate them from the analysis is essential for accurate gene expression quantification (Cáceres et al. 2003
Identification of disease-causing mutations in human disease groups
The data of 59 Crohns disease patients, 26 ulcerative colitis patients, and 42 healthy controls were analyzed with the GeSNP algorithm in order to identify potential sequence differences between these groups. Several previously identified Crohns disease and ulcerative colitis candidate susceptibility genes showed SFPs based on the GeSNP analysis, including SLC22A4, identified in linkage and association studies (Ma et al. 1999
The results demonstrate that the GeSNP algorithm can identify sequence differences using array-based gene expression data. The approach is general to several Affymetrix gene expression array types and applicable to the analysis of data obtained in different populations of genetically distinct individuals, including humans. With most array designs, the sequence coverage for each gene is incomplete. Usually 100–400 bases of sequence are interrogated for each gene since there are typically 11–20 probe pairs per gene, the probes are 25 bases in length, some of the probes are overlapping, and sequence differences that result in mismatches near the probe ends (e.g., the five bases at either end) are not expected to lead to consistently measurable hybridization differences (Pease et al. 1994
In addition to the identification of genetic variants, the analysis methods described here may find their most immediate application in improving array performance and enabling arrays designed for one strain or species to be used more broadly. We have used this technique successfully in the past to improve the quality of gene expression data by masking probes that cover regions with potential sequence differences in both mouse (Carter et al. 2005
Moreover, as new array designs become more widely used, the GeSNP algorithm could have a much larger impact. For example, Affymetrix recently released the exon arrays to interrogate all putative exons in a genome. The human array contains 1.4 million probe sets with four PM probes per set (5.6 million probes, 140 million nucleotides). Assuming that half of the probe sets pass the pattern quality measure of detectable expression and that only 50% of the covered nucleotides provide information due to probe sequence overlap and lower sensitivity to differences at the probe ends, analysis with the GeSNP algorithm could yield information on In summary, the GeSNP algorithm allows for the unbiased, opportunistic extraction of sequence variation information from array-based gene expression data. This information can be used to improve the quality of gene expression and eQTL analyses and to identify potential disease-causing genes in human disease populations. The GeSNP source code and a Web-based program are available for public implementation.
Computer software The algorithm was written in standard ANSI C++ and compiled to run on UNIX. The extensively commented source code is available for download from Supplemental materials at the Genome Research Web site and the GeSNP Web site, http://porifera.ucsd.edu/~cabney/cgi-bin/geSNP.cgi. In addition, the GeSNP Web site hosts a user-friendly Web-based tool that allows users to upload their expression data in two predefined groups and obtain results online. A user manual and example data are also available at the Web site. The GeSNP program outputs a text file for each comparison with the following columns: Probe set, Probe pair, pspp (probe set with probe pair number appended at the end), N1 (number of files included in group one), Mean1, Var1 (variance of group 1), N2 (number of files included in group two), Mean2, Var2 (variance of group 2), and t-value. In the t-value column, the value "NaN" indicates that there were less than two files included in one or both groups and a t-value could not be calculated. Probe sets where both groups had zero files passing the pattern quality measures are not included in the output.
False positive estimation and choosing a threshold
Analysis of SFPs Since increasing the number of files can improve analytical power, we also compared combined groups. For example, we combined ulcerative colitis and Crohns disease samples and compared this group with control samples. While all the SFPs in common to both the ulcerative colitis versus normal and Crohns disease versus normal lists appear in the combined comparison, additional probe sets are identified as SFPs. These probe sets may have more subtle hybridization differences (such as sequence differences near the ends of the probes) that are enhanced with the larger number of files.
Implementation of the Ronald et al. (2005)
RNA preparation and cDNA synthesis for sequence confirmation
We thank Stephen Heinemann of the Salk Institute for support, Charles Abney for Web programming assistance, Eva Mitter of IMC for technical assistance, Sebastian Fuchs for sequencing assistance, Jo Del Rio for preliminary analysis, James Thomas for help with the human and chimpanzee genome analysis, and Svante Pääbo, Todd Carter, and Michael Burczynski for providing data files. J.A.G. was supported by a generous gift from the Lewin family and the Sprint Corporation, the NIH Neuroplasticity of Aging Training Grant (5 T32 AG00216), and the National Defense Science and Engineering Graduate Fellowship. M.C. was supported by an EMBO Long-Term Fellowship, a Salk Institute Innovation Grant, and the Ramón y Cajal Program (Ministerio de Educación y Ciencia, Spain). Additional funding was provided by the DOD grant DAMD17-99-1-9561 and the Frederick B. Rentschler Developmental Chair to C.B.
8 These authors contributed equally to this work.
E-mail dlockhart{at}amicustherapeutics.com; fax (609) 662-2001. [Supplemental material is available online at www.genome.org. GeSNP can be accessed at http://porifera.ucsd.edu/~cabney/cgi-bin/geSNP.cgi. The Affymetrix CEL files for the mouse studies and the human/chimpanzee array data have been submitted to GEO under accession nos. GSE6238 and GSE7540, respectively.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6307307
Albert, T.J., Dailidiene, D., Dailide, G., Norton, J.E., Kalia, A., Richmond, T.A., Molla, M., Singh, J., Green, R.D., and Berg, D.E. 2005. Mutation discovery in bacterial genomes: Metronidazole resistance in Helicobacter pylori. Nat. Methods 2: 951–953.[CrossRef][Medline] Borevitz, J.O., Liang, D., Plouffe, D., Chang, H.S., Zhu, T., Weigel, D., Berry, C.C., Winzeler, E., and Chory, J. 2003. Large-scale identification of single-feature polymorphisms in complex genomes. Genome Res. 13: 513–523. Burczynski, M.E., Peterson, R.L., Twine, N.C., Zuberek, K.A., Brodeur, B.J., Casciotti, L., Maganti, V., Reddy, P.S., Strahs, A., Immermann, F., et al. 2006. Molecular classification of Crohns disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells. J. Mol. Diagn. 8: 51–61. Bystrykh, L., Weersing, E., Dontje, B., Sutton, S., Pletcher, M.T., Wiltshire, T., Su, A.I., Vellenga, E., Wang, J., Manly, K.F., et al. 2005. Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics.' Nat. Genet. 37: 225–232.[CrossRef][Medline] Cáceres, M., Laucher, J., Zapala, M.A., Redmond, J.C., Kudo, L., Geschwind, D.H., Lockhart, D.J., Preuss, T.M., and Barlow, C. 2003. Elevated gene expression levels distinguish human from non-human primate brains. Proc. Natl. Acad. Sci. 100: 13030–13035. Carter, M.J., di Giovine, F.S., Jones, S., Mee, J., Camp, N.J., Lobo, A.J., and Duff, G.W. 2001. Association of the interleukin 1 receptor antagonist gene with ulcerative colitis in Northern European Caucasians. Gut 48: 461–467. Carter, T.A., Greenhall, J.A., Yoshida, S., Fuchs, S., Helton, R., Swaroop, A., Lockhart, D.J., and Barlow, C. 2005. Mechanisms of aging in senescence-accelerated mice. Genome Biol. 6: R48. doi: 10.1186/gb-2005-6-6-r48.[CrossRef][Medline] Cervino, A.C., Li, G., Edwards, S., Zhu, J., Laurie, C., Tokiwa, G., Lum, P.Y., Wang, S., Castellini, L.W., Lusis, A.J., et al. 2005. Integrating QTL and high-density SNP analyses in mice to identify Insig2 as a susceptibility gene for plasma cholesterol levels. Genomics 86: 505–517.[CrossRef][Medline] Chee, M., Yang, R., Hubbell, E., Berno, A., Huang, X.C., Stern, D., Winkler, J., Lockhart, D.J., Morris, M.S., and Fodor, S.P. 1996. Accessing genetic information with high-density DNA arrays. Science 274: 610–614. Chesler, E.J., Lu, L., Shou, S., Qu, Y., Gu, J., Wang, J., Hsu, H.C., Mountz, J.D., Baldwin, N.E., Langston, M.A., et al. 2005. Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function. Nat. Genet. 37: 233–242.[CrossRef][Medline] Enard, W., Khaitovich, P., Klose, J., Zollner, S., Heissig, F., Giavalisco, P., Nieselt-Struwe, K., Muchmore, E., Varki, A., Ravid, R., et al. 2002. Intra- and interspecific variation in primate gene expression patterns. Science 296: 340–343. Fodor, S.P., Read, J.L., Pirrung, M.C., Stryer, L., Lu, A.T., and Solas, D. 1991. Light-directed, spatially addressable parallel chemical synthesis. Science 251: 767–773. Fodor, S.P., Rava, R.P., Huang, X.C., Pease, A.C., Holmes, C.P., and Adams, C.L. 1993. Multiplexed biochemical assays with biological chips. Nature 364: 555–556.[CrossRef][Medline] Franchimont, D., Vermeire, S., El Housni, H., Pierik, M., Van Steen, K., Gustot, T., Quertinmont, E., Abramowicz, M., Van Gossum, A., Deviere, J., et al. 2004. Deficient host-bacteria interactions in inflammatory bowel disease? The toll-like receptor (TLR)-4 Asp299gly polymorphism is associated with Crohns disease and ulcerative colitis. Gut 53: 987–992. Gazouli, M., Mantzaris, G., Kotsinas, A., Zacharatos, P., Papalambros, E., Archimandritis, A., Ikonomopoulos, J., and Gorgoulis, V.G. 2005. Association between polymorphisms in the Toll-like receptor 4, CD14, and CARD15/NOD2 and inflammatory bowel disease in the Greek population. World J. Gastroenterol. 11: 681–685.[Medline] Geschwind, D.H. 2000. Mice, microarrays, and the genetic diversity of the brain. Proc. Natl. Acad. Sci. 97: 10676–10678. Giallourakis, C., Stoll, M., Miller, K., Hampe, J., Lander, E.S., Daly, M.J., Schreiber, S., and Rioux, J.D. 2003. IBD5 is a general risk factor for inflammatory bowel disease: Replication of association with Crohn disease and identification of a novel association with ulcerative colitis. Am. J. Hum. Genet. 73: 205–211.[CrossRef][Medline] Gresham, D., Ruderfer, D.M., Pratt, S.C., Schacherer, J., Dunham, M.J., Botstein, D., and Kruglyak, L. 2006. Genome-wide detection of polymorphisms at nucleotide resolution with a single DNA microarray. Science 311: 1932–1936. Grupe, A., Germer, S., Usuka, J., Aud, D., Belknap, J.K., Klein, R.F., Ahluwalia, M.K., Higuchi, R., and Peltz, G. 2001. In silico mapping of complex disease-related traits in mice. Science 292: 1915–1918. Hacia, J.G., Brody, L.C., Chee, M.S., Fodor, S.P., and Collins, F.S. 1996. Detection of heterozygous mutations in BRCA1 using high density oligonucleotide arrays and two-colour fluorescence analysis. Nat. Genet. 14: 441–447.[CrossRef][Medline] Hazen, S.P., Borevitz, J.O., Harmon, F.G., Pruneda-Paz, J.L., Schultz, T.F., Yanovsky, M.J., Liljegren, S.J., Ecker, J.R., and Kay, S.A. 2005. Rapid array mapping of circadian clock and developmental mutations in Arabidopsis. Plant Physiol. 138: 990–997. Hovatta, I., Zapala, M.A., Broide, R.S., Schadt, E.E., Libiger, O., Schork, N.J., Lockhart, D.J., and Barlow, C. 2007. DNA variation and brain region-specific expression profiles exhibit different relationships between inbred mouse strains: Implications for eQTL mapping studies. Genome Biol. 8: R25. doi: 10.1186/gb-2007-8-2-r25.[CrossRef][Medline] Hu, G.K., Madore, S.J., Moldover, B., Jatkoe, T., Balaban, D., Thomas, J., and Wang, Y. 2001. Predicting splice variant from DNA chip expression data. Genome Res. 11: 1237–1245. Hubner, N., Wallace, C.A., Zimdahl, H., Petretto, E., Schulz, H., Maciver, F., Mueller, M., Hummel, O., Monti, J., Zidek, V., et al. 2005. Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease. Nat. Genet. 37: 243–253.[CrossRef][Medline] Irizarry, R.A., Bolstad, B.M., Collin, F., Cope, L.M., Hobbs, B., and Speed, T.P. 2003. Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 31: e15. doi: 10.1093/nar/gng015. Karaman, M.W., Houck, M.L., Chemnick, L.G., Nagpal, S., Chawannakul, D., Sudano, D., Pike, B.L., Ho, V.V., Ryder, O.A., and Hacia, J.G. 2003. Comparative analysis of gene-expression patterns in human and African great ape cultured fibroblasts. Genome Res. 13: 1619–1630. Khaitovich, P., Muetzel, B., She, X., Lachmann, M., Hellmann, I., Dietzsch, J., Steigele, S., Do, H.H., Weiss, G., Enard, W., et al. 2004. Regional patterns of gene expression in human and chimpanzee brains. Genome Res. 14: 1462–1473. Kumar, S., Tamura, K., and Nei, M. 2004. MEGA3: Integrated software for Molecular Evolutionary Genetics Analysis and sequence alignment. Brief. Bioinform. 5: 150–163. Lamb, J., Crawford, E.D., Peck, D., Modell, J.W., Blat, I.C., Wrobel, M.J., Lerner, J., Brunet, J.P., Subramanian, A., Ross, K.N., et al. 2006. The connectivity map: Using gene-expression signatures to connect small molecules, genes, and disease. Science 313: 1929–1935. Li, C. and Wong, W.H. 2001. Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detection. Proc. Natl. Acad. Sci. 98: 31–36. Lipshutz, R.J., Fodor, S.P., Gingeras, T.R., and Lockhart, D.J. 1999. High density synthetic oligonucleotide arrays. Nat. Genet. 21: 20–24.[CrossRef][Medline] Lockhart, D.J. and Winzeler, E.A. 2000. Genomics, gene expression and DNA arrays. Nature 405: 827–836.[CrossRef][Medline] Lockhart, D.J., Dong, H., Byrne, M.C., Follettie, K.T., Gallo, M.V., Chee, M.S., Mittmann, M., Wang, C., Kobayashi, M., Horton, H., et al. 1996. Expression monitoring by hybridization to high-density oligonucleotide arrays. Nat. Biotechnol. 14: 1675–1680.[CrossRef][Medline] Ma, Y., Ohmen, J.D., Li, Z., Bentley, L.G., McElree, C., Pressman, S., Targan, S.R., Fischel-Ghodsian, N., Rotter, J.I., and Yang, H. 1999. A genome-wide search identifies potential new susceptibility loci for Crohns disease. Inflamm. Bowel Dis. 5: 271–278.[Medline] Nagpal, S., Karaman, M.W., Timmerman, M.M., Ho, V.V., Pike, B.L., and Hacia, J.G. 2004. Improving the sensitivity and specificity of gene expression analysis in highly related organisms through the use of electronic masks. Nucleic Acids Res. doi: 10.1093/nar/gnh048. Noble, C., Nimmo, E., Gaya, D., Russell, R.K., and Satsangi, J. 2006. Novel susceptibility genes in inflammatory bowel disease. World J. Gastroenterol. 12: 1991–1999.[Medline] Pease, A.C., Solas, D., Sullivan, E.J., Cronin, M.T., Holmes, C.P., and Fodor, S.P. 1994. Light-generated oligonucleotide arrays for rapid DNA sequence analysis. Proc. Natl. Acad. Sci. 91: 5022–5026. Peirce, J.L., Li, H., Wang, J., Manly, K.F., Hitzemann, R.J., Belknap, J.K., Rosen, G.D., Goodwin, S., Sutter, T.R., Williams, R.W., et al. 2006. How replicable are mRNA expression QTL? Mamm. Genome 17: 643–656.[CrossRef][Medline] Peltekova, V.D., Wintle, R.F., Rubin, L.A., Amos, C.I., Huang, Q., Gu, X., Newman, B., Van Oene, M., Cescon, D., Greenberg, G., et al. 2004. Functional variants of OCTN cation transporter genes are associated with Crohn disease. Nat. Genet. 36: 471–475.[CrossRef][Medline] Radcliffe, R.A., Lee, M.J., and Williams, R.W. 2006. Prediction of cis-QTLs in a pair of inbred mouse strains with the use of expression and haplotype data from public databases. Mamm. Genome 17: 629–642.[CrossRef][Medline] Rioux, J.D., Silverberg, M.S., Daly, M.J., Steinhart, A.H., McLeod, R.S., Griffiths, A.M., Green, T., Brettin, T.S., Stone, V., Bull, S.B., et al. 2000. Genomewide search in Canadian families with inflammatory bowel disease reveals two novel susceptibility loci. Am. J. Hum. Genet. 66: 1863–1870.[CrossRef][Medline] Rioux, J.D., Daly, M.J., Silverberg, M.S., Lindblad, K., Steinhart, H., Cohen, Z., Delmonte, T., Kocher, K., Miller, K., Guschwan, S., et al. 2001. Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease. Nat. Genet. 29: 223–228.[CrossRef][Medline] Ronald, J., Akey, J.M., Whittle, J., Smith, E.N., Yvert, G., and Kruglyak, L. 2005. Simultaneous genotyping, gene-expression measurement, and detection of allele-specific expression with oligonucleotide arrays. Genome Res. 15: 284–291. Sandberg, R., Yasuda, R., Pankratz, D.G., Carter, T.A., Del Rio, J.A., Wodicka, L., Mayford, M., Lockhart, D.J., and Barlow, C. 2000. Regional and strain-specific gene expression mapping in the adult mouse brain. Proc. Natl. Acad. Sci. 97: 11038–11043. Sartor, R.B. 2006. Mechanisms of disease: pathogenesis of Crohns disease and ulcerative colitis. Nat. Clin. Pract. Gastroenterol. Hepatol. 3: 390–407.[CrossRef][Medline] Schadt, E.E., Monks, S.A., Drake, T.A., Lusis, A.J., Che, N., Colinayo, V., Ruff, T.G., Milligan, S.B., Lamb, J.R., Cavet, G., et al. 2003. Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297–302.[CrossRef][Medline] Shi, L., Reid, L.H., Jones, W.D., Shippy, R., Warrington, J.A., Baker, S.C., Collins, P.J., de Longueville, F., Kawasaki, E.S., Lee, K.Y., et al. 2006. The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat. Biotechnol. 24: 1151–1161.[CrossRef][Medline] Takeda, T. 1999. Senescence-accelerated mouse (SAM): A biogerontological resource in aging research. Neurobiol. Aging 20: 105–110.[CrossRef][Medline] Tountas, N.A., Casini-Raggi, V., Yang, H., Di Giovine, F.S., Vecchi, M., Kam, L., Melani, L., Pizarro, T.T., Rotter, J.I., and Cominelli, F. 1999. Functional and ethnic association of allele 2 of the interleukin-1 receptor antagonist gene in ulcerative colitis. Gastroenterology 117: 806–813.[CrossRef][Medline] Waller, S., Tremelling, M., Bredin, F., Godfrey, L., Howson, J., and Parkes, M. 2006. Evidence for association of OCTN genes and IBD5 with ulcerative colitis. Gut 55: 809–814. Winzeler, E.A., Richards, D.R., Conway, A.R., Goldstein, A.L., Kalman, S., McCullough, M.J., McCusker, J.H., Stevens, D.A., Wodicka, L., Lockhart, D.J., et al. 1998. Direct allelic variation scanning of the yeast genome. Science 281: 1194–1197. Wodicka, L., Dong, H., Mittmann, M., Ho, M.-H., and Lockhart, D.J. 1997. Genome-wide expression monitoring in Saccharomyces cerevisiae. Nat. Biotechnol. 15: 1359–1367.[CrossRef][Medline] Xia, C., Higuchi, K., Shimizu, M., Matsushita, T., Kogishi, K., Wang, J., Chiba, T., Festing, M.F., and Hosokawa, M. 1999. Genetic typing of the senescence-accelerated mouse (SAM) strains with microsatellite markers. Mamm. Genome 10: 235–238.[CrossRef][Medline] Zhang, L., Miles, M.F., and Aldape, K.D. 2003. A model of molecular interactions on short oligonucleotide microarrays. Nat. Biotechnol. 21: 818–821.[CrossRef][Medline]
Received January 23, 2007; accepted in revised format April 23, 2007.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||