|
|
|
|
Published online before print
June 29, 2006, 10.1101/gr.5320706 Genome Res. 16:973-979, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
Letter Heterozygous carriers of Nijmegen Breakage Syndrome have a distinct gene expression phenotype1 Department of Pediatrics and Department of Genetics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA; 2 Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
Autosomal recessive diseases are those that require mutations in both alleles to exhibit the disorder. Although most recessive conditions are rare, heterozygous carriers of recessive mutations are quite common. In this study, we show that carriers of Nijmegen Breakage Syndrome (NBS) have a distinct gene expression phenotype that differs from that of noncarriers and also from that of carriers of a similar syndrome, Ataxia Telangiectasia (AT). We found 520 genes whose expression levels differ significantly (P 0.001) between NBS carriers and controls. By linear discriminant analysis, we found a combination of 16 genes that allows 100% correct classification of individuals as either NBS carriers or noncarriers in a training set with 25 individuals, and in a test set with 52 individuals. When applied to AT carriers, the discriminant function misclassified only one out of 18 AT carriers as an NBS carrier. Our result shows that NBS carriers have a specific gene expression phenotype. It suggests that heterozygous mutations can contribute significantly to natural variation in gene expression. This has implications for the role that heterozygosity for recessive diseases plays in the overall genetic architecture of complex human traits and diseases.
Nijmegen Breakage Syndrome (NBS; MIM 251260 [OMIM] ) is an autosomal recessive disease caused by mutations in the NBS gene (nibrin) on chromosome 8q21 (Saar et al. 1997
NBS is a rare disease. However, the frequency of heterozygous carriers is estimated to be as high as 1/150 in the Slavic population (Varon et al. 2000
For many years, geneticists have been interested in phenotypic characteristics of carriers of autosomal recessive diseases (Neel 1947
Microarray analysis of cells from NBS carriers and normal controls First, we obtained gene expression profiles of lymphoblastoid cells from nine obligate NBS carriers and 16 normal controls using the Affymetrix Genome Focus arrays that contain 8500 genes. From these, we analyzed 3928 genes that are expressed in lymphoblastoid cells. Using the t-statistic, we identified genes whose expression levels differ significantly between controls and NBS carriers. Among the 3928 genes examined, we found 218 that differ at nominal P 0.0001. At a less stringent threshold of P 0.001, there are 520 genes that differ significantly.
To correct for the 3928 tests performed and the unknown correlation among the tests, we used a permutation analysis to determine the number of significant genes one would expect by chance alone. In the permutation data, the average number of significant genes (P Among these 520 genes, there are 282 genes that are expressed at higher level in NBS carriers than in controls, and 238 genes that are expressed at lower level in NBS carriers than in controls. The distribution of fold differences for the 520 genes is shown in Figure 1. Sixty-nine genes have greater than twofold difference between controls and NBS carriers. The expression level of nibrin, the mutated gene in NBS, was not significantly different between the NBS carriers and controls. Since our goal is to identify genes that are significantly different between the two groups, we ranked the genes by t-scores. Table 1 lists the genes with the highest t-scores and their average fold difference in expression level between controls and NBS carriers.
Next, we performed cluster analysis to determine how the 25 subjects (16 controls and nine NBS carriers) can be grouped by the expression levels of the 520 differentially expressed genes (Fig. 2). All but one of the controls was clustered in one branch of the dendrogram, and the NBS carriers were grouped in the other major branch. The cluster analysis, therefore, supports the above findings that based on these 520 genes, the normal controls are more similar to each other than to the NBS carriers, and likewise, the NBS carriers are more similar to each other than to the control individuals.
Functional classification of differentially expressed genes We explored the function of the differentially expressed genes by using Gene Ontology annotations (Ashburner et al. 2000 0.0001). Differentially expressed genes that belong to RNA/nucleic acid binding include XBP1, EIF4A2, USF2, RFX5, and XRCC1. Those with pre-RNA splicing factor activity include SFRS9, SFPQ, SNRPB2, LSM2, and CUGBP1.
For a more comprehensive analysis of biological pathways that differ between NBS carriers and controls, we combined expression data from the microarray analysis with information in Reactome, a knowledge-base of biological pathways (Joshi-Tope et al. 2005
Nibrin is known to play a role in DNA repair. Thus it is not surprising that among the down-regulated pathways is homologous recombination repair, in which NBS forms a complex with RAD50 and MRE11 at double-strand breaks. In addition, two other repair pathways, the MAD1- and MAD2-mediated mitotic spindle checkpoint, and the XPG- and ERCC1-dependent nucleotide excision repair, are also down-regulated. These findings suggest that although NBS is a recessive disorder, carriers have detectable differences in their DNA repair pathways compared to noncarriers. Besides DNA damage repair, other pathways such as transcription, translation elongation, and nucleotide metabolism are also down-regulated.
While there are down-regulated pathways, we also found ones that are up-regulated in NBS carriers. These include signaling pathways such as MAP-kinase cascade and NOTCH signaling. For example, the NOTCH ligand JAG2 is significantly up-regulated. Base excision repair and polymerase
Discriminant analysis: NBS carriers and normal controls
Three of the 16 genes (DDX21, MYST3, and APRT) are among the 30 genes with highest t-statistic (Tables 1 and 3). The remaining 27 genes with high t-scores were not selected in the discriminant analysis, presumably because their expression levels are highly correlated with one or several of the genes that formed the discriminant functions. For example, the correlation coefficients between UQCRC1 and DDX21 and between PURA and DDX21 are 0.45 and 0.65, respectively. The inclusion of correlated genes would add little to the ability to predict whether an individual is a NBS carrier or not.
Validation of discriminant genes
Next, to examine how well this scheme will work when applied to an independent sample, we considered the preceding 25 samples as a training set and used the discriminant function formed by them to classify two other NBS carriers and 50 additional unrelated individuals who are part of the Center dEtude du Polymorphisme Humain (CEPH) collections. All 50 CEPH individuals were correctly classified as normal individuals, and the two NBS carriers were classified as NBS carriers. The frequency of NBS carriers is 0.7% in the Slavic population and likely to be lower in other populations; so it is very possible that none of the 50 CEPH individuals tested are NBS carriers. In Figure 3B, expression levels for DDX21, MYST3, and KCNMB4 for the new samples are shown along with those for individuals in the training set. Similar to the samples in the training set, the additional NBS carriers and normal controls have different expression levels of DDX21, MYST3, and KCNMB4.
One of the characteristics of NBS is radiosensitivity, thus we compared results from the discriminant analysis with those for another radiosensitivity syndrome, Ataxia Telangiectasia (AT). At the cellular level, AT and NBS are quite similar (van der Burgt et al. 1996
Diseases that are transmitted in recessive fashion are by definition those that have no marked heterozygous phenotypes. However, studies from the early days of human genetics had already suggested that there are subtle manifestations in carriers that are different from the norm (Neel 1947 0.001) between NBS carriers and controls. We show that baseline expression levels of 16 genes allowed discrimination of NBS carriers from noncarriers. This finding is validated on a new sample of 50 normal controls and two carriers. In addition, AT carriers were classified as non-NBS carriers, therefore allowing discrimination between carriers of these two closely related radiosensitivity syndromes.
There are several implications to our findings. First, identification of carriers is important medically. NBS and AT patients are known to be radiosensitive. Studies have suggested that carriers of these diseases are also radiosensitive (Dumon-Jones et al. 2003 Second, our results contribute to an understanding of phenotypic characteristics of carriers of NBS. The 520 genes whose expression levels define the gene expression phenotype of NBS carriers belong to several major DNA repair pathways. In NBS carriers, homologous recombination repair, nucleotide excision repair, and mitotic checkpoint were down-regulated, while steps in DNA damage bypass, processing of double-strand ends, and apoptosis were up-regulated. These data suggest that cells from NBS carriers are less efficient in DNA repair. This is expected in cells from NBS patients but not in cells from carriers. With time or under stress such as radiation exposure, this deficiency in DNA repair may increase the carriers risk of developing malignancies.
Lastly, our results identify heterozygous mutations as a contributor of natural variation in human gene expression. We and others have shown that there is extensive natural variation in human gene expression (Cheung and Spielman 2002 In this study, we showed that NBS carriers have a distinct gene expression phenotype. Although mutations in the nibrin gene are recessive, they result in phenotypes that differ between carriers and noncarriers. This finding suggests the importance of determining clinically important phenotypes in carriers and the contribution of heterozygous carriers to phenotypic diversity.
Study subject and expression phenotyping Lymphoblastoid cell lines (from Coriell Cell Repositories) were grown to a density of 5 x 105 cells/mL in RPMI 1640 with 15% fetal bovine serum (FBS), 100 U penicillin/mL, 100 µg streptomycin/mL sulfate, and 1% L-glutamine. To identify differentially expressed genes between NBS carriers and controls and as a training set for the discriminant analysis, samples from 16 normal controls (GM07013, GM07019, GM07029, GM07042, GM07348, GM10830, GM10842, GM10852, GM10857, GM12740, GM12767, GM12802, GM12817, GM12865, GM12911, GM12912) and nine obligate NBS carriers (GM08036, GM08037, GM15805, GM15806, GM15810, GM15811, GM15815, GM15820, GM15821) were used. Total RNA from each sample was extracted with the RNeasy Mini-Kit (Qiagen) and hybridized onto the Affymetrix Genome Focus arrays per the manufacturers protocol. The growing and processing of cell lines were randomized by genotypes to eliminate batch effects that may contribute to genotype-specific gene expression.
To validate results from the discriminant analysis, expression data for 50 unrelated individuals that were part of another study (Morley et al. 2004 Expression intensity was scaled to 500 using the Affymetrix MAS 5.0 software, log2 transformed, and normalized by Z-transformation. The 3928 genes that were called "Present" by the MAS 5.0 software in 90% or more of the controls or NBS carriers were used for further analysis.
Data analysis
Among the 3928 genes, 218 genes had t-scores that were the highest among the 10,000 replicates, thus those were given a P-value
To correct for the problem of multiple testing and the unknown correlation between the genes, we used the sets of permutation to give us 10,000 estimates of the number of extreme t-scores expected under the null hypothesis that no genes differ in expression level between controls and NBS carriers. Among the replicates, the average number of genes with P
Discriminant analysis Sixteen genes (Table 3) were identified as the minimum set required to best discriminate between the controls and the NBS carriers. Validation with leave-one-out-classification and additional subjects (CEPH individuals and AT carriers) were also carried out using SPSS with the above parameters. The discriminant score for each individual is the value of the discriminant function for the 16 observations.
Clustering
Gene Ontology classification and pathway analysis
We thank Nancy King, Michael Morley, Jason Watts, and Marc Burock for performing the microarray experiments and data analysis; and Richard Spielman for comments. This work is supported by grants from the National Institutes of Health.
3 Corresponding author.
E-mail vcheung{at}mail.med.upenn.edu; fax (215) 590-3709. [Supplemental material is available online at www.genome.org. The microarray data from this study have been submitted to NCBI/GEO under accession no. GSE3894.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.5320706
Ashburner M., Ball C.A., Blake J.A., Botstein D., Butler H., Cherry J.M., Davis A.P., Dolinski K., Dwight S.S., Eppig J.T. et al. 2000. Gene Ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 25: 2529.[CrossRef][Medline] Carney J.P., Maser R.S., Olivares H., Davis E.M., Le Beau M., Yates J.R. III, Hays L., Morgan W.F., Petrini J.H. 1998. The hMre11/hRad50 protein complex and Nijmegen Breakage Syndrome: Linkage of double-strand break repair to the cellular DNA damage response. Cell 93: 477486.[CrossRef][Medline] Cheung V.G. and Spielman R.S. 2002. The genetics of variation in gene expression. Nat. Genet. 32: 522525. Cheung V.G., Conlin L.K., Weber T.M., Arcaro M., Jen K.-Y., Morley M., Spielman R.S. 2003. Natural variation in human gene expression assessed in lymphoblastoid cells. Nat. Genet. 33: 422425.[CrossRef][Medline] Cheung V.G., Spielman R.S., Ewens K.G., Weber T.M., Morley M., Burdick J. 2005. Mapping determinants of human gene expression by regional and genome-wide association. Nature 437: 13651369.[CrossRef][Medline] Cybulski C., Gorski B., Debniak T., Gliniewicz B., Mierzejewski M., Masojc B., Jakubowska A., Matyjasik J., Zlowocka E., Sikorski A. et al. 2004. NBS1 is a prostate cancer susceptibility gene. Cancer Res. 64: 12151219. DeRisi J., Penland L., Brown P.O., Bittner M.L., Meltzer P.S., Ray M., Chen Y., Su Y.A., Trent J.M. 1996. Use of a cDNA microarray to analyse gene expression patterns in human cancer. Nat. Genet. 14: 457460.[CrossRef][Medline] Drabek J., Hajduch M., Gojova L., Weigl E., Mihal V. 2002. Frequency of 657del(5) mutation of the NBS1 gene in the Czech population by polymerase chain reaction with sequence specific primers. Cancer Genet. Cytogenet. 138: 157159.[CrossRef][Medline] Dumon-Jones V., Frappart P.O., Tong W.M., Sajithlal G., Hulla W., Schmid G., Herceg Z., Digweed M., Wang Z.Q. 2003. Nbn heterozygosity renders mice susceptible to tumor formation and ionizing radiation-induced tumorigenesis. Cancer Res. 63: 72637269. Golub T.R., Slonim D.K., Tamayo P., Huard C., Gaasenbeek M., Mesirov J.P., Coller H., Loh M.L., Downing J.R., Caligiuri M.A. et al. 1999. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286: 531537. Hosack D.A., Dennis G. Jr., Sherman B.T., Lane H.C., Lempicki R.A. 2003. Identifying biological themes within lists of genes with EASE. Genome Biol. 4: R70.[CrossRef][Medline] Hsia D.Y. 1957. The laboratory detection of heterozygotes. Am. J. Hum. Genet. 9: 98112.[Medline] Hsia D.Y. 1966. The diagnosis of carriers of disease-producing genes. Ann. N.Y. Acad. Sci. 66: 946964. Joshi-Tope G., Gillespie M., Vastrik I., DEustachio P., Schmidt E., de Bono B., Jassal B., Gopinath G.R., Wu G.R., Matthews L. et al. 2005. Reactome: A knowledge base of biological pathways. Nucleic Acids Res. 33: D428D432. Lo H.S., Wang Z., Hu Y., Yang H.H., Gere S., Buetow K.H., Lee M.P. 2003. Allelic variation in gene expression is common in the human genome. Genome Res. 13: 18551862. Manly B.J.F. 1997. Randomization, bootstrap and Monte Carlo methods in biology. 2nd ed. Chapman & Hall, London. Morley M., Molony C.M., Weber T.M., Devlin J.L., Ewens K.G., Spielman R.S., Cheung V.G. 2004. Genetic analysis of genome-wide variation in human gene expression. Nature 430: 743747.[CrossRef][Medline] Morton N.E., Crow J.F., Muller H.J. 1956. An estimate of the mutational damage in man from data on consanguineous marriages. Proc. Natl. Acad. Sci. 42: 855863. Neel J.V. 1947. The clinical detection of the genetic carriers of inherited disease. Medicine 26: 135153. Pastinen T., Sladek R., Gurd S., Sammak A., Ge B., Lepage P., Lavergne K., Villeneuve A., Gaudin T., Brandstrom H. et al. 2004. A survey of genetic and epigenetic variation affecting human gene expression. Physiol. Genomics 16: 184193. Resnick I.B., Kondratenko I., Pashanov E., Maschan A.A., Karachunsky A., Togoev O., Timakov A., Polyakov A., Tverskaya S., Evgrafov O. et al. 2003. 657del5 mutation in the gene for Nijmegen Breakage Syndrome (NBS1) in a cohort of Russian children with lymphoid tissue malignancies and controls. Am. J. Med. Genet. A 120: 174179.[Medline] Saar K., Chrzanowska K.H., Stumm M., Jung M., Nurnberg G., Wienker T.F., Seemanova E., Wegner R.D., Reis A., Sperling K. 1997. The gene for the ataxia-telangiectasia variant, Nijmegen Breakage Syndrome, maps to a 1-cM interval on chromosome 8q21. Am. J. Hum. Genet. 60: 605610.[Medline] Schadt E.E., Monks S.A., Drake T.A., Lusis A.J., Che N., Colinayo V., Ruff T.G., Milligan S.B., Lamb J.R., Cavet G. et al. 2003. Genetics of gene expression surveyed in maize, mouse and man. Nature 422: 297302.[CrossRef][Medline] Stranger B.E., Forrest M.S., Clark A.G., Minichiello M.J., Deutsch S., Lyle R., Hunt S., Kahl B., Antonarakis S.E., Tavare S. et al. 2005. Genome-wide associations of gene expression variation in humans. PLoS Genet. 1: e78.[CrossRef][Medline] Tanzanella C., Antoccia A., Spadoni E., di Masi A., Pecile V., Demori E., Varon R., Marseglia G.L., Tiepolo L., Maraschio P. 2003. Chromosome instability and nibrin protein variants in NBS heterozygotes. Eur. J. Hum. Genet. 11: 297303.[CrossRef][Medline] van der Burgt I., Chrzanowska K.H., Smeets D., Weemaes C. 1996. Nijmegen Breakage Syndrome. J. Med. Genet. 33: 153156.[Abstract] Varon R., Vissinga C., Platzer M., Cerosaletti K.M., Chrzanowska K.H., Saar K., Beckmann G., Seemanova E., Cooper P.R., Nowak N.J. et al. 1998. Nibrin, a novel DNA double-strand break repair protein, is mutated in Nijmegen Breakage Syndrome. Cell 93: 467476.[CrossRef][Medline] Varon R., Seemanova E., Chrzanowska K., Hnateyko O., Piekutowska-Abramczuk D., Krajewska-Walasek M., Sykut-Cegielska J., Sperling K., Reis A. 2000. Clinical ascertainment of Nijmegen Breakage Syndrome (NBS) and prevalence of the major mutation, 657del5, in three Slav populations. Eur. J. Hum. Genet. 8: 900902.[CrossRef][Medline] Watts J.A., Morley M., Burdick J.T., Fiori J.L., Ewens W.J., Spielman R.S., Cheung V.G. 2002. Gene expression phenotype in heterozygous carriers of Ataxia Telangiectasia. Am. J. Hum. Genet. 71: 791800.[CrossRef][Medline] Yan H., Yuan W., Velculescu V.E., Vogelstein B., Kinzler K.W. 2002. Allelic variation in human gene expression. Science 297: 1143.
Received March 27, 2006; accepted in revised format May 8, 2006.
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||