|
|
|
|
Published online before print
March 13, 2006, 10.1101/gr.4627606 Genome Res. 16:477-484, 2006 ©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
Letter Word frequency analysis reveals enrichment of dinucleotide repeats on the human X chromosome and [GATA]n in the X escape regionDepartment of Cell Biology, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA
Most of the human genome encodes neither protein nor known functional RNA, yet available approaches to seek meaningful information in the "noncoding" sequence are limited. The unique biology of the X chromosome, one of which is silenced in mammalian females, can yield clues into sequence motifs involved in chromosome packaging and function. Although autosomal chromatin has some capacity for inactivation, evidence indicates that sequences enriched on the X chromosome render it fully competent for silencing, except in specific regions that escape inactivation. Here we have used a linguistic approach by analyzing the frequency and distribution of nine base-pair genomic "words" throughout the human genome. Results identify previously unknown sequence differences on the human X chromosome. Notably, the dinucleotide repeats [AT]n, [AC]n, and [AG]n are significantly enriched across the X chromosome compared with autosomes. Moreover, a striking enrichment (>10-fold) of [GATA]n is revealed throughout the 10-Mb segment at Xp22 that escapes inactivation, and is confirmed by fluorescence in situ hybridization. A similar enrichment is found in other eutherian genomes. Our findings clearly demonstrate sequence differences relevant to the novel biology and evolution of the X chromosome. Furthermore, they implicate simple sequence repeats, linked to gene regulation and unusual DNA structures, in the regulation and formation of facultative heterochromatin. Results suggest a new paradigm whereby a regional escape from X inactivation is due to the presence of elements that prevent heterochromatinization, rather than the lack of other elements that promote it.
The inactive X chromosome in mammalian females provides a singular opportunity to study putative sequences involved in the structural and functional transformation of essentially a whole chromosome. The XIST gene on one X chromosome produces a stable nuclear RNA that coats the chromosome, thereby initiating a cascade of chromatin remodeling that permanently silences the chromosome. (for review, see Chadwick and Willard 2003
Sequence motifs involved in chromosome structure or regulation would likely be highly represented throughout the genome, and may comprise motifs that are difficult to discriminate from mere "junk." It was long ago suggested that repetitive sequences may be involved in promoting chromosome inactivation (Gartler and Riggs 1983 In this study, we have taken a different strategy; we have searched for any motifs that are abundant, widely distributed, and specifically enriched on the X chromosome. This will identify specific sequences relevant to X chromosome biology, which may be implicated in such basic processes as chromatin folding, regulation of heterochromatic or euchromatic domains, and recombination. The analyses were performed on genomic sequence masked for known interspersed repeat families (e.g., LINEs, SINEs, and LTRs). Although the copious interspersed repeats may well contribute to genome function, their presence in this analysis would obscure other repeated motifs.
To accomplish this we used a linguistic approach, counting the occurrences and distribution of nine base-pair words in the genomic sequence of all individual human chromosomes, with focus on the X chromosome. We divided the X chromosome into two regions: XE, a 7.5-Mb region at Xp22 that includes the pseudoautosomal region and escapes X inactivation (Carrel and Willard 2005
Overview analysis of small word frequenciesin the whole genome There are 131,072 possible nine base complementary word pairs derived from four letters (ACGT). In the masked human genome ( 1500 Mb), each word would be present roughly 11,000 times in the genome, or 7.6 times per Mb, if the frequency of words were random. Figure 1 illustrates the distribution of word frequencies observed. Each possible word is present in the human genome, but there was an extremely wide range of frequencies. While the ubiquitous [A]9 occurs 472,658 times, words containing multiple copies of the underrepresented dinucleotide CpG tend to be quite rare; CGTACGTCG occurs only 47 times in the entire genome. The unique nature of CpG containing words is apparent as they form distinguishable peaks on a frequency distribution histogram (Fig. 1). Of the 10,134 word pairs that occur at low frequencies (between 1 and 1.5 copies per Mb), 96% contain exactly one CpG. In contrast, of 20,765 words that occur at 10-fold higher frequencies, only 0.06% contain any CpG.
The other major word class present at frequencies notably deviant from normal frequency distribution consists of words derived from simple sequence repeats (SSRs) or microsatellites (repeats of 16-bp units). For example, ATATATATA occurs 604 times per MB, in contrast to the median 9mer word frequency of 5.76 words/MB. The abundance and wide distribution of SSRs, which comprise 3% of the genome (Lander et al. 2001
Is bulk X chromosome sequence more different from autosomes than individual autosomes are from each other?
Enrichment of specific SSRs on XS vs. autosomes We adapted our search method to not only identify words with unusual frequencies on X, but to favor more common and widely distributed motifs, screening out those repeated at one or a few sites and rare words, since the lower the copy number of a sequence, the less meaningful enrichment on any individual chromosome would be. Therefore, we ranked words by the difference in word density (words/Mb) on XS and XE vs. autosomes, which takes both abundance and enrichment into account. We also imposed a coefficient of variance cut-off to verify broad physical distribution (see Methods). Under these constraints, the most enriched words on XS relative to autosomes are three dinucleotide repeats, [AT]n, [AC]n, and [AG]n. These are enriched between 1.2 and 1.5 times on the X chromosome. This is more striking when one considers that the copy number is very high already on the autosomes. For example, there are 590 words per MB for [AT]n on autosomes compared with about 900 per MB on XS, with quite uniform distribution. Since the X chromosome is rather A/T rich and [AT]n showed more variation among individual autosomes than did AG or AC, we compared the [AT]n density relative with A/T content for all chromosomes. XS and XE (as well as Y) are clearly outliers in terms of [AT]n enrichment, even when compared with similarly A/T-rich, gene-poor chromosomes (Fig. 2); thus, A/T content does not account for this enrichment.
We also performed comparisons of the deviance of the dinucleotide motif word densities from the autosomal mean among all individual chromosomes, (Fig. 3). The X chromosome, both XS and XE, is strikingly more enriched than any other individual autosome for these three dinucleotides. Unlike [AT]n, the [AC]n and [AG]n dinucleotides do not vary with gene density or A/T content, and they are present at similar levels on each of the autosomes, with the exception of chromosome 19, which is anomalous in other ways; it is the most gene-rich chromosome (Grimwood et al. 2004
Comparison of XE vs. XS In considering the biological significance of any enrichment on XS compared with autosomes, it is important to also consider its distribution on XS relative to XE. Although XE and XS occupy the same physical chromosome, their transcriptional behavior under silencing conditions is quite different. The XE region as defined here is one continuous 7.5-Mb block, in which all of the genes escape inactivation, whereas throughout the 145 Mb of XS, most, but not all genes are silenced. The XE region also includes the important pseudoautosomal region (PAR), which is homologous to the PAR on the Y chromosome with which it engages in meiotic recombination in males. Given that this region undergoes recombination in both sexes, similar to an autosome, it provides clues as to whether any differences in the repeat content of the X chromosome might be explained by the more limited recombination of XS. In this respect, it is important to note that the enrichment of dinucleotide repeats is not restricted to the XS region, but is also seen in XE (Fig. 3). This does not preclude the possibility that this motif could have some role related to X inactivation, particularly since we did not find a spatial difference between escape and silenced genes relative to the XIST RNA territory (C.M. Clemson, L.L. Hall, and J.B. Lawrence, in prep.). However, it does indicate that escape of the XE region from inactivation is not due to depletion of dinucleotide repeats. The fact that both XE and XS are enriched for dinucleotide repeats makes the important point that the enrichment is not easily explained by differences in the rate of recombination of XS and autosomes.
Only eight words that can be described with three motifs are significantly (P' < 0.01) enriched on XS as compared with XE: [AAGGC]n, CCCACCCC, and [CAG]n. However, [AAGGC]n and CCCACCCC are excluded based on their coefficient of variance (V > 100) due to the fact that they are localized large-tandem repeats on XS. The CAG repeat has approximately a twofold enrichment and more even distribution, [CAG]n, but it encodes polyglutamine common in proteins and is highly correlated with gene density, so the enrichment on XS can be attributed to greater gene density than XE (data not shown). Therefore, it is notable that our search found no 9mer or larger words that met the criteria for abundance and distribution that were significantly enriched on XS over XE. This finding is further noteworthy because it contrasts with the distribution of LINE L1 elements in unmasked genomic sequence, which we confirm are enriched (in unmasked sequence) on XS vs. XE (Bailey et al. 2000 The most striking word frequency difference turned out to be a marked enrichment of a specific motif on XE as compared not only with XS, but also to autosomes. The 9mer word frequency analysis revealed that words representing the tetramer repeat [GATA]n are overwhelmingly enriched in XE. While there is some enrichment also for the [ATCC]n repeat, this was less pronounced and also found in specific segments of other chromosomes (J.A. McNeil and J.B. Lawrence, in prep.), whereas the [GATA]n was enriched on XE over all other autosomes and XS. The [GATA]n repeats constitute a remarkable feature of the XE region, being almost 12 times more frequent on XE than the autosomal mean. Although the X chromosome in its entirety initially appears somewhat enriched with this sequence, about 1.3 times the autosomal average, when one considers our finding that [GATA]n frequency shows an inverse correlation with gene density, the XS is not enriched for [GATA]n over autosomes with similarly low-gene density, in contrast to the dramatic enrichment on XE (Fig. 4).
We scanned the RepBase libraries (which is used by RepeatMasker) for human LINES and SINES and found that these interspersed repeats generally do not have GATA, although some do contain dinucleotide repeats. To ensure that the distribution differences seen for these SSRs was not related to differences in distribution of interspersed repeats, a limited analysis on unmasked sequence involving XE, XS, and chromosome 7 was performed. The same patterns were seen and the enrichment on XS and XE were still significant.
Detailed analysis of [GATA]n distribution
In hybridizations to male metaphase spreads, a less-intense [GATA]n signal is also seen on the Y chromosome in the pseudoautosomal region at Yp11.3 (Fig. 5C), and to a lesser degree at Yq11, which is populated by many genes and pseudogenes with homologous partners located in XE. The strata included in XE originally were part of a larger PAR, which was shared by X and Y. These strata became independent of the PAR (and no longer able to recombine in male meiosis) due to rearrangements on the Y that translocated much of the sequence from Yp to Yq (Lahn and Page 1999
We also examined the distribution of [GATA]n at high resolution along the X chromosome by searching for perfect incidences of GATA repeats of any length, rather than nine base words, and creating a physical map (Fig. 6). This analysis shows that [GATA]n is present at a large number of sites in microsatellites scattered throughout the XE segment. [GATA]n is present both near genes and in intergenic regions, with no apparent relationship between the orientation of genes and [GATA]n/[TATC]n. The enrichment of [GATA]n covers
Enrichment of GATA in chimp and dog pseudoautosomal regions Finally, we examined whether the GATA enrichment seen in the escape region of the human X chromosome is also present in the analogous region of other mammalian species. In this case, the mouse model is less informative because mice (Mus musculus) have a much smaller PAR of different evolutionary origin (Perry et al. 2001
More informative is the analysis of dogs and chimps, because they have pseudoautosomal regions similar to the human. While not fully characterized, some evidence has shown that genes in this region (in both dog and chimp) escape inactivation (Jegalian and Page 1998
Despite the enormous success in identifying conventional genes within the human genome, knowledge of how to relate "noncoding" genomic sequence to the structure and function of a chromosome is at a primitive stage. Using an open-ended word frequency approach, we identified distinctive sequence features on the X chromosome; these provide new clues to the unique biology of this chromosome, and to the potential role of certain "junk" DNA. Recently there has been increased interest in the abundant microsatellites throughout the human and other genomes (Subramanian et al. 2003b
In our view, the complex biology of chromosome inactivation (and escape from it) is unlikely to be controlled by or dependent upon any one sequence element. For example, there may be motifs that support the propagation of XIST RNA along the chromosome, others that help retain XIST RNA, and still others required for chromatin modifications, DNA methylation, or architectural changes. Substantial attention has been paid to the proposed involvement of LINE L1 elements in these activities (Gartler and Riggs 1983
A critical consideration, however, is the pattern of enrichment on XS relative to XE and autosomes that should be expected for candidate sequences involved in X inactivation. While it has generally been presumed that the region at Xp22.222.3 would be similar to an autosome and have lower levels of putative "X inactivation motifs," autosomes show substantially more competence for inactivation when in cis with the XIST gene than the XE region, which is markedly resistant. The most dramatic chromosomal sequence difference identified by our comprehensive search was the
It is also important to consider the impact of recombination differences, since most of the X chromosome does not recombine in male meiosis, which may impact the evolution of sequence content. The enrichment of L1 LINES on XS over XE, and their enrichment on Y, could be consistent with the possibility that they accumulate due to lower recombination on XS/Y (Smit 1999
Although there has been substantial interest in identifying sequences involved in mammalian chromosome inactivation, the dinucleotide repeat enrichment of the human X chromosome was not previously recognized. In fact, microsatellites were suggested to be underrepresented (Jarne et al. 1998
Subramanian et al. (Subramanian et al. 2003a
A remarkable feature of the sequence elements identified here is that they all have the capacity to form nonstandard DNA forms, and there is substantial literature linking them to gene regulation. The [GATA]n repeat satisfies the consensus-binding-site motif (WGATAR) for the GATA transcription factor family involved in gene regulation (Patient and McGhee 2002
The dinucleotide motifs also have unusual and labile physical properties, with dual effects of repression or activation. [AG]n has been strongly linked to chromatin regulation and is a polypurine/polypyrimidine motif (PP) capable of forming triplex DNA (Maueler et al. 1998
[AT]n and [AC]n have also been linked to specific gene regulation (for example, Rothenburg et al. 2001 While we focus here on the X chromosome, it is likely that the sequence elements involved in X inactivation are present at significant levels on other chromosomes where they may be involved in the widespread formation of facultative heterochromatin in different regions of the human genome that occurs throughout development. The ubiquitous nature of SSRs throughout the human and other genomes is often taken as indicative of mere "junk." Yet, in human language, common words such as "to" or "the" are critical for syntax, modifying the meaning of more specific, but less common words (e.g., "to puzzle" vs. "the puzzle"). We suggest that common motifs such as [GATA]n and other SSRs are candidates for common words in the human genome that modify the structure and function of chromosomal domains. Thus, the concept of a "regulatory microsatellite" may apply not only to specific instances of individual genes, but more broadly to the regulation of heterochromatin and euchromatin throughout the genome.
Genomic sequences used for word-frequency analysis were derived from NCBI release B33. Two regions of the X chromosome, designated XE and XS, were treated as separate chromosomes. The 7.5-MB XE region includes PAR-1 and most of the evolutionary stratum 4 (defined by Lahn and Page 1999 2 distribution, and P' < 0.01 significance cutoff. The P-values were corrected to compensate for the large number of tests being performed (P' = 1-(1-P)k). Distribution analysis on XS was performed by determining the individual word frequencies in 1-MB bins along the chromosome. The coefficient of variance ((stdev*100)/mean) of bin word frequencies was calculated and a cutoff value (V < 100) was used to eliminate words in highly localized satellites. Words were ranked by the difference between regional (XS, XE) word densities and autosomal word density (words/million words). These analyses were performed on a dual processor pentium III computer running GNU/Linux using custom scripts written in GAWK (release 3.1.1).
For fluorescence in situ hybridization, biotinylated oligonucleotide probes were hybridized (5 pM/uL probe in 5% formamide, 2XSSC, 37°C, overnight) to metaphase preparations of normal human peripheral blood lymphocytes that were denatured in 70% formamide, 2XSSC for 2 min. Hybridization was performed as previously described (Clemson et al. 1996
We recognize the contributions of Meg Byron, Kevin Byron, and John Butler who performed the in situ hybridizations; and David LaPointe and Stephen Baker for assistance with the statistical analysis. We thank Joel Graber for helpful discussions in the initial phases of this project. This work was funded by NIH GM68138 and NIH GM053234
1 Corresponding author.
E-mail jeanne.lawrence{at}umassmed.edu; fax (508) 856-5178. [Supplemental material is available online at www.genome.org.] Article published online ahead of print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4627606
Bailey J.A., Carrel L., Chakravarti A., Eichler E.E. 2000. Molecular evidence for a relationship between LINE-1 elements and X chromosome inactivation: The Lyon repeat hypothesis. Proc. Natl. Acad. Sci. 97: 66346639. Bode J., Kohwi Y., Dickinson L., Joh T., Klehr D., Mielke C., Kohwi-Shigematsu T. 1992. Biological significance of unwinding capability of nuclear matrix-associated DNAs. Science 255: 195197. Brown C.J. and Greally J.M. 2003. A stain upon the silence: Genes escaping X inactivation. Trends Genet. 19: 432438.[CrossRef][Medline] Burgoyne P.S. 1982. Genetic homology and crossing over in the X and Y chromosomes of mammals. Hum. Genet. 61: 8590.[CrossRef][Medline] Carrel L. and Willard H.F. 2005. X-inactivation profile reveals extensive variability in X-linked gene expression in females. Nature 434: 400404.[CrossRef][Medline] Carrel L., Cottle A.A., Goglin K.C., Willard H.F. 1999. A first-generation X-inactivation profile of the human X chromosome. Proc. Natl. Acad. Sci. 96: 1444014444. Chadwick B.P. and Willard H.F. 2003. Barring gene expression after XIST: Maintaining facultative heterochromatin on the inactive X. Semin. Cell Dev. Biol. 14: 359367.[CrossRef][Medline] Chureau C., Prissette M., Bourdet A., Barbe V., Cattolico L., Jones L., Eggen A., Avner P., Duret L. 2002. Comparative sequence analysis of the X-inactivation center region in mouse, human, and bovine. Genome Res. 12: 894908. Clemson C.M., McNeil J.A., Willard H.F., Lawrence J.B. 1996. XIST RNA paints the inactive X chromosome at interphase: Evidence for a novel RNA involved in nuclear/chromosome structure. J. Cell Biol. 132: 259275. de Belle I., Cai S., Kohwi-Shigematsu T. 1998. The genomic sequences bound to special AT-rich sequence-binding protein 1 (SATB1) in vivo in Jurkat T cells are tightly associated with the nuclear matrix at the bases of the chromatin loops. J. Cell Biol. 141: 335348. Dickinson L.A., Joh T., Kohwi Y., Kohwi-Shigematsu T. 1992. A tissue-specific MAR/SAR DNA-binding protein with unusual binding site recognition. Cell 70: 631645.[CrossRef][Medline] Ellegren H. 2004. Microsatellites: Simple sequences with complex evolution. Nat. Rev. Genet. 5: 435445.[CrossRef][Medline] Epplen J.T., Kyas A., Maueler W. 1996. Genomic simple repetitive DNAs are targets for differential binding of nuclear proteins. FEBS Lett. 389: 9295.[CrossRef][Medline] Filippova G.N., Cheng M.K., Moore J.M., Truong J.P., Hu Y.J., Nguyen D.K., Tsuchiya K.D., Disteche C.M. 2005. Boundaries between chromosomal domains of X inactivation and escape bind CTCF and lack CpG methylation during early development. Dev. Cell 8: 3142.[CrossRef][Medline] Gartler S.M. and Riggs A.D. 1983. Mammalian X-chromosome inactivation. Annu. Rev. Genet. 17: 155190.[CrossRef][Medline] Greenberg A.J., Yanowitz J.L., Schedl P. 2004. The Drosophila GAGA factor is required for dosage compensation in males and for the formation of the male-specific-lethal complex chromatin entry site at 12DE. Genetics 166: 279289. Grimwood J., Gordon L.A., Olsen A., Terry A., Schmutz J., Lamerdin J., Hellsten U., Goodstein D., Couronne O., Tran-Gyamfi M.et al. 2004. The DNA sequence and biology of human chromosome 19. Nature 428: 529535.[CrossRef][Medline] Hall L.L. and Lawrence J.B. 2003. The cell biology of a novel chromosomal RNA: Chromosome painting by XIST/Xist RNA initiates a remodeling cascade. Semin. Cell Dev. Biol. 14: 369378.[CrossRef][Medline] Hall L.L., Byron M., Sakai K., Carrel L., Willard H.F., Lawrence J.B. 2002a. An ectopic human XIST gene can induce chromosome inactivation in postdifferentiation human HT-1080 cells. Proc. Natl. Acad. Sci. 99: 86778682. Hall L.L., Clemson C.M., Byron M., Wydner K., Lawrence J.B. 2002b. Unbalanced X;autosome translocations provide evidence for sequence specificity in the association of XIST RNA with chromatin. Hum. Mol. Genet. 11: 31573165. Hammock E.A. and Young L.J. 2005. Microsatellite instability generates diversity in brain and sociobehavioral traits. Science 308: 16301634. Heard E. 2004. Recent advances in X-chromosome inactivation. Curr. Opin. Cell Biol. 16: 247255.[CrossRef][Medline] Hendrich B.D., Plenge R.M., Willard H.F. 1997. Identification and characterization of the human XIST gene promoter: Implications for models of X chromosome inactivation. Nucleic Acids Res. 25: 26612671. Herzing L.B.K., Romer J.T., Horn J.M., Ashworth A. 1997. Xist has properties of the X-chromosome inactivation centre. Nature 386: 272275.[CrossRef][Medline] Jarne P., David P., Viard F. 1998. Microsatellites, transposable elements and the X chromosome. Mol. Biol. Evol. 15: 2834.[Abstract] Jegalian K. and Page D.C. 1998. A proposed path by which genes common to mammalian X and Y chromosomes evolve to become X inactivated. Nature 394: 776780.[CrossRef][Medline] Ke X. and Collins A. 2003. CpG islands in human X-inactivation. Ann. Hum. Genet. 67: 242249.[CrossRef][Medline] Kelley R.L., Meller V.H., Gordadze P.R., Roman G., Davis R.L., Kuroda M.I. 1999. Epigenetic spreading of the Drosophila dosage compensation complex from roX RNA genes into flanking chromatin. Cell 98: 513522.[CrossRef][Medline] Kirkness E.F., Bafna V., Halpern A.L., Levy S., Remington K., Rusch D.B., Delcher A.L., Pop M., Wang W., Fraser C.M.et al. 2003. The dog genome: Survey sequencing and comparative analysis. Science 301: 18981903. Kohwi Y. and Kohwi-Shigematsu T. 1991. Altered gene expression correlates with DNA structure. Genes & Dev. 5: 25472554. Lahn B.T. and Page D.C. 1999. Four evolutionary strata on the human X chromosome. Science 286: 964967. Lander E.S., Linton L.M., Birren B., Nusbaum C., Zody M.C., Baldwin J., Devon K., Dewar K., Doyle M., FitzHugh W.et al. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860921.[CrossRef][Medline] Lee J.T. and Jaenisch R. 1997. Long-range cis effects of ectopic X-inactivation centres on a mouse autosome. Nature 386: 275279.[CrossRef][Medline] Lehmann M. 2004. Anything else but GAGA: A nonhistone protein complex reshapes chromatin structure. Trends Genet. 20: 1522.[CrossRef][Medline] Lien S., Szyda J., Schechinger B., Rappold G., Arnheim N. 2000. Evidence for heterogeneity in recombination in the human pseudoautosomal region: High resolution analysis by sperm typing and radiation-hybrid mapping. Am. J. Hum. Genet. 66: 557566.[CrossRef][Medline] Lowenhaupt K., Rich A., Pardue M.L. 1989. Nonrandom distribution of long mono- and dinucleotide repeats in Drosophila chromosomes: Correlations with dosage compensation, heterochromatin, and recombination. Mol. Cell. Biol. 9: 11731182. Lyon M.F. 1998. X-chromosome inactivation: A repeat hypothesis. Cytogenet. Cell Genet. 80: 133137.[CrossRef][Medline] Lyon M.F. 2003. The Lyon and the LINE hypothesis. Semin. Cell Dev. Biol. 14: 313318.[CrossRef][Medline] Marahens Y., Panning B., Dausman J., Strauss W., Jaenisch R. 1997. Xist deficient mice are defective in dosage compensation but not spermatogenesis. Genes & Dev. 11: 156166. Maueler W., Kyas A., Keyl H.G., Epplen J.T. 1998. A genome-derived (gaa.ttc)24 trinucleotide block binds nuclear protein(s) specifically and forms triple helices. Gene 215: 389403.[CrossRef][Medline] Myers S., Bottolo L., Freeman C., McVean G., Donnelly P. 2005. A fine-scale map of recombination rates and hotspots across the human genome. Science 310: 321324. Nayler O., Stratling W., Bourquin J.P., Stagljar I., Lindemann L., Jasper H., Hartmann A.M., Fackelmayer F.O., Ullrich A., Stamm S. 1998. SAF-B protein couples transcription and pre-mRNA splicing to SAR/MAR elements. Nucleic Acids Res. 26: 35423549. Ohno M., Fukagawa T., Lee J.S., Ikemura T. 2002. Triplex-forming DNAs in the human interphase nucleus visualized in situ by polypurine/polypyrimidine DNA probes and antitriplex antibodies. Chromosoma 111: 201213.[Medline] Patient R.K. and McGhee J.D. 2002. The GATA family (vertebrates and invertebrates). Curr. Opin. Genet. Dev. 12: 416422.[CrossRef][Medline] Perry J., Palmer S., Gabriel A., Ashworth A. 2001. A short pseudoautosomal region in laboratory mice. Genome Res. 11: 18261832. Rich A. and Zhang S. 2003. Timeline: Z-DNA: The long road to biological function. Nat. Rev. Genet. 4: 566572.[Medline] Ross M.T., Grafham D.V., Coffey A.J., Scherer S., McLay K., Muzny D., Platzer M., Howell G.R., Burrows C., Bird C.P.et al. 2005. The DNA sequence of the human X chromosome. Nature 434: 325337.[CrossRef][Medline] Rothenburg S., Koch-Nolte F., Rich A., Haag F. 2001. A polymorphic dinucleotide repeat in the rat nucleolin gene forms Z-DNA and inhibits promoter activity. Proc. Natl. Acad. Sci. 98: 89858990. Sharp A.J., Spotswood H.T., Robinson D.O., Turner B.M., Jacobs P.A. 2002. Molecular and cytogenetic analysis of the spreading of X inactivation in X;autosome translocations. Hum. Mol. Genet. 11: 31453156. Silva J., Mak W., Zvetkova I., Appanah R., Nesterova T.B., Webster Z., Peters A.H., Jenuwein T., Otte A.P., Brockdorff N. 2003. Establishment of histone h3 methylation on the inactive X chromosome requires transient recruitment of Eed-Enx1 polycomb group complexes. Dev. Cell 4: 481495.[CrossRef][Medline] Singh L. and Jones K.W. 1982. Sex reversal in the mouse (Mus musculus) is caused by a recurrent nonreciprocal crossover involving the X and an aberrant Y chromosome. Cell 28: 205216.[CrossRef][Medline] Smit A.F. 1999. Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr. Opin. Genet. Dev. 9: 657663.[CrossRef][Medline] Strutt H., Cavalli G., Paro R. 1997. Co-localization of Polycomb protein and GAGA factor on regulatory elements responsible for the maintenance of homeotic gene expression. EMBO J. 16: 36213632.[CrossRef][Medline] Subramanian S., Mishra R.K., Singh L. 2003a. Genome-wide analysis of Bkm sequences (GATA repeats): Predominant association with sex chromosomes and potential role in higher order chromatin organization and function. Bioinformatics 19: 681685. Subramanian S., Mishra R.K., Singh L. 2003b. Genome-wide analysis of microsatellite repeats in humans: Their abundance and density in specific genomic regions. Genome Biol. 4: R13.[CrossRef][Medline] Tam R., Shopland L.S., Johnson C.V., McNeil J.A., Lawrence J.B. In Applications of RNA FISH for visualizing gene expression and nuclear archetecture. . 2002. Oxford University Press, New York. Tilford C.A., Kuroda-Kawaguchi T., Skaletsky H., Rozen S., Brown L.G., Rosenberg M., McPherson J.D., Wylie K., Sekhon M., Kucaba T.A.et al. 2001. A physical map of the human Y chromosome. Nature 409: 943945.[CrossRef][Medline] Vakoc C.R., Letting D.L., Gheldof N., Sawado T., Bender M.A., Groudine M., Weiss M.J., Dekker J., Blobel G.A. 2005. Proximity among distant regulatory elements at the Wen J., Huang S., Rogers H., Dickinson L.A., Kohwi-Shigematsu T., Noguchi C.T. 2005. SATB1 family protein expressed during early erythroid differentiation modifies globin gene expression. Blood 105: 33303339. White W.M., Willard H.F., Van Dyke D.L., Wolff D.J. 1998. The spreading of X inactivation into autosomal material of an X;autosome translocation: Evidence for a difference between autosomal and X-chromosomal DNA. Am. J. Hum. Genet. 63: 2028.[CrossRef][Medline] Wolff D.J., Schwartz S., Montgomery T., Zackowski J.L. 1998. Random X inactivation in a girl with a balanced t(X;9) and an abnormal phenotype. Am. J. Med. Genet. 77: 401404.[CrossRef][Medline] Wutz A. and Jaenisch R. 2000. A shift from reversible to irreversible X inactivation is triggered during ES cell differentiation. Mol. Cell 5: 695705.[CrossRef][Medline] Yasui D., Miyano M., Cai S., Varga-Weisz P., Kohwi-Shigematsu T. 2002. SATB1 targets chromatin remodelling to regulate genes over long distances. Nature 419: 641645.[CrossRef][Medline]
Received August 30, 2005; accepted in revised format January 17, 2006. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||