|
|
|
|
Genome Res. 17:798-806, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00 OPEN ACCESS ARTICLE Letter Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY11 Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA; 2 Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215, USA
A set of 723 high-quality human core promoter sequences were compiled and analyzed for overrepresented motifs. Beside the two well-characterized core promoter motifs (TATA and Inr), several known motifs (YY1, Sp1, NRF-1, NRF-2, CAAT, and CREB) and one potentially new motif (motif8) were found. Interestingly, YY1 and motif8 mostly reside immediately downstream from the TSS. In particular, the YY1 motif occurs primarily in genes with 5'-UTRs shorter than 40 base pairs (bp) and its locations coincide with the translation start site. We verified that the YY1 motif is bound by YY1 in vitro. We then performed detailed analysis on YY1 chromatin immunoprecipitation data with a whole-genome human promoter microarray (ChIP-chip) and revealed that the thus identified promoters in HeLa cells were highly enriched with the YY1 motif. Moreover, the motif overlapped with the translation start sites on the plus strand of a group of genes, many with short 5'-UTRs, and with the transcription start sites on the minus strand of another distinct group of genes; together, the two groups of genes accounted for the majority of the YY1-bound promoters in the ChIP-chip data. Furthermore, the first group of genes was highly enriched in the functional categories of ribosomal proteins and nuclear-encoded mitochondria proteins. We suggest that the YY1 motif plays a dual role in both transcription and translation initiation of these genes. We also discuss the evolutionary advantages of housing a transcriptional element inside the transcript in terms of the migration of these genes in the human genome.
The core promoter, consisting of 100 bp flanking the transcription start site (TSS), plays an essential role in transcriptional initiation. It facilitates the assembly of the transcription initiation complex around the TSS. The TATA box is the best-characterized motif in this region (Smale and Kadonaga 2003
Typical mammalian transcription regulatory motifs span only 612 bp and are degenerate. When the total base pair of input sequences is large, it is difficult to distinguish the real regulatory motifs in them from random short sequences. Fortunately, core promoters consist of only a short stretch of sequences flanking the transcription start sites. This drastically reduces the sequence search space. Recent efforts in large-scale sequencing of human full-length cDNAs have provided a unique opportunity to identify the precise positions of TSSs. In this study, we extracted high-quality human core promoter sequences (defined as 70 to +50 bp around the TSS throughout this study) from the Database of Transcription Start Sites (DBTSS, http://dbtss.hgc.jp) and analyzed them with the motif-finding program MEME (Bailey and Elkan 1994
One identified motif was particularly interesting. It matched the YY1-binding consensus; however, it occurred downstream from the TSS and often overlapped with the translation start site. We demonstrated experimentally that it could be recognized by YY1 in vitro. The similarity between this YY1 motif and the Kozak sequence (Kozak 1984
We first analyzed overrepresented motifs found in core promoters in terms of their consensus, positional specificity, co-occurrence with TATA, and evolutionary conservation. Then, one overrepresented motif, YY1, was characterized in great detail by using both experimental and computational approaches.
Overrepresented core promoter motifs
All 10 motifs showed positional specificity with respect to the TSS (Table 1). TATA and Inr, as expected, showed extremely strong positional specificity. Similar levels of positional specificity were also observed for YY1 and motif8 and both peaked immediately downstream from the TSS. Co-occurrence of these motifs with TATA was analyzed in the 723 promoter sequences (Supplemental Table 1). YY1, NRF-1, NRF-2, and motif8 showed significantly lower-than-expected co-occurrence with TATA. In the extreme case of NRF-2, only two cases of co-occurrence were observed as opposed to the expected 13 cases. Thus, these motifs might assist transcription initiation in TATA-less genes. The statistics for other motifs including CAAT and CREB were insignificant, possibly due to their lower abundance in the sequence set. All 10 motifs found in humans were also overrepresented in a set of 1849 high-quality mouse core promoter sequences from DBTSS with >10 mapped cDNA sequences. As conservation often suggests functional importance, overrepresentation observed in both humans and the mouse supports the functional relevance of these motifs in core promoters.
The YY1 motif overlaps with the Kozak sequence
The occurrence of the YY1 motif correlates with short 5'-UTR
YY1 sites are conserved between human and mouse Searching against DBTSS, we found 168 genes with a 5'-UTR shorter than 30 bp and a YY1 motif spanning from 4 to +8 bp relative to the translation start site. These 12 nucleotide positions are significantly more conserved between humans and the mouse in the 168 genes than in a set of randomly chosen genes that do not contain YY1 sites (Fig. 2) (P = 0.001 according to the paired Students t-test). In particular, 15%20% fewer mismatches were observed at the 3, 2, and 1 positions for the former set, while much smaller differences were observed for positions outside of the YY1 site (12 to 5 and +9 to +15; data not shown). The additional conservation observed in the YY1 motif-matching region around the translation start site further suggests its function beyond a translation initiation motif.
Predicted YY1 sites are recognized by the YY1 protein in vitro The consensus of the YY1 motif (CAAGATGGCGGC) derived from MEME analysis matches perfectly the reverse complement of the previously reported YY1-binding site consensus GC CATCTTG (Shrivastava and Calame 1994
Genome-wide chromatin immunoprecipitation with HeLa cells indicates that YY1 binds to YY1 motifs in two distinct modes To study YY1-binding sites in living cells, we analyzed YY1 ChIP-chip data on a human genome-wide promoter array (B. Ren, pers. comm.). A total of 24,135 promoters were probed by the array. Among them, 765 showed significant YY1 binding (called ChIP hits), indicated by ChIP signal two standard deviations above the mean. We searched the YY1 PSSM (generated with MEME on the 723 high-quality DBTSS promoters as described above) (Table 1) against all 24,135 promoters. Only the regions covered by the probes were searched, typically 1300 to +200 bp around the TSS. A 10-fold enrichment of the YY1 motif was found on the plus strand of the ChIP hits when compared with non-hits (Table 2A). Such a highly significant enrichment (P = 2.2 x 1016) cannot be trivially explained by a general enrichment of genuine core promoters in the ChIP hits, as no or only slight enrichments (one- to twofold) were observed for three other core promoter motifs, TATA, NRF-2, and CREB (Table 2C,D,E). Surprisingly, a lower but still highly significant fivefold enrichment of the YY1 motif was also observed on the minus strand of the ChIP hits (Table 2B). Of the 723 high-quality DBTSS promoters, 535 were probed by ChIP-chip. These included 47 promoters that contained the YY1 motif, of which 42% were ChIP hits, a fivefold enrichment over the promoters that do not contain the YY1 motif (Table 2F) (P = 2.9 x 1012).
When analyzed for the positional distribution of the YY1 motif in these 765 ChIP hits, different positional specificities were observed on plus and minus strands. On the plus strand, the YY1 motif peaks at +1 to +20 bp downstream the TSS, while on the minus strand it peaks at 40 to +1 bp, mostly upstream from the TSS (Fig. 4; Supplemental Fig. 5). Furthermore, when we averaged the ChIP signal of all ChIP hits that contain YY1 sites on the plus strand, we observed a single peak at +30 to +80 bp downstream of the TSS (thick black curve in Fig. 5), while the average ChIP intensity generated with ChIP hits that contain the YY1 site on the minus strand had a broader peak at 60 to 10 bp upstream the TSS (thin black curve in Fig. 5). This clear correspondence between the positional distributions of the YY1 motif and the YY1 ChIP signal indicates that YY1 is capable of binding to the predicted motif in both orientations in living cells, and these are two distinct binding modes.
Furthermore, the genes corresponding to the ChIP hits with YY1 sites on the plus strand tend to have short 5'-UTRs: 34% (102) of these 294 genes have 5'-UTRs shorter than 40 bp (Fig. 6). This trend is even more pronounced for ChIP hits with YY1 sites overlapping the translational start site on the plus strand: 78% (87) of these 109 genes have 5'-UTRs shorter than 40 bp. In contrast, only 10% of the genes corresponding to ChIP hits with YY1 sites on the minus strand have short 5'-UTRs, at the same level as random. The plus-strand mode is consistent with, although with a lower percentage of short 5-UTRs, our earlier observation in the 723 high-quality DBTSS promoters, where 85% of the genes with downstream YY1 sites have 5'-UTRs shorter than 40 bp (Fig. 1). The higher percentage for the DBTSS promoters likely reflects their higher TSS mapping accuracy than the promoters on the microarray, for which the TSS coordinates were obtained from RefSeq. Indeed, when the TSS coordinates from DBTSS were used (available for 188 of the 294 ChIP hits with YY1 sites on the plus strand), the percentage of genes with <40 bp 5'-UTRs rose to 47% (Supplemental Fig. 6). The positional specificity of the YY1 motif also became sharper when the TSS coordinates from DBTSS were used (cf. Supplemental Figs. 5 and 4).
YY1 PSSM can predict accurately YY1-binding promoters identified by ChIP Four hundred (52%) of the 765 ChIP hits contained a YY1 site on the plus or minus strand. For the remaining 365 ChIP hits, no other obvious motif was found to be enriched in the probe-covered regions when a MEME search was carried out. Some of these promoters might contain weak YY1 sites, as we used a stringent P-value cutoff (<105) for matching the YY1 PSSM. Indeed, when we relaxed the cutoff slightly to 4x105, 534 (70%) of the 765 ChIP hits contained YY1 sites on either strand. The average ChIP signal for these 365 promoters seems to peak even further downstream from the TSS (black curve in Fig. 6). This suggests that additional proteins might be involved in assisting the binding of YY1.
Also of interest are the false-positive rates of the YY1 PSSM at various sensitivity levels. We scanned the 10,000 promoters with the weakest YY1 ChIP signals for YY1 sites at a series of P-value cutoffs. The percentage of promoters that contained at least one YY1 site was defined as the false-positive rate and plotted against the sensitivity at the same P-value cutoff (Fig. 7). On average, there are twice as many YY1 sites on the plus strand (the curve labeled with "YY1") than on the minus strand (the curve labeled with "YY1r," where r indicates reverse). When both strands are considered (the curve labeled with "YY1|YY1r"), a genome-wide specificity of 96.5% is achieved at a sensitivity of 52%, indicating that the YY1 PSSM is highly predictive of YY1 binding in living cells. One might expect that top ChIP hits can be used to optimize the YY1 PSSM in an iterative fashion. We attempted this with the ROVER algorithm (Haverty et al. 2004b
Biological functions of genes with YY1 sites We performed functional enrichment analysis on the genes corresponding to the 765 ChIP hits after dividing them into three sets: genes with YY1 sites on the plus strand (220 annotated), genes with YY1 sites on the minus strand (87 annotated), and genes without YY1 sites (189 annotated). For set 3, a loose P-value cutoff (<0.0001) was used to exclude genes with possible YY1 motif matches on either strand. We searched for the enrichment of any GO terms in each set of genes using GoStat (Beissbarth and Speed 2004
Genes with YY1 sites on the plus strand are highly enriched in the GO term mitochondrion (Table 3). Specifically, 36 genes in set 1 are encoded in the nuclear genome, but their protein products function in the mitochondria, including mitochondrial membrane proteins, enzymes, and a large number of ribosomal proteins. Gene sets 2 and 3 are only marginally enriched in this term. For the genes with YY1 sites on the minus strand, several ribosome-related terms were highly enriched; however, most of them were also among the most enriched terms in gene sets 1 and 3. Only the genes without YY1 sites were enriched in the term RNA binding. Genes annotated with this term are different from the ribosomal proteins found in gene sets 1 and 2; the former are involved in RNA post-transcriptional processing or RNA metabolism, including polyadenylation factors, splicing factors, and RNAases. We note that these enriched functional terms collectively account for only
Accurate TSS mapping and focused search in core promoters led to the discovery of high-quality transcription-factor binding motifs Transcription-factor binding sites are in general short and degenerate. The difficulty in mapping the TSS accurately further complicates the effort in finding core promoter motifs. Our strategy was to use only sequences with accurate TSS locations. This allowed us to interrogate the short stretch of sequences (from 70 to +50 bp) that were most likely to cover the real core promoters. Our approach boosted the core promoter element signal and decreased background noise. We identified eight previously known motifs (TATA, Inr, YY1, Sp1, CAAT, NRF-1, NRF-2, and CREB) and one potentially novel motif. FitzGerald et al. analyzed the positional distribution of all 8-mers around the putative TSSs of 13,010 human genes and reported eight overrepresented motifs and the Kozak sequence (FitzGerald et al. 2004
We estimated that >90% of the 723 high-quality human promoters studied would have their actual TSS located within the 120-bp window. The recent CAGE study by the RIKEN group indicates that CpG promoters tend to have a broad TSS distribution, with variation in TSS positions up to 50 bp (Carninci et al. 2006
The predicted YY1 motif was confirmed by EMSA and ChIP-chip data We were prompted to investigate the distribution of YY1 sites in nonpromoter regions, as the probes on the promoter array covered only 36 M base pairs in total. The YY1 PSSM could detect 50% of ChIP hits at the 97% specificity of rejecting non-YY1 promoters genome-wide. At the same stringency cutoff, 89,000 YY1 sites were found in the 1.5 G-base-pair nonrepetitive portion of the human genome. These many sites are expected at the corresponding 97% specificity, indicating that there is no under-representation of YY1 sites outside promoters. Nonetheless, there is a strong enrichment of YY1 sites in the ±100-bp window around the TSS: 6.3 folds on the plus strand and 3.4 folds on the minus strand. This is compared with the even greater enrichment of YY1 sites observed for ChIP hits vs. non-hits: 11 folds on the plus strand and 5.8 folds on the minus strand (Table 2A,B). Thus, the YY1 motif can explain YY1 binding in living cells, although not entirely, and there are other contributing factors, possibly chromatin structure or the interaction of YY1 with other transcription regulators.
ChIP-chip data reveals two binding modes of YY1, both consistent with YY1 motif enrichment This study represents a systematic assessment of YY1-binding promoters genome-wide in HeLa cells. Our results indicate that the promoters of >3% of the human genes in HeLa cells have detectable YY1 binding with ChIP-chip. We do not observe significant overlap between the ChIP hits of YY1 and SUZ12 in ENCODE regions (The ENCODE Project Consortium 2007). Instead, there is a significant overlap between the ChIP hits of YY1 and TAF1 (a component of TBP) in HeLa cells, with the fold of enrichment even higher than that between TAF1 and POLR2A (16 vs. 13 folds; ENCODE data). These results suggest that YY1 binding frequently indicates active transcription. We extrapolate to speculate that YY1 is more often an activator than a repressor. At least 50% and likely 70% of ChIP hits can be accounted for by the YY1 motif. These ChIP hits are divided into two classes: 2/3 of them contain YY1 sites on the plus strand and 1/3 of them contain YY1 sites on the minus strand. For these two classes, a clear difference in positional distribution of YY1 sites was observed (Fig. 4). Similar difference in positional specificity was also observed for the two classes in average ChIP signal (Fig. 5). The dominance of downstream plus-strand binding mode and the overlap of the YY1 site and the translational start site are novel findings in this work. When occurring on the minus strand, YY1 sites often overlap with the TSS, at the same location as Inr. This is consistent with the previously reported initiator function of YY1. Our results further previous studies by indicating that YY1 may initiate the transcription of many more human genes than previously known. The list of 18 genes for which the YY1 site overlaps Inr is provided in Supplemental Materials. When occurring on the plus strand, a significant portion of the YY1 sites overlap with the Kozak sequence, in which case the associated genes frequently have short 5'-UTRs (<40 bp). In contrast, the ChIP hits with YY1 sites on the minus strand are not enriched with genes with short 5'-UTRs (Fig. 6). Thus, we propose that these downstream plus-strand YY1 sites correspond to a distinct regulatory mechanism, different from the upstream minus-strand sites, or the downstream plus-strand sites in genes with longer 5'-UTRs. The former may also impact translation efficiency. The dual regulatory roles of these sites on both transcription and translation may account for their greater extent of evolutionary conservation than Kozak sequences (Fig. 2). In the next section we speculate on the biological function and evolutionary advantages of these sites.
Biological functions of YY1-regulated genes and speculation on the evolutionary role of YY1
Most strikingly, many nuclear-encoded mitochondria genes have downstream YY1 sites and they tend to have short 5'-UTRs. A similar arrangement was also observed in L1 elements (Becker et al. 1993 There are several intriguing questions associated with the YY1 motif when occurring downstream the TSS. What is the advantage of a downstream core promoter motif compared with a typical upstream one? What is the advantage of having the translation start site overlapping with YY1 motif? Is there any biological significance between the striking similarity of downstream YY1 sites in genes and L1 elements?
It is well known that short 5'-UTRs are indicative of efficient translation initiation (Kozak 1987a
There might be a resemblance between the evolutionary histories of L1 elements and nuclear-encoded mitochondria genes. L1 elements move about the genome via a RNA intermediate by a process termed retrotransposition. Having a downstream core promoter element buried inside the 5'-UTR gives an L1 element the advantage of carrying along its own core promoter during transposition and maintaining an intact core promoter after inserting into a new site in the genome. Nuclear-encoded mitochondrial genes were originally encoded by the mitochondria genome and, at some point of evolution, transferred into the nuclear genome. Little is known about the precise mechanism of the mitochondrial gene transfer, except that this is a highly inefficient process and either RNA or DNA could serve as the transfer intermediate (Blanchard and Lynch 2000
Data sources Various human promoter data sets used in this study are defined and compared in Supplemental Figure 2. The coordinates of human TSSs were downloaded from DBTSS (http://dbtss.hgc.jp; version 4.2). Core promoter sequences from 70 to +50 from the TSS were extracted from UCSC genome browser (http://genome.ucsc.edu) by using genome assembly hg16 and mm3. The accession numbers of the 723 high-quality human promoters are provided in the Supplemental Materials.
Identification of overrepresented motifs
Positional distribution of overrepresented motifs
Conservation of YY1 motif between human and mouse
Defining hits in YY1 ChIP-chip experiment To identify the promoters bound by YY1, or ChIP hits, the hybridization signal of the dye swapping experiments was averaged and associated with the genomic coordinates of the probes. A signal cutoff was set to the mean plus 2.5 standard deviations. A hit was called for a genomic region covered by at least five probes above this cutoff, allowing gaps smaller than 200 bp between consecutive probes. Genes immediately adjacent to the ChIP hits were identified by finding the TSS closest to the boundaries of hits. These genes were later used to analyze 5'-UTR length and enrichment of GO terms.
Positional specificity of ChIP signal
Calculation of 5'-UTR length
We thank Bing Ren for providing us with the YY1 ChIP-chip data set prior to publication. We thank Tom Tullius for letting us use his radioactive facility. We thank The ENCODE Project Consortium for making their data publicly available and Transcriptional Regulation analysis group for providing ChIP-chip and ChIP-sequencing data sets. We thank the Genome Research reviewers for their insightful comments. We thank Jesscia Marie Barros and Enoch Huang for proofreading the manuscript. This work was funded by the ENCODE Consortium grant R01HG03110 from NHGRI, NIH to Z.W.
3 Corresponding author.
E-mail zhiping{at}bu.edu; fax (617) 353-6766. [Supplemental material is available online at www.genome.org.] Article is online at http://www.genome.org/cgi/doi/10.1101/gr.5754707
Athanikar, J.N., Badge, R.M., and Moran, J.V. 2004. A YY1-binding site is required for accurate human LINE-1 transcription initiation. Nucleic Acids Res. 32: 38463855. Bailey, T.L. and Elkan, C. 1994. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc. Int. Conf. Intell. Syst. Mol. Biol. 2: 2836.[Medline] Bailey, T.L., Baker, M.E., and Elkan, C.P. 1997. An artificial intelligence approach to motif discovery in protein sequences: Application to steriod dehydrogenases. J. Steroid Biochem. Mol. Biol. 62: 2944.[CrossRef][Medline] Becker, K.G., Swergold, G.D., Ozato, K., and Thayer, R.E. 1993. Binding of the ubiquitous nuclear transcription factor YY1 to a cis regulatory sequence in the human LINE-1 transposable element. Hum. Mol. Genet. 2: 16971702. Beissbarth, T. and Speed, T.P. 2004. GOstat: Find statistically overrepresented Gene Ontologies within a group of genes. Bioinformatics 20: 14641465. Blanchard, J.L. and Lynch, M. 2000. Organellar genes: Why do they end up in the nucleus? Trends Genet. 16: 315320.[CrossRef][Medline] Burke, T.W. and Kadonaga, J.T. 1996. Drosophila TFIID binds to a conserved downstream basal promoter element that is present in many TATA-box-deficient promoters. Genes & Dev. 10: 711724. Carninci, P., Sandelin, A., Lenhard, B., Katayama, S., Shimokawa, K., Ponjavic, J., Semple, C.A., Taylor, M.S., Engstrom, P.G., Frith, M.C., et al. 2006. Genome-wide analysis of mammalian promoter architecture and evolution. Nat. Genet. 38: 626635.[CrossRef][Medline] Cole, E.G. and Gaston, K. 1997. A functional YY1 binding site is necessary and sufficient to activate Surf-1 promoter activity in response to serum growth factors. Nucleic Acids Res. 25: 37053711. The ENCODE Project Consortium, 2007. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature (in press). FitzGerald, P.C., Shlyakhtenko, A., Mir, A.A., and Vinson, C. 2004. Clustering of DNA sequences in human promoters. Genome Res. 14: 15621574. Hariharan, N., Kelley, D.E., and Perry, R.P. 1991. Delta, a transcription factor that binds to downstream elements in several polymerase II promoters, is a functionally versatile zinc finger protein. Proc. Natl. Acad. Sci. 88: 97999803. Haverty, P.M., Frith, M.C., and Weng, Z. 2004a. CARRIE Web service: Automated transcriptional regulatory network inference and interactive analysis. Nucleic Acids Res. 32: W213W216. Haverty, P.M., Hansen, U., and Weng, Z. 2004b. Computational inference of transcriptional regulatory networks from expression profiling and transcription factor binding site identification. Nucleic Acids Res. 32: 179188. Huber, R., Schlessinger, D., and Pilia, G. 1998. Multiple Sp1 sites efficiently drive transcription of the TATA-less promoter of the human glypican 3 (GPC3) gene. Gene 214: 3544.[CrossRef][Medline] Javahery, R., Khachi, A., Lo, K., Zenzie-Gregory, B., and Smale, S.T. 1994. DNA sequence requirements for transcriptional initiator activity in mammalian cells. Mol. Cell. Biol. 14: 116127. Kozak, M. 1984. Compilation and analysis of sequences upstream from the translational start site in eukaryotic mRNAs. Nucleic Acids Res. 12: 857872. Kozak, M. 1987a. An analysis of 5'-noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 15: 81258148. Kozak, M. 1987b. At least six nucleotides preceding the AUG initiator codon enhance translation in mammalian cells. J. Mol. Biol. 196: 947950.[CrossRef][Medline] Lavie, L., Maldener, E., Brouha, B., Meese, E.U., and Mayer, J. 2004. The human L1 promoter: Variable transcription initiation sites and a major impact of upstream flanking sequence on promoter activity. Genome Res. 14: 22532260. Li, W.W., Hsiung, Y., Wong, V., Galvin, K., Zhou, Y., Shi, Y., and Lee, A.S. 1997. Suppression of grp78 core promoter element-mediated stress induction by the dbpA and dbpB (YB-1) cold shock domain proteins. Mol. Cell. Biol. 17: 6168.[Abstract] Lim, C.Y., Santoso, B., Boulay, T., Dong, E., Ohler, U., and Kadonaga, J.T. 2004. The MTE, a new core promoter element for transcription by RNA polymerase II. Genes & Dev. 18: 16061617. Mantovani, R. 1998. A survey of 178 NF-Y binding CCAAT boxes. Nucleic Acids Res. 26: 11351143. Ohler, U., Liao, G.C., Niemann, H., and Rubin, G.M. 2002. Computational analysis of core promoters in the Drosophila genome. Genome Biol. 3: RESEARCH0087. Riggs, K.J., Saleque, S., Wong, K.K., Merrell, K.T., Lee, J.S., Shi, Y., and Calame, K. 1993. Yin-yang 1 activates the c-myc promoter. Mol. Cell. Biol. 13: 74877495. Safrany, G. and Perry, R.P. 1995. The relative contributions of various transcription factors to the overall promoter strength of the mouse ribosomal protein L30 gene. Eur. J. Biochem. 230: 10661072.[Medline] Shi, Y., Seto, E., Chang, L.S., and Shenk, T. 1991. Transcriptional repression by YY1, a human GLI-Kruppel-related protein, and relief of repression by adenovirus E1A protein. Cell 67: 377388.[CrossRef][Medline] Shi, Y., Lee, J.S., and Galvin, K.M. 1997. Everything you have ever wanted to know about Yin Yang 1. Biochim. Biophys. Acta 1332: F49F66.[Medline] Shrivastava, A. and Calame, K. 1994. An analysis of genes regulated by the multi-functional transcriptional regulator Yin Yang-1. Nucleic Acids Res. 22: 51515155. Smale, S.T. 1997. Transcription initiation from TATA-less promoters within eukaryotic protein-coding genes. Biochim. Biophys. Acta 1351: 7388.[Medline] Smale, S.T. and Baltimore, D. 1989. The "initiator" as a transcription control element. Cell 57: 103113.[CrossRef][Medline] Smale, S.T. and Kadonaga, J.T. 2003. The RNA polymerase II core promoter. Annu. Rev. Biochem. 72: 449479.[CrossRef][Medline] Smit, A.F. 1996. The origin of interspersed repeats in the human genome. Curr. Opin. Genet. Dev. 6: 743748.[CrossRef][Medline] Smith, E., Meyerrose, T.E., Kohler, T., Namdar-Attar, M., Bab, N., Lahat, O., Noh, T., Li, J., Karaman, M.W., Hacia, J.G., et al. 2005. Leaky ribosomal scanning in mammalian genomes: Significance of histone H4 alternative translation in vivo. Nucleic Acids Res. 33: 12981308. Srinivasan, L. and Atchison, M.L. 2004. YY1 DNA binding and PcG recruitment requires CtBP. Genes & Dev. 18: 25962601. Thomas, M.J. and Seto, E. 1999. Unlocking the mechanisms of transcription factor YY1: Are chromatin modifying enzymes the key? Gene 236: 197208.[CrossRef][Medline] Wei, C.L., Wu, Q., Vega, V.B., Chiu, K.P., Ng, P., Zhang, T., Shahab, A., Yong, H.C., Fu, Y., Weng, Z., et al. 2006. A global map of p53 transcription-factor binding sites in the human genome. Cell 124: 207219.[CrossRef][Medline]
Received July 13, 2006; accepted in revised format January 9, 2007. This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||