|
|
|
Published online before print
April 11, 2001, 10.1101/gr.GR-1640R
Vol. 11, Issue 5, 677-684, May 2001
REPORTS
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
To understand the mechanism of transcriptional regulation, it is essential to identify and characterize the promoter, which is located proximal to the mRNA start site. To identify the promoters from the large volumes of genomic sequences, we used mRNA start sites determined by a large-scale sequencing of the cDNA libraries constructed by the "oligo-capping" method. We aligned the mRNA start sites with the genomic sequences and retrieved adjacent sequences as potential promoter regions (PPRs) for 1031 genes. The PPR sequences were searched to determine the frequencies of major promoter elements. Among 1031 PPRs, 329 (32%) contained TATA boxes, 872 (85%) contained initiators, 999 (97%) contained GC box, and 663 (64%) contained CAAT box. Furthermore, 493 (48%) PPRs were located in CpG islands. This frequency of CpG islands was reduced in TATA+/Inr+ PPRs and in the PPRs of ubiquitously expressed genes. In the PPRs of the CGM2 gene, the DRA gene, and the TM30pl genes, which showed highly colon specific expression patterns, the consensus sequences of E boxes were commonly observed. The PPRs were also useful for exploring promoter SNPs.
[The nucleotide sequences described in this paper have been deposited in the DDBJ, EMBL, and GenBank data libraries under accession nos. AU098358-AU100608.]
| |
INTRODUCTION |
|---|
|
|
|---|
To understand the mechanism of transcriptional
regulation, it is indispensable to identify and
characterize the promoter. The promoter is usually located just
proximal to or overlapping the transcription initiation site and
contains several sequence motifs with which transcription factors (TFs)
interact in a sequence-specific manner. When recruited, these TFs serve
as molecular switches, which turn the transcription of the gene on or
off. The combinations of the TF-binding motifs in promoters vary
depending on the gene, so that an appropriate subset of genes can be
expressed according to tissue types or developmental stages (Mitchell
and Tjian 1989
; Novina and Roy 1996
). Among many TF-binding motifs,
TATA box and initiator (Inr) are considered to be especially important
because only these motifs are directly recognized by the general
transcription factors (Roeder 1996
; Smale 1997
). GC box and CAAT box
are also thought to be important promoter elements besides TATA box and Inr.
Whether the promoter is located in CpG islands or not is also important
for transcriptional regulation. CpG islands are defined as dispersed
regions of DNA with a high frequency of CpG dinucleotide relative to
the bulk genome (Gardiner-Garden and Frommer 1987
; Larsen et al. 1992
).
When CpG islands remain unmethylated, TF-binding sites can be
recognized by TF. In contrast, when methylated, the presence of
5-methylcytosine in CpG islands interferes with the binding of TFs and
thus suppresses transcription (Cross and Bird 1995
; Costello et al 2000
).
Despite the important roles of the promoters, the number of genes whose
promoters have been identified is limited. In the Eukaryotic Promoter
Database (EPD; Rel. 62; http://www.epd.isb-sib.ch; Perier et al. 2000
),
which accumulates previously-characterized promoter sequences, only 273 human promoters have been registered. This may be due to the fact that
the exact mRNA start sites have not been identified for most of the
genes. The conventional methods used to identify the mRNA start site,
such as S1 mapping, primer extension, or 5' RACE (Berk and Sharp 1977
;
McKnight and Kingsbury 1982
; Schaefer 1995
) are technically difficult
and often lead to the inaccurate identification of the mRNA start sites.
Previously, we developed a novel method to construct a full-length
enriched and 5'-end enriched cDNA library (Maruyama and Sugano 1994
;
Suzuki et al. 1997
). This "oligo-capping" method uses the cap
structure of mRNA, which is the characteristic structure of the 5' end
of eukaryotic mRNAs. By three sequential enzyme reactions, the
oligo-capping method replaces the cap structure of mRNA with synthetic
oligoribonucleotide (Fig. 1). Using this 5'
oligoribonucleotide as a sequence tag, cDNAs that originally contained
the cap structure are selectively cloned. This type of library
(oligo-capped cDNA libraries) contained 50%-80% of the full-length
cDNAs whose 5' ends correspond to the mRNA start sites (Suzuki et al.
1997
, 2000
).
|
The oligo-capped cDNA libraries are found to be good resources for identification of the mRNA start site for many genes. We have constructed oligo-capped cDNA libraries from 34 kinds of human tissues and cultured cells and sequenced the 5&prime ends of 100,000 clones from these cDNA libraries. By clustering the sequence data, we identified the mRNA start sites at least for 2251 genes. We aligned these transcriptional start sites onto the genomic sequences and retrieved adjacent sequences as the potential promoter regions (PPRs) for 1031 genes. Here we report the identification and characterization of our first 1031 PPRs.
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
Identification of the PPRs of 1031 Kinds of Genes
We searched for the promoters using the mRNA start sites of 2251 kinds of genes identified by the oligo-capped method (Suzuki et al.
2000
). The genomic sequences in Genbank were searched by BLASTN (Altschul et al. 1990
) and aligned by
CLUSTALW (Thompson et al. 1999). The genomic sequences
were obtained from GenBank on February 8, 2000, when draft and finished
sequences altogether had covered ~60% of the entire human genome.
Repetitive sequences, such as Alu, were masked with
CENSOR (Jurka et al. 1996
). As a result of the alignment, the
mRNA start sites of 1031 genes were mapped onto the genomic sequences.
For each gene, the genomic sequence between 500 bp upstream and 100 bp downstream of the mRNAs start site was retrieved as PPR.
To check the validity of the retrieved PPRs, we searched EPD with the
corresponding gene names of these PPRs. Among 1031 genes, 44 genes had
their promoters registered in EPD. Forty promoters, such as the
promoter of
actin, serum albumin, and ferritin
heavy chain, completely coincided with the registered promoters.
Only four PPRs differed from registered promoters. These four PPRs seems to be derived from sequences adjacent to pseudogenes. Pseudogenes sometimes gives the highest score in BLAST search because they tend to lack introns. Considering these results, it is likely that
a high proportion of the PPRs identified by our approach actually
coincide with the real promoters.
Functional Classification of the Corresponding Genes
To show that PPRs of genes involved in various biological functions
were included in the retrieved PPRs, we categorized the corresponding genes of the PPR, according to their functional annotations. We used Human Info Base (HIB) at MIPS
(http://www.mips.biochem.mpg.de/proj/human/selec_view.html), in which
human genes are functionally classified based on their homologies to
the yeast genes. Among 1031 corresponding genes, 426 genes appeared in
HIB. As shown in Figure 2, they are
distributed along all the categories in HIB. The most frequently
observed genes were those involved in cellular organization, such as
actin and keratin 6. Although the numbers
were small, the PPRs of genes involved in signal transduction and cell
death such as PDGF receptor and TRAIL
(TNF-related
apoptosis-inducing ligand) were
also included in our data set. This suggests that there is no strong
bias in the PPRs with regard to the functions of the corresponding genes.
|
Analysis of TF-Binding Sites and CpG Islands in the PPRs
To characterize the sequence elements in the PPRs, the PPR sequences
were searched for possible TF-binding sites and for CpG islands. For
the search of TF-binding sites, a TF-binding prediction program,
TFBIND (Tsunoda and Takagi 1999
) was used. CpG islands
were defined as continuous regions >200 bp with GC content [%(G + C)] >50% and CpG ratio >0.6 (for more details, see Methods).
First, we analyzed important TF-binding sites as well as CpG islands.
Table 1 shows that 329 PPRs (32%) out of
1031 PPRs contained TATA boxes, 872 (85%) contained Inrs, 999 (97%)
contained GC boxes, and 663 (64%) contained CAAT boxes. For 493 (48%), the PPRs were located in CpG islands. As for the TATA box and
the initiator, we also examined what fraction of genes contained both, either, or neither of them. The TATA+Inr+,
TATA+Inr
, TATA
Inr+, and
TATA
Inr
genes constituted 28%, 4%, 56%, and
12% of the genes, respectively.
|
Although these promoter elements are considered to be important, there have been almost no reports that describe what fraction of promoters contains these elements. In this respect, it is interesting to note that frequency of GC box is almost 100% whereas that of TATA box is 32%. This might suggest that GC box plays more fundamental roles in transcription than TATA box.
Next, we examined the relation between TATA boxes, Inrs, and CpG
islands. We calculated the frequencies of PPRs with (+) or without (
)
CpG islands separately for TATA+Inr+,
TATA+Inr
, TATA
Inr+, and TATA
Inr
PPRs (Fig.
3). For TATA+Inr
, TATA
Inr+, and
TATA
Inr
PPRs, the numbers of PPRs in CpG island were similar to that out of 1031 PPRs. In contrast, two-thirds of TATA+Inr+
PPRs were located outside of CpG islands. It is known that TATA box and
Inr are the most preferred docking platforms created for the RNA
polymerase II complex, which drives the most active transcription (Roeder 1996
; Smale 1997
). Our results may suggest that
TATA+Inr+ promoters need not be located in CpG
islands because of their strong activity.
|
Association Between the PPRs with the Expression Profiles
To study the relationship between the promoter element in the PPRs
and the transcriptional level of genes, we associated the PPRs with the
expression profiles obtained by iAFLP. iAFLP is an RT-PCR-mediated gene
quantification method (Kato 1997
; Kawamoto et al. 1999
) and now
expression levels among 30 kinds of human tissues are available for
many human genes (http://bodymap.ims.u-tokyo.ac.jp). We searched the
iAFLP database and retrieved expression profiles for 350 PPRs.
Some of the genes showed highly tissue-specific expression patterns. We classified the 350 PPRs according to the tissue specificity of the expression using GeneRank (http://bodymap.ims.u-tokyo.ac.jp). GeneRank is a program that calculates the scores of tissue specificity based on rigorous indexes as to whether the expression pattern is statistically biased (in prep.). We selected the top 10% of the genes (GeneRank nos. 1-35) according to the GeneRank scores and defined these genes as tissue-specific genes. The most significant tissue-specific expression pattern was observed for chitinase 3-like 1 in osteoblast (Fig. 4A). The genes ranked at bottom 10% (GeneRank nos. 316-350) were defined as ubiquitously expressed genes and the genes ranked in between (GeneRank nos. 36-315) were defined as middle genes. The expression patterns of chitinase 3-like 1 (GeneRank no. 1), lipoamide beta (GeneRank no. 35), SUPT4H1 (GeneRank no. 316), and mitochondrial-processing peptidase beta (GeneRank no. 350) are shown in Figure 4A.
|
We examined the association between the presence of the CpG islands in
the PPRs and the tissue-specific expression of the corresponding genes
because CpG islands are thought to be associated with housekeeping
genes (Larsen et al. 1992
; Cross and Bird 1995
). The frequencies of CpG
(+) PPRs were almost the same with that of CpG (
) PPRs for
tissue-specific genes or for middle genes (Fig. 4B; see also Kusuda et
al. 1993
). This coincides with the result of all the 1031 PPRs (48%,
see Table 1). However, the frequency of CpG (+) PPRs was significantly
reduced for ubiquitous genes. This discrepancy with the previous view
was unexpected. At present, we do not have a good explanation for this observation.
Association Between the TF-Binding Sites in the PPRs and the Expression Profile
We examined whether there is correlation between the presence of a
certain TF-binding sites in PPRs and the expression profiles revealed
by iAFLP. We selected the CGM2 gene (carcinoembryonic antigen family member 2), DRA gene (colon mucosa-associated
gene), and TM30pl gene [fibroblast tropomyosin TM30 (pl)]
genes, which showed highly colon-specific expression patterns
(GeneRank nos. 11, 36, and 50, respectively). In Figure
5, the potential promoter structure and the
expression profile of each gene are shown. In each of the PPRs, the
consensus sequence of the E box was observed. The CGM2 gene
has other members of a closely related gene family, the CEA
gene (carcinoembryonic antigen) and the BGP gene (biliary
glycoprotein), both of which are also expressed in colon (Thompson et
al. 1991
). As for the CEA gene and the BGP gene,
the promoter structures have been well characterized and the E boxes in
their promoters have been reported to play essential roles in their
transcriptional regulations (Hauck et al. 1994
; Hauck and Stanners
1995
). Because sequence homology of the PPRs of the CGM2
gene and the promoter of the CEA gene was >80% (data not
shown), it is suggested that the E boxes in the PPR of the CGM2 gene are also involved in the transcriptional
regulation. Also, the E boxes were observed in the PPRs of the
DRA gene and the TM30p1 gene, which showed
similar expression patterns with the CGM2 gene. It is likely
that these E boxes have functional roles for these genes. Although more
experimental analyses should be done before it can concluded that these
E boxes actually have functional roles, this approach should give a
useful clue to infer the involvement of certain TFs in the
transcriptional regulation of a gene.
|
Mapping of the SNPs onto the PPR Sequences
Promoter regions are also important targets for exploring single
nucleotide polymorphisms (SNPs; Brookes 1999
). Recently, several
attempts have been reported to associate SNPs in the promoters with
disease predispositions. One of the most successful results may be
analysis of a SNP in the promoter of the TNF gene. This SNP
has been shown to affect the affinity of a transcription factor, OCT-1, and increase the susceptibility to severe malaria
infection (Knight et al. 1999
).
We searched the PPRs for SNPs reported in dbSNP (http://www.ncbi.nlm.nih.gov/SNP/index.html). We downloaded 595,893 SNP data (as of September 25, 2000) and mapped 119 SNPs successfully onto the PPRs. Some of these SNPs turned out to be located in the TF-binding consensus sequences. Figure 6 shows the case of the CYB5RP (delta-6 fatty acid desaturase) gene. When the allele contains C instead of T, the consensus sequence of AP-1-binding site would be destroyed because of this SNP. Although it has not been reported that the AP-1 is involved in the transcriptional regulation of the CYB5RP gene, this information should be a useful clue to identify the promoter SNPs that have functional consequences.
|
In this report, we identified and characterized the PPRs of 1031 kinds of human genes. In addition to the promoter elements described here, other factors should also have important roles in transcription. For example, enhancers located distantly from the mRNA start site, and whether the chromatin in the promoter is formed or remodeled, should also be considered. Even though our data are based on the sequence adjacent to the transcription start sites only, because almost no reports have described genome-wide features of promoters, the present study should lay groundwork for better understanding of the mechanism of transcription.
| |
METHODS |
|---|
|
|
|---|
Construction of Oligo-Capped cDNA Libraries and Sequencing Analysis
Oligo-capped cDNA libraries were constructed as reported previously
(Suzuki et al. 1997
, 2000
; Sambrook et al. 1989
). Clones (100,000) were
randomly isolated from 34 kinds of these cDNA libraries and one-pass
sequenced from their 5' ends using an ABI 377 XL Auto-Sequencer. The
list of cDNA libraries was described elsewhere (Suzuki et al. 2000
).
Identification of the PPRs
Sequence similarity was searched against Genbank (Release 102.0)
using BLASTN (Altschul et al. 1990
). The cDNAs matched with known genes were collected after removing the oligo-capped 5'-oligonucleotide sequence from each 5' end. The cDNAs lacking the
reported translation initiator ATG were removed from the database as
erroneous products of the oligo-capping method. The selected cDNAs were
clustered to create a nonredundant set of 2251 cDNAs by round-robin
search of BLASTN. When the multiple start sites were
observed (Y. Suzuki, H. Taira, S. Tsunoda, J. Sese, J.S. Mizushima, H. Hata, T. Ota, T. Isogai, T. Tanaka, Y. Sakaki, A. Suyama, S. Morishita,
Y. Nakamura, K. Okubo, and S. Sugano, in prep.), cDNAs containing the
longest 5' ends were selected as representatives. For alignment of the
5' end of the cDNA with the genome sequence, genome sequences were
downloaded from FTP sites of GenBank
(ftp://ncbi.nlm.nih.gov/genbank/genomes/H_sapiens/) on February 8, 2000, when draft and finished sequences altogether had covered about
60% of the entire human genome. The genomic sequences were first
roughly searched with BLAST, using 100-bp sequences from
the 5' ends of oligo-capped cDNAs. The exact alignment between cDNAs
and genome sequences was confirmed with CLUSTALW (Thompson
et al. 1994
). When 23 bp out of 25 bp from the 5' ends of the cDNAs
were matched, the corresponding genomic sequences were retrieved. The
promoters were defined as the sequences extending from 500 bp upstream
to 100 bp downstream of the mapped 5' ends of the oligo-capped cDNAs.
In the numbering scheme used here, the mRNA start site identified by
the longest oligo-capped cDNA is designated as +0. Negative and
positive integers indicate 3' and 5' relative to +0, respectively.
Repetitive sequence elements, such as Alu, were masked using
CENSOR (Jurka et al. 1996
). Four PPRs that had been
derived from the nonspecific genomic sequences with high homology to
the 5' ends of the cDNAs used for the search, such as sequences
adjacent to pseudogenes were excluded from the data set as erroneous products.
Identification of the TF-Binding Sites and CpG Islands in the PPRs
The search of possible TF-binding sites using TFBIND
was performed as described previously (Tsunoda and Takagi 1999
). For
calculating matching scores, Bucher's calculating method (Bucher 1990
)
and TF frequency matrices in TRANSFAC (Rel. 4.0; http://transfac.gbf.de/index.html) were used. The optimized cutoff values, preferred region, and searched region of each TF-binding motif
are shown in Table 1. On each TF-binding matrix, the number of
promoters whose matching scores were above the optimized cut-off values
were counted. The methods for optimizing the cutoff values and
determining the preferred regions were described elsewhere (Tsunoda and
Takagi 1999
).
For the analysis of CpG islands, the moving average for %(G + C) and
the CpG ratio were calculated for each sequence, using a 100-bp window
moving along the sequence at 1-bp intervals. The CpG ratio was
calculated according to the standard method (Gardiner-Garden and
Frommer 1987
): (number of CG × N)/(number of C × number of G),
where N is the total number of nucleotides in the sequence being
analyzed. CpG islands were defined as regions >200 bp with %(G + C) > 50% and CpG ratio > 0.6. Repetitive sequence
elements were not included in the calculation.
Resources for Databases and Computer Programs
Human genomic sequences were from
ftp://ncbi.nlm.nih.gov/genbank/genomes/H_sapiens/ as of February 8, 2000. HIB were from http://www.mips.biochem.mpg.de/proj/human/selec_view.html. All of the
iAFLP data was from http://bodymap.ims.u-tokyo.ac.jp.
TRANSFAC was from http://transfac.gbf.de/index.html
(Heinemeyer et al. 1999
). TFBIND was from T. Tsunoda
(Tsunoda and Takagi 1999
). GenRank was from J. Sese
(University of Tokyo). BLASTN, CLUSTALW, and
CENSOR were from the Human Genome Center in the Institute
of Medical Sciences, University of Tokyo.
| |
ACKNOWLEDGMENTS |
|---|
We thank Y. Shirai, Y. Takahashi, T. Tsunoda, T. Mizuno, M. Morinaga, M. Kawamura, and K. Mizuno for their excellent sequencing work. We are also grateful to M. Hida, M. Sasaki, and T. Ishihara for their technical support, D. Ramana, M. Zhang, S. Watanabe, K. Kataoka, J. Imai, T. Komatsu, M. Watanabe, T. Togashi, and N. Osada for helpful discussions, and E. Nakajima for critical reading of the manuscript. This work was supported by a Grant-in-Aid for Scientific Research on Priority Areas from the Ministry of Education, Science, Sports, and Culture of Japan and by special coordination funds for promoting science and technology (SCF) from the Science and Technology Agency (STA) of Japan.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
9 Corresponding author.
E-MAIL ysuzuki{at}ims.u-tokyo.ac.jp; FAX 81 3 5449 5416.
Article published on-line before print: Genome Res., 10.1101/gr.164001.
Article and publication are at www.genome.org/cgi/doi/10.1101/gr.164001.
| |
REFERENCES |
|---|
|
|
|---|
a program for identification and elimination of repetitive elements from DNA sequences.
Computers Chem.
20:
119-122.Received September 5, 2000; accepted in revised form February 5, 2001.
This article has been cited by other articles:
![]() |
M. Reuter, J. Engelstadter, P. Fontanillas, and L. D. Hurst A Test of the Null Model for 5' UTR Evolution Based on GC Content Mol. Biol. Evol., May 1, 2008; 25(5): 801 - 804. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ceribelli, D. Dolfini, D. Merico, R. Gatta, A. M. Vigano, G. Pavesi, and R. Mantovani The Histone-Like NF-Y Is a Bifunctional Transcription Factor Mol. Cell. Biol., March 15, 2008; 28(6): 2047 - 2058. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Benatti, V. Basile, D. Merico, L. I. Fantoni, E. Tagliafico, and C. Imbriano A balance between NF-Y and p53 governs the pro- and anti-apoptotic transcriptional response Nucleic Acids Res., March 1, 2008; 36(5): 1415 - 1428. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Abeel, Y. Saeys, E. Bonnet, P. Rouze, and Y. Van de Peer Generic eukaryotic core promoter prediction using structural features of DNA Genome Res., February 1, 2008; 18(2): 310 - 323. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wakaguri, R. Yamashita, Y. Suzuki, S. Sugano, and K. Nakai DBTSS: database of transcription start sites, progress report 2008 Nucleic Acids Res., January 11, 2008; 36(suppl_1): D97 - D101. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Y. Yamamoto, H. Ichida, T. Abe, Y. Suzuki, S. Sugano, and J. Obokata Differentiation of core promoter architecture between plants and mammals revealed by LDSS analysis Nucleic Acids Res., September 25, 2007; 35(18): 6219 - 6226. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Malecova, P. Gross, M. Boyer-Guittaut, S. Yavuz, and T. Oelgeschlager The Initiator Core Promoter Element Antagonizes Repression of TATA-directed Transcription by Negative Cofactor NC2 J. Biol. Chem., August 24, 2007; 282(34): 24767 - 24776. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. F. Guerra, L. Imperadori, R. Mantovani, D. D. Dunlap, and L. Finzi DNA Compaction by the Nuclear Factor-Y Biophys. J., July 1, 2007; 93(1): 176 - 182. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Sakakibara, T. Irie, Y. Suzuki, R. Yamashita, H. Wakaguri, A. Kanai, J. Chiba, T. Takagi, J. Mizushima-Sugano, S.-i. Hashimoto, et al. Intrinsic Promoter Activities of Primary DNA Sequences in the Human Genome DNA Res, May 23, 2007; (2007) dsm006v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-C. Chiang, H.-M. Chen, Y.-H. Lee, H.-H. Chang, Y.-C. Wu, B.-W. Soong, C.-M. Chen, Y.-R. Wu, C.-S. Liu, D.-M. Niu, et al. Dysregulation of C/EBP{alpha} by mutant Huntingtin causes the urea cycle deficiency in Huntington's disease Hum. Mol. Genet., March 1, 2007; 16(5): 483 - 498. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Tokusumi, Y. Ma, X. Song, R. H. Jacobson, and S. Takada The New Core Promoter Element XCPE1 (X Core Promoter Element 1) Directs Activator-, Mediator-, and TATA-Binding Protein-Dependent but TFIID-Independent RNA Polymerase II Transcription from TATA-Less Promoters Mol. Cell. Biol., March 1, 2007; 27(5): 1844 - 1858. [Abstract] [Full Text] [PDF] |
||||
![]() |
N.-K. Kim, K. Tharakaraman, and J. L. Spouge Adding sequence context to a Markov background model improves the identification of regulatory elements Bioinformatics, December 1, 2006; 22(23): 2870 - 2875. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Ohler Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction Nucleic Acids Res., November 6, 2006; 34(20): 5943 - 5950. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Iwatani, K. Ikegami, Y. Kremenska, N. Hattori, S. Tanaka, S. Yagi, and K. Shiota Dimethyl Sulfoxide Has an Impact on Epigenetic Profile in Mouse Embryoid Body Stem Cells, November 1, 2006; 24(11): 2549 - 2556. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ogura, M. Azuma, Y. Tsuboi, Y. Kabe, Y. Yamaguchi, T. Wada, H. Watanabe, and H. Handa TFII-I down-regulates a subset of estrogen-responsive genes through its interaction with an initiator element and estrogen receptor {alpha} Genes Cells, April 1, 2006; 11(4): 373 - 381. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. C. Yaden, M. Garcia III, T. P. L. Smith, and S. J. Rhodes Two Promoters Mediate Transcription from the Human LHX3 Gene: Involvement of Nuclear Factor I and Specificity Protein 1 Endocrinology, January 1, 2006; 147(1): 324 - 337. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Lipovich and M.-C. King Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22 Genome Res., January 1, 2006; 16(1): 45 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Kimura, A. Wakamatsu, Y. Suzuki, T. Ota, T. Nishikawa, R. Yamashita, J.-i. Yamamoto, M. Sekine, K. Tsuritani, H. Wakaguri, et al. Diversification of transcriptional modulation: Large-scale identification and characterization of putative alternative promoters of human genes Genome Res., January 1, 2006; 16(1): 55 - 65. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Schmid, R. Perier, V. Praz, and P. Bucher EPD in its twentieth year: towards complete promoter coverage of selected model organisms Nucleic Acids Res., January 1, 2006; 34(suppl_1): D82 - D85. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheong, Y. Yamada, R. Yamashita, T. Irie, A. Kanai, H. Wakaguri, K. Nakai, T. Ito, I. Saito, S. Sugano, et al. Diverse DNA Methylation Statuses at Alternative Promoters of Human Genes in Various Tissues DNA Res, January 1, 2006; 13(4): 155 - 167. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Suhre, S. Audic, and J.-M. Claverie Mimivirus gene promoters exhibit an unprecedented conservation among all eukaryotes PNAS, October 11, 2005; 102(41): 14689 - 14693. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer Large-scale structural analysis of the core promoter in mammalian and plant genomes Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Somboonthum, H. Ohta, S. Yamada, M. Onishi, A. Ike, Y. Nishimune, and M. Nozaki cAMP-responsive element in TATA-less core promoter is essential for haploid-specific gene expression in mouse testis Nucleic Acids Res., June 10, 2005; 33(10): 3401 - 3411. [Abstract] [Full Text] [PDF] |
||||