|
|
|
|
Published online before print
September 4, 2007, 10.1101/gr.6224007 Genome Res. 17:1438-1447, 2007 ©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
Letter Computational identification and functional validation of regulatory motifs in cartilage-expressed genes1 Department of Orthopaedic Surgery, Washington University School of Medicine, St. Louis, Missouri 63110, USA; 2 Department of Biomedical Engineering, Washington University, St. Louis, Missouri 63130, USA; 3 Department of Pediatrics, University of Texas Medical School at Houston, Houston, Texas 77030, USA; 4 Shriners Hospital for Children, Houston, Texas 77030, USA; 5 Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA; 6 Department of Cell Biology and Physiology, Washington University School of Medicine, St. Louis, Missouri 63110, USA
Chondrocyte gene regulation is important for the generation and maintenance of cartilage tissues. Several regulatory factors have been identified that play a role in chondrogenesis, including the positive transacting factors of the SOX family such as SOX9, SOX5, and SOX6, as well as negative transacting factors such as C/EBP and delta EF1. However, a complete understanding of the intricate regulatory network that governs the tissue-specific expression of cartilage genes is not yet available. We have taken a computational approach to identify cis-regulatory, transcription factor (TF) binding motifs in a set of cartilage characteristic genes to better define the transcriptional regulatory networks that regulate chondrogenesis. Our computational methods have identified several TFs, whose binding profiles are available in the TRANSFAC database, as important to chondrogenesis. In addition, a cartilage-specific SOX-binding profile was constructed and used to identify both known, and novel, functional paired SOX-binding motifs in chondrocyte genes. Using DNA pattern-recognition algorithms, we have also identified cis-regulatory elements for unknown TFs. We have validated our computational predictions through mutational analyses in cell transfection experiments. One novel regulatory motif, N1, found at high frequency in the COL2A1 promoter, was found to bind to chondrocyte nuclear proteins. Mutational analyses suggest that this motif binds a repressive factor that regulates basal levels of the COL2A1 promoter.
Gene expression and its regulation are important for the coordination of various activities of a cell. This complex regulatory circuit involves a multitude of transcription factors (TFs) and their corresponding cis-acting regulatory elements whose interaction in a variety of permutational and combinatorial events enable the function and maintenance of tissues. An understanding of the transcriptional regulatory network (Covert et al. 2004
The successful use of DNA pattern recognition programs in inferring regulatory circuits has been evident in the detection of novel regulatory elements involved in heat-shock response (GuhaThakurta et al. 2002a
Many genes such as collagen type II (COL2A1), collagen type IX (COL9A1), collagen type XI (COL11A2), and melanoma inhibitory activity (MIA) that are normally expressed predominantly and almost exclusively in cartilage are known to share common regulatory factors (Okazaki et al. 2002
Analyses of previously characterized TF-binding motifs in cartilage characteristic genes Our analyses included steps to identify known TF-binding motifs, novel binding motifs, and paired SOX-binding sites within cartilage enhancer sequences by examining the conserved sequence in 18 orthologous pairs of human and mouse genes. The results of the analysis of the characterized TF binding motifs from TRANSFAC are shown in Table 1A. Transcription factor-binding motifs were ranked by the log ratio of the probability scores, taking into account multiple predicted binding sites within a promoter, calculated using cartilage genes and background genes, respectively (see Methods). Based on this model, a higher log ratio value indicates a higher probability that the TF will bind to the promoter of these cartilage genes. The log ratios of the probability scores for TFs with documented roles in chondrogenesis are given in Table 1B.
Identification of paired SOX-binding motifs in cartilage characteristic genes One TF we expected to detect with a high log ratio in the analyses above is SOX9, a master regulator of chondrogenesis (Wagner et al. 1994
To test the sensitivity of our SOX model, we used this motif to predict experimentally identified SOX-binding sites in our training set of cartilage regulatory regions of the Mia, Col2a1, and Col11a2 genes (Fig. 2A–C). Although using the SOX profile to identify the contributing SOX-binding sites seems circular, the purpose of this test is to identify (1) any SOX site in the training set that is significantly different from other SOX sites and (2) additional unknown SOX sites in the known cartilage regulatory regions. All of the sites used to train the motif were also predicted from our search algorithm with the exception of one in the 183-bp Mia gene promoter element, indicating that our search criteria were functioning appropriately. When the orientation and spacing of the predicted sites were examined, we found at least one tandem pair of sites with opposite orientations, with a short spacer sequence in between (3–8 bp), for each of the regulatory elements used to develop our model motifs (Fig. 2A–C).
Multiple sites were predicted in the 183-bp element, located at –2251 to –2058 of the Mia promoter, an essential part of the cartilage regulatory module (Okazaki et al. 2006
Since experimental data suggested that pairing of SOX motifs is an important part of the chondrocyte regulatory module, we refined our search criteria to include not just the conservation between the mouse and human sequences, but also the close positioning of the sites that are 3–8 bp apart. A summary of all the conserved sites predicted in the promoters of all the searched cartilage genes is shown in Supplemental Table 2. As the table indicates, the conservation between human and mouse orthologous sequences significantly reduced the number of motifs that would be predicted to be functional. There were only three genes in our cartilage set (COL9A2, COMP, FGF18) that did not have any conserved pair of SOX sites. To test the specificity of our search criteria, we also collected the promoters of 13 previously reported liver-specific genes (Krivan and Wasserman 2001
In addition to SOX factors, our previous results have implicated the repressor C/EBP beta as a TF that may work in these regulatory sequences to restrict cartilage gene expression (Okazaki et al. 2006
Since the SOX-binding module seemed a predominant feature in our gene set, we then decided to test these predicted sites for functional activity in another well-characterized cartilage gene promoter, COL9A1, that is known to be activated by SOX9 in chondrocytes (Zhang et al. 2003
Identification of a novel motif N1 in cartilage-expressed genes Although the TRANSFAC database curates 500 vertebrate TF-binding profiles, there are estimated to be 2000 TF proteins in the human genome. Therefore, the binding profiles of many TFs are still not available, and the computational analysis described above using the TRANSFAC motifs may have missed other chondrocyte cis-regulatory elements. To circumvent this problem, we applied the DNA pattern recognition algorithm CONSENSUS (Stormo and Hartzell 1989
The location of the N1 motif in the promoter regions of both the COL2A1 and COL11A2 genes made this motif a good candidate for experimental validation of its function. The consensus sequence determined for this motif is CCAGAGCCC. Significantly, evolutionarily conserved N1 motifs were detected in the proximal promoter region of 11 of the 18 cartilage genes analyzed in both human and mouse (Table 2). In some genes, multiple conserved N1 motifs were detected. For example, in the COL2A1 promoter, four conserved N1 motifs were identified within –1 kb (N-125/126, N-135, N-147, and N-991), and one conserved N1 motif was identified at the +16 (N+16) position relative to the transcriptional start site.
To assess the function of the N1 motif and to determine its relevance in the regulation of chondrogenesis, oligonucleotides (15 bp in length) were generated from the promoter sequences of the cartilage genes in which this motif was recognized (see Table 2) and used as probes in electromobility gel shift assays using nuclear extracts from the RCS cells to determine protein-binding capacity (Fig. 6). Significantly, a major DNA–protein complex with similar mobility was generated for all the 16 oligonucleotides tested (Fig. 6, O1–O16, arrowhead), suggesting that these protein–DNA complexes probably have an identical TF binding to the N1 motif. In all of these cases, the binding could be competed by the addition of nonradiolabeled oligonucleotides to the reaction, demonstrating the specific nature of this binding. For the oligonucleotides O6, O13, O14, and O16, an additional, faster migrating complex (double arrowhead) was also observed, suggesting additional TFs may bind to these sequences as well. Four conserved N1 sites were identified in the COL2A1 promoter within –1 kb from the transcription start site (O6, O7, O8, O9). The N1 site at –125/126 was found to be present on both strands of the COL2A1 promoter sequence at the indicated positions. Interestingly, this motif, CCCCGGAGCCC, partially overlaps a previously identified EGR1 site (with overlap shown in bold/underlined) that mediates repression of the COL2A1 gene by interleukin 1 beta (IL1B) (Tan et al. 2003
To test the functional significance of the N1 motif in the COL2A1 gene and validate the computational search, a mutational analysis of the N1 motif was performed. To avoid complications in the analysis caused by the binding of other factors to the site at –125/126, the more distal N1 motif at –147 in a COL2A1 promoter-luciferase reporter construct was mutated to test its function in the RCS cell line in which strong COL2A1 expression is normally observed. This mutational analysis involved deletion of the four most conserved bases from the N1 motif (CCAGTGCCC to CCA----CC) at the –147 position. Deletion of these four core bases increased the basal COL2A1 promoter activity in our reporter construct as measured by relative luciferase activity (Fig. 7A), suggesting that this is probably a motif that binds to a repressor resulting in negative regulation. As IL1B is known to repress chondrogenesis, the response of the mutated COL2A1 promoter was also tested by measuring its response to IL1B (Fig. 7A). As expected, the wild-type COL2A1 promoter demonstrated a reduction in its basal activity (by 62%) in the presence of IL1B. The mutant promoter activity was also repressed (by 54%–63%), suggesting that the regulation to inflammatory mediators was still active. Interestingly, the repression of the mutant promoter in the presence of IL1B results in its activity being equal to the activity of the wild-type promoter in the absence of IL1B again, further suggesting that the –147 N1 motif probably binds to a negatively regulating TF.
The functional significance of the N1 motifs found in the promoter of an additional cartilage characteristic gene, COMP, was also analyzed. When oligonucleotides representing an N1 motif in the COMP promoter were tested by EMSA as above, a high-molecular-weight DNA–protein complex similar to that of the other tested oligonucleotides was observed. This DNA–protein complex was competed away with non-radiolabeled oligonucleotides representing the wild-type N1 motif, but not by oligonucleotides with mutation(s) of the core nucleotides within the N1 motif (data not shown; for details, see Supplemental Methods). Furthermore, to test the function of this motif in vitro, analogous mutations in the human COMP gene promoter-luciferase reporter construct were made, mutating the N1 sequence motif from CACCACAGCCC to CACTGTGGCCC to form N1Mut (mutational changes are underlined) (Fig. 7B). The RCS cell line transfected with this mutant construct also showed an increase in activity compared to the wild-type promoter, again demonstrating a negative transacting cis-regulatory function for this motif.
In this study, we have taken a computational approach to annotate and validate important cis-regulatory motifs in cartilage-specific genes. The use of standard molecular biology techniques such as systematic sequence deletions and other mutagenesis approaches to define cartilage-specific regulatory regions would have been time-consuming given the size of the promoter region and the number of genes that we have used in this study. To circumvent this problem, we used computational methods that utilized position weight matrices and phylogenetic footprinting to increase the sensitivity of our predictions. Using this approach, we screened our cartilage-specific gene set for previously characterized TF-binding motifs from the TRANSFAC database and novel TF-binding sites with customized binding motifs. Statistical analysis of motif enrichment was performed to predict those that are most likely to regulate the chondrocyte genes.
Several TRANSFAC motifs were highly represented in our gene set for factors that play critical roles in patterning required for skeletal development (Nissen et al. 2003
Using this information, we were able to refine our computational search and predict novel SOX-binding motifs for experimental validation more reliably. We found many high-scoring SOX-binding sites in our cartilage genes using the customized SOX-binding profile, but only some of them were present in a tandem pair with opposite orientations. We hypothesized that this is the preferred architecture of the chondrocyte regulatory module. Our analysis identified five novel SOX-binding motifs in the COL9A1 promoter in addition to one that is already published. Mutations in any of these SOX-binding motifs significantly decreased promoter activity in luciferase reporter assays, validating our computational approach. We also identified a novel SOX-binding motif in the cartilage-specific Mia gene and similarly validated its function by mutational analyses. Although we did not test it for functional activity, our model also predicted an additional SOX site (+130 bp), oppositely orientated and adjacent to, a documented site (+120 bp) in the HAPLN1 gene (also called cartilage linking protein 1 gene) (Kou and Ikegawa 2004
Another challenge to defining biologically functional SOX-binding sites through computational approaches is that in vivo, some SOX sites function only in the context of other cis-acting motifs and TFs (Kamachi et al. 2000
One drawback of performing computational promoter analysis based on the TRANSFAC database is that currently we do not have a complete collection of the binding profiles of all transcription factors. Although recent studies have proposed methods to computationally identify mammalian TF-binding profiles using comparative genomics approaches (Tan et al. 2005
In summary, there are several challenges in the use of computational approaches to identify TF-binding motifs in mammalian organisms. One major challenge is the fact that mammalian species have very long intergenic sequence, and it is very difficult to detect regulatory sequences in such a large search space. In our approach, we used phylogenetic footprinting methods (Wasserman et al. 2000
Definition of promoter sequences A set of 18 orthologous human and mouse genes with documented expression in cartilage either during development or maintenance of the tissue was selected for this analysis: ACAN, CILP, COL11A1, COL11A2, COL2A1, COL9A1, COL9A2, COL9A3, COMP, HAPLN1, CTGF, FGF18, IHH, MATN1, MATN3, MIA (Cdrap), PRG4, and SOX9. An additional set of 13 previously reported liver-specific genes was also collected for testing specificity. The promoter sequences from these gene sets were selected for the analyses as follows. For most genes, the promoter sequence was defined as the 10-kb upstream and the 5-kb downstream sequence according to the transcriptional start site. For some genes, the promoter sequence was truncated when an upstream gene was encountered (e.g., the COL11A2 gene) or when the translation start site was reached. Focusing on the 15-kb genomic sequence around the transcription start site for each gene in our analysis is keeping within the limit of current DNA recognition pattern programs. For each human gene, the mouse ortholog was determined by the NCBIs HomoloGene database (http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=homologene). The mouse ortholog was further verified for the reciprocal best match of the protein sequences using WU-BLAST (http://blast.wustl.edu/). Promoter sequences of human and mouse genes were retrieved from the UCSC genome browser (http://www.genome.ucsc.edu). Repetitive elements in the promoter sequences were masked by the program RepeatMasker (http://www.repeatmasker.org) with the slow and sensitive mode.
Determination of conserved regions in promoters
Identification of evolutionarily conserved TF-binding motifs
Enrichment of binding motifs
Construction of a cartilage-specific SOX-binding profile
Identification of paired SOX-binding sites in cartilage genes
Identification of novel regulatory motifs in cartilage-characteristic genes
Cell culture
Electromobility gel shift assays (EMSA) and transient transfections
We thank Hua Yu for her technical assistance in preparation of mutant constructs. We also thank David Stokes (Thomas Jefferson University, Philadelphia) for providing us with the COL9A1 promoter construct (p846). This work was supported by NIH grants RO1 AR36994, R01 AR45550, and RO1 AR50847 to L.J.S. and NIH grants HG00249 and GM63340 to G.D.S.
7 Present address: Division of Oncology, Washington University School of Medicine, St. Louis, MO 63110, USA.
E-mail sandelll{at}wudosis.wustl.edu; fax (314) 454-5900. [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6224007
Akiyama, H., Chaboissier, M.C., Martin, J.F., Schedl, A., and de Crombrugghe, B. 2002. The transcription factor SOX9 has essential roles in successive steps of the chondrocyte differentiation pathway and is required for expression of SOX5 and SOX6. Genes & Dev. 16: 2813–2828. Ao, W., Gaudet, J., Kent, W.J., Muttumu, S., and Mango, S.E. 2004. Environmentally induced foregut remodeling by PHA-4/FoxA and DAF-12/NHR. Science 305: 1743–1746. Bi, W., Deng, J.M., Zhang, Z., Behringer, R.R., and de Crombrugghe, B. 1999. SOX9 is required for cartilage formation. Nat. Genet. 22: 85– 89.[CrossRef][Medline] Blanchette, M. and Tompa, M. 2002. Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res. 12: 739–748. Blanchette, M., Bataille, A.R., Chen, X., Poitras, C., Laganiere, J., Lefebvre, C., Deblois, G., Giguere, V., Ferretti, V., Bergeron, D., et al. 2006. Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression. Genome Res. 16: 656–668. Bridgewater, L.C., Walker, M.D., Miller, G.C., Ellison, T.A., Holsinger, L.D., Potter, J.L., Jackson, T.L., Chen, R.K., Winkel, V.L., Zhang, Z., et al. 2003. Adjacent DNA sequences modulate SOX9 transcriptional activation at paired SOX sites in three chondrocyte-specific enhancer elements. Nucleic Acids Res. 31: 1541–1553. Chang, L.W., Nagarajan, R., Magee, J.A., Milbrandt, J., and Stormo, G.D. 2006. A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles. Genome Res. 16: 405–413. Covert, M.W., Knight, E.M., Reed, J.L., Herrgard, M.J., and Palsson, B.O. 2004. Integrating high-throughput and computational data elucidates bacterial networks. Nature 429: 92–96.[CrossRef][Medline] Davies, S.R., Li, J., Okazaki, K., and Sandell, L.J. 2004. Tissue-restricted expression of the Cdrap/Mia gene within a conserved multigenic housekeeping locus. Genomics 83: 667–678.[CrossRef][Medline] Furumatsu, T., Tsuda, M., Taniguchi, N., Tajima, Y., and Asahara, H. 2005. Smad3 induces chondrogenesis through the activation of SOX9 via CREB-binding protein/p300 recruitment. J. Biol. Chem. 280: 8343–8350. GuhaThakurta, D., Palomar, L., Stormo, G.D., Tedesco, P., Johnson, T.E., Walker, D.W., Lithgow, G., Kim, S., and Link, C.D. 2002a. Identification of a novel cis-regulatory element involved in the heat shock response in Caenorhabditis elegans using microarray gene expression and computational methods. Genome Res. 12: 701–712. GuhaThakurta, D., Schriefer, L.A., Hresko, M.C., Waterston, R.H., and Stormo, G.D. 2002b. Identifying muscle regulatory elements and genes in the nematode Caenorhabditis elegans. Pac. Symp. Biocomput. 2002: 425–436. Hertz, G.Z. and Stormo, G.D. 1999. Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics 15: 563–577. Hu, Y., Wang, T., Stormo, G.D., and Gordon, J.I. 2004. RNA interference of achaete-scute homolog 1 in mouse prostate neuroendocrine cells reveals its gene targets and DNA binding sites. Proc. Natl. Acad. Sci. 101: 5559–5564. Huang, W., Lu, N., Eberspaecher, H., and De Crombrugghe, B. 2002. A new long form of c-Maf cooperates with SOX9 to activate the type II collagen gene. J. Biol. Chem. 277: 50668–50675. Imamura, T., Imamura, C., Iwamoto, Y., and Sandell, L.J. 2005. Transcriptional co-activators CREB-binding protein/p300 increase chondrocyte Cd-rap gene expression by multiple mechanisms including sequestration of the repressor CCAAT/enhancer-binding protein. J. Biol. Chem. 280: 16625–16634. Inada, M., Yasui, T., Nomura, S., Miyake, S., Deguchi, K., Himeno, M., Sato, M., Yamagiwa, H., Kimura, T., Yasui, N., et al. 1999. Maturational disturbance of chondrocytes in Cbfa1-deficient mice. Dev. Dyn. 214: 279–290.[CrossRef][Medline] Kamachi, Y., Uchikawa, M., and Kondoh, H. 2000. Pairing SOX off: With partners in the regulation of embryonic development. Trends Genet. 16: 182–187.[CrossRef][Medline] Karsenty, G. and Wagner, E.F. 2002. Reaching a genetic and molecular understanding of skeletal development. Dev. Cell 2: 389–406.[CrossRef][Medline] King, K.B. and Kimura, J.H. 2003. The establishment and characterization of an immortal cell line with a stable chondrocytic phenotype. J. Cell. Biochem. 89: 992–1004.[CrossRef][Medline] Kou, I. and Ikegawa, S. 2004. SOX9-dependent and -independent transcriptional regulation of human cartilage link protein. J. Biol. Chem. 279: 50942–50948. Krivan, W. and Wasserman, W.W. 2001. A predictive model for regulatory sequences directing liver-specific transcription. Genome Res. 11: 1559–1566. Kucharska, A.M., Kuettner, K.E., and Kimura, J.H. 1990. Biochemical characterization of long-term culture of the Swarm rat chondrosarcoma chondrocytes in agarose. J. Orthop. Res. 8: 781–792.[CrossRef][Medline] Lefebvre, V. and de Crombrugghe, B. 1998. Toward understanding SOX9 function in chondrocyte differentiation. Matrix Biol. 16: 529–540.[CrossRef][Medline] Lefebvre, V., Huang, W., Harley, V.R., Goodfellow, P.N., and de Crombrugghe, B. 1997. SOX9 is a potent activator of the chondrocyte-specific enhancer of the pro alpha1(II) collagen gene. Mol. Cell. Biol. 17: 2336–2346.[Abstract] Lefebvre, V., Behringer, R.R., and de Crombrugghe, B. 2001. L-Sox5, Sox6 and Sox9 control essential steps of the chondrocyte differentiation pathway. Osteoarthritis Cartilage 9: S69–S75.[Medline] Loots, G.G., Ovcharenko, I., Pachter, L., Dubchak, I., and Rubin, E.M. 2002. rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res. 12: 832–839. Matys, V., Fricke, E., Geffers, R., Gossling, E., Haubrock, M., Hehl, R., Hornischer, K., Karas, D., Kel, A.E., Kel-Margoulis, O.V., et al. 2003. TRANSFAC: Transcriptional regulation, from patterns to profiles. Nucleic Acids Res. 31: 374–378. Miller, J., Horner, A., Stacy, T., Lowrey, C., Lian, J.B., Stein, G., Nuckolls, G.H., and Speck, N.A. 2002. The core-binding factor Mukhopadhyay, K., Lefebvre, V., Zhou, G., Garofalo, S., Kimura, J.H., and de Crombrugghe, B. 1995. Use of a new rat chondrosarcoma cell line to delineate a 119-base pair chondrocyte-specific enhancer element and to define active promoter segments in the mouse pro-alpha 1(II) collagen gene. J. Biol. Chem. 270: 27711–27719. Nelander, S., Larsson, E., Kristiansson, E., Mansson, R., Nerman, O., Sigvardsson, M., Mostad, P., and Lindahl, P. 2005. Predictive screening for regulators of conserved functional gene modules (gene batteries) in mammals. BMC Genomics 6: 68. doi: 10.1186/1471-2164-6-68.[CrossRef][Medline] Ng, L.J., Wheatley, S., Muscat, G.E., Conway-Campbell, J., Bowles, J., Wright, E., Bell, D.M., Tam, P.P., Cheah, K.S., and Koopman, P. 1997. SOX9 binds DNA, activates transcription, and coexpresses with type II collagen during chondrogenesis in the mouse. Dev. Biol. 183: 108–121.[CrossRef][Medline] Nissen, R.M., Yan, J., Amsterdam, A., Hopkins, N., and Burgess, S.M. 2003. Zebrafish foxi one modulates cellular responses to Fgf signaling required for the integrity of ear and jaw patterning. Development 130: 2543–2554. Okazaki, K. and Sandell, L.J. 2004. Extracellular matrix gene regulation. Clin. Orthop. Relat. Res. 427: S123–S128 Suppl:.[CrossRef][Medline] Okazaki, K., Li, J., Yu, H., Fukui, N., and Sandell, L.J. 2002. CCAAT/enhancer-binding proteins Okazaki, K., Yu, H., Davies, S.R., Imamura, T., and Sandell, L.J. 2006. A promoter element of the CD-RAP gene is required for repression of gene expression in non-cartilage tissues in vitro and in vivo. J. Cell. Biochem. 97: 857–868.[CrossRef][Medline] Peirano, R.I. and Wegner, M. 2000. The glial transcription factor SOX10 binds to DNA both as monomer and dimer with different functional consequences. Nucleic Acids Res. 28: 3047–3055. Qiu, P., Qin, L., Sorrentino, R.P., Greene, J.R., Wang, L., and Partridge, N.C. 2003. Comparative promoter analysis and its application in analysis of PTH-regulated gene expression. J. Mol. Biol. 326: 1327–1336.[CrossRef][Medline] Schneider, T.D. and Stephens, R.M. 1990. Sequence logos: A new way to display consensus sequences. Nucleic Acids Res. 18: 6097–6100. Sock, E., Pagon, R.A., Keymolen, K., Lissens, W., Wegner, M., and Scherer, G. 2003. Loss of DNA-dependent dimerization of the transcription factor SOX9 as a cause for campomelic dysplasia. Hum. Mol. Genet. 12: 1439–1447. Staden, R. 1989. Methods for discovering novel motifs in nucleic acid sequences. Comput. Appl. Biosci. 5: 293–298. Stormo, G.D. 2000. DNA binding sites: Representation and discovery. Bioinformatics 16: 16–23. Stormo, G.D. and Fields, D.S. 1998. Specificity, free energy and information content in protein–DNA interactions. Trends Biochem. Sci. 23: 109–113.[CrossRef][Medline] Stormo, G.D. and Hartzell, G.W. 1989. Identifying protein-binding sites from unaligned DNA fragments. Proc. Natl. Acad. Sci. 86: 1183–1187. Stormo, G.D., Schneider, T.D., Gold, L., and Ehrenfeucht, A. 1982. Use of the Perceptron algorithm to distinguish translational initiation sites in E. coli. Nucleic Acids Res. 10: 2997–3011. Tagle, D.A., Koop, B.F., Goodman, M., Slightom, J.L., Hess, D.L., and Jones, R.T. 1988. Embryonic Takagi, T., Moribe, H., Kondoh, H., and Higashi, Y. 1998. Tan, L., Peng, H., Osaki, M., Choy, B.K., Auron, P.E., Sandell, L.J., and Goldring, M.B. 2003. Egr-1 mediates transcriptional repression of COL2A1 promoter activity by interleukin-1 Tan, K., McCue, L.A., and Stormo, G.D. 2005. Making connections between novel transcription factors and their DNA motifs. Genome Res. 15: 312–320. Wagner, T., Wirth, J., Meyer, J., Zabel, B., Held, M., Zimmer, J., Pasantes, J., Bricarelli, F.D., Keutel, J., Hustert, E., et al. 1994. Autosomal sex reversal and campomelic dysplasia are caused by mutations in and around the SRY-related gene SOX9. Cell 79: 1111–1120.[CrossRef][Medline] Wang, T. and Stormo, G.D. 2003. Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics 19: 2369–2380. Wang, H., Zhang, Y., Cheng, Y., Zhou, Y., King, D.C., Taylor, J., Chiaromonte, F., Kasturi, J., Petrykowska, H., Gibb, B., et al. 2006. Experimental validation of predicted mammalian erythroid cis-regulatory modules. Genome Res. 16: 1480–1492. Wasserman, W.W., Palumbo, M., Thompson, W., Fickett, J.W., and Lawrence, C.E. 2000. Human–mouse genome comparisons to locate regulatory sites. Nat. Genet. 26: 225–228.[CrossRef][Medline] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||