|
Vol. 10, Issue 6, 744-757, June 2000
Conservation of DNA Regulatory Motifs and Discovery of New Motifs in Microbial Genomes
Abigail Manson
McGuire,
Jason D.
Hughes, and
George M.
Church1
Graduate Program in Biophysics, and Department of Genetics, Lipper Center
for Computational Genetics, Harvard Medical School, Boston, MA 02115 USA
Regulatory motifs can be found by local multiple alignment of
upstream regions from coregulated sets of genes, or regulons. We
searched for regulatory motifs using the program AlignACE together with
a set of filters that helped us choose the motifs most likely to be
biologically relevant in 17 complete microbial genomes. We searched the
upstream regions of potentially coregulated genes grouped by three
methods: (1) genes that make up functional pathways; (2) genes
homologous to regulons from a well-studied species (Escherichia coli); and (3) groups of genes derived from conserved operons. This
last group is based on the observation that genes making up homologous
regulons in different species are often assorted into coregulated
operons in different combinations. This allows partial reconstruction
of regulons by looking at operon structure across several species.
Unlike other methods for predicting regulons, this method does not
depend on the availability of experimental data other than the genome
sequence and the locations of genes. New, statistically significant
motifs were found in the genome sequence of each organism using each
grouping method. The most significant new motif was found upstream of
genes in the methane-metabolism functional group in
Methanobacterium thermoautotrophicum. We found that at least
27% of the known E. coli DNA-regulatory motifs are conserved
in one or more distantly related eubacteria. We also observed
significant motifs that differed from the E. coli motif in
other organisms upstream of sets of genes homologous to known E. coli regulons, including Crp, LexA, and ArcA in Bacillus
subtilis; four anaerobic regulons in Archaeoglobus
fulgidus (NarL, NarP, Fnr, and ModE); and the PhoB, PurR, RpoH, and
FhlA regulons in other archaebacterial species. We also used motif
conservation to aid in finding new motifs by grouping upstream regions
from closely related bacteria, thus increasing the number of instances of the motif in the sequence to be aligned. For example, by grouping upstream sequences from three archaebacterial species, we found a
conserved motif that may regulate ferrous ion transport that was not
found in individual genomes. Discovery of conserved motifs becomes
easier as the number of closely related genome sequences increases.
1
Corresponding author.
10:744-757 ©2000 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/00 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
W. S.W. Wong and R. Nielsen
Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models
Bioinformatics,
August 15, 2007;
23(16):
2031 - 2037.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Tian, T. Kasuga, M. S. Sachs, and N. L. Glass
Transcriptional Profiling of Cross Pathway Control in Neurospora crassa and Comparative Analysis of the Gcn4 and CPC1 Regulons
Eukaryot. Cell,
June 1, 2007;
6(6):
1018 - 1029.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Ma, Y. Pan, J. Zheng, A. J. Levine, and R. Nussinov
Sequence analysis of p53 response-elements suggests multiple binding modes of the p53 tetramer to DNA targets
Nucleic Acids Res.,
May 14, 2007;
35(9):
2986 - 3001.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. R. Boverhof, L. D. Burgoon, C. Tashiro, B. Sharratt, B. Chittim, J. R. Harkema, D. L. Mendrick, and T. R. Zacharewski
Comparative Toxicogenomic Analysis of the Hepatotoxic Effects of TCDD in Sprague Dawley Rats and C57BL/6 Mice
Toxicol. Sci.,
December 1, 2006;
94(2):
398 - 416.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Rawal, V. B. R. Kummarasetti, J. Ravindran, N. Kumar, K. Halder, R. Sharma, M. Mukerji, S. K. Das, and S. Chowdhury
Genome-wide prediction of G4 DNA as regulatory motifs: Role in Escherichia coli global regulation.
Genome Res.,
May 1, 2006;
16(5):
644 - 655.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Wels, C. Francke, R. Kerkhoven, M. Kleerebezem, and R. J. Siezen
Predicting cis-acting elements of Lactobacillus plantarum by comparative genomics with different taxonomic subgroups
Nucleic Acids Res.,
April 13, 2006;
34(7):
1947 - 1958.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Su, F. Mao, P. Dam, H. Wu, V. Olman, I. T. Paulsen, B. Palenik, and Y. Xu
Computational inference and experimental validation of the nitrogen assimilation regulatory network in cyanobacterium Synechococcus sp. WH 8102
Nucleic Acids Res.,
February 10, 2006;
34(3):
1050 - 1065.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Wang, S.-E. Lu, Q. Yang, S.-H. Sze, and D. C. Gross
Identification of the syr-syp Box in the Promoter Regions of Genes Dedicated to Syringomycin and Syringopeptin Production by Pseudomonas syringae pv. syringae B301D
J. Bacteriol.,
January 1, 2006;
188(1):
160 - 168.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Kasuga, J. P. Townsend, C. Tian, L. B. Gilbert, G. Mannhaupt, J. W. Taylor, and N. L. Glass
Long-oligomer microarray profiling in Neurospora crassa reveals the transcriptional program underlying biochemical and physiological events of conidial germination
Nucleic Acids Res.,
November 14, 2005;
33(20):
6469 - 6485.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Rommens, O. Bougri, H. Yan, J. M. Humara, J. Owen, K. Swords, and J. Ye
Plant-Derived Transfer DNAs
Plant Physiology,
November 1, 2005;
139(3):
1338 - 1349.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. L. Rowe, G. L. Starnes, and P. T. Chivers
Complex Transcriptional Control Links NikABCDE-Dependent Nickel Transport with Hydrogenase Expression in Escherichia coli
J. Bacteriol.,
September 15, 2005;
187(18):
6317 - 6323.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Su, V. Olman, F. Mao, and Y. Xu
Comparative genomics analysis of NtcA regulons in cyanobacteria: regulation of nitrogen assimilation and its coupling to photosynthesis
Nucleic Acids Res.,
September 12, 2005;
33(16):
5156 - 5171.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Hu, B. Li, and D. Kihara
Limitations and potentials of current motif discovery algorithms
Nucleic Acids Res.,
September 2, 2005;
33(15):
4899 - 4913.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. N. Price, K. H. Huang, E. J. Alm, and A. P. Arkin
A novel method for accurate operon predictions in all sequenced prokaryotes
Nucleic Acids Res.,
February 8, 2005;
33(3):
880 - 892.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. I. Johnson and S. W. Chisholm
Properties of overlapping genes are conserved across microbial genomes
Genome Res.,
November 1, 2004;
14(11):
2268 - 2272.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. V. Sun, D. R. Boverhof, L. D. Burgoon, M. R. Fielden, and T. R. Zacharewski
Comparative analysis of dioxin response elements in human, mouse and rat genomic sequences
Nucleic Acids Res.,
August 24, 2004;
32(15):
4512 - 4523.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. B.L. Alkema, B. Lenhard, and W. W. Wasserman
Regulog Analysis: Detection of Conserved Regulatory Networks Across Bacteria: Application to Staphylococcus aureus
Genome Res.,
July 1, 2004;
14(7):
1362 - 1373.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. K. Hottes, M. Meewan, D. Yang, N. Arana, P. Romero, H. H. McAdams, and C. Stephens
Transcriptional Profiling of Caulobacter crescentus during Growth on Complex and Minimal Media
J. Bacteriol.,
March 1, 2004;
186(5):
1448 - 1461.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Liu, X. S. Liu, L. Wei, R. B. Altman, and S. Batzoglou
Eukaryotic Regulatory Element Conservation Analysis and Identification Using Comparative Genomics
Genome Res.,
March 1, 2004;
14(3):
451 - 458.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. C. Frith, U. Hansen, J. L. Spouge, and Z. Weng
Finding functional sequence elements by multiple local alignment
Nucleic Acids Res.,
January 2, 2004;
32(1):
189 - 200.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. P. Fischer, N. A. Brunner, B. Wieland, J. Paquette, L. Macko, K. Ziegelbauer, and C. Freiberg
Identification of Antibiotic Stress-Inducible Promoters: A Systematic Approach to Novel Pathway-Specific Reporter Assays for Antibacterial Drug Discovery
Genome Res.,
January 1, 2004;
14(1):
90 - 98.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Pritsker, Y.-C. Liu, M. A. Beer, and S. Tavazoie
Whole-Genome Discovery of Transcription Factor Binding Sites by Network-Level Conservation
Genome Res.,
January 1, 2004;
14(1):
99 - 108.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Lapidot and Y. Pilpel
Comprehensive quantitative analyses of the effects of promoter sequence elements on mRNA transcription
Nucleic Acids Res.,
July 1, 2003;
31(13):
3824 - 3828.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. R. Chhabra, K. R. Shockley, S. B. Conners, K. L. Scott, R. D. Wolfinger, and R. M. Kelly
Carbohydrate-induced Differential Gene Expression Patterns in the Hyperthermophilic Bacterium Thermotoga maritima
J. Biol. Chem.,
February 21, 2003;
278(9):
7540 - 7552.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. A. McCue, W. Thompson, C. S. Carmack, and C. E. Lawrence
Factors Influencing the Identification of Transcription Factor Binding Sites by Cross-Species Comparison
Genome Res.,
October 1, 2002;
12(10):
1523 - 1532.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Li, V. Rhodius, C. Gross, and E. D. Siggia
Identification of the binding sites of regulatory proteins in bacterial genomes
PNAS,
September 3, 2002;
99(18):
11772 - 11777.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. van Nimwegen, M. Zavolan, N. Rajewsky, and E. D. Siggia
Probabilistic clustering of sequences: Inferring new bacterial regulons by comparative genomics
PNAS,
May 28, 2002;
99(11):
7323 - 7328.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Blanchette and M. Tompa
Discovery of Regulatory Elements by a Computational Method for Phylogenetic Footprinting
Genome Res.,
May 1, 2002;
12(5):
739 - 748.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. H. Graber, G. D. McAllister, and T. F. Smith
Probabilistic prediction of Saccharomyces cerevisiae mRNA 3'-processing sites
Nucleic Acids Res.,
April 15, 2002;
30(8):
1851 - 1858.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. I. Su, M. P. Cooke, K. A. Ching, Y. Hakak, J. R. Walker, T. Wiltshire, A. P. Orth, R. G. Vega, L. M. Sapinoso, A. Moqrich, et al.
Large-scale analysis of the human and mouse transcriptomes
PNAS,
April 2, 2002;
99(7):
4465 - 4470.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. A. Mirny and M. S. Gelfand
Structural analysis of conserved base pairs in protein-DNA complexes
Nucleic Acids Res.,
April 1, 2002;
30(7):
1704 - 1711.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. M. Panina, A. A. Mironov, and M. S. Gelfand
Comparative analysis of FUR regulons in gamma-proteobacteria
Nucleic Acids Res.,
December 15, 2001;
29(24):
5195 - 5206.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. F. Cliften, L. W. Hillier, L. Fulton, T. Graves, T. Miner, W. R. Gish, R. H. Waterston, and M. Johnston
Surveying Saccharomyces Genomes to Identify Functional Elements by Comparative DNA Sequence Analysis
Genome Res.,
July 1, 2001;
11(7):
1175 - 1186.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Tan, G. Moreno-Hagelsieb, J. Collado-Vides, and G. D. Stormo
A Comparative Genomics Approach to Prediction of New Members of Regulons
Genome Res.,
April 1, 2001;
11(4):
566 - 584.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
J. Qiu and J. D. Helmann
The {-}10 Region Is a Key Promoter Specificity Determinant for the Bacillus subtilis Extracytoplasmic-Function {sigma} Factors {sigma}X and {sigma}W
J. Bacteriol.,
March 15, 2001;
183(6):
1921 - 1927.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
L. A. McCue, W. Thompson, C. S. Carmack, M. P. Ryan, J. S. Liu, V. Derbyshire, and C. E. Lawrence
Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes
Nucleic Acids Res.,
February 1, 2001;
29(3):
774 - 782.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Manson McGuire and G. M. Church
Predicting regulons and their cis-regulatory motifs by comparative genomics
Nucleic Acids Res.,
November 15, 2000;
28(22):
4523 - 4530.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. I. Su, M. P. Cooke, K. A. Ching, Y. Hakak, J. R. Walker, T. Wiltshire, A. P. Orth, R. G. Vega, L. M. Sapinoso, A. Moqrich, et al.
Large-scale analysis of the human and mouse transcriptomes
PNAS,
April 2, 2002;
99(7):
4465 - 4470.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Rajewsky, N. D. Socci, M. Zapotocky, and E. D. Siggia
The Evolution of DNA Regulatory Regions for Proteo-Gamma Bacteria by Interspecies Comparisons
Genome Res.,
February 1, 2002;
12(2):
298 - 308.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|