Published online before print
January 14, 2005, 10.1101/gr.3069205
Genome Res. 15:312-320, 2005
©2005 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/05 $5.00
Methods
Making connections between novel transcription factors and their DNA motifs
Kai Tan1,
Lee Ann McCue2 and
Gary D. Stormo1,3
1 Department of Genetics, Washington University School of Medicine, Saint Louis, Missouri 63110, USA
2 The Wadsworth Center, New York State Department of Health, Albany, New York 12201-0509, USA
The key components of a transcriptional regulatory network are the connections between trans-acting transcription factors and cis-acting DNA-binding sites. In spite of several decades of intense research, only a fraction of the estimated 300 transcription factors in Escherichia coli have been linked to some of their binding sites in the genome. In this paper, we present a computational method to connect novel transcription factors and DNA motifs in E. coli. Our method uses three types of mutually independent information, two of which are gleaned by comparative analysis of multiple genomes and the third one derived from similarities of transcription-factor-DNA-binding-site interactions. The different types of information are combined to calculate the probability of a given transcription-factor-DNA-motif pair being a true pair. Tested on a study set of transcription factors and their DNA motifs, our method has a prediction accuracy of 59% for the top predictions and 85% for the top three predictions. When applied to 99 novel transcription factors and 70 novel DNA motifs, our method predicted 64 transcription-factor-DNA-motif pairs. Supporting evidence for some of the predicted pairs is presented. Functional annotations are made for 23 novel transcription factors based on the predicted transcription-factor-DNA-motif connections.
3 Corresponding author. E-mail stormo{at}genetics.wustl.edu; fax (314) 362-7855.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3069205. Article published online ahead of print in January 2005.
[Supplemental material is available online at www.genome.org.]

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
S. R. Davies, L.-W. Chang, D. Patra, X. Xing, K. Posey, J. Hecht, G. D. Stormo, and L. J. Sandell
Computational identification and functional validation of regulatory motifs in cartilage-expressed genes
Genome Res.,
October 1, 2007;
17(10):
1438 - 1447.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Kolesov, Z. Wunderlich, O. N. Laikova, M. S. Gelfand, and L. A. Mirny
How gene order is influenced by the biophysics of transcription regulation
PNAS,
August 28, 2007;
104(35):
13948 - 13953.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res.,
July 19, 2006;
34(12):
3585 - 3598.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. D. MacIsaac, D. B. Gordon, L. Nekludova, D. T. Odom, J. Schreiber, D. K. Gifford, R. A. Young, and E. Fraenkel
A hypothesis-based approach for identifying the binding specificity of regulatory proteins from chromatin immunoprecipitation data
Bioinformatics,
February 15, 2006;
22(4):
423 - 429.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Conlan, C. Lawrence, and L. A. McCue
Rhodopseudomonas palustris Regulons Detected by Cross-Species Analysis of Alphaproteobacterial Genomes
Appl. Envir. Microbiol.,
November 1, 2005;
71(11):
7442 - 7452.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. N. Vemuri and A. A. Aristidou
Metabolic Engineering in the -omics Era: Elucidating and Modulating Regulatory Networks
Microbiol. Mol. Biol. Rev.,
June 1, 2005;
69(2):
197 - 216.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|