Genome Res. 14:1967-1974, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Methods
Decoding Human Regulatory Circuits
William Thompson1,5,
Michael J. Palumbo1,
Wyeth W. Wasserman2,
Jun S. Liu3 and
Charles E. Lawrence1,4
1 Center for Bioinformatics, The Wadsworth Center, New York State Department of Health, Albany, New York 12208, USA
2 Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia V5Z 4H4, Canada
3 Department of Statistics, Harvard University, Cambridge, Massachusetts 02138, USA
4 Computer Science Department, Rensselaer Polytechnic Institute, Troy, New York 12180, USA
Clusters of transcription factor binding sites (TFBSs) which direct gene expression constitute cis-regulatory modules (CRMs). We present a novel algorithm, based on Gibbs sampling, which locates, de novo, the cis features of these CRMs, their component TFBSs, and the properties of their spatial distribution. The algorithm finds 69% of experimentally reported TFBSs and 85% of the CRMs in a reference data set of regions upstream of genes differentially expressed in skeletal muscle cells. A discriminant procedure based on the output of the model specifically discriminated regulatory sequences in muscle-specific genes in an independent test set. Application of the method to the analysis of 2710 10-kb fragments upstream of annotated human genes identified 17 novel candidate modules with a false discovery rate 0.05, demonstrating the applicability of the method to genome-scale data.
5 Corresponding author. E-MAIL thompson{at}wadsworth.org; FAX (518) 402-4623.
[Supplemental material is available online at www.genome.org.]
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2589004.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
V. Gotea and I. Ovcharenko
DiRE: identifying distant regulatory elements of co-expressed genes
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W133 - W139.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Q. Zhou and J. S. Liu
Extracting sequence features to predict protein-DNA interactions: a comparative study
Nucleic Acids Res.,
June 13, 2008;
(2008)
gkn361v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Hannenhalli
Eukaryotic transcription factor binding sites--modeling and integrative search methods
Bioinformatics,
June 1, 2008;
24(11):
1325 - 1331.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. T. Glover, J. Kriakov, S. J. Garforth, A. D. Baughn, and W. R. Jacobs Jr.
The Two-Component Regulatory System senX3-regX3 Regulates Phosphate-Dependent Gene Expression in Mycobacterium smegmatis
J. Bacteriol.,
August 1, 2007;
189(15):
5495 - 5503.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. A. Newberg, W. A. Thompson, S. Conlan, T. M. Smith, L. A. McCue, and C. E. Lawrence
A phylogenetic Gibbs sampler that yields centroid solutions for cis-regulatory site prediction
Bioinformatics,
July 15, 2007;
23(14):
1718 - 1727.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. A. Thompson, L. A. Newberg, S. Conlan, L. A. McCue, and C. E. Lawrence
The Gibbs Centroid Sampler
Nucleic Acids Res.,
July 13, 2007;
35(suppl_2):
W232 - W237.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Vardhanabhuti, J. Wang, and S. Hannenhalli
Position and distance specificity are important determinants of cis-regulatory motifs in addition to evolutionary conservation
Nucleic Acids Res.,
May 11, 2007;
35(10):
3203 - 3213.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. A. Pennacchio, G. G. Loots, M. A. Nobrega, and I. Ovcharenko
Predicting tissue-specific enhancers in the human genome
Genome Res.,
February 1, 2007;
17(2):
201 - 211.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Ferretti, C. Poitras, D. Bergeron, B. Coulombe, F. Robert, and M. Blanchette
PReMod: a database of genome-wide mammalian cis-regulatory module predictions
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D122 - D126.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Ji, S. A. Vokes, and W. H. Wong
A comparative analysis of genome-wide chromatin immunoprecipitation data for mammalian transcription factors
Nucleic Acids Res.,
December 4, 2006;
34(21):
e146 - e146.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res.,
July 19, 2006;
34(12):
3585 - 3598.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Blanchette, A. R. Bataille, X. Chen, C. Poitras, J. Laganiere, C. Lefebvre, G. Deblois, V. Giguere, V. Ferretti, D. Bergeron, et al.
Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression
Genome Res.,
May 1, 2006;
16(5):
656 - 668.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Q. Sun, G. Chen, J. W. Streb, X. Long, Y. Yang, C. J. Stoeckert Jr., and J. M. Miano
Defining the mammalian CArGome
Genome Res.,
February 1, 2006;
16(2):
197 - 207.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. Ovcharenko and M. A. Nobrega
Identifying synonymous regulatory elements in vertebrate genomes
Nucleic Acids Res.,
July 1, 2005;
33(suppl_2):
W403 - W407.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|