Genome Res. 14:2041-2047, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Letter
An ORFeome-based Analysis of Human Transcription Factor Genes and the Construction of a Microarray to Interrogate Their Expression
David N. Messina1,
Jarret Glasscock1,
Warren Gish and
Michael Lovett2
Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
Transcription factors (TFs) are essential regulators of gene expression, and mutated TF genes have been shown to cause numerous human genetic diseases. Yet to date, no single, comprehensive database of human TFs exists. In this work, we describe the collection of an essentially complete set of TF genes from one depiction of the human ORFeome, and the design of a microarray to interrogate their expression. Taking 1468 known TFs from TRANSFAC, InterPro, and FlyBase, we used this seed set to search the ScriptSure human transcriptome database for additional genes. ScriptSure's genome-anchored transcript clusters allowed us to work with a nonredundant high-quality representation of the human transcriptome. We used a high-stringency similarity search by using BLASTN, and a protein motif search of the human ORFeome by using hidden Markov models of DNA-binding domains known to occur exclusively or primarily in TFs. Four hundred ninety-four additional TF genes were identified in the overlap between the two searches, bringing our estimate of the total number of human TFs to 1962. Zinc finger genes are by far the most abundant family (762 members), followed by homeobox (199 members) and basic helix-loop-helix genes (117 members). We designed a microarray of 50-mer oligonucleotide probes targeted to a unique region of the coding sequence of each gene. We have successfully used this microarray to interrogate TF gene expression in species as diverse as chickens and mice, as well as in humans.
1 These authors contributed equally to this work.
2 Corresponding author. E-MAIL Lovett{at}genetics.wustl.edu; FAX (314) 747-2489.
[Supplemental material is available online at www.genome.org.]
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2584104.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
B. Hooghe, P. Hulpiau, F. van Roy, and P. De Bleser
ConTra: a promoter alignment analysis tool for identification of transcription factor binding sites across species
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W128 - W132.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Wilson, V. Charoensawan, S. K. Kummerfeld, and S. A. Teichmann
DBD--taxonomically broad transcription factor predictions: new content and functionality
Nucleic Acids Res.,
January 18, 2008;
36(suppl_1):
D88 - D92.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Zeng, J. Yan, T. Wang, D. Mosbrook-Davis, K. T. Dolan, R. Christensen, G. D. Stormo, D. Haussler, R. H. Lathrop, R. K. Brachmann, et al.
Genome wide screens in yeast to identify potential binding sites and target genes of DNA-binding proteins
Nucleic Acids Res.,
January 17, 2008;
36(1):
e8 - e8.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. C. Hollenhorst, A. A. Shah, C. Hopkins, and B. J. Graves
Genome-wide analyses reveal properties of redundant and specific promoter occupancy within the ETS gene family
Genes & Dev.,
August 1, 2007;
21(15):
1882 - 1894.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. D. Southall and A. H. Brand
Chromatin profiling in model organisms
Brief Funct Genomic Proteomic,
July 24, 2007;
(2007)
elm013v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Chen, T. R. Hughes, and Q. Morris
RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors
Bioinformatics,
July 1, 2007;
23(13):
i72 - i79.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Richardt, D. Lang, R. Reski, W. Frank, and S. A. Rensing
PlanTAPDB, a Phylogeny-Based Resource of Plant Transcription-Associated Proteins
Plant Physiology,
April 1, 2007;
143(4):
1452 - 1466.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones
Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques
Genome Res.,
December 1, 2006;
16(12):
1455 - 1464.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Yu, J. Lin, D. J. Zack, and J. Qian
Computational analysis of tissue-specific combinatorial gene regulation: predicting interaction between transcription factors in human tissues
Nucleic Acids Res.,
October 18, 2006;
34(17):
4925 - 4936.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Y. Choi, A. I. Romer, M. Hu, M. Lepourcelet, A. Mechoor, A. Yesilaltay, M. Krieger, P. A. Gray, and R. A. Shivdasani
A dynamic expression survey identifies transcription factors relevant in mouse digestive tract development
Development,
October 15, 2006;
133(20):
4119 - 4129.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Huntley, D. M. Baggott, A. T. Hamilton, M. Tran-Gyamfi, S. Yang, J. Kim, L. Gordon, E. Branscomb, and L. Stubbs
A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors
Genome Res.,
May 1, 2006;
16(5):
669 - 677.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. D. Hawkins and B. Ren
Genome-wide location analysis: insights on transcriptional regulation.
Hum. Mol. Genet.,
April 15, 2006;
15(suppl_1):
R1 - R7.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. H. Kasper, T. Fukuyama, M. A. Biesen, F. Boussouar, C. Tong, A. de Pauw, P. J. Murray, J. M. A. van Deursen, and P. K. Brindle
Conditional Knockout Mice Reveal Distinct Functions for the Global Transcriptional Coactivators CBP and p300 in T-Cell Development
Mol. Cell. Biol.,
February 1, 2006;
26(3):
789 - 809.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. K. Kummerfeld and S. A. Teichmann
DBD: a transcription factor prediction database
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D74 - D81.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. S. Siddiqui, J. Khattra, A. D. Delaney, Y. Zhao, C. Astell, J. Asano, R. Babakaiff, S. Barber, J. Beland, S. Bohacec, et al.
A mouse atlas of gene expression: Large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells
PNAS,
December 20, 2005;
102(51):
18485 - 18490.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Saksouk, M. M. Bhatti, S. Kieffer, A. T. Smith, K. Musset, J. Garin, W. J. Sullivan Jr., M.-F. Cesbron-Delauw, and M.-A. Hakimi
Histone-Modifying Complexes Regulate Gene Expression Pertinent to the Differentiation of the Protozoan Parasite Toxoplasma gondii
Mol. Cell. Biol.,
December 1, 2005;
25(23):
10301 - 10314.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. K. Gillette, D. Esposito, P. H. Frank, M. Zhou, L.-R. Yu, C. Jozwik, X. Zhang, B. McGowan, D. M. Jacobowitz, H. B. Pollard, et al.
Pooled ORF Expression Technology (POET): Using Proteomics to Screen Pools of Open Reading Frames for Protein Expression
Mol. Cell. Proteomics,
November 1, 2005;
4(11):
1647 - 1652.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. E. Cusick, N. Klitgord, M. Vidal, and D. E. Hill
Interactome: gateway into systems biology
Hum. Mol. Genet.,
October 15, 2005;
14(suppl_2):
R171 - R181.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|