Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Down, T. A.
Right arrow Articles by Hubbard, T. J. P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Down, T. A.
Right arrow Articles by Hubbard, T. J. P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 3, 458-461, March 2002

METHODS
Computational Detection and Location of Transcription Start Sites in Mammalian Genomic DNA

Thomas A. Down,1 and Tim J. P. Hubbard

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, United Kingdom

Transcription, the process whereby RNA copies are made from sections of the DNA genome, is directed by promoter regions. These define the transcription start site, and also the set of cellular conditions under which the promoter is active. At least in more complex species, it appears to be common for genes to have several different transcription start sites, which may be active under different conditions. Eukaryotic promoters are complex and fairly diffuse structures, which have proven hard to detect in silico. We show that a novel hybrid machine-learning method is able to build useful models of promoters for >50% of human transcription start sites. We estimate specificity to be >70%, and demonstrate good positional accuracy. Based on the structure of our learned models, we conclude that a signal resembling the well known TATA box, together with flanking regions of C-G enrichment, are the most important sequence-based signals marking sites of transcriptional initiation at a large class of typical promoters.


1 Corresponding author.


12:458-461 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
RNAHome page
J. Xu and C. Wong
A computational screen for mouse signaling pathways targeted by microRNA clusters
RNA, July 1, 2008; 14(7): 1276 - 1283.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Sonnenburg, A. Zien, P. Philips, and G. Ratsch
POIMs: positional oligomer importance matrices--understanding support vector machine-based signal detectors
Bioinformatics, July 1, 2008; 24(13): i6 - i14.
[Abstract] [PDF]


Home page
BioinformaticsHome page
T. Abeel, Y. Saeys, P. Rouze, and Y. Van de Peer
ProSOM: core promoter prediction based on unsupervised clustering of DNA physical profiles
Bioinformatics, July 1, 2008; 24(13): i24 - i31.
[Abstract] [PDF]


Home page
Nucleic Acids ResHome page
A. Perez, F. Lankas, F. J. Luque, and M. Orozco
Towards a molecular dynamics consensus view of B-DNA flexibility
Nucleic Acids Res., April 1, 2008; 36(7): 2379 - 2394.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. S. Vernikos and J. Parkhill
Resolving the structural features of genomic islands: A machine learning approach
Genome Res., February 1, 2008; 18(2): 331 - 342.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
T. Abeel, Y. Saeys, E. Bonnet, P. Rouze, and Y. Van de Peer
Generic eukaryotic core promoter prediction using structural features of DNA
Genome Res., February 1, 2008; 18(2): 310 - 323.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Griffiths-Jones, H. K. Saini, S. van Dongen, and A. J. Enright
miRBase: tools for microRNA genomics
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D154 - D158.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. C. Frith, E. Valen, A. Krogh, Y. Hayashizaki, P. Carninci, and A. Sandelin
A code for transcription initiation in mammalian genomes
Genome Res., January 1, 2008; 18(1): 1 - 12.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. K. Saini, S. Griffiths-Jones, and A. J. Enright
Genomic analysis of human microRNA transcripts
PNAS, November 6, 2007; 104(45): 17719 - 17724.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
P. Kundu, A. Alioua, E. Stefani, and L. Toro
Regulation of Mouse Slo Gene Expression: MULTIPLE PROMOTERS, TRANSCRIPTION START SITES, AND GENOMIC ACTION OF ESTROGEN
J. Biol. Chem., September 14, 2007; 282(37): 27478 - 27492.
[Abstract] [Full Text] [PDF]


Home page
MutagenesisHome page
M. Pastoriza-Gallego, J. Armier, and A. Sarasin
Transcription through 8-oxoguanine in DNA repair-proficient and Csb /Ogg1 DNA repair-deficient mouse embryonic fibroblasts is dependent upon promoter strength and sequence context
Mutagenesis, September 1, 2007; 22(5): 343 - 351.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Roma, G. Cobellis, P. Claudiani, F. Maione, P. Cruz, G. Tripoli, M. Sardiello, I. Peluso, and E. Stupka
A novel view of the transcriptome revealed from gene trapping in mouse embryonic stem cells
Genome Res., July 1, 2007; 17(7): 1051 - 1060.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. M. Koch, R. M. Andrews, P. Flicek, S. C. Dillon, U. Karaoz, G. K. Clelland, S. Wilcox, D. M. Beare, J. C. Fowler, P. Couttet, et al.
The landscape of histone modifications across 1% of the human genome in five human cell lines
Genome Res., June 1, 2007; 17(6): 691 - 707.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. S. Rozowsky, D. Newburger, F. Sayward, J. Wu, G. Jordan, J. O. Korbel, U. Nagalakshmi, J. Yang, D. Zheng, R. Guigo, et al.
The DART classification of unannotated transcription within the ENCODE regions: Associating transcription with known and novel loci
Genome Res., June 1, 2007; 17(6): 732 - 745.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Lardenois, F. Chalmel, L. Bianchetti, J.-A. Sahel, T. Leveillard, and O. Poch
PromAn: an integrated knowledge-based web server dedicated to promoter analysis.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W578 - W583.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
A. A. Sharov, D. B. Dudekula, and M. S. H. Ko
CisView: A Browser and Database of cis-regulatory Modules Predicted in the Mouse Genome
DNA Res, January 1, 2006; 13(3): 123 - 134.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer
Large-scale structural analysis of the core promoter in mammalian and plant genomes
Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
T. Aranyi, M. Ratajewski, V. Bardoczy, L. Pulaski, A. Bors, A. Tordai, and A. Varadi
Identification of a DNA Methylation-dependent Activator Sequence in the Pseudoxanthoma Elasticum Gene, ABCC6
J. Biol. Chem., May 13, 2005; 280(19): 18643 - 18650.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
R. H. Brown, S. S. Gross, and M. R. Brent
Begin at the beginning: Predicting genes with 5' UTRs
Genome Res., May 1, 2005; 15(5): 742 - 747.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Burden, Y.-X. Lin, and R. Zhang
Improving promoter prediction Improving promoter prediction for the NNPP2.2 algorithm: a case study using Escherichia coli DNA sequences
Bioinformatics, March 1, 2005; 21(5): 601 - 607.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. A. Shahmuradov, V. V. Solovyev, and A. J. Gammerman
Plant promoter prediction with confidence estimation
Nucleic Acids Res., February 18, 2005; 33(3): 1069 - 1076.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. L. Ashurst, C.-K. Chen, J. G. R. Gilbert, K. Jekosch, S. Keenan, P. Meidl, S. M. Searle, J. Stalker, R. Storey, S. Trevanion, et al.
The Vertebrate Genome Annotation (Vega) database
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D459 - D465.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. C. Potter, L. Clarke, V. Curwen, S. Keenan, E. Mongin, S. M.J. Searle, A. Stabenau, R. Storey, and M. Clamp
The Ensembl Analysis Pipeline
Genome Res., May 1, 2004; 14(5): 934 - 941.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. Curwen, E. Eyras, T. D. Andrews, L. Clarke, E. Mongin, S. M.J. Searle, and M. Clamp
The Ensembl Automatic Gene Annotation System
Genome Res., May 1, 2004; 14(5): 942 - 950.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Marino-Ramirez, J. L. Spouge, G. C. Kanga, and D. Landsman
Statistical analysis of over-represented words in human promoter sequences
Nucleic Acids Res., February 12, 2004; 32(3): 949 - 958.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
G. Delhon, M. P. Moraes, Z. Lu, C. L. Afonso, E. F. Flores, R. Weiblen, G. F. Kutish, and D. L. Rock
Genome of Bovine Herpesvirus 5
J. Virol., October 1, 2003; 77(19): 10339 - 10347.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. B. Bajic and S. H. Seah
Dragon Gene Start Finder: An Advanced System for Finding Approximate Locations of the Start of Gene Transcriptional Units
Genome Res., August 1, 2003; 13(8): 1923 - 1929.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Solovyev and I. Shahmuradov
PromH: promoters identification using orthologous genomic sequences
Nucleic Acids Res., July 1, 2003; 31(13): 3540 - 3545.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. S. Halees, D. Leyfer, and Z. Weng
PromoSer: a large-scale mammalian promoter and transcription start site identification service
Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. B. Bajic and S. H. Seah
Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes
Nucleic Acids Res., July 1, 2003; 31(13): 3560 - 3563.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Rombauts, K. Florquin, M. Lescot, K. Marchal, P. Rouze, and Y. Van de Peer
Computational Approaches to Identify Promoters and cis-Regulatory Elements in Plant Genomes
Plant Physiology, July 1, 2003; 132(3): 1162 - 1176.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
A. Volz, A. Ehlers, R. Younger, S. Forbes, J. Trowsdale, D. Schnorr, S. Beck, and A. Ziegler
Complex Transcription and Splicing of Odorant Receptor Genes
J. Biol. Chem., May 23, 2003; 278(22): 19691 - 19701.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Aerts, G. Thijs, B. Coessens, M. Staes, Y. Moreau, and B. D. Moor
Toucan: deciphering the cis-regulatory logic of coregulated genes
Nucleic Acids Res., March 15, 2003; 31(6): 1753 - 1764.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
N. D. Trinklein, S. J. F. Aldred, A. J. Saldanha, and R. M. Myers
Identification and Functional Analysis of Human Transcriptional Promoters
Genome Res., February 1, 2003; 13(2): 308 - 312.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Clamp, D. Andrews, D. Barker, P. Bevan, G. Cameron, Y. Chen, L. Clark, T. Cox, J. Cuff, V. Curwen, et al.
Ensembl 2002: accommodating comparative genomics
Nucleic Acids Res., January 1, 2003; 31(1): 38 - 42.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Mathe, M.-F. Sagot, T. Schiex, and P. Rouze
Current methods of gene prediction, their strengths and weaknesses
Nucleic Acids Res., October 1, 2002; 30(19): 4103 - 4117.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.