|
Vol. 12, Issue 9, 1418-1427, September 2002
METHODS
GAZE: A Generic Framework for the Integration of Gene-Prediction Data by Dynamic Programming
Kevin L.
Howe,
Tom
Chothia, and
Richard
Durbin1
The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Hinxton, Cambridge CB10 1SA, UK
We describe a method (implemented in a program, GAZE) for assembling
arbitrary evidence for individual gene components (features) into
predictions of complete gene structures. Our system is generic in that
both the features themselves, and the model of gene structure against
which potential assemblies are validated and scored, are external to
the system and supplied by the user. GAZE uses a dynamic programming
algorithm to obtain the highest scoring gene structure according to the
model and posterior probabilities that each input feature is part of a
gene. A novel pruning strategy ensures that the algorithm has a
run-time effectively linear in sequence length. To demonstrate the
flexibility of our system in the incorporation of additional evidence
into the gene prediction process, we show how it can be used to both
represent nonstandard gene structures (in the form of
trans-spliced genes in Caenorhabditis elegans), and
make use of similarity information (in the form of Expressed Sequence Tag alignments), while requiring no change to
the underlying software. GAZE is available at
http://www.sanger.ac.uk/Software/analysis/GAZE.
1
Corresponding author.
12:1418-1427 ©2002 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/02 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
Q. Liu, A. J. Mackey, D. S. Roos, and F. C. N. Pereira
Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction
Bioinformatics,
March 1, 2008;
24(5):
597 - 605.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. DeCaprio, J. P. Vinson, M. D. Pearson, P. Montgomery, M. Doherty, and J. E. Galagan
Conrad: Gene prediction using conditional random fields
Genome Res.,
September 1, 2007;
17(9):
1389 - 1398.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Coghlan and R. Durbin
Genomix: a method for combining gene-finders' predictions, which uses evolutionary conservation of sequence and intron exon structure
Bioinformatics,
June 15, 2007;
23(12):
1468 - 1475.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Wei, P. Lamesch, M. Arumugam, J. Rosenberg, P. Hu, M. Vidal, and M. R. Brent
Closing in on the C. elegans ORFeome by cloning TWINSCAN predictions
Genome Res.,
April 1, 2005;
15(4):
577 - 582.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Birney, M. Clamp, and R. Durbin
GeneWise and Genomewise
Genome Res.,
May 1, 2004;
14(5):
988 - 995.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. E. Allen, M. Pertea, and S. L. Salzberg
Computational Gene Prediction Using Multiple Sources of Evidence
Genome Res.,
January 1, 2004;
14(1):
142 - 148.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|