Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Parra, G.
Right arrow Articles by Guigó, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Parra, G.
Right arrow Articles by Guigó, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 10, Issue 4, 511-515, April 2000

METHODS
GeneID in Drosophila

Genís Parra, Enrique Blanco, and Roderic Guigó1

Grup de Recerca en Informàtica Mèdica, Institut Municipal d'Investigació Mèdica (IMIM), Universitat Pompeu Fabra, E-08003 Barcelona, Spain

GeneID is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, and start and stop codons are predicted and scored along the sequence using position weight matrices (PWMs). In the second step, exons are built from the sites. Exons are scored as the sum of the scores of the defining sites, plus the log-likelihood ratio of a Markov model for coding DNA. In the last step, from the set of predicted exons, the gene structure is assembled, maximizing the sum of the scores of the assembled exons. In this paper we describe the obtention of PWMs for sites, and the Markov model of coding DNA in Drosophila melanogaster. We also compare other models of coding DNA with the Markov model. Finally, we present and discuss the results obtained when GeneID is used to predict genes in the Adh region. These results show that the accuracy of GeneID predictions compares currently with that of other existing tools but that GeneID is likely to be more efficient in terms of speed and memory usage. GeneID is available at http://www1.imim.es/~eblanco/GeneId.


1 Corresponding author.


10:511-515 ©2000 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/00 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
C. Ansong, S. O. Purvine, J. N. Adkins, M. S. Lipton, and R. D. Smith
Proteogenomics: needs and roles to be filled by proteomics in genome annotation
Brief Funct Genomic Proteomic, March 10, 2008; (2008) eln010v1.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Q. Liu, A. J. Mackey, D. S. Roos, and F. C. N. Pereira
Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction
Bioinformatics, March 1, 2008; 24(5): 597 - 605.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Castellano, V. N. Gladyshev, R. Guigo, and M. J. Berry
SelenoDB 1.0 : a database of selenoprotein genes, proteins and SECIS elements
Nucleic Acids Res., January 11, 2008; 36(suppl_1): D332 - D338.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. F. Lin, J. W. Carlson, M. A. Crosby, B. B. Matthews, C. Yu, S. Park, K. H. Wan, A. J. Schroeder, L. S. Gramates, S. E. St. Pierre, et al.
Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes
Genome Res., December 1, 2007; 17(12): 1823 - 1836.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
R. Lyle, P. Prandini, K. Osoegawa, B. ten Hallers, S. Humphray, B. Zhu, E. Eyras, R. Castelo, C. P. Bird, S. Gagos, et al.
Islands of euchromatin-like sequence and expressed polymorphic sequences within the short arm of human chromosome 21
Genome Res., November 1, 2007; 17(11): 1690 - 1696.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. DeCaprio, J. P. Vinson, M. D. Pearson, P. Montgomery, M. Doherty, and J. E. Galagan
Conrad: Gene prediction using conditional random fields
Genome Res., September 1, 2007; 17(9): 1389 - 1398.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. Parra, K. Bradnam, and I. Korf
CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes
Bioinformatics, May 1, 2007; 23(9): 1061 - 1067.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
B. A. Peters, B. St. Croix, T. Sjoblom, J. M. Cummins, N. Silliman, J. Ptak, S. Saha, K. W. Kinzler, C. Hatzis, and V. E. Velculescu
Large-scale identification of novel transcripts in the human genome
Genome Res., March 1, 2007; 17(3): 287 - 292.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
S. Tanner, Z. Shen, J. Ng, L. Florea, R. Guigo, S. P. Briggs, and V. Bafna
Improving gene annotation using peptide mass spectrometry
Genome Res., February 1, 2007; 17(2): 231 - 239.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. G. Gilbert
DroSpeGe: rapid access database for new Drosophila species genomes
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D480 - D485.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. S. Alioto
U12DB: a database of orthologous U12-type spliceosomal introns
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D110 - D115.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Vilardell and A. Sanchez-Pla
Hypothesis testing approaches to the exon prediction problem
Bioinformatics, December 15, 2006; 22(24): 3003 - 3008.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Parra, A. Reymond, N. Dabbouseh, E. T. Dermitzakis, R. Castelo, T. M. Thomson, S. E. Antonarakis, and R. Guigo
Tandem chimerism as a means to increase protein complexity in the human genome
Genome Res., January 1, 2006; 16(1): 37 - 44.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. S. Hinrichs, D. Karolchik, R. Baertsch, G. P. Barber, G. Bejerano, H. Clawson, M. Diekhans, T. S. Furey, R. A. Harte, F. Hsu, et al.
The UCSC Genome Browser Database: update 2006
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D590 - D598.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. E. Galagan, M. R. Henn, L.-J. Ma, C. A. Cuomo, and B. Birren
Genomics of the fungal kingdom: Insights into eukaryotic biology
Genome Res., December 1, 2005; 15(12): 1620 - 1631.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Lomsadze, V. Ter-Hovhannisyan, Y. O. Chernoff, and M. Borodovsky
Gene identification in novel eukaryotic genomes by self-training algorithm
Nucleic Acids Res., November 28, 2005; 33(20): 6494 - 6506.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Castellano, A. V. Lobanov, C. Chapple, S. V. Novoselov, M. Albrecht, D. Hua, A. Lescure, T. Lengauer, A. Krol, V. N. Gladyshev, et al.
From the Cover: Diversity and functional plasticity of eukaryotic selenoproteins: Identification and characterization of the SelJ family
PNAS, November 8, 2005; 102(45): 16188 - 16193.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
G. V Kryukov, S. Schmidt, and S. Sunyaev
Small fitness effect of mutations in highly conserved non-coding regions
Hum. Mol. Genet., August 1, 2005; 14(15): 2221 - 2229.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Stanke and B. Morgenstern
AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W465 - W467.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Taskov, C. Chapple, G. V. Kryukov, S. Castellano, A. V. Lobanov, K. V. Korotkov, R. Guigo, and V. N. Gladyshev
Nematode selenoproteome: the use of the selenocysteine insertion system to decode one codon in an animal genome?
Nucleic Acids Res., April 20, 2005; 33(7): 2227 - 2238.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Felder, K. Szafranski, Rüd. Lehmann, L. Eichinger, A. A. Noegel, M. Platzer, and G. Glöckner
DictyMOLD-a Dictyostelium discoideum genome browser database
Bioinformatics, March 1, 2005; 21(5): 696 - 697.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Stanke, R. Steinkamp, S. Waack, and B. Morgenstern
AUGUSTUS: a web server for gene finding in eukaryotes
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W309 - W312.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
N. F. Lobo, L. Q. Ton, C. A. Hill, C. Emore, J. Romero-Severson, G. J. Hunt, and F. H. Collins
Genomic Analysis in the sting-2 Quantitative Trait Locus for Defensive Behavior in the Honey Bee, Apis mellifera
Genome Res., December 1, 2003; 13(12): 2588 - 2593.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
L. Zhang, V. Pavlovic, C. R Cantor, and S. Kasif
Human-Mouse Gene Identification by Comparative Evidence Integration and Evolutionary Analysis
Genome Res., June 1, 2003; 13(6): 1190 - 1202.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Beltran, E. Blanco, F. Serras, B. Perez-Villamil, R. Guigo, S. Artavanis-Tsakonas, and M. Corominas
Transcriptional network controlled by the trithorax-group gene ash2 in Drosophila melanogaster
PNAS, March 18, 2003; 100(6): 3293 - 3298.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Guigo, E. T. Dermitzakis, P. Agarwal, C. P. Ponting, G. Parra, A. Reymond, J. F. Abril, E. Keibler, R. Lyle, C. Ucla, et al.
Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes
PNAS, February 4, 2003; 100(3): 1140 - 1145.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
T.-J. Chuang, W.-C. Lin, H.-C. Lee, C.-W. Wang, K.-L. Hsiao, Z.-H. Wang, D. Shieh, S. C. Lin, and L.-Y. Ch'ang
A Complexity Reduction Algorithm for Analysis and Annotation of Large Genomic Sequences
Genome Res., February 1, 2003; 13(2): 313 - 322.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
P. Flicek, E. Keibler, P. Hu, I. Korf, and M. R. Brent
Leveraging the Mouse Genome for Gene Prediction in Human: From Whole-Genome Shotgun Reads to a Global Synteny Map
Genome Res., January 1, 2003; 13(1): 46 - 54.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Parra, P. Agarwal, J. F. Abril, T. Wiehe, J. W. Fickett, and R. Guigo
Comparative Gene Prediction in Human and Mouse
Genome Res., January 1, 2003; 13(1): 108 - 117.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Thomasova, L. Q. Ton, R. R. Copley, E. M. Zdobnov, X. Wang, Y. S. Hong, C. Sim, P. Bork, F. C. Kafatos, and F. H. Collins
Comparative genomic analysis in the region of a major Plasmodium-refractoriness locus of Anophelesgambiae
PNAS, June 11, 2002; 99(12): 8179 - 8184.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. Andrews, G. G. Bouffard, C. Cheadle, J. Lü, K. G. Becker, and B. Oliver
Gene Discovery Using Computational and Microarray Analysis of Transcription in the Drosophila melanogaster Testis
Genome Res., December 1, 2000; 10(12): 2030 - 2043.
[Abstract] [Full Text]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.