Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 16:678-685, 2006
©2006 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/06 $5.00
OPEN ACCESS ARTICLE
This Article
OPEN ACCESS ARTICLE
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by van Baren, M. J.
Right arrow Articles by Brent, M. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by van Baren, M. J.
Right arrow Articles by Brent, M. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Resource

Iterative gene prediction and pseudogene removal improves genome annotation

Marijke J. van Baren and Michael R. Brent1

Laboratory for Computational Genomics, Department of Computer Science Washington University, Saint Louis, Missouri 63130, USA

Correct gene prediction is impaired by the presence of processed pseudogenes: nonfunctional, intronless copies of real genes found elsewhere in the genome. Gene prediction programs frequently mistake processed pseudogenes for real genes or exons, leading to biologically irrelevant gene predictions. While methods exist to identify processed pseudogenes in genomes, no attempt has been made to integrate pseudogene removal with gene prediction, or even to provide a freestanding tool that identifies such erroneous gene predictions. We have created PPFINDER (for Processed Pseudogene finder), a program that integrates several methods of processed pseudogene finding in mammalian gene annotations. We used PPFINDER to remove pseudogenes from N-SCAN gene predictions, and show that gene prediction improves substantially when gene prediction and pseudogene masking are interleaved. In addition, we used PPFINDER with gene predictions as a parent database, eliminating the need for libraries of known genes. This allows us to run the gene prediction/PPFINDER procedure on newly sequenced genomes for which few genes are known.


1 Corresponding author.

E-mail brent{at}cse.wustl.edu.

[Supplemental material is available online at www.genome.org. N-SCAN and PPFINDER are open source software and may be obtained from http://genes.cse.wustl.edu/.]

Article is online at http://www.genome.org/cgi/doi/10.1101/gr.4766206


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
J EndocrinolHome page
L. L Espey, R. A Garcia, H. Kondo, B. Ishizuka, S. Yoshioka, S. Fujii, S. Hampton, and J. S Richards
Expression of paralogs of cytochrome P45021a1 pseudogene (Cyp21a1-ps) and endogenous retrovirus SC1 (SC1) in the rat ovary during the ovulatory process
J. Endocrinol., July 1, 2008; 198(1): 231 - 241.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Stanke, M. Diekhans, R. Baertsch, and D. Haussler
Using native and syntenically mapped cDNA alignments to improve de novo gene finding
Bioinformatics, March 1, 2008; 24(5): 637 - 644.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res., December 1, 2007; 17(12): 1763 - 1773.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. Zheng, A. Frankish, R. Baertsch, P. Kapranov, A. Reymond, S. W. Choo, Y. Lu, F. Denoeud, S. E. Antonarakis, M. Snyder, et al.
Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution
Genome Res., June 1, 2007; 17(6): 839 - 851.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
Rhesus Macaque Genome Sequencing and Analysis Cons, R. A. Gibbs, J. Rogers, M. G. Katze, R. Bumgarner, G. M. Weinstock, E. R. Mardis, K. A. Remington, R. L. Strausberg, J. C. Venter, et al.
Evolutionary and Biomedical Insights from the Rhesus Macaque Genome
Science, April 13, 2007; 316(5822): 222 - 234.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2006 by Cold Spring Harbor Laboratory Press.