Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


Genome Res. 17:839-851, 2007
©2007 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/07 $5.00
OPEN ACCESS ARTICLE
This Article
OPEN ACCESS ARTICLE
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zheng, D.
Right arrow Articles by Gerstein, M. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zheng, D.
Right arrow Articles by Gerstein, M. B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Letter

Pseudogenes in the ENCODE regions: Consensus annotation, analysis of transcription, and evolution

Deyou Zheng1,13, Adam Frankish2, Robert Baertsch3, Philipp Kapranov4, Alexandre Reymond5,6, Siew Woh Choo7, Yontao Lu3, France Denoeud8, Stylianos E. Antonarakis6, Michael Snyder9, Yijun Ruan7, Chia-Lin Wei7, Thomas R. Gingeras4, Roderic Guigó8,10, Jennifer Harrow2, and Mark B. Gerstein1,11,12,13

1 Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA; 2 Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1HH, United Kingdom; 3 Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, California 95064, USA; 4 Affymetrix, Inc., Santa Clara, California 92024, USA; 5 Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland; 6 Department of Genetic Medicine and Development, University of Geneva Medical School, 1211 Geneva, Switzerland; 7 Genome Institute of Singapore, Singapore 138672, Singapore; 8 Grup de Recerca en Informática Biomèdica, Institut Municipal d’Investigació Mèdica/Universitat Pompeu Fabra, Passeig Marítim de la Barceloneta, 37-49, 08003, Barcelona, Catalonia, Spain; 9 Molecular, Cellular & Developmental Biology Department, Yale University, New Haven, Connecticut 06520, USA; 10 Center for Genomic Regulation, Passeig Marítim de la Barceloneta, 37-49, 08003, Barcelona, Catalonia, Spain; 11 Department of Computer Science, Yale University, New Haven, Connecticut 06520, USA; 12 Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA

Arising from either retrotransposition or genomic duplication of functional genes, pseudogenes are "genomic fossils" valuable for exploring the dynamics and evolution of genes and genomes. Pseudogene identification is an important problem in computational genomics, and is also critical for obtaining an accurate picture of a genome’s structure and function. However, no consensus computational scheme for defining and detecting pseudogenes has been developed thus far. As part of the ENCyclopedia Of DNA Elements (ENCODE) project, we have compared several distinct pseudogene annotation strategies and found that different approaches and parameters often resulted in rather distinct sets of pseudogenes. We subsequently developed a consensus approach for annotating pseudogenes (derived from protein coding genes) in the ENCODE regions, resulting in 201 pseudogenes, two-thirds of which originated from retrotransposition. A survey of orthologs for these pseudogenes in 28 vertebrate genomes showed that a significant fraction (~80%) of the processed pseudogenes are primate-specific sequences, highlighting the increasing retrotransposition activity in primates. Analysis of sequence conservation and variation also demonstrated that most pseudogenes evolve neutrally, and processed pseudogenes appear to have lost their coding potential immediately or soon after their emergence. In order to explore the functional implication of pseudogene prevalence, we have extensively examined the transcriptional activity of the ENCODE pseudogenes. We performed systematic series of pseudogene-specific RACE analyses. These, together with complementary evidence derived from tiling microarrays and high throughput sequencing, demonstrated that at least a fifth of the 201 pseudogenes are transcribed in one or more cell lines or tissues.


13 Corresponding authors.

E-mail Mark.Gerstein{at}yale.edu; fax (360) 838-7861.

E-mail deyou.zheng{at}yale.edu; fax (360) 838-7861.

[Supplemental material is available online at www.genome.org and http://www.pseudogene.org/ENCODE/supplement/.]

Article is online at http://www.genome.org/cgi/doi/10.1101/gr.5586307


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
K. Okamura and K. Nakai
Retrotransposition as a Source of New Promoters
Mol. Biol. Evol., June 1, 2008; 25(6): 1231 - 1238.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
J. Cai, R. Zhao, H. Jiang, and W. Wang
De Novo Origination of a New Protein-Coding Gene in Saccharomyces cerevisiae
Genetics, May 1, 2008; 179(1): 487 - 496.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
Z. D. Zhang, P. Cayting, G. Weinstock, and M. Gerstein
Analysis of Nuclear Receptor Pseudogenes in Vertebrates: How the Silent Tell Their Stories
Mol. Biol. Evol., January 1, 2008; 25(1): 131 - 143.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res., December 1, 2007; 17(12): 1763 - 1773.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. F. Furlong, R. Younger, M. Kasahara, R. Reinhardt, M. Thorndyke, and P. W. H. Holland
A Degenerate ParaHox Gene Cluster in a Degenerate Vertebrate
Mol. Biol. Evol., December 1, 2007; 24(12): 2681 - 2686.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. B. Gerstein, C. Bruce, J. S. Rozowsky, D. Zheng, J. Du, J. O. Korbel, O. Emanuelsson, Z. D. Zhang, S. Weissman, and M. Snyder
What is a gene, post-ENCODE? History and updated definition
Genome Res., June 1, 2007; 17(6): 669 - 681.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
T. R. Gingeras
Origin of phenotypes: Genes and transcripts
Genome Res., June 1, 2007; 17(6): 682 - 690.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
F. Denoeud, P. Kapranov, C. Ucla, A. Frankish, R. Castelo, J. Drenkow, J. Lagarde, T. Alioto, C. Manzano, J. Chrast, et al.
Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions
Genome Res., June 1, 2007; 17(6): 746 - 759.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.
Copyright © 2007 by Cold Spring Harbor Laboratory Press.