Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Zhang, Z.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Zhang, Z.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 10, 1466-1482, October 2002

Identification and Analysis of Over 2000 Ribosomal Protein Pseudogenes in the Human Genome

Zhaolei Zhang, Paul Harrison, and Mark Gerstein1

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA

Mammals have 79 ribosomal proteins (RP). Using a systematic procedure based on sequence-homology, we have comprehensively identified pseudogenes of these proteins in the human genome. Our assignments are available at http://www.pseudogene.org or http://bioinfo.mbb.yale.edu/genome/pseudogene. In total, we found 2090 processed pseudogenes and 16 duplications of RP genes. In relation to the matching parent protein, each of the processed pseudogenes has an average relative sequence length of 97% and an average sequence identity of 76%. A small number (258) of them do not contain obvious disablements (stop codons or frameshifts) and, therefore, could be mistaken as functional genes, and 178 are disrupted by one or more repetitive elements. On average, processed pseudogenes have a longer truncation at the 5' end than the 3' end, consistent with the target-primed-reverse-transcription (TPRT) mechanism. Interestingly, on chromosome 16, an RPL26 processed pseudogene was found in the intron region of a functional RPS2 gene. The large-scale distribution of RP pseudogenes throughout the genome appears to result, chiefly, from random insertions with the numbers on each chromosome, consequently, proportional to its size. In contrast to RP genes, the RP pseudogenes have the highest density in GC-intermediate regions (41%-46%) of the genome, with the density pattern being between that of LINEs and Alus. This can be explained by a negative selection theory as we observed that GC-rich RP pseudogenes decay faster in GC-poor regions. Also, we observed a correlation between the number of processed pseudogenes and the GC content of the associated functional gene, i.e., relatively GC-poor RPs have more processed pseudogenes. This ranges from 145 pseudogenes for RPL21 down to 3 pseudogenes for RPL14. We were able to date the RP pseudogenes based on their sequence divergence from present-day RP genes, finding an age distribution similar to that for Alus. The distribution is consistent with a decline in retrotransposition activity in the hominid lineage during the last 40 Myr. We discuss the implications for retrotransposon stability and genome dynamics based on these new findings.


1 Corresponding author.


12:1466-1482 ©2002 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/02 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
Z. D. Zhang, P. Cayting, G. Weinstock, and M. Gerstein
Analysis of Nuclear Receptor Pseudogenes in Vertebrates: How the Silent Tell Their Stories
Mol. Biol. Evol., January 1, 2008; 25(1): 131 - 143.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
T. R. Gingeras
Origin of phenotypes: Genes and transcripts
Genome Res., June 1, 2007; 17(6): 682 - 690.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
J. Flygare and S. Karlsson
Diamond-Blackfan anemia: erythropoiesis lost in translation
Blood, April 15, 2007; 109(8): 3152 - 3154.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. E. Karro, Y. Yan, D. Zheng, Z. Zhang, N. Carriero, P. Cayting, P. Harrrison, and M. Gerstein
Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D55 - D60.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. Drouin
Processed Pseudogenes Are More Abundant in Human and Mouse X Chromosomes than in Autosomes
Mol. Biol. Evol., September 1, 2006; 23(9): 1652 - 1655.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Z. Zhang, N. Carriero, D. Zheng, J. Karro, P. M. Harrison, and M. Gerstein
PseudoPipe: an automated pseudogene identification pipeline
Bioinformatics, June 15, 2006; 22(12): 1437 - 1439.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. J. van Baren and M. R. Brent
Iterative gene prediction and pseudogene removal improves genome annotation.
Genome Res., May 1, 2006; 16(5): 678 - 685.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Vinckenbosch, I. Dupanloup, and H. Kaessmann
Evolutionary fate of retroposed gene copies in the human genome
PNAS, February 28, 2006; 103(9): 3220 - 3225.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. Pyne, S. Skiena, and B. Futcher
Copy Correction and Concerted Evolution in the Conservation of Yeast Genes
Genetics, August 1, 2005; 170(4): 1501 - 1513.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
H. Diekmann, M. Klinger, T. Oertle, D. Heinz, H.-M. Pogoda, M. E. Schwab, and C. A. O. Stuermer
Analysis of the Reticulon Gene Family Demonstrates the Absence of the Neurite Growth Inhibitor Nogo-A in Fish
Mol. Biol. Evol., August 1, 2005; 22(8): 1635 - 1648.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. M. Harrison, D. Zheng, Z. Zhang, N. Carriero, and M. Gerstein
Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability
Nucleic Acids Res., April 28, 2005; 33(8): 2374 - 2383.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Perreault, J.-F. Noel, F. Briere, B. Cousineau, J.-F. Lucier, J.-P. Perreault, and G. Boire
Retropseudogenes derived from the human Ro/SS-A autoantigen-associated hY RNAs
Nucleic Acids Res., April 7, 2005; 33(6): 2032 - 2041.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
S. R. Schulze, D. A. R. Sinclair, K. A. Fitzpatrick, and B. M. Honda
A Genetic and Molecular Characterization of Two Proximal Heterochromatic Genes on Chromosome 3 of Drosophila melanogaster
Genetics, April 1, 2005; 169(4): 2165 - 2177.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Adel, D. Laurent, and M. Dominique
HOPPSIGEN: a database of human and mouse processed pseudogenes
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D59 - D66.
[Abstract] [Full Text] [PDF]


Home page
DevelopmentHome page
E. R. Oliver, T. L. Saunders, S. A. Tarle, and T. Glaser
Ribosomal protein L24 defect in Belly spot and tail (Bst), a mouse Minute
Development, August 15, 2004; 131(16): 3907 - 3920.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. R. Weil, P. Widlak, J. D. Minna, and H. R. Garner
Global Survey of Chromatin Accessibility Using DNA Microarrays
Genome Res., July 1, 2004; 14(7): 1374 - 1381.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
J. J. Emerson, H. Kaessmann, E. Betran, and M. Long
Extensive Gene Traffic on the Mammalian X Chromosome
Science, January 23, 2004; 303(5657): 537 - 540.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Nakao, M. Yoshihama, and N. Kenmochi
RPG: the Ribosomal Protein Gene database
Nucleic Acids Res., January 1, 2004; 32(90001): D168 - 170.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
Z. Zhang, P. M. Harrison, Y. Liu, and M. Gerstein
Millions of Years of Evolution Preserved: A Comprehensive Catalog of the Processed Pseudogenes in the Human Genome
Genome Res., December 1, 2003; 13(12): 2541 - 2558.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
D. Torrents, M. Suyama, E. Zdobnov, and P. Bork
A Genome-Wide Survey of Human Pseudogenes
Genome Res., December 1, 2003; 13(12): 2559 - 2567.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. J. Fleishman, T. Dagan, and D. Graur
pANT: A Method for the Pairwise Assessment of Nonfunctionalization Times of Processed Pseudogenes
Mol. Biol. Evol., November 1, 2003; 20(11): 1876 - 1880.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Z. Zhang and M. Gerstein
Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes
Nucleic Acids Res., September 15, 2003; 31(18): 5338 - 5348.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
L. Z. Strichman-Almashanu, M. Bustin, and D. Landsman
Retroposed Copies of the HMG Genes: A Window to Genome Dynamics
Genome Res., May 1, 2003; 13(5): 800 - 812.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. M. Harrison, D. Milburn, Z. Zhang, P. Bertone, and M. Gerstein
Identification of pseudogenes in the Drosophila melanogaster genome
Nucleic Acids Res., February 1, 2003; 31(3): 1033 - 1037.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.