|
Vol. 12, Issue 10, 1466-1482, October 2002
Identification and Analysis of Over 2000 Ribosomal Protein Pseudogenes in the Human Genome
Zhaolei
Zhang,
Paul
Harrison, and
Mark
Gerstein1
Department of Molecular Biophysics and Biochemistry, Yale
University, New Haven, Connecticut 06520, USA
Mammals have 79 ribosomal proteins (RP). Using a systematic
procedure based on sequence-homology, we have comprehensively identified pseudogenes of these proteins in the human genome. Our
assignments are available at http://www.pseudogene.org or http://bioinfo.mbb.yale.edu/genome/pseudogene. In total, we found 2090 processed pseudogenes and 16 duplications of RP genes. In relation to
the matching parent protein, each of the processed pseudogenes has an
average relative sequence length of 97% and an average sequence
identity of 76%. A small number (258) of them do not contain obvious
disablements (stop codons or frameshifts) and, therefore, could be
mistaken as functional genes, and 178 are disrupted by one or more
repetitive elements. On average, processed pseudogenes have a longer
truncation at the 5' end than the 3' end, consistent with the
target-primed-reverse-transcription (TPRT) mechanism. Interestingly, on
chromosome 16, an RPL26 processed pseudogene was found in the intron
region of a functional RPS2 gene. The large-scale distribution of RP
pseudogenes throughout the genome appears to result, chiefly, from
random insertions with the numbers on each chromosome, consequently,
proportional to its size. In contrast to RP genes, the RP pseudogenes
have the highest density in GC-intermediate regions (41%-46%) of the genome, with the density pattern being between that of LINEs and Alus.
This can be explained by a negative selection theory as we observed
that GC-rich RP pseudogenes decay faster in GC-poor regions. Also, we
observed a correlation between the number of processed pseudogenes and
the GC content of the associated functional gene, i.e., relatively
GC-poor RPs have more processed pseudogenes. This ranges from 145 pseudogenes for RPL21 down to 3 pseudogenes for RPL14. We were able to
date the RP pseudogenes based on their sequence divergence from
present-day RP genes, finding an age distribution similar to that for
Alus. The distribution is consistent with a decline in
retrotransposition activity in the hominid lineage during the last 40 Myr. We discuss the implications for retrotransposon stability and
genome dynamics based on these new findings.
1
Corresponding author.
12:1466-1482 ©2002 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/02 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
Z. D. Zhang, P. Cayting, G. Weinstock, and M. Gerstein
Analysis of Nuclear Receptor Pseudogenes in Vertebrates: How the Silent Tell Their Stories
Mol. Biol. Evol.,
January 1, 2008;
25(1):
131 - 143.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. R. Gingeras
Origin of phenotypes: Genes and transcripts
Genome Res.,
June 1, 2007;
17(6):
682 - 690.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Flygare and S. Karlsson
Diamond-Blackfan anemia: erythropoiesis lost in translation
Blood,
April 15, 2007;
109(8):
3152 - 3154.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. E. Karro, Y. Yan, D. Zheng, Z. Zhang, N. Carriero, P. Cayting, P. Harrrison, and M. Gerstein
Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D55 - D60.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Drouin
Processed Pseudogenes Are More Abundant in Human and Mouse X Chromosomes than in Autosomes
Mol. Biol. Evol.,
September 1, 2006;
23(9):
1652 - 1655.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Zhang, N. Carriero, D. Zheng, J. Karro, P. M. Harrison, and M. Gerstein
PseudoPipe: an automated pseudogene identification pipeline
Bioinformatics,
June 15, 2006;
22(12):
1437 - 1439.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. J. van Baren and M. R. Brent
Iterative gene prediction and pseudogene removal improves genome annotation.
Genome Res.,
May 1, 2006;
16(5):
678 - 685.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Vinckenbosch, I. Dupanloup, and H. Kaessmann
Evolutionary fate of retroposed gene copies in the human genome
PNAS,
February 28, 2006;
103(9):
3220 - 3225.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Pyne, S. Skiena, and B. Futcher
Copy Correction and Concerted Evolution in the Conservation of Yeast Genes
Genetics,
August 1, 2005;
170(4):
1501 - 1513.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Diekmann, M. Klinger, T. Oertle, D. Heinz, H.-M. Pogoda, M. E. Schwab, and C. A. O. Stuermer
Analysis of the Reticulon Gene Family Demonstrates the Absence of the Neurite Growth Inhibitor Nogo-A in Fish
Mol. Biol. Evol.,
August 1, 2005;
22(8):
1635 - 1648.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. M. Harrison, D. Zheng, Z. Zhang, N. Carriero, and M. Gerstein
Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability
Nucleic Acids Res.,
April 28, 2005;
33(8):
2374 - 2383.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Perreault, J.-F. Noel, F. Briere, B. Cousineau, J.-F. Lucier, J.-P. Perreault, and G. Boire
Retropseudogenes derived from the human Ro/SS-A autoantigen-associated hY RNAs
Nucleic Acids Res.,
April 7, 2005;
33(6):
2032 - 2041.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. R. Schulze, D. A. R. Sinclair, K. A. Fitzpatrick, and B. M. Honda
A Genetic and Molecular Characterization of Two Proximal Heterochromatic Genes on Chromosome 3 of Drosophila melanogaster
Genetics,
April 1, 2005;
169(4):
2165 - 2177.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Adel, D. Laurent, and M. Dominique
HOPPSIGEN: a database of human and mouse processed pseudogenes
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D59 - D66.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. R. Oliver, T. L. Saunders, S. A. Tarle, and T. Glaser
Ribosomal protein L24 defect in Belly spot and tail (Bst), a mouse Minute
Development,
August 15, 2004;
131(16):
3907 - 3920.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R. Weil, P. Widlak, J. D. Minna, and H. R. Garner
Global Survey of Chromatin Accessibility Using DNA Microarrays
Genome Res.,
July 1, 2004;
14(7):
1374 - 1381.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. J. Emerson, H. Kaessmann, E. Betran, and M. Long
Extensive Gene Traffic on the Mammalian X Chromosome
Science,
January 23, 2004;
303(5657):
537 - 540.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Nakao, M. Yoshihama, and N. Kenmochi
RPG: the Ribosomal Protein Gene database
Nucleic Acids Res.,
January 1, 2004;
32(90001):
D168 - 170.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Zhang, P. M. Harrison, Y. Liu, and M. Gerstein
Millions of Years of Evolution Preserved: A Comprehensive Catalog of the Processed Pseudogenes in the Human Genome
Genome Res.,
December 1, 2003;
13(12):
2541 - 2558.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Torrents, M. Suyama, E. Zdobnov, and P. Bork
A Genome-Wide Survey of Human Pseudogenes
Genome Res.,
December 1, 2003;
13(12):
2559 - 2567.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. J. Fleishman, T. Dagan, and D. Graur
pANT: A Method for the Pairwise Assessment of Nonfunctionalization Times of Processed Pseudogenes
Mol. Biol. Evol.,
November 1, 2003;
20(11):
1876 - 1880.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Zhang and M. Gerstein
Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes
Nucleic Acids Res.,
September 15, 2003;
31(18):
5338 - 5348.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Z. Strichman-Almashanu, M. Bustin, and D. Landsman
Retroposed Copies of the HMG Genes: A Window to Genome Dynamics
Genome Res.,
May 1, 2003;
13(5):
800 - 812.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. M. Harrison, D. Milburn, Z. Zhang, P. Bertone, and M. Gerstein
Identification of pseudogenes in the Drosophila melanogaster genome
Nucleic Acids Res.,
February 1, 2003;
31(3):
1033 - 1037.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|