Published online before print
December 30, 2002, 10.1101/gr.695703
Vol 13, Issue 1, 27-36, January 2003
Reevaluating Human Gene Annotation: A Second-Generation Analysis of Chromosome 22
John E. Collins,
Melanie E. Goward,
Charlotte G. Cole,
Luc J. Smink1,
Elizabeth J. Huckle,
Sarah Knowles,
Jacqueline M. Bye,
David M. Beare and
Ian Dunham2
The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus,
Hinxton, Cambridge CB10 1SA, UK
We report a second-generation gene annotation of human chromosome
22. Using expressed sequence databases, comparative sequence analysis,
and experimental verification, we have extended genes, fused previously
fragmented structures, and identified new genes. The total length in
exons of annotation was increased by 74% over our previously published
annotation and includes 546 protein-coding genes and 234 pseudogenes.
Thirty-two potential protein-coding annotations are partial copies of
other genes, and may represent duplications on an evolutionary path to
change or loss of function. We also identified 31 non-protein-coding
transcripts, including 16 possible antisense RNAs. By extrapolation, we
estimate the human genome contains 29,00036,000 protein-coding genes,
21,300 pseudogenes, and 1500 antisense RNAs. We suggest that our
revised annotation criteria provide a paradigm for future annotation of
the human genome.
[Supplemental material is available
online at www.genome.org. The sequence data from this study have been
submitted to GenBank under accession nos. AL009266, AL021682-3,
AL021708, AL022729, AL035081-2, AL035364, AL035366, AL035545, AL049654,
AL050253-8, AL050345-6, AL079310, AL096779-81, AL096879-81, AL096883,
AL096886, AL138578, AL157851, AL159142-3, AL160111-2, AL160131-2,
AL160311, AL355092, AL355192, AL355841, AL359401, AL359403, AL365511-5,
AL442116, AL449243, AL449244, AL450314, AL589866-7, AL590120,
AL590887-8, BU583989BU585359. The following individuals kindly
provided reagents, samples, or unpublished information as indicated in
the paper: J. Seilhamer, L. Stuve, H. Roest-Crollius, A. Levine, G.
Slater, and J. Kent.]
1 Present address: JDRF/WT Diabetes and Inflammation
Laboratory, Cambridge Institute for Medical Research, University of
Cambridge, Wellcome Trust/MRC Building, Addenbrookes Hospital,
Cambridge CB2 2XY, UK.
2 Corresponding author.
E-MAIL id1{at}sanger.ac.uk; FAX +44 (0) 1223 494919
Article and publication are at
http://www.genome.org/cgi/doi/10.1101/gr.695703. Article published online before print in December
2002.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
V. D. Winn, R. Haimov-Kochman, A. C. Paquet, Y. J. Yang, M. S. Madhusudhan, M. Gormley, K.-T. V. Feng, D. A. Bernlohr, S. McDonagh, L. Pereira, et al.
Gene Expression Profiling of the Human Maternal-Fetal Interface Reveals Dramatic Changes between Midgestation and Term
Endocrinology,
March 1, 2007;
148(3):
1059 - 1079.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. E. Karro, Y. Yan, D. Zheng, Z. Zhang, N. Carriero, P. Cayting, P. Harrrison, and M. Gerstein
Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation
Nucleic Acids Res.,
January 12, 2007;
35(suppl_1):
D55 - D60.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Damaraju, D. Murray, J. Dufour, D. Carandang, S. Myrehaug, G. Fallone, C. Field, R. Greiner, J. Hanson, C. E. Cass, et al.
Association of DNA Repair and Steroid Metabolism Gene Polymorphisms with Clinical Late Toxicity in Patients Treated with Conformal Radiotherapy for Prostate Cancer
Clin. Cancer Res.,
April 15, 2006;
12(8):
2545 - 2554.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Lipovich and M.-C. King
Abundant novel transcriptional units and unconventional gene pairs on human chromosome 22
Genome Res.,
January 1, 2006;
16(1):
45 - 54.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. M.J. Searle, J. Gilbert, V. Iyer, and M. Clamp
The Otter Annotation System
Genome Res.,
May 1, 2004;
14(5):
963 - 970.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Kampa, J. Cheng, P. Kapranov, M. Yamanaka, S. Brubaker, S. Cawley, J. Drenkow, A. Piccolboni, S. Bekiranov, G. Helt, et al.
Novel RNAs Identified From an In-Depth Analysis of the Transcriptome of Human Chromosomes 21 and 22
Genome Res.,
March 1, 2004;
14(3):
331 - 342.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. M. Porcel, O. Delfour, V. Castelli, V. De Berardinis, L. Friedlander, C. Cruaud, A. Ureta-Vidal, C. Scarpelli, P. Wincker, V. Schachter, et al.
Numerous Novel Annotations of the Human Genome Sequence Supported by a 5'-End-Enriched cDNA Collection
Genome Res.,
March 1, 2004;
14(3):
463 - 471.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Woodfine, H. Fiegler, D. M. Beare, J. E. Collins, O. T. McCann, B. D. Young, S. Debernardi, R. Mott, I. Dunham, and N. P. Carter
Replication timing of the human genome
Hum. Mol. Genet.,
January 15, 2004;
13(2):
191 - 202.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Babcock, A. Pavlicek, E. Spiteri, C. D. Kashork, I. Ioshikhes, L. G. Shaffer, J. Jurka, and B. E. Morrow
Shuffling of Genes Within Low-Copy Repeats on 22q11 (LCR22) by Alu-Mediated Recombination Events During Evolution
Genome Res.,
December 1, 2003;
13(12):
2519 - 2532.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Torrents, M. Suyama, E. Zdobnov, and P. Bork
A Genome-Wide Survey of Human Pseudogenes
Genome Res.,
December 1, 2003;
13(12):
2559 - 2567.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Martone, G. Euskirchen, P. Bertone, S. Hartman, T. E. Royce, N. M. Luscombe, J. L. Rinn, F. K. Nelson, P. Miller, M. Gerstein, et al.
Distribution of NF-{kappa}B-binding sites across human chromosome 22
PNAS,
October 14, 2003;
100(21):
12247 - 12252.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. J. Kent, R. Baertsch, A. Hinrichs, W. Miller, and D. Haussler
Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes
PNAS,
September 30, 2003;
100(20):
11484 - 11489.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
F. Mignone, G. Grillo, S. Liuni, and G. Pesole
Computational identification of protein coding potential of conserved sequence tags through cross-species evolutionary analysis
Nucleic Acids Res.,
August 1, 2003;
31(15):
4639 - 4645.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. B. Bajic and S. H. Seah
Dragon Gene Start Finder: An Advanced System for Finding Approximate Locations of the Start of Gene Transcriptional Units
Genome Res.,
August 1, 2003;
13(8):
1923 - 1929.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Chong, G. Zhang, and V. B. Bajic
FIE2: a program for the extraction of genomic DNA sequences around the start and translation initiation site of human genes
Nucleic Acids Res.,
July 1, 2003;
31(13):
3546 - 3553.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|