|
|
|
|
Published online before print
February 12, 2004, 10.1101/gr.1481104 Genome Res. 14:463-471, 2004 ©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Resources Numerous Novel Annotations of the Human Genome Sequence Supported by a 5'-EndEnriched cDNA Collection1 Genoscope-Centre National de Séquençage and CNRS UMR-8030, 91000 Evry, France
A collection of 90,000 human cDNA clones generated to increase the fraction of "full-length" cDNAs available was analyzed by sequence alignment on the human genome assembly. Five hundred fifty-two gene models not found in LocusLink, with coding regions of at least 300 bp, were defined by using this collection. Exon composition proposed for novel genes showed an average of 4.7 exons per gene. In 20% of the cases, at least half of the exons predicted for new genes coincided with evolutionary conserved regions defined by sequence comparisons with the pufferfish Tetraodon nigroviridis. Among this subset, CpG islands were observed at the 5' end of 75%. In-frame stop codons upstream of the initiator ATG were present in 49% of the new genes, and 16% contained a coding region comprising at least 50% of the cDNA sequence. This cDNA resource also provided candidate small protein-coding genes, usually not included in genome annotations. In addition, analysis of a sample from this cDNA collection indicates that
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1481104. Article published online before print in February 2004.
5 Corresponding author. 2 Present address: LGI-BioInformatic, Aventis Pharma S.A., 94400, Vitry-Sur-Seine, France 3 Present address: European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB101SD, UK 4 Present address: Genomining, 92120, Montrouge, France.
[The sequence data from this study have been submitted to EMBL under accession nos. BX323813
This article has been cited by other articles:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||