|
|
|
|
Genome Res. 13:1552-1553, 2003 ©2003 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/03 $5.00 Abstract Exon Structure Analysis, Ortholog Identification, and SNP Candidate Screening by Mapping Mouse RIKEN Sequences to Multiple Genome AssembliesGenomics Institute of the Novartis Research Foundation (GNF), San Diego, California 92121, USA
Mapping RIKEN full-length cDNAs (The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team 2002
We have mapped 60,770 RIKEN clones by BLAT (Kent 2002
The single-exon clones longer than 1 Kb are candidates for further investigation as bona fide intronless genes, retransposition events, or possible genomic DNA contamination. For this investigation, the expression profile of an intronless clone can be very informative (Su et al. 2002 This analysis also allows us to roughly compare the completeness of the assemblies. Of 60,172 RIKEN cDNAs containing >100 non-masked bases, >99% were mapped at >70% length to both latest assemblies, much more complete than the earlier assemblies. Figure 1 illustrates the comparison of the mapping to four assemblies. MGSC v.3 in green as left bars, Celera R13 in blue as right bars for each chromosome. Where RIKEN clones mapped to both assemblies, a cyan line connects the mapping positions, whereas triangles mark clones mapped exclusively to one assembly. The large-scale discrepancies are marked in red. One can observe a 10-Mb contig inversion on chromosome X (later detected and corrected by MGSC) and smaller ones on chr.5, 17, 18, 19, etc. The up-to-date scalable version of the mapping comparison is available at http://www.gnf.org/RIKEN/.
The extra bars represent superimposed syntenic regions identified by mapping to human assemblies, NCBI v.28 left from MGSC v.3, Celera R26h right from the mouse Celera R13. The two-digit electric color code for human chromosomes is shown at the bottom. Several cases of different syntenic assignment deserve further investigation. Footnotes Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1458903.
1 Corresponding author. REFERENCES
The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and Phase II Team. 2002. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420:563 -573.[CrossRef][Medline]
Kent, W.J. 2002. BLATThe BLAST-Like Alignment Tool. Genome Res. 12:656
-664. Mouse Genome Sequencing Consortium. 2002. Initial sequencing and comparative analysis of the mouse genome. Nature 420:520 -562.[CrossRef][Medline]
Su, A.I., Cooke, M.P., Ching, K.A., Hakak, Y., Walker, J.R., Wiltshire, T., Orth, A.P., Vega, R.G., Sapinoso, L.M., Moqrich, A., et al. 2002. Large-scale analysis of the human and mouse transcriptomes. Proc. Natl. Acad. Sci.
99:4465
-4470.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||