Genome Res. 14:685-692, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Methods
Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human
Michael Brudno1,
Alexander Poliakov2,
Asaf Salamov3,4,
Gregory M. Cooper5,
Arend Sidow5,6,
Edward M. Rubin2,3,
Victor Solovyev3,4,
Serafim Batzoglou1,7 and
Inna Dubchak2,3,7
1 Department of Computer Science, Stanford University, Stanford, California 94305, USA
2 Genomics Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
3 U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598, USA
4 Softberry Inc., Mount Kisco, New York 10549, USA
5 Department of Genetics, Stanford University, Stanford, California 94305-5324, USA
6 Department of Pathology, Stanford University, Stanford, California 94305-5324, USA
We have built a whole-genome multiple alignment of the three currently available mammalian genomes using a fully automated pipeline that combines the local/global approach of the Berkeley Genome Pipeline and the LAGAN program. The strategy is based on progressive alignment and consists of two main steps: (1) alignment of the mouse and rat genomes, and (2) alignment of human to either the mouse-rat alignments from step 1, or the remaining unaligned mouse and rat sequences. The resulting alignments demonstrate high sensitivity, with 87% of all human gene-coding areas aligned in both mouse and rat. The specificity is also high: <7% of the rat contigs are aligned to multiple places in human, and 97% of all alignments with human sequence >100 kb agree with a three-way synteny map built independently, using predicted exons in the three genomes. At the nucleotide level <1% of the rat nucleotides are mapped to multiple places in the human sequence in the alignment, and 96.5% of human nucleotides within all alignments agree with the synteny map. The alignments are publicly available online, with visualization through the novel Multi-VISTA browser that we also present.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2067704.
7 Corresponding authors. E-MAIL ildubchak{at}lbl.gov; FAX (510) 486-5717. E-MAIL serafim{at}cs.stanford.edu; FAX (650) 725-1449.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
G. Lunter, A. Rocco, N. Mimouni, A. Heger, A. Caldeira, and J. Hein
Uncertainty in homology inferences: Assessing and improving genomic sequence alignment
Genome Res.,
February 1, 2008;
18(2):
298 - 309.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Brudno, A. Poliakov, S. Minovitsky, I. Ratnere, and I. Dubchak
Multiple whole genome alignments and novel biomedical applications at the VISTA portal
Nucleic Acids Res.,
July 13, 2007;
35(suppl_2):
W669 - W674.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. F. Frohlich, M. Bastepe, D. Ozturk, H. Abu-Zahra, and H. Juppner
Lack of Gnas Epigenetic Changes and Pseudohypoparathyroidism Type Ib in Mice with Targeted Disruption of Syntaxin-16
Endocrinology,
June 1, 2007;
148(6):
2925 - 2935.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. W. Shin, G. Bian, and A. S. Raikhel
A Toll Receptor and a Cytokine, Toll5A and Spz1C, Are Involved in Toll Antifungal Immune Signaling in the Mosquito Aedes aegypti
J. Biol. Chem.,
December 22, 2006;
281(51):
39388 - 39395.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. J. Vallender, J. E. Paschall, C. M. Malcom, B. T. Lahn, and G. J. Wyckoff
SPEED: a molecular-evolution-based database of mammalian orthologous groups
Bioinformatics,
November 15, 2006;
22(22):
2835 - 2837.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Nelson, S. R. Walker, W. Li, X. S. Liu, and D. A. Frank
Identification of Human STAT5-dependent Gene Regulatory Elements Based on Interspecies Homology
J. Biol. Chem.,
September 8, 2006;
281(36):
26216 - 26224.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. N. Dewey and L. Pachter
Evolution at the nucleotide level: the problem of multiple whole-genome alignment.
Hum. Mol. Genet.,
April 15, 2006;
15(suppl_1):
R51 - R56.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Tran, P. Havlak, and J. Miller
MicroRNA enrichment among short 'ultraconserved' sequences in insects.
Nucleic Acids Res.,
January 1, 2006;
34(9):
e65 - e65.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. A. Shabalina, A. Y. Ogurtsov, and N. A. Spiridonov
A periodic pattern of mRNA secondary structure created by the genetic code.
Nucleic Acids Res.,
January 1, 2006;
34(8):
2428 - 2437.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Reynolds, B. Collier, K. Maratou, V. Bingham, R. M. Speed, M. Taggart, C. A. Semple, N. K. Gray, and H. J. Cooke
Dazl binds in vivo to specific transcripts and can regulate the pre-meiotic translation of Mvh in germ cells
Hum. Mol. Genet.,
December 15, 2005;
14(24):
3899 - 3909.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Lazar, C. Moreno, H. J. Jacob, and A. E. Kwitek
Impact of genomics on research in the rat
Genome Res.,
December 1, 2005;
15(12):
1717 - 1728.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. M. Cooper, E. A. Stone, G. Asimenos, NISC Comparative Sequencing Program, E. D. Green, S. Batzoglou, and A. Sidow
Distribution and intensity of constraint in mammalian genomic sequence
Genome Res.,
July 1, 2005;
15(7):
901 - 913.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. C. Zambon, L. Zhang, S. Minovitsky, J. R. Kanter, S. Prabhakar, N. Salomonis, K. Vranizan, I. Dubchak, B. R. Conklin, and P. A. Insel
Gene expression patterns define key transcriptional events in cell-cycle regulation by cAMP and protein kinase A
PNAS,
June 14, 2005;
102(24):
8561 - 8566.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Boccia, M. Petrillo, D. di Bernardo, A. Guffanti, F. Mignone, S. Confalonieri, L. Luzi, G. Pesole, G. Paolella, A. Ballabio, et al.
DG-CST (Disease Gene Conserved Sequence Tags), a database of human-mouse conserved elements associated to disease genes
Nucleic Acids Res.,
January 1, 2005;
33(suppl_1):
D505 - D510.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. A. Frazer, L. Pachter, A. Poliakov, E. M. Rubin, and I. Dubchak
VISTA: computational tools for comparative genomics
Nucleic Acids Res.,
July 1, 2004;
32(suppl_2):
W273 - W279.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|