Genome Res. 14:507-516, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Reconstructing the Genomic Architecture of Ancestral Mammals: Lessons From Human, Mouse, and Rat Genomes
Guillaume Bourque1,
Pavel A. Pevzner2 and
Glenn Tesler3,4
1 Centre de Recherches Mathématiques, Université de Montréal, Canada H3C 3J7
2 Department of Computer Science and Engineering, University of CaliforniaSan Diego, La Jolla, California 92093, USA
3 Department of Mathematics, University of CaliforniaSan Diego, La Jolla, California 92093, USA
Recent analysis of genome rearrangements in human and mouse genomes revealed evidence for more rearrangements than thought previously and shed light on previously unknown features of mammalian evolution, like breakpoint reuse and numerous microrearrangements. However, two-way analysis cannot reveal the genomic architecture of ancestral mammals or assign rearrangement events to different lineages. Thus, the "original synteny" problem introduced by Nadeau and Sankoff previously, remains unsolved, as at least three mammalian genomes are required to derive the ancestral mammalian karyotype. We show that availability of the rat genome allows one to reconstruct a putative genomic architecture of the ancestral murid rodent genome. This reconstruction suggests that this ancestral genome retained many previously postulated chromosome associations in the placental ancestor and reveals others that were beyond the resolution of cytogenetic, radiation hybrid mapping, and chromosome painting techniques. Three-way analysis of rearrangements leads to a reliable reconstruction of the genomic architecture of specific regions in the murid ancestor, including the X chromosome, and for the first time allows one to assign major rearrangement events to one of human, mouse, and rat lineages. Our analysis implies that the rate of rearrangements is much higher in murid rodents than in the human lineage and confirms the existence of rearrangement hot-spots in all three lineages.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1975204.
4 Corresponding author. E-MAIL gptesler{at}ucsd.edu; FAX (858) 534-7029.
[Supplemental material is available online at www.genome.org.]
5 Table 1 illustrates some potential pitfalls of the artificial separation of large-scale and small-scale rearrangements. Increasing the minimum synteny block length filters out smaller blocks and reduces the number of large-scale rearrangements of those that remain. We concurrently increased the gap threshold (described later) by a proportionate amount to the minimum synteny block length, which allowed more anchors into these larger synteny blocks and resulted in more microrearrangements. Apart from these issues, there is ambiguity in how to treat the chromosome ends; because the teleomeres have not been sequenced (too many repeats), we do not know whether rearrangements involving them are bounded at the exact end of the chromosome, or somewhere within the end regions. The lack of sequence data precludes us from computing the blocks (if any) at the ends and the rearrangements involving them. Some chromosome ends do not participate in rearrangements, so there is ambiguity in whether to call them breakpoint regions.
6 In Pevzner and Tesler (2003a), we used anchors produced by Michael Kamal (Waterston et al. 2002) for using another procedure, which did not output the signs of the anchors nor information on the repeat familes. We used heuristics to guess the signs of those anchors when possible, and discarded the 2% of the anchors for which this determination was not possible. Some of the guessed anchor signs may have been incorrect, leading to an undercount of microrearrangements.
7 Although the Hannenhalli-Pevzner algorithm finds a most parsimonious rearrangement scenario for two genomes, the real scenario is not necessarily a most parsimonious one, and the order of rearrangement events within a most parsimonious scenario usually remains uncertain. Availability of more than two genomes remedies some of these limitations and provides a means to infer the gene order in the mammalian ancestor (Bourque and Pevzner 2002).
8 In many cases, MGR does not provide an exact solution. It attempts to determine a most parsimonious tree based on macrorearrangements of the synteny block orders in the genomes. The synteny block order of the computed ancestral nodes is only approximate, because there are other possible orders that give identical tree scores, although exploration of neighboring alternative solutions suggests that most of the adjacencies (including all of the chromosome associations) are valid. Localizing the analysis to particular regions of the genome, or including data from appropriate additional species acting as outgroups, can help resolve these issues.
9 The rate of rearrangements on the path from the placental ancestor to human may be significantly smaller than 1.6, as most of the rearrangements on the human branch in Figure 3 may have been acquired on the path from the placental to the murid rodent ancestor.
10 The scenario being optimal, and the ancestor being unique are not typical. With arbitrary data, MGR approximates an optimal scenario, but does not guarantee it will achieve it. In general, distances between two genomes in the computed tree equal or exceed their pairwise distance if the tree constraints are ignored; in this case they are equal, proving the scenario is optimal. Also in general, the recovered ancestor may not be unique.
11 We emphasize that by reusing breakpoints, we do not mean multiple use of exactly the same genomic position as an endpoint of rearrangements, but rather the fact that the breakpoint regions host endpoints for multiple rearrangement events.
12 The breakpoint reuse analysis depends crucially on the accurate generation of synteny blocks and separation between macro- and microrearrangements (Sankoff and Nadeau 2003). Our breakpoint reuse estimates assume this separation is accurate and is not blurred by microrearrangements on the border between breakpoint regions and synteny blocks.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
B.-Y. Liao and J. Zhang
Coexpression of Linked Genes in Mammalian Genomes Is Generally Disadvantageous
Mol. Biol. Evol.,
August 1, 2008;
25(8):
1555 - 1565.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Darai-Ramqvist, A. Sandlund, S. Muller, G. Klein, S. Imreh, and M. Kost-Alimova
Segmental duplications and evolutionary plasticity at tumor chromosome break-prone regions
Genome Res.,
March 1, 2008;
18(3):
370 - 379.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. U. Pontius, J. C. Mullikin, D. R. Smith, Agencourt Sequencing Team, K. Lindblad-Toh, S. Gnerre, M. Clamp, J. Chang, R. Stephens, B. Neelam, et al.
Initial sequence and comparative analysis of the cat genome
Genome Res.,
November 1, 2007;
17(11):
1675 - 1689.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Gordon, S. Yang, M. Tran-Gyamfi, D. Baggott, M. Christensen, A. Hamilton, R. Crooijmans, M. Groenen, S. Lucas, I. Ovcharenko, et al.
Comparative analysis of chicken chromosome 28 provides new clues to the evolutionary fragility of gene-rich vertebrate regions
Genome Res.,
November 1, 2007;
17(11):
1603 - 1613.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Radhakrishnan, M. A. Fares, F. S. French, and S. H. Hall
Comparative genomic analysis of a mammalian {beta}-defensin gene cluster
Physiol Genomics,
August 20, 2007;
30(3):
213 - 222.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Semon and K. H. Wolfe
Rearrangement Rate following the Whole-Genome Duplication in Teleosts
Mol. Biol. Evol.,
March 1, 2007;
24(3):
860 - 867.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Gonzalez, F. Casals, and A. Ruiz
Testing Chromosomal Phylogenies and Inversion Breakpoint Reuse in Drosophila
Genetics,
January 1, 2007;
175(1):
167 - 177.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Sun, G. Skogerbo, and R. Chen
Conserved distances between vertebrate highly conserved elements
Hum. Mol. Genet.,
October 1, 2006;
15(19):
2911 - 2922.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Backstrom, M. Brandstrom, L. Gustafsson, A. Qvarnstrom, H. Cheng, and H. Ellegren
Genetic Mapping in a Natural Population of Collared Flycatchers (Ficedula albicollis): Conserved Synteny but Gene Order Rearrangements on the Avian Z Chromosome
Genetics,
September 1, 2006;
174(1):
377 - 386.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Kirkpatrick and N. Barton
Chromosome Inversions, Local Adaptation and Speciation
Genetics,
May 1, 2006;
173(1):
419 - 434.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. N. Dewey and L. Pachter
Evolution at the nucleotide level: the problem of multiple whole-genome alignment.
Hum. Mol. Genet.,
April 15, 2006;
15(suppl_1):
R51 - R56.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Froenicke, M. G. Caldes, A. Graphodatsky, S. Muller, L. A. Lyons, T. J. Robinson, M. Volleth, F. Yang, and J. Wienberg
Are molecular cytogenetics and bioinformatics suggesting diverging models of ancestral mammalian genomes?
Genome Res.,
March 1, 2006;
16(3):
306 - 310.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Bourque, G. Tesler, and P. A. Pevzner
The convergence of cytogenetics and rearrangement-based models for ancestral genome reconstruction.
Genome Res.,
March 1, 2006;
16(3):
311 - 313.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Takahashi and H. Nakashima
Negative Correlation of G+C Content at Silent Substitution Sites Between Orthologous Human and Mouse Protein-Coding Sequences
DNA Res,
January 1, 2006;
13(4):
135 - 140.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Karlin
Colloquium Perspective: Statistical signals in bioinformatics
PNAS,
September 20, 2005;
102(38):
13355 - 13362.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. Kai, K. Kikuchi, M. Fujita, H. Suetake, A. Fujiwara, Y. Yoshiura, M. Ototake, B. Venkatesh, K. Miyaki, and Y. Suzuki
A Genetic Linkage Map for the Tiger Pufferfish, Takifugu rubripes
Genetics,
September 1, 2005;
171(1):
227 - 238.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. J. Murphy, D. M. Larkin, A. E.-v. der Wind, G. Bourque, G. Tesler, L. Auvil, J. E. Beever, B. P. Chowdhary, F. Galibert, L. Gatzke, et al.
Dynamics of Mammalian Chromosome Evolution Inferred from Multispecies Comparative Maps
Science,
July 22, 2005;
309(5734):
613 - 617.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Wood, T. Udagawa, Y. Hida, and R. J. D'Amato
X-Linked Dominant Growth Suppression of Transplanted Tumors in C57BL/6J-scid Mice
Cancer Res.,
July 1, 2005;
65(13):
5690 - 5695.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Misceo, M. F. Cardone, L. Carbone, P. D'Addabbo, P. J. de Jong, M. Rocchi, and N. Archidiacono
Evolutionary History of Chromosome 20
Mol. Biol. Evol.,
February 1, 2005;
22(2):
360 - 366.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Bourque, E. M. Zdobnov, P. Bork, P. A. Pevzner, and G. Tesler
Comparative architectures of mammalian and chicken genomes reveal highly variable rates of genomic rearrangements across different lineages
Genome Res.,
January 1, 2005;
15(1):
98 - 110.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Blanchette, E. D. Green, W. Miller, and D. Haussler
Reconstructing large regions of an ancestral mammalian genome in silico
Genome Res.,
December 1, 2004;
14(12):
2412 - 2423.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Zhao, J. Shetty, L. Hou, A. Delcher, B. Zhu, K. Osoegawa, P. de Jong, W. C. Nierman, R. L. Strausberg, and C. M. Fraser
Human, Mouse, and Rat Genome Large-Scale Rearrangements: Stability Versus Speciation
Genome Res.,
October 1, 2004;
14(10a):
1851 - 1860.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A.-M. Mallon, L. Wilming, J. Weekes, J. G.R. Gilbert, J. Ashurst, S. Peyrefitte, L. Matthews, M. Cadman, R. McKeone, C. A. Sellick, et al.
Organization and Evolution of a Gene-Rich Region of the Mouse Genome: A 12.7-Mb Region Deleted in the Del(13)Svea36H Mouse
Genome Res.,
October 1, 2004;
14(10a):
1888 - 1901.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. A. Pevzner, H. Tang, and G. Tesler
De Novo Repeat Classification and Fragment Assembly
Genome Res.,
September 1, 2004;
14(9):
1786 - 1796.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|