|
|
|
|
Vol. 10, Issue 5, 624-633, May 2000
REPORTS
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
The success of the ongoing Human Genome Project has resulted in accelerated plans for completing the human genome sequence and the earlier-than-anticipated initiation of efforts to sequence the mouse genome. As a complement to these efforts, we are utilizing the available human sequence to refine human-mouse comparative maps and to assemble sequence-ready mouse physical maps. Here we describe how the first glimpses of genomic sequence from human chromosome 7 are directly facilitating these activities. Specifically, we are actively enhancing the available human-mouse comparative map by analyzing human chromosome 7 sequence for the presence of orthologs of mapped mouse genes. Such orthologs can then be precisely positioned relative to mapped human STSs and other genes. The chromosome 7 sequence generated to date has allowed us to more than double the number of genes that can be placed on the comparative map. The latter effort reveals that human chromosome 7 is represented by at least 20 orthologous segments of DNA in the mouse genome. A second component of our program involves systematically analyzing the evolving human chromosome 7 sequence for the presence of matching mouse genes and expressed-sequence tags (ESTs). Mouse-specific hybridization probes are designed from such sequences and used to screen a mouse bacterial artificial chromosome (BAC) library, with the resulting data used to assemble BAC contigs based on probe-content data. Nascent contigs are then expanded using probes derived from newly generated BAC-end sequences. This approach produces BAC-based sequence-ready maps that are known to contain a gene(s) and are homologous to segments of the human genome for which sequence is already available. Our ongoing efforts have thus far resulted in the isolation and mapping of >3,800 mouse BACs, which have been assembled into >100 contigs. These contigs include >250 genes and represent ~40% of the mouse genome that is homologous to human chromosome 7. Together, these approaches illustrate how the availability of genomic sequence directly facilitates studies in comparative genomics and genome evolution.
| |
INTRODUCTION |
|---|
|
|
|---|
The rapid advances and monumental achievements of
the Human Genome Project are igniting a new era in biomedical research. The availability of the complete genomic sequence for the model organism Caenorhabditis elegans (The C. elegans
Sequencing Consortium 1998
), in conjunction with the impending
completion of the Drosophila and human genomic sequences, provides an
unprecedented view of the genetic essence of multicellular organisms.
In the coming century, interpretation and utilization of complete
genome sequences will provide new insights into old problems and
inspire the development and testing of novel hypotheses. Adapting to a
scientific environment with such rapidly changing and powerful new
resources presents a significant but rewarding challenge to those
studying biology.
Genome mapping is historically dependent on species-specific resources
and reagents as well as species-independent technologies. For instance,
genome mapping in mouse has been performed principally with genetic
mapping methods using inbred strains (Reeves & D'Eustachio 1999
). This
valuable mouse-specific resource combined with technological advances
(e.g., PCR) has increased the number of available genetic markers
(Dietrich et al. 1992
; Dietrich et al. 1994
; Dietrich et al. 1996
) and
the rate at which they can be assembled into maps. The recent
development of radiation hybrid mapping (Cox et al. 1990
; Matise et al.
1999
) and markers that are conserved between species (Lyons et al.
1997
) reduces the dependence on species-specific resources, thereby
increasing the number of genomes amenable to systematic mapping. The
complete sequence of the human genome will provide a reference
vertebrate `genetic blueprint' and, as such, should accelerate the
mapping of other genomes.
Here we provide a general overview of the approaches we are developing and implementing to exploit available human genomic sequence for the purpose of mapping a related mammal, the mouse. Importantly, our preliminary results demonstrate how genomic sequence greatly enhances comparative genome analyses.
| |
Overview of Experimental Strategy |
|---|
|
|
|---|
As a complement to the ongoing and accelerated efforts to sequence
the human genome (Collins et al. 1998
; Waterston & Sulston 1998
; The
Sanger Centre & The Washington University Genome Sequencing Center
1998
) , we are developing a generalized strategy for using human
sequence data to facilitate human-mouse comparative mapping. A
schematic overview of our evolving approach is depicted in Figure 1. In the Human Genome Project, individual human
bacterial-based clones [mostly bacterial artificial chromosomes (BACs)
(Shizuya et al. 1992
)] are being sequenced (Waterston & Sulston 1998
;
The Sanger Centre & The Washington University Genome Sequencing Center 1998). Most often, these have been mapped relative to existing sets of
sequence-tagged sites (STSs) (Olson et al. 1989
), ordered by yeast
artificial chromosome (YAC)-based STS-content mapping (Green & Olson
1990
; Green & Green 1991
) and/or radiation hybrid mapping (Cox et al.
1990
). By simple electronic detection of STSs within the genomic
sequence, the BACs and their corresponding sequence can be reliably
organized relative to the starting STS map. In addition to providing
base-pair-level resolution, the genomic sequence enhances the STS map
by identifying and positioning additional genes, expressed-sequence
tags (ESTs), and other sequence elements. Such efforts, which require
electronic analyses only, produce a more detailed and accurate
sequence-based map.
|
A variant of the above strategy can also be used to assemble refined
comparative maps. Comparative mapping involves establishing the
relative genomic positions of orthologous gene pairs in two species,
such as human and mouse. Traditionally, this information comes from
genetic, physical, radiation hybrid, and/or cytogenetic mapping data
and can include genes/ESTs present on the human STS map that have also
been mapped in mouse. A comparative map built from a single human STS
map has the virtue of providing a high confidence marker order but is
limited by the number of included genes (Hudson et al. 1995
; Nagaraja
et al. 1997
; Bouffard et al. 1997
). In contrast, an integrated
STS-sequence map represents a robust framework for including a much
larger set of genes (theoretically all genes) in the comparative map
of a human genomic segment. Specifically, this involves the
identification of genes in the human sequence that have been mapped
either in human and mouse or only in mouse (Fig. 1). In the case of an
ortholog pair already mapped in both species, only the identification
of the gene in the human genomic sequence is required. In the case of a
mapped mouse gene whose ortholog has not yet been mapped in human, the gene pair can be included on the comparative map provided the sequences
are determined to be orthologous.
For the comparative mapping studies described below, we have
established a hierarchical, stepwise approach to determine if a mouse
gene sequence is orthologous to a human sequence. First, a given human
genomic sequence is compared by BLAST with all available
mouse mRNA and EST sequences, with any human-mouse DNA matches at or
above the reported average for human-mouse orthologs (Makalowski et al.
1996
; Makalowski & Boguski 1998
) selected for further consideration.
Second, candidate orthologs are evaluated with respect to their
chromosomal location (if known) in mouse. As a DNA sequence-independent
resource, such mapping data helps to remove nonorthologous mouse
sequences that are similar to related gene family members or are
pseudogenes as well as those detected because of some artifact
associated with the human sequence (e.g., a chimeric clone). Third,
candidate mouse orthologs with the highest similarity to a specific
human sequence are then compared to all other available human genomic
sequence; if no greater similarity is found, then the matching mouse
sequence is operationally considered to be orthologous to the initial
human sequence (used in the first step). This last criterion will
become increasingly relevant and powerful as the human genome sequence
reaches completion in the coming months. It is worth noting that even a
mapped mouse EST with no known human ortholog can be readily placed on
the comparative map based on these criteria.
A virtue of a refined human-mouse comparative map is that it provides accurate and detailed insights into the patterns of conservation between orthologous regions of the human and mouse genomes. Such patterns provide the basis for defining conserved segments containing genes linked in both human and mouse. By aligning groups of mouse genes with the human map, the boundaries between the conserved segments can be localized. These boundaries define genomic segments that have undergone rearrangement (i.e., evolutionary breakpoints; see Fig. 1) in either the mouse or human lineage since their last common ancestor.
In a similar fashion, human genomic sequence can be used for organizing
and constructing higher resolution physical maps of mouse DNA.
Specifically, the human sequence can be analyzed for the presence of
orthologous mouse genes and ESTs (Marra et al. 1999
) (also see
http://www.ncbi.nlm.nih.gov/dbEST). Mouse-specific hybridization probes
can then be designed from these sequences and used to construct mouse
BAC contigs (Fig. 1). Importantly, the refined comparative map
(described above) can be used as an organizational framework for
mapping all regions of the mouse genome orthologous to a particular
region of the human genome.
The general approaches described above and in Figure 1 thus produce two
immediate products: refined human-mouse comparative maps and nascent
mouse BAC physical maps. Our experience implementing these strategies
has focused on human chromosome 7, which contains ~5% of the human
genome and is associated with a detailed STS map [containing 2150 mapped STSs (Bouffard et al. 1997
) (see
http://genome.nhgri.nih.gov/chr7)] and considerable amounts of
sequence data. In fact, as of this writing, >50% of the ~170-Mb
chromosome is available as high-accuracy, finished sequence, while an
additional 15-20% of the chromosome is available as `working draft'
sequence (see http://www.ncbi.nlm.nih.gov/genome/seq). Below we
summarize our comparative mapping of the mouse genomic regions
corresponding to human chromosome 7.
| |
Assembly of a Refined Human-Mouse Comparative Map |
|---|
|
|
|---|
Human-mouse comparative maps provide an important resource for
the analysis of mammalian genomes (Copeland et al. 1993
; Carver & Stubbs 1997
; Nadeau & Sankoff 1998
) (see
http://www.informatics.jax.org). As such, the human gene database
OMIM (http://ncbi.nlm.nih.gov/omim) and the Mouse Genome Informatics
database (MGI; http://www.informatics.jax.org) include comparative
mapping information. In particular, MGI provides an updated and
reliable database of human-mouse comparative mapping data.
Unfortunately, the predicted human gene locations catalogued in MGI are
cytogenetically based, precluding the high-confidence ordering of genes
along a human chromosome. Another, older source of human-mouse (and
mouse-human) comparative mapping data is the Davis Human/Mouse Homology
Map compiled by DeBry and Seldin (DeBry & Seldin 1996
) (see
http://www.ncbi.nlm.nih.gov/Homology); for example, this map depicts 14 segments of the mouse genome known to be orthologous to human
chromosome 7. However, the rapid generation of human chromosome 7 sequence coupled with ongoing mouse mapping efforts makes these various
sources of comparative mapping data incomplete. Thus, we are routinely
using the available STS map and evolving human chromosome 7 sequence to
assemble a more detailed and up-to-date human-mouse comparative map.
This refined comparative map, which at present contains 154 genes, is
available as a Web supplement to this paper (see
http://genome.nhgri.nih.gov/chr7/comparative).
The genes on our current refined comparative map were included based on
publicly available mouse mapping data and the human chromosome 7 STS
(Bouffard et al. 1997
) and sequence (see
http://genome.nhgri.nih.gov/chr7) maps. Specifically, genes included
both in previous comparative maps (see http://www.informatics.jax.org
and http://www.ncbi.nlm.nih.gov/Homology) and in either the human
chromosome 7 STS map (n=58) or mapped chromosome 7 genomic sequence
(n=58) were ordered first. As part of a comprehensive effort to compare
all mouse mRNA sequences against human chromosome 7 genomic sequence,
additional mapped mouse genes orthologous to human chromosome 7 sequence were then added to the map (n=25). To better define regions of
interest and enhance the comparative map, a small number of mouse
orthologs of genes detected in human genomic sequence were genetically
(n=8) or cytogenetically (n=5) mapped in mouse.
A small region of the resulting comparative map is depicted in Figure
2 to illustrate the types of differences seen between the MGI cytogenetic-based map, the Davis Human/Mouse Homology Map, and
our refined map. Numerous new genes are present on the MGI-derived and
refined maps but absent on the older Davis map, with very few genes
uniquely present on either the MGI or Davis map compared to the refined
map. Within the individual conserved genomic segments, a comparison of
gene order is either not possible (e.g., between the MGI and refined
maps) or reveals multiple differences [e.g., as seen between the Davis
and refined maps (Fig. 2)]. These differences were expected because
many of the human genes had not been ordered previously relative to one
another, and thus in the Davis map, their positions were based on the
inferred mouse order (rather than the human order), which could differ
between species (DeBry & Seldin 1996
). In addition, the order of
conserved segments is occasionally different between the Davis and
refined maps. For instance, the positions of the conserved segments
from proximal MMU6 and proximal MMU5 are inverted between the Davis and
refined maps (see below). Finally, in some cases, the Davis map
incorrectly assigns a gene to a conserved segment. For example, GNB2 is placed in the distal MMU5 conserved segment near the
top of Figure 2 (corresponding to human 7q11), but in fact resides within a
more telomeric segment with homology to distal MMU5 (corresponding to 7q22).
|
Taken together, our efforts to date have provided a more refined human-mouse comparative map for human chromosome 7. A global view of this is provided in Figure 3, which for illustrative purposes shows the sizes and relative order of the orthologous segments on the Davis map and our refined map. The former contains 14 conserved segments from seven distinct regions of the mouse genome, while the latter (to date) contains 20 conserved segments from ten distinct regions of the mouse genome. The six unique conserved segments on the refined map reflect four instances where new mapping data led to the addition of a conserved segment and two instances where simple misordering of genes on the Davis map resulted in a failure to detect conserved segments.
Comparison of the previous comparative maps with our refined map
reveals two instances where the relative order of adjacent conserved
segments is inverted: one on 7p between proximal MMU13 and distal MMU2
and the other shown in Figure 2 involving proximal MMU5 and proximal
MMU6 (discussed above). Interestingly, we have encountered only a
single case of potential conflict in the assignment of a gene to a
conserved segment. Calumenin (CALU), whose cytogenetic-based assignment to 7q32 (Yabe et al. 1998
) can be confirmed by its identification in genomic sequence from that region, has been mapped to
MMU7 by genetic mapping (Yabe et al. 1997
) but falls in the middle of a
large conserved segment homologous to proximal MMU6. Preliminary
physical mapping in the mouse is consistent with the linkage of
calumenin (Calu) to proximal MMU6; as such, this gene is
currently not included on the refined comparative map. It is also worth
noting that three conserved segments (the proximal MMU5 segment on 7p,
the distal MMU2 segment on 7p, and the distal MMU6 segment on 7q) are
based on a single gene, making them somewhat tentative. However,
indirect support for the presence of the proximal MMU5 segment comes
from other conserved segments homologous to this region on human chromosome 7.
Although there are 20 conserved segments on the refined comparative map, these correspond to just ten nonadjacent regions of the mouse genome. This reflects the fact that five conserved segments in mouse (proximal MMU5, proximal MMU6, distal MMU5, distal MMU12, and proximal MMU12) are homologous to 15 distinct segments on human chromosome 7. The latter provides strong evidence for the occurrence of intrachromosomal rearrangements, particularly inversions, during the evolution of human chromosome 7. For example, at least three inversions likely occurred to produce the four human chromosome 7 segments homologous to proximal MMU5. Thus, intrachromosomal rearrangements can be hypothesized to be the mechanism producing roughly half of the conserved segments on human chromosome 7.
Because the refined comparative map is based on the linear STS and
sequence maps, the size of each conserved segment can be estimated, as
can the sizes of the gaps (corresponding to regions for which the
orthologous mouse region is unknown) (Fig. 3). Based on STS content
[and assuming an average inter-STS spacing of 79 kb (Bouffard et al.
1997
)], the average size of the conserved segments is ~7.1 Mb, with
the largest and smallest being ~37 and <0.1 Mb, respectively. It
is not possible to compare directly the sizes of the chromosome 7 conserved segments between the Davis and refined comparative maps since
the former reflects a compilation of data from multiple maps. However,
the average size of conserved segments across the mouse genome has been
estimated at 8.1 (±1.6) cM (Nadeau & Taylor 1984
). Given their
assumption of a ~1600-cM mouse genome (Nadeau & Taylor 1984
) and the
additional assumption that the mouse genome spans ~3000 Mb, one
might postulate that the average size of a conserved segment in the
mouse genome is 15.2 ( ± 3.0) Mb. This genome-wide estimate is over
twice that observed with the current refined comparative map of human
chromosome 7, even after inflating our estimated average conserved
segment size by ~15% to account for the portion of the chromosome
that cannot currently be assigned to a position in mouse (see below). Whether this difference reflects refined mapping data, specific features of chromosome 7, or some combination of these should become
clearer with additional mapping and sequencing of both genomes.
|
Our efforts have resulted in the predicted assignment of ~85% of human chromosome 7 to a defined position in the mouse genome. The remaining portions of the chromosome (`gaps') correspond to regions residing between conserved segments; these average ~1.4 Mb, with the largest and smallest estimated to be ~5 and <0.1 Mb, respectively. To reduce the amount of chromosome 7 without an established corresponding position in mouse, orthologous genes and ESTs within these gaps are being identified in human genomic sequence and then genetically mapped in mouse. In five of the first seven cases, such an effort simply extended one of the adjacent conserved segments; in the remaining two cases, a new conserved segment (homologous to proximal MMU9 on 7p) was defined. The amount of chromosome 7 not represented on the refined comparative map should be reduced further by the additional mapping of orthologous genes and ESTs in the mouse (e.g., by the clone-based physical mapping detailed below) and eventually by the sequencing of the corresponding regions of the mouse genome.
| |
Construction of Mouse BAC-based Sequence-ready Maps |
|---|
|
|
|---|
A major program to sequence the mouse genome as part of the ongoing
Human Genome Project has recently been launched (Battey et al. 1999
)
(see http://www.nih.gov/science/models/mouse). To facilitate the
sequencing of regions of the mouse genome corresponding to human
chromosome 7, we are extending our comparative mapping efforts to
include the construction of mouse BAC-based physical maps (Fig. 1).
Similar to constructing refined comparative maps, the key computational step for this endeavor involves analyzing all new human genomic sequence for the presence of orthologous mouse sequences. In this case, however, the orthologous sequences (occasionally known genes but most often ESTs) need not be mapped in the mouse genome for use in the construction of corresponding mouse physical maps. Although by no means assured, the central working assumptions are that gene content and order within a conserved segment are identical between human and mouse and that a single mouse BAC contig can be assembled for that region. Such assumptions provide a framework for evaluating probe order and content in the assembled mouse contigs, directing contig mergers, estimating the sizes of gaps between adjacent mouse contigs, and detecting potential differences in gene order between human and mouse. This general approach also allows blocks of human genomic sequence as well as orthologous, unmapped mouse genes/ESTs to be assigned predicted positions in the mouse genome based on their relationship with already mapped, flanking genes. In addition, the relative confidence for such predicted assignments can be made based on proximity to the informative flanking markers.
The analysis of human genomic sequence for detecting the presence of
orthologous mouse sequences involves the following steps. Available
human sequence, either in a preliminary working draft or high-accuracy
finished form, is compared to a series of GenBank databases (nr, month,
htgs, and dbEST; see http://www.ncbi.nlm.nih.gov) using the program
PowerBLAST (Zhang & Madden 1997
) (Fig. 4) and/or to a local, customized database consisting
of all publicly available mouse and rat mRNA sequences using
WebBLAST (Ferlanti et al. 1999
). The resulting sequence
alignments provide valuable overviews of the gene content and structure
of each region (see Fig. 4).
|
While PowerBLAST output provides a highly detailed and comprehensive analysis of a region, the actual computer time and file storage can be limiting when applied on a large scale. Thus, by reviewing carefully the results from PowerBLAST analysis of ~200 BAC sequences, we have been able to establish stringent criteria for establishing human-mouse orthology. These criteria can be applied to the more streamlined and time-efficient WebBLAST program to compare our local rodent mRNA database with new human genomic sequence.
Based mainly on our PowerBLAST analyses and using a cutoff
value of E=1e-25, we have found that ~50% of human BAC sequences contain at least one matching orthologous rodent sequence. This figure is highly variable from region to region, reflecting significant differences in gene density and an incomplete collection of
rodent mRNA sequences. However, these findings suggest that our general
approach should yield nascent physical maps for mouse genomic segments
corresponding to at least 50% of sequenced human regions. Our analyses
can also be used to estimate the percentage of all human genes with at
least one matching orthologous rodent mRNA sequence. We have found this
number to be >80%, consistent with the findings of Marra et al. (1999)
.
For mouse BAC library screening, we have emphasized the development and
use of `overgo'-type hybridization probes. Such probes are derived by
subjecting two ~22-base oligonucleotides with an 8-base
complementary overlap to a radioactive primer extension reaction,
thereby producing a double-stranded ~36-bp probe (Vollrath 1999
)
(see http://genome.wustl.edu/gsc). Specifically, the aligned rodent
gene/EST and human genomic sequence is examined, and overgo probes that
are fully contained within individual exons are designed (so as to
insure contiguity of the probe and mouse genomic sequence). Probes are
generated for all independent genes within a region and, when possible,
an attempt is made to develop probes at somewhat regular spacing (Fig.
4), albeit with the caveat that the mouse gene structure, order, and
size are inferred. For the RPCI-23 mouse BAC library (see
http://bacpac.med.buffalo.edu), with its average BAC size of ~200 kb
(J.W. Thomas, V.V. Braden Maduro, and E.D. Green, unpubl.), an
inter-probe spacing of ~65 kb would be desirable (since clones would
then contain an average of at least two probe sequences). In cases
where more than one probe are developed for a gene, a fixed probe order
can be imposed, given that exon order between orthologs should not
vary. This data becomes useful in evaluating the integrity of the
nascent BAC contig maps. Similarly, achieving high probe densities
helps to verify that positive clones contain the true orthologous
sequence as opposed to a closely related sequence (e.g., a pseudogene). For example, clones containing the authentic gene should often be
positive for another nearby probe(s), unlike clones containing only a
related sequence. Finally, in our experience >95% of the designed
mouse overgo probes successfully identify at least one positive mouse
clone within the RPCI-23 library.
To screen the mouse RPCI-23 BAC library on a large scale, we have
implemented a two-stage hybridization-based approach. The first round
involves hybridizing filters containing arrays of all the clones in the
library with pools of 30-50 overgo probes. Single-colony isolates of
all candidate positive clones are then rearrayed onto new (secondary)
filters, which in turn are analyzed with combinatorial pools derived
from the starting set of overgo probes. These second-round
combinatorial pools are constructed by arraying the overgo probes and
pooling individual rows and columns (i.e., two-dimensional pooling).
Each row/column pool is then hybridized to an individual secondary
filter. The data from the second round of hybridizations can then be
disambiguated to produce the probe-clone relationships. This entire
two-stage screening procedure is greatly facilitated by use of the
program ComboScreen (Jamison et al. 2000
) (see
http://genome.nhgri.nih.gov/comboscreen), which has been developed in
conjunction with our ongoing mouse mapping efforts. This program is
tailored specifically for assimilating the data from both rounds of
hybridization, including the design of secondary filters and
combinatorial pools, as well as analyzing the results to deduce
probe-clone relationships.
The resulting probe-content data can be analyzed with the program
SEGMAP (Green & Green 1991
) (see
http://www.genome.washington.edu/UWGC/analysistools/segmap.htm) to
assemble nascent contig maps that reflect both the predicted probe
order and the relative overlaps among BACs. Such contigs can then be
readily expanded by developing new overgo probes from insert-end
sequences of BACs residing at contig ends [e.g., generated by
bubble-PCR (Riley et al. 1990
)] and rescreening the BAC library. Figure 4 depicts a representative example of a
SEGMAP-constructed BAC contig spanning >1 Mb of
proximal MMU6 containing Pon1, Pon2, Pon3,
and Dncic1. Newly mapped and ordered genes in this contig include the orthologs of the rat neurabin gene and human PDK4 and SLC25A13. Note that the deduced order for the six mapped
genes is the same in human and mouse.
In addition to being a tool for contig expansion, BAC insert-end sequences can often provide mapping information based on their similarity to human genomic sequence. For example, ~7% of our nonredundant BAC-end sequences (n= >1000) have significant similarity to human genomic sequence. Such matching sequence can be utilized to solidify the evolving human-mouse comparative map. Another cross-species resource involves designing overgo probes from rat mRNA sequences. In our preliminary experience, >60% of such rat probes can be used to isolate corresponding mouse BACs (with a similar percentage of mouse probes successfully isolating corresponding rat BACs).
The above strategy for constructing BAC contigs has been implemented
for mapping mouse genomic regions homologous to human chromosome 7. To
date, we have analyzed the sequence for greater than half of human
chromosome 7 (~100 Mb), resulting in the design of >400
gene/EST-based overgo probes. In addition, we have developed >400
probes from generated BAC-end sequences. Together, these probes have
been used to assemble >3800 BACs from the RPCI-23 library into 110 contigs that are estimated to span a total of ~61 Mb of the mouse
genome. Thus, our firstpass mapping efforts appear to have yielded
mouse BAC contigs corresponding to more than half of the human sequence
analyzed. Additional rounds of contig expansion coupled with the
increasing availability of genome-wide mapping resources
(http://www.nih.gov/science/models/mouse), human genomic sequence
(http://www.ncbi.nlm.nih.gov/genome/seq), and mouse ESTs
(http://www.ncbi.nlm.nih.gov/dbEST) should facilitate expansion of
these mouse physical maps. Finally, groups of BACs already assigned to
individual contigs (such as those depicted in Fig. 4) can be readily
subjected to restriction enzyme digest-based fingerprint analysis
(Marra et al. 1997
), allowing the assembly of higher-resolution contig
maps (i.e., `sequence-ready maps') and the rational selection of
minimally overlapping clones suitable for systematic sequencing.
In summary, the approaches we describe here represent effective strategies for utilizing newly generated reference genomic sequence from one species, in this case human, to refine comparative maps and to facilitate the rapid construction of `sequence-ready' physical maps. These methods are efficient and can be implemented in a relatively small-scale fashion, with all the results mentioned above generated with ~3 person-years of effort. Together, these mapping studies yield detailed information about the relationships among homologous mouse and human genomic segments, provide well-characterized clone sets for use in genomic sequencing, and, in the long run, should markedly enhance our understanding of mammalian genome structure and evolution. Finally, while human-mouse comparative mapping has been emphasized here, our approaches can be readily adapted for use in the comparative analyses of other vertebrate species.
| |
ACKNOWLEDGMENTS |
|---|
We thank the Washington University Genome Sequencing Center and University of Washington Genome Center for generating human chromosome 7 sequence data. We also thank Bill Pavan for critical review of this manuscript.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
2 Corresponding author.
E-MAIL egreen{at}nhgri.nih.gov; FAX (301) 402-4735.
| |
REFERENCES |
|---|
|
|
|---|
Received December 14, 1999; accepted in revised form March 21,2000.
This article has been cited by other articles:
![]() |
S. W. Scherer and E. D. Green Human chromosome 7 circa 2004: a model for structural and functional studies of the human genome Hum. Mol. Genet., October 1, 2004; 13(suppl_2): R303 - R313. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Frazer, L. Elnitski, D. M. Church, I. Dubchak, and R. C. Hardison Cross-Species Sequence Comparisons: A Review of Methods and Available Resources Genome Res., January 1, 2003; 13(1): 1 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Pevzner and G. Tesler Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse Genomes Genome Res., January 1, 2003; 13(1): 37 - 45. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Thomas, M. G. Schueler, T. J. Summers, R. W. Blakesley, J. C. McDowell, P. J. Thomas, J. R. Idol, V. V.B. Maduro, S.-Q. Lee-Lin, J. W. Touchman, et al. Pericentromeric Duplications in the Laboratory Mouse Genome Res., January 1, 2003; 13(1): 55 - 63. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Thomas, A. B. Prasad, T. J. Summers, S.-Q. Lee-Lin, V. V.B. Maduro, J. R. Idol, J. F. Ryan, P. J. Thomas, J. C. McDowell, and E. D. Green Parallel Construction of Orthologous Sequence-Ready Clone Contig Maps in Multiple Species Genome Res., August 1, 2002; 12(8): 1277 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. DeSilva, L. Elnitski, J. R. Idol, J. L. Doyle, W. Gan, J. W. Thomas, S. Schwartz, N. L. Dietrich, S. M. Beckstrom-Sternberg, J. C. McDowell, et al. Generation and Comparative Analysis of ~3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome Genome Res., January 1, 2002; 12(1): 3 - 15. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Kwitek, P. J. Tonellato, D. Chen, J. Gullings-Handley, Y. S. Cheng, S. Twigger, T. E. Scheetz, T. L. Casavant, M. Stoll, M. A. Nobrega, et al. Automated Construction of High-Density Comparative Maps Between Rat, Human, and Mouse Genome Res., November 1, 2001; 11(11): 1935 - 1943. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Wilson, C. Riemer, D. W. Martindale, P. Schnupf, A. P. Boright, T. L. Cheung, D. M. Hardy, S. Schwartz, S. W. Scherer, L.-C. Tsui, et al. Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5 Nucleic Acids Res., March 15, 2001; 29(6): 1352 - 1365. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Ranz, F. Casals, and A. Ruiz How Malleable is the Eukaryotic Genome? Extreme Rate of Chromosomal Rearrangement in the Genus Drosophila Genome Res., February 1, 2001; 11(2): 230 - 239. [Abstract] [Full Text] |
||||
| |||||||||||||||||||||||||||||||||||||||||