Vol. 10, Issue 7, 1020-1030, July 2000
LETTER
Parking Strategies for Genome Sequencing
Jared C.
Roach,1,4
Vesteinn
Thorsson,1,2 and
Andrew F.
Siegel1,2,3
1 The Institute for Systems Biology, Seattle, Washington
98105 USA; 2 Department of Molecular Biotechnology, University
of Washington, Seattle, Washington 98195 USA; 3 Departments of
Management Science, Finance, and Statistics, University of Washington,
Seattle, Washington 98195 USA
The parking strategy is an iterative approach to DNA sequencing.
Each iteration consists of sequencing a novel portion of target DNA
that does not overlap any previously sequenced region. Subject to the
constraint of no overlap, each new region is chosen randomly. A parking
strategy is often ideal in the early stages of a project for rapidly
generating unique data. As a project progresses, parking becomes
progressively more expensive and eventually prohibitive. We present a
mathematical model with a generalization to allow for overlaps. This
model predicts multiple parameters, including progress, costs, and the
distribution of gap sizes left by a parking strategy. The highly
fragmented nature of the gaps left after an initial parking strategy
may make it difficult to finish a project efficiently. Therefore, in
addition to our parking model, we model gap closing by walking. Our
gap-closing model is generalizable to many other strategies. Our
discussion includes modified parking strategies and hybrids with other
strategies. A hybrid parking strategy has been employed for portions of
the Human Genome Project.
4
Corresponding author. Present address: The Institute for
Systems Biology, 4225 Roosevelt Way NE, Suite 200, Seattle, Washington 98105 USA.
10:1020-1030 ©2000 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/00 $5.00