Vol. 11, Issue 2, 274-280, February 2001
METHODS
Theories and Applications for Sequencing Randomly Selected Clones
Michael C.
Wendl,1,3
Marco A.
Marra,2
LaDeana W.
Hillier,1
Asif T.
Chinwalla,1
Richard K.
Wilson,1 and
Robert H.
Waterston1
1 Genome Sequencing Center, Washington University, St.
Louis, Missouri 63108, USA; 2 British Columbia Cancer Research
Centre, Vancouver, British Columbia V5Z 1L3, Canada
Theory is developed for the process of sequencing randomly selected
large-insert clones. Genome size, library depth, clone size, and clone
distribution are considered relevant properties and perfect overlap
detection for contig assembly is assumed. Genome-specific and nonrandom
effects are neglected. Order of magnitude analysis indicates library
depth is of secondary importance compared to the other variables,
especially as clone size diminishes. In such cases, the well-known
Poisson coverage law is a good approximation. Parameters derived from
these models are used to examine performance for the specific case of
sequencing random human BAC clones. We compare coverage and redundancy
rates for libraries possessing uniform and nonuniform clone
distributions. Results are measured against data from map-based
human-chromosome-2 sequencing. We conclude that the map-based approach
outperforms random clone sequencing, except early in a project.
However, simultaneous use of both strategies can be beneficial if a
performance-based estimate for halting random clone sequencing is made.
Results further show that the random approach yields maximum
effectiveness using nonbiased rather than biased libraries.
3
Corresponding author.
11:274-280 ©2001 by Cold Spring Harbor Laboratory Press ISSN 1088-9051/01 $5.00