Vol 13, Issue 5, 813-820, May 2003
LETTER
Quantitative Estimates of Sequence Divergence for Comparative Analyses of Mammalian Genomes
Gregory M. Cooper1,
Michael Brudno2,
NISC Comparative Sequencing Program3,
Eric D. Green3,
Serafim Batzoglou2 and
Arend Sidow1,4,5
1Department of Genetics, Stanford University, Stanford,
California 94305, USA; 2Department of Computer Science,
Stanford University, Stanford, California 94305, USA; 3Genome
Technology Branch and National Institutes of Health Intramural
Sequencing Center, National Human Genome Research Institute, National
Institutes of Health, Bethesda, Maryland 20892, USA;4
Department of Pathology, Stanford University,
Stanford, California 94305, USA
Comparative sequence analyses on a collection of carefully chosen
mammalian genomes could facilitate identification of functional
elements within the human genome and allow quantification of
evolutionary constraint at the single nucleotide level. High-resolution
quantification would be informative for determining the distribution of
important positions within functional elements and for evaluating the
relative importance of nucleotide sites that carry single nucleotide
polymorphisms (SNPs). Because the level of resolution in comparative
sequence analyses is a direct function of sequence diversity, we
propose that the information content of a candidate mammalian genome be
defined as the sequence divergence it would add relative to
already-sequenced genomes. We show that reliable estimates of genomic
sequence divergence can be obtained from small genomic regions. On the
basis of a multiple sequence alignment of 1.4 megabases each from
eight mammals, we generate such estimates for five unsequenced mammals.
Estimates of the neutral divergence in these data suggest that a small
number of diverse mammalian genomes in addition to human, mouse, and
rat would allow single nucleotide resolution in comparative sequence
analyses.
[The multiple sequence alignment of the
CFTR region and a spreadsheet with the calculations performed,
will be available as supplementary information online at
www.genome.org.]
5 Corresponding author.
E-MAIL arend{at}stanford.edu; FAX (650) 725-4905.
Article and publication are at
http://www.genome.org/cgi/doi/10.1101/gr.1064503.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
M. D. Rasmussen and M. Kellis
Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes
Genome Res.,
December 1, 2007;
17(12):
1932 - 1942.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Pheasant and J. S. Mattick
Raising the estimate of functional human sequences
Genome Res.,
September 1, 2007;
17(9):
1245 - 1253.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Prabhakar, F. Poulin, M. Shoukry, V. Afzal, E. M. Rubin, O. Couronne, and L. A. Pennacchio
Close sequence comparisons are sufficient to identify human cis-regulatory elements
Genome Res.,
July 1, 2006;
16(7):
855 - 863.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. J. Urban, R. Sebro, E. H. Hurowitz, M. K. Leabman, I. Badagnani, L. L. Lagpacan, N. Risch, and K. M. Giacomini
Functional genomics of membrane transporters in human populations
Genome Res.,
February 1, 2006;
16(2):
223 - 230.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Grice, E. S. Rochelle, E. D. Green, A. Chakravarti, and A. S. McCallion
Evaluation of the RET regulatory landscape reveals the biological relevance of a HSCR-implicated enhancer
Hum. Mol. Genet.,
December 15, 2005;
14(24):
3837 - 3845.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. M. Cooper, E. A. Stone, G. Asimenos, NISC Comparative Sequencing Program, E. D. Green, S. Batzoglou, and A. Sidow
Distribution and intensity of constraint in mammalian genomic sequence
Genome Res.,
July 1, 2005;
15(7):
901 - 913.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. D. McAuliffe, M. I. Jordan, and L. Pachter
Subtree power analysis and species selection for comparative genomics
PNAS,
May 31, 2005;
102(22):
7900 - 7905.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Blanchette, E. D. Green, W. Miller, and D. Haussler
Reconstructing large regions of an ancestral mammalian genome in silico
Genome Res.,
December 1, 2004;
14(12):
2412 - 2423.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Kellogg and J. L. Bennetzen
The evolution of nuclear genome structure in seed plants
Am. J. Botany,
October 1, 2004;
91(10):
1709 - 1725.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. Ovcharenko, M. A. Nobrega, G. G. Loots, and L. Stubbs
ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes
Nucleic Acids Res.,
July 1, 2004;
32(suppl_2):
W280 - W286.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. Ovcharenko, D. Boffelli, and G. G. Loots
eShadow: A Tool for Comparing Closely Related Sequences
Genome Res.,
June 1, 2004;
14(6):
1191 - 1198.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. M. Cooper, M. Brudno, E. A. Stone, I. Dubchak, S. Batzoglou, and A. Sidow
Characterization of Evolutionary Rates and Constraints in Three Mammalian Genomes
Genome Res.,
April 1, 2004;
14(4):
539 - 548.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. B. Yap and L. Pachter
Identification of Evolutionary Hotspots in the Rodent Genomes
Genome Res.,
April 1, 2004;
14(4):
574 - 579.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Brudno, A. Poliakov, A. Salamov, G. M. Cooper, A. Sidow, E. M. Rubin, V. Solovyev, S. Batzoglou, and I. Dubchak
Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human
Genome Res.,
April 1, 2004;
14(4):
685 - 692.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. P. Wilder, M.-T. Bihoreau, K. Argoud, T. K. Watanabe, M. Lathrop, and D. Gauguier
Integration of the Rat Recombination and EST Maps in the Rat Genomic Sequence and Comparative Mapping Analysis With the Mouse Genome
Genome Res.,
April 1, 2004;
14(4):
758 - 765.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Nagpal, M. W. Karaman, M. M. Timmerman, V. V. Ho, B. L. Pike, and J. G. Hacia
Improving the sensitivity and specificity of gene expression analysis in highly related organisms through the use of electronic masks
Nucleic Acids Res.,
March 18, 2004;
32(5):
e51 - e51.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Siepel and D. Haussler
Phylogenetic Estimation of Context-Dependent Substitution Rates by Maximum Likelihood
Mol. Biol. Evol.,
March 1, 2004;
21(3):
468 - 488.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. A. Chapman, I. J. Donaldson, J. Gilbert, D. Grafham, J. Rogers, A. R. Green, and B. Gottgens
Analysis of Multiple Genomic Sequence Alignments: A Web Resource, Online Tools, and Lessons Learned From Analysis of Mammalian SCL Loci
Genome Res.,
February 1, 2004;
14(2):
313 - 318.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. H. Margulies, M. Blanchette, NISC Comparative Sequencing Program, D. Haussler, and E. D. Green
Identification and Characterization of Multi-Species Conserved Sequences
Genome Res.,
December 1, 2003;
13(12):
2507 - 2518.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|