Published online before print
December 30, 2002, 10.1101/gr.809403
Vol 13, Issue 1, 103-107, January 2003
METHODS
HumanMouse Alignments with BLASTZ
Scott Schwartz1,
W. James Kent2,
Arian Smit3,
Zheng Zhang4,
Robert Baertsch2,
Ross C. Hardison5,
David Haussler6 and
Webb Miller1,7
1Department of Computer Science and Engineering, The
Pennsylvania State University, University Park, Pennsylvania 16802,
USA; 2Center for Biomolecular Science and Engineering,
University of California, Santa Cruz, California 95064, USA;3
Institute for Systems Biology, Seattle, Washington 98103,
USA; 4Paracel Inc., Pasadena, California 91106, USA;5
Department of Biochemistry and Molecular Biology, The
Pennsylvania State University, University Park, Pennsylvania 16802,
USA; 6Howard Hughes Medical Institute, 321 Applied Sciences,
University of California, Santa Cruz, California 95064, USA
The Mouse Genome Analysis Consortium aligned the human and mouse
genome sequences for a variety of purposes, using alignment programs
that suited the various needs. For investigating issues regarding
genome evolution, a particularly sensitive method was needed to permit
alignment of a large proportion of the neutrally evolving regions. We
selected a program called BLASTZ, an independent implementation of the
Gapped BLAST algorithm specifically designed for aligning two long
genomic sequences. BLASTZ was subsequently modified, both to attain
efficiency adequate for aligning entire mammalian genomes and to
increase its sensitivity. This work describes BLASTZ, its
modifications, the hardware environment on which we run it, and several
empirical studies to validate its results.
7 Corresponding author.
E-MAIL webb{at}bio.cse.psu.edu; FAX (814) 865-3176.
Article and publication are at
http://www.genome.org/cgi/doi/10.1101/gr.809403. Article published online before print in December
2002.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
H-L Wong, W-P Koh, N M Probst-Hensch, D Van den Berg, M C Yu, and S A Ingles
Insulin-like growth factor-1 promoter polymorphisms and colorectal cancer: a functional genomics approach
Gut,
August 1, 2008;
57(8):
1090 - 1096.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J.-T. Li, Y. Zhang, L. Kong, Q.-R. Liu, and L. Wei
Trans-natural antisense transcripts including noncoding RNAs in 10 species: implications for expression regulation
Nucleic Acids Res.,
July 24, 2008;
(2008)
gkn470v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Hooghe, P. Hulpiau, F. van Roy, and P. De Bleser
ConTra: a promoter alignment analysis tool for identification of transcription factor binding sites across species
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W128 - W132.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Gotea and I. Ovcharenko
DiRE: identifying distant regulatory elements of co-expressed genes
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W133 - W139.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Lu, Y. Fu, S. Kumar, Y. Shen, K. Zeng, A. Xu, R. Carthew, and C.-I Wu
Adaptive Evolution of Newly Emerged Micro-RNA Genes in Drosophila
Mol. Biol. Evol.,
May 1, 2008;
25(5):
929 - 938.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Stanke, M. Diekhans, R. Baertsch, and D. Haussler
Using native and syntenically mapped cDNA alignments to improve de novo gene finding
Bioinformatics,
March 1, 2008;
24(5):
637 - 644.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Lunter, A. Rocco, N. Mimouni, A. Heger, A. Caldeira, and J. Hein
Uncertainty in homology inferences: Assessing and improving genomic sequence alignment
Genome Res.,
February 1, 2008;
18(2):
298 - 309.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. A. Ayoub and C. Y. Hayashi
Multiple Recombining Loci Encode MaSp1, the Primary Constituent of Dragline Silk, in Widow Spiders (Latrodectus: Theridiidae)
Mol. Biol. Evol.,
February 1, 2008;
25(2):
277 - 286.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Wang, J. Gu, M. Q. Zhang, and Y. Li
Identification of phylogenetically conserved microRNA cis-regulatory elements across 12 Drosophila species
Bioinformatics,
January 15, 2008;
24(2):
165 - 171.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Bao, M. Zhou, and Y. Cui
CTCFBSDB: a CTCF-binding site database for characterization of vertebrate genomic insulators
Nucleic Acids Res.,
January 11, 2008;
36(suppl_1):
D83 - D87.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Matsuya, R. Sakate, Y. Kawahara, K. O. Koyanagi, Y. Sato, Y. Fujii, C. Yamasaki, T. Habara, H. Nakaoka, F. Todokoro, et al.
Evola: Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees
Nucleic Acids Res.,
January 11, 2008;
36(suppl_1):
D787 - D792.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. Miller, K. Rosenbloom, R. C. Hardison, M. Hou, J. Taylor, B. Raney, R. Burhans, D. C. King, R. Baertsch, D. Blankenberg, et al.
28-Way vertebrate alignment and conservation track in the UCSC Genome Browser
Genome Res.,
December 1, 2007;
17(12):
1797 - 1808.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res.,
December 1, 2007;
17(12):
1763 - 1773.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Wang, J. Zeng, C. B. Lowe, R. G. Sellers, S. R. Salama, M. Yang, S. M. Burgess, R. K. Brachmann, and D. Haussler
Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53
PNAS,
November 20, 2007;
104(47):
18613 - 18618.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Spudich, X. M. Fernandez-Suarez, and E. Birney
Genome browsing with Ensembl: a practical overview
Brief Funct Genomic Proteomic,
October 29, 2007;
(2007)
elm025v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. N. Bainbridge, R. L. Warren, A. He, M. Bilenky, A. G. Robertson, and S. J.M. Jones
THOR: targeted high-throughput ortholog reconstructor
Bioinformatics,
October 1, 2007;
23(19):
2622 - 2624.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Szklarczyk, J. Heringa, S. K. Pond, and A. Nekrutenko
Rapid asymmetric evolution of a dual-coding tumor suppressor INK4a/ARF locus contradicts its function
PNAS,
July 31, 2007;
104(31):
12807 - 12812.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Asthana, W. S. Noble, G. Kryukov, C. E. Grant, S. Sunyaev, and J. A. Stamatoyannopoulos
Widely distributed noncoding purifying selection in the human genome
PNAS,
July 24, 2007;
104(30):
12410 - 12415.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Brudno, A. Poliakov, S. Minovitsky, I. Ratnere, and I. Dubchak
Multiple whole genome alignments and novel biomedical applications at the VISTA portal
Nucleic Acids Res.,
July 13, 2007;
35(suppl_2):
W669 - W674.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. J. Ho Sui, D. L. Fulton, D. J. Arenillas, A. T. Kwon, and W. W. Wasserman
oPOSSUM: integrated tools for analysis of regulatory motif over-representation
Nucleic Acids Res.,
July 13, 2007;
35(suppl_2):
W245 - W252.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. B. Palmer, P. Majumder, M. R. Green, P. A. Wade, and J. M. Boss
A 3' Enhancer Controls Snail Expression in Melanoma Cells
Cancer Res.,
July 1, 2007;
67(13):
6113 - 6120.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. Lunter
Probabilistic whole-genome alignments reveal high indel rates in the human and mouse genomes
Bioinformatics,
July 1, 2007;
23(13):
i289 - i296.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Garg and P. Green
Differing patterns of selection in alternative and constitutive splice sites
Genome Res.,
July 1, 2007;
17(7):
1015 - 1022.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Vishnoi, R. Roy, and A. Bhattacharya
Comparative analysis of bacterial genomes: identification of divergent regions in mycobacterial strains using an anchor-based approach
Nucleic Acids Res.,
June 28, 2007;
35(11):
3654 - 3667.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. O. Korbel, A. E. Urban, F. Grubert, J. Du, T. E. Royce, P. Starr, G. Zhong, B. S. Emanuel, S. M. Weissman, M. Snyder, et al.
Systematic prediction and validation of breakpoints associated with copy-number variants in the human genome
PNAS,
June 12, 2007;
104(24):
10110 - 10115.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. H. Margulies, G. M. Cooper, G. Asimenos, D. J. Thomas, C. N. Dewey, A. Siepel, E. Birney, D. Keefe, A. S. Schwartz, M. Hou, et al.
Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome
Genome Res.,
June 1, 2007;
17(6):
760 - 774.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. C. King, J. Taylor, Y. Zhang, Y. Cheng, H. A. Lawson, J. Martin, ENCODE groups for Transcriptional Regulation and M, F. Chiaromonte, W. Miller, and R. C. Hardison
Finding cis-regulatory elements using comparative genomics: Some lessons from ENCODE data
Genome Res.,
June 1, 2007;
17(6):
775 - 786.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. B. Lowe, G. Bejerano, and D. Haussler
Thousands of human mobile element fragments undergo strong purifying selection near developmental genes
PNAS,
May 8, 2007;
104(19):
8005 - 8010.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Ponjavic, C. P. Ponting, and G. Lunter
Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs
Genome Res.,
May 1, 2007;
17(5):
556 - 565.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Wang, Y. Zhang, Y. Cheng, Y. Zhou, D. C. King, J. Taylor, F. Chiaromonte, J. Kasturi, H. Petrykowska, B. Gibb, et al.
Experimental validation of predicted mammalian erythroid cis-regulatory modules
Genome Res.,
December 1, 2006;
16(12):
1480 - 1492.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Johnson, R. J. Gamblin, L. Ooi, A. W. Bruce, I. J. Donaldson, D. R. Westhead, I. C. Wood, R. M. Jackson, and N. J. Buckley
Identification of the REST regulon reveals extensive transposable element-mediated binding site duplication
Nucleic Acids Res.,
September 1, 2006;
34(14):
3862 - 3877.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. T. P. Gilbert, J. Binladen, W. Miller, C. Wiuf, E. Willerslev, H. Poinar, J. E. Carlson, J. H. Leebens-Mack, and S. C. Schuster
Recharacterization of ancient DNA miscoding lesions: insights in the era of sequencing-by-synthesis
Nucleic Acids Res.,
August 18, 2006;
(2006)
gkl483v1.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res.,
July 19, 2006;
34(12):
3585 - 3598.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. N. Dewey and L. Pachter
Evolution at the nucleotide level: the problem of multiple whole-genome alignment.
Hum. Mol. Genet.,
April 15, 2006;
15(suppl_1):
R51 - R56.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Elango, J. W. Thomas, NISC Comparative Sequencing Program, and S. V. Yi
Variable molecular clocks in hominoids
PNAS,
January 31, 2006;
103(5):
1370 - 1375.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. N. Poinar, C. Schwarz, J. Qi, B. Shapiro, R. D. E. MacPhee, B. Buigues, A. Tikhonov, D. H. Huson, L. P. Tomsho, A. Auch, et al.
Metagenomics to Paleogenomics: Large-Scale Sequencing of Mammoth DNA
Science,
January 20, 2006;
311(5759):
392 - 394.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. SCHATTNER, S. BARBERAN-SOLER, and T. M. LOWE
A computational screen for mammalian pseudouridylation guide H/ACA RNAs
RNA,
January 1, 2006;
12(1):
15 - 25.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Tran, P. Havlak, and J. Miller
MicroRNA enrichment among short 'ultraconserved' sequences in insects.
Nucleic Acids Res.,
January 1, 2006;
34(9):
e65 - e65.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. McAlinden, N. Havlioglu, L. Liang, S. R. Davies, and L. J. Sandell
Alternative Splicing of Type II Procollagen Exon 2 Is Regulated by the Combination of a Weak 5' Splice Site and an Adjacent Intronic Stem-loop Cis Element
J. Biol. Chem.,
September 23, 2005;
280(38):
32700 - 32711.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Baek and P. Green
Sequence conservation, relative isoform frequencies, and nonsense-mediated decay in evolutionarily conserved alternative splicing
PNAS,
September 6, 2005;
102(36):
12813 - 12818.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. J. Welch, J. A. Watts, C. R. Vakoc, Y. Yao, H. Wang, R. C. Hardison, G. A. Blobel, L. A. Chodosh, and M. J. Weiss
Global regulation of erythroid gene expression by transcription factor GATA-1
Blood,
November 15, 2004;
104(10):
3136 - 3147.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|