Published online before print
December 30, 2002, 10.1101/gr.789803
Vol 13, Issue 1, 97-102, January 2003
METHODS
AVID: A Global Alignment Program
Nick Bray1,2,
Inna Dubchak1 and
Lior Pachter2,3
1Lawrence Berkeley National Laboratory, Berkeley,
California 94720, USA; 2Department of Mathematics, University
of California, Berkeley, California 94720, USA
In this paper we describe a new global alignment method called AVID.
The method is designed to be fast, memory efficient, and practical for
sequence alignments of large genomic regions up to megabases long. We
present numerous applications of the method, ranging from the
comparison of assemblies to alignment of large syntenic genomic regions
and whole genome human/mouse alignments. We have also performed a
quantitative comparison of AVID with other popular alignment tools. To
this end, we have established a format for the representation of
alignments and methods for their comparison. These formats and methods
should be useful for future studies. The tools we have developed for
the alignment comparisons, as well as the AVID program, are publicly
available. See Web Site References section for AVID Web address and Web
addresses for other programs discussed in this paper.
3 Corresponding author.
E-MAIL lpachter{at}math.berkeley.edu; FAX (510) 642-8204.
Article and publication are at
http://www.genome.org/cgi/doi/10.1101/gr.789803. Article published online before print in December
2002.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
N. Warthmann, S. Das, C. Lanz, and D. Weigel
Comparative Analysis of the MIR319a MicroRNA Locus in Arabidopsis and Related Brassicaceae
Mol. Biol. Evol.,
May 1, 2008;
25(5):
892 - 902.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. C. Tzika, R. Helaers, Y. Van de Peer, and M. C. Milinkovitch
MANTIS: a phylogenetic framework for multi-species genome comparisons
Bioinformatics,
January 15, 2008;
24(2):
151 - 157.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. M. Bergman and H. Quesneville
Discovering and detecting transposable elements in genome sequences
Brief Bioinform,
November 1, 2007;
8(6):
382 - 392.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Dong, C. Ivascu, H.-D. Chang, P. Wu, R. Angeli, L. Maggi, F. Eckhardt, L. Tykocinski, C. Haefliger, B. Mowes, et al.
IL-10 Is Excluded from the Functional Cytokine Memory of Human CD4+ Memory T Lymphocytes
J. Immunol.,
August 15, 2007;
179(4):
2389 - 2396.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Boyd and A. Naray-Fejes-Toth
Steroid-Mediated Regulation of the Epithelial Sodium Channel Subunits in Mammary Epithelial Cells
Endocrinology,
August 1, 2007;
148(8):
3958 - 3967.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Vishnoi, R. Roy, and A. Bhattacharya
Comparative analysis of bacterial genomes: identification of divergent regions in mycobacterial strains using an anchor-based approach
Nucleic Acids Res.,
June 28, 2007;
35(11):
3654 - 3667.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Huang and D. L. Brutlag
Dynamic use of multiple parameter sets in sequence alignment
Nucleic Acids Res.,
January 28, 2007;
35(2):
678 - 686.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Yuan, A. Tie, M. Tarnopolsky, and M. Bakovic
Genomic organization, promoter activity, and expression of the human choline transporter-like protein 1
Physiol Genomics,
September 14, 2006;
26(1):
76 - 90.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. F. Mulley, C.-h. Chiu, and P. W. H. Holland
Breakup of a homeobox cluster after genome duplication in teleosts
PNAS,
July 5, 2006;
103(27):
10369 - 10372.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. Sauer, E. Shelest, and E. Wingender
Evaluating phylogenetic footprinting for human-rodent comparisons
Bioinformatics,
February 15, 2006;
22(4):
430 - 437.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Huang, S.-P. Yang, A. T. Chinwalla, L. W. Hillier, P. Minx, E. R. Mardis, and R. K. Wilson
Application of a superword array in genome assembly
Nucleic Acids Res.,
January 5, 2006;
34(1):
201 - 205.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Blanco, D. Farre, M. M. Alba, X. Messeguer, and R. Guigo
ABS: a database of Annotated regulatory Binding Sites from orthologous promoters
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D63 - D67.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
W. Huang, D. M. Umbach, and L. Li
Accurate anchoring alignment of divergent sequences
Bioinformatics,
January 1, 2006;
22(1):
29 - 34.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Grice, E. S. Rochelle, E. D. Green, A. Chakravarti, and A. S. McCallion
Evaluation of the RET regulatory landscape reveals the biological relevance of a HSCR-implicated enhancer
Hum. Mol. Genet.,
December 15, 2005;
14(24):
3837 - 3845.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Vanrompay, T. Q. T. Hoang, L. De Vos, K. Verminnen, T. Harkinezhad, K. Chiers, S. A. Morre, and E. Cox
Specific-Pathogen-Free Pigs as an Animal Model for Studying Chlamydia trachomatis Genital Infection
Infect. Immun.,
December 1, 2005;
73(12):
8317 - 8321.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Jones and R. A. Flavell
Distal Enhancer Elements Transcribe Intergenic RNA in the IL-10 Family Gene Cluster
J. Immunol.,
December 1, 2005;
175(11):
7437 - 7446.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
X. Pan, L. Stein, and V. Brendel
SynBrowse: a synteny browser for comparative sequence analysis
Bioinformatics,
September 1, 2005;
21(17):
3461 - 3468.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. P. Hermann and L. L. Heckert
Silencing of Fshr Occurs through a Conserved, Hypersensitive Site in the First Intron
Mol. Endocrinol.,
August 1, 2005;
19(8):
2112 - 2131.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. J. Gaffney and P. D. Keightley
The scale of mutational variation in the murid genome
Genome Res.,
August 1, 2005;
15(8):
1086 - 1094.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Aerts, P. Van Loo, G. Thijs, H. Mayer, R. de Martin, Y. Moreau, and B. De Moor
TOUCAN 2: the all-inclusive open source workbench for regulatory sequence analysis
Nucleic Acids Res.,
July 1, 2005;
33(suppl_2):
W393 - W396.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Berezikov, V. Guryev, and E. Cuppen
CONREAL web server: identification and visualization of conserved transcription factor binding sites
Nucleic Acids Res.,
July 1, 2005;
33(suppl_2):
W447 - W450.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. S. Pappu, E. J. Ostrin, B. W. Middlebrooks, B. T. Sili, R. Chen, M. R. Atkins, R. Gibbs, and G. Mardon
Dual regulation and redundant function of two eye-specific enhancers of the Drosophila retinal determination gene dachshund
Development,
June 15, 2005;
132(12):
2895 - 2905.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Negre, S. Casillas, M. Suzanne, E. Sanchez-Herrero, M. Akam, M. Nefedov, A. Barbadilla, P. de Jong, and A. Ruiz
Conservation of regulatory sequences and gene expression patterns in the disintegrating Drosophila Hox gene complex
Genome Res.,
May 1, 2005;
15(5):
692 - 700.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. D. Wu and C. K. Watanabe
GMAP: a genomic mapping and alignment program for mRNA and EST sequences
Bioinformatics,
May 1, 2005;
21(9):
1859 - 1875.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Bomblies and J. F. Doebley
Molecular Evolution of FLORICAULA/LEAFY Orthologs in the Andropogoneae (Poaceae)
Mol. Biol. Evol.,
April 1, 2005;
22(4):
1082 - 1094.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Richards, Y. Liu, B. R. Bettencourt, P. Hradecky, S. Letovsky, R. Nielsen, K. Thornton, M. J. Hubisz, R. Chen, R. P. Meisel, et al.
Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution
Genome Res.,
January 1, 2005;
15(1):
1 - 18.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. Ovcharenko, G. G. Loots, B. M. Giardine, M. Hou, J. Ma, R. C. Hardison, L. Stubbs, and W. Miller
Mulan: Multiple-sequence local alignment and visualization for studying function and evolution
Genome Res.,
January 1, 2005;
15(1):
184 - 194.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Belshaw and A. Katzourakis
BlastAlign: a program that uses blast to align problematic nucleotide sequences
Bioinformatics,
January 1, 2005;
21(1):
122 - 123.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J.-Y. Lee, S. F. Baum, J. Alvarez, A. Patel, D. H. Chitwood, and J. L. Bowman
Activation of CRABS CLAW in the Nectaries and Carpels of Arabidopsis
PLANT CELL,
January 1, 2005;
17(1):
25 - 36.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. T. Webster, N. G.C. Smith, M. J. Lercher, and H. Ellegren
Gene Expression, Synteny, and Local Similarity in Human Noncoding Mutation Rates
Mol. Biol. Evol.,
October 1, 2004;
21(10):
1820 - 1830.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. G. Burleigh and S. Mathews
Phylogenetic signal in nucleotide data from seed plants: implications for resolving the seed plant tree of life
Am. J. Botany,
September 1, 2004;
91(10):
1599 - 1613.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. J. Hatcher, N. Y.S.-G. Diman, M.-S. Kim, D. Pennisi, Y. Song, M. M. Goldstein, T. Mikawa, and C. T. Basson
A role for Tbx5 in proepicardial cell migration during cardiogenesis
Physiol Genomics,
July 8, 2004;
18(2):
129 - 140.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. C.E. Darling, B. Mau, F. R. Blattner, and N. T. Perna
Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements
Genome Res.,
July 1, 2004;
14(7):
1394 - 1403.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. A. Frazer, L. Pachter, A. Poliakov, E. M. Rubin, and I. Dubchak
VISTA: computational tools for comparative genomics
Nucleic Acids Res.,
July 1, 2004;
32(suppl_2):
W273 - W279.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. P. Hardy and L. A. J. O'Neill
The Murine Irak2 Gene Encodes Four Alternatively Spliced Isoforms, Two of Which Are Inhibitory
J. Biol. Chem.,
June 25, 2004;
279(26):
27699 - 27708.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Simillion, K. Vandepoele, Y. Saeys, and Y. Van de Peer
Building Genomic Profiles for Uncovering Segmental Homology in the Twilight Zone
Genome Res.,
June 1, 2004;
14(6):
1095 - 1106.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. B. Rankin, W. Xu, D. G. Silberg, and E. Suh
Putative intestine-specific enhancers located in 5' sequence of the CDX1 gene regulate CDX1 expression in the intestine
Am J Physiol Gastrointest Liver Physiol,
May 1, 2004;
286(5):
G872 - G880.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Wang, B. Jones Voy, S. Urs, S. Kim, M. Soltani-Bejnood, N. Quigley, Y.-R. Heo, M. Standridge, B. Andersen, M. Dhar, et al.
The Human Fatty Acid Synthase Gene and De Novo Lipogenesis Are Coordinately Regulated in Human Adipose Tissue
J. Nutr.,
May 1, 2004;
134(5):
1032 - 1038.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Hou, M. Eren, C. A. Painter, J. W. Covington, J. D. Dixon, J. A. Schoenhard, and D. E. Vaughan
Tumor Necrosis Factor {alpha} Activates the Human Plasminogen Activator Inhibitor-1 Gene through a Distal Nuclear Factor {kappa}B Site
J. Biol. Chem.,
April 30, 2004;
279(18):
18127 - 18136.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. J. Kalafus, A. R. Jackson, and A. Milosavljevic
Pash: Efficient Genome-Scale Sequence Anchoring by Positional Hashing
Genome Res.,
April 1, 2004;
14(4):
672 - 678.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Bray and L. Pachter
MAVID: Constrained Ancestral Alignment of Multiple Sequences
Genome Res.,
April 1, 2004;
14(4):
693 - 699.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Chakrabarti and L. Pachter
Visualization of Multiple Genome Annotations and Alignments With the K-BROWSER
Genome Res.,
April 1, 2004;
14(4):
716 - 720.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. E. Abbas and S. P. Holmes
Bioinformatics and Management Science: Some Common Tools and Techniques
Operations Research,
March 1, 2004;
52(2):
165 - 190.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
P. D. Keightley and T. Johnson
MCALIGN: Stochastic Alignment of Noncoding DNA Sequences Based on an Evolutionary Model of Sequence Evolution
Genome Res.,
March 1, 2004;
14(3):
442 - 450.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Istrail, G. G. Sutton, L. Florea, A. L. Halpern, C. M. Mobarry, R. Lippert, B. Walenz, H. Shatkay, I. Dew, J. R. Miller, et al.
Whole-genome shotgun assembly and comparison of human genome assemblies
PNAS,
February 17, 2004;
101(7):
1916 - 1921.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. B. Major and D. A. Jones
Identification of a gadd45{beta} 3' Enhancer That Mediates SMAD3- and SMAD4-dependent Transcriptional Induction by Transforming Growth Factor {beta}
J. Biol. Chem.,
February 13, 2004;
279(7):
5278 - 5287.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. M. Clark, E. Linton, J. Messing, and J. F. Doebley
Inaugural Article: Pattern of diversity in the genomic region near the maize domestication gene tb1
PNAS,
January 20, 2004;
101(3):
700 - 707.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Berezikov, V. Guryev, R. H.A. Plasterk, and E. Cuppen
CONREAL: Conserved Regulatory Elements Anchored Alignment Algorithm for Identification of Transcription Factor Binding Sites by Phylogenetic Footprinting
Genome Res.,
January 1, 2004;
14(1):
170 - 178.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. A. F. T. van Hijum, A. L. Zomer, O. P. Kuipers, and J. Kok
Projector: automatic contig mapping for gap closure purposes
Nucleic Acids Res.,
November 15, 2003;
31(22):
e144 - e144.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Alonso, R. Mahmood, S. Li, F. Cheung, K. Yoda, and P. E. Warburton
Genomic microarray analysis reveals distinct locations for the CENP-A binding domains in three human chromosome 13q32 neocentromeres
Hum. Mol. Genet.,
October 16, 2003;
12(20):
2711 - 2721.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. G. Hogan, L. Chen, J. Nardone, and A. Rao
Transcriptional regulation by calcium, calcineurin, and NFAT
Genes & Dev.,
September 15, 2003;
17(18):
2205 - 2232.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. A. Lyons, H. Wittenburg, R. Li, K. A. Walsh, M. R. Leonard, G. A. Churchill, M. C. Carey, and B. Paigen
New quantitative trait loci that contribute to cholesterol gallstone formation detected in an intercross of CAST/Ei and 129S1/SvImJ inbred mice
Physiol Genomics,
August 15, 2003;
14(3):
225 - 239.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. C.-C. Shih and W.-H. Li
GS-Aligner: A Novel Tool for Aligning Genomic Sequences Using Bit-Level Operations
Mol. Biol. Evol.,
August 1, 2003;
20(8):
1299 - 1309.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Cawley, L. Pachter, and M. Alexandersson
SLAM web server for comparative gene finding and alignment
Nucleic Acids Res.,
July 1, 2003;
31(13):
3507 - 3509.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Bray and L. Pachter
MAVID multiple alignment server
Nucleic Acids Res.,
July 1, 2003;
31(13):
3525 - 3526.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Brudno, C. B. Do, G. M. Cooper, M. F. Kim, E. Davydov, N. C. S. Program, E. D. Green, A. Sidow, and S. Batzoglou
LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNA
Genome Res.,
April 1, 2003;
13(4):
721 - 731.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Alexandersson, S. Cawley, and L. Pachter
SLAM: Cross-Species Gene Finding and Alignment with a Generalized Pair Hidden Markov Model
Genome Res.,
March 1, 2003;
13(3):
496 - 502.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|