Genome Res. 14:1188-1190, 2004
©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Resources
WebLogo: A Sequence Logo Generator
Gavin E. Crooks1,
Gary Hon1,
John-Marc Chandonia2 and
Steven E. Brenner1,2,3
1 Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
2 Berkeley Structural Genomics Center, Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
 |
ABSTRACT
|
|---|
WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization.
Sequence logos were invented by Tom Schneider and Mike Stephens (Schneider and Stephens 1990 ; Shaner et al. 1993 ) to display patterns in sequence conservation, and to assist in discovering and analyzing those patterns. As an example, the accompanying figure (Fig. 1) shows how WebLogo can help interpret the sequence-specific binding of the protein CAP to its DNA recognition site (Schultz et al. 1991 ). Homodimeric DNA-binding proteins typically display a symmetric double hump in the DNA binding-site logo (Schneider and Stephens 1990 ), as shown in the figure. Deviations from this basic pattern can indicate additional features; a highly conserved residue in the center of such a pattern may indicate DNA distortion or base flipping (Schneider 2001 ); an unexpectedly high-sequence conservation may be due to overlapping binding sites (Schneider et al. 1986 ). Protein logos can illuminate patterns of amino acid conservation that are often of structural or functional importance (Galperin et al. 2001 ; Rigden et al. 2003 ). Sequence logos have also been used to display patterns in the BLOCKS protein sequence database (Henikoff et al. 1995 ), and in DNA-binding site motifs (Robison et al. 1998 ; Nelson et al. 2002 ), to analyze splice sites (Stephens and Schneider 1992 ; Emmert et al. 2001 ), and in a variety of other contexts. Additional examples, and the raw data for the example presented here, can be found on the WebLogo examples page (http://weblogo.berkeley.edu/examples.html).

View larger version (44K):
[in this window]
[in a new window]
|
Figure 1 (A) CAP (Catabolite Activator Protein, also known as CRP) acts as a transcription promoter by binding at more than 100 sites within the Escherichia coli genome. We rendered the PDB structure 1CGP
[PDB]
(Schultz et al. 1991 ) using Chimera (Huang et al. 1996 ). (B) The two DNA recognition helices of the CAP homodimer insert themselves into consecutive turns of the major groove. Several consequences can be observed in this CAP binding-site logo. The logo is approximately palindromic, which provides two very similar recognition sites, one for each subunit of the dimer. However, the binding site lacks perfect symmetry, possibly due to the inherent asymmetry of the operon promoter region. The displacement of the two halves is 11 bp, or approximately one full turn of the DNA helix. Additional interactions occur between the protein and the first and last two bases within the DNA minor groove, where the protein cannot easily distinguish A from T, or G from C (Seeman et al. 1976 ). The data for this logo consists of 59 binding sites determined by DNA footprinting (Robison et al. 1998 ). (C) The helix-turn-helix motif from the CAP family of homodimeric DNA binding proteins (Brennan and Matthews 1989 ; Schultz et al. 1991 ). Positions 180, 181, and 185 are known to interact directly with bases in the major groove (Schultz et al. 1991 ; Parkinson et al. 1996 ) and are critical to the sequence-specific binding of the protein. The conserved glycine at position 177 is located inside of the turn between the helices, where packing effects prevent the insertion of a side chain. Partially or completely buried positions (labeled B) frequently contain hydrophobic amino acids, which are colored black. The data for this logo consists of 100 sequences from the full Pfam (Bateman et al. 2002 ) alignment of this family (Accession no. PF00325). We removed a few sequences with rare insertions for convenience.
|
|
The logo generation form (http://weblogo.berkeley.edu/logo.cgi) can process RNA, DNA, or protein multiple sequence alignments provided in either FASTA (Pearson and Lipman 1988 ) or CLUSTAL (Higgins and Sharp 1988 ) formats. If the user does not explicitly specify the sequence type, then WebLogo will make a determination on the basis of the symbols found within the sequences. A logo represents each column of the alignment by a stack of letters, with the height of each letter proportional to the observed frequency of the corresponding amino acid or nucleotide, and the overall height of each stack proportional to the sequence conservation, measured in bits, at that position. The letters of each stack are ordered from most to least frequent, so that one may read the consensus sequence from the tops of the stacks. For example, the figure shows that the CAP bindingsite consensus sequence is AA-TGTGA------TCACA-TT.
Schneider and Stephens (1990 ) define the sequence conservation at a particular position in the alignment, Rseq, as the difference between the maximum possible entropy and the entropy of the observed symbol distribution:
Here, pn is the observed frequency of symbol n at a particular sequence position and N is the number of distinct symbols for the given sequence type, either four for DNA/RNA or 20 for protein. Consequently, the maximum sequence conservation per site is log2 4 = 2 bits for DNA/RNA and log2 20 4.32 bits for proteins. If we neglect the intersite correlations and assume a uniform background symbol distribution, then the total entropy of the logo, the sum of the sequence conservation at each position, measures the information content of the logo. For binding sites, this total entropy has, in many cases, been shown to be approximately equal to the amount of information needed to locate the binding site within the relevant stretch of DNA (Schneider et al 1986 ). For a nonuniform background distribution, such as found in protein sequences or the genomes of many hyperthermophiles, the information content would be given by the relative entropy between the observed and background distributions (Cover and Thomas 1991 ; Gorodkin et al. 1997 ; Stormo 1998 ).
Limited sequence data results in a systematic underestimation of the entropy, which becomes significant if the multiple alignment contains fewer than about 20 nucleotide or 40 protein sequences. By default, WebLogo incorporates a small sample correction (Schneider et al. 1986 ), which can, in part, ameliorate this bias. In addition, WebLogo can optionally display error bars with heights twice this correction, which gives some idea of the sampling errors made. Note that the error bars may not have uniform height across the logo, as the magnitude of the small sample correction depends on the number of symbols observed at each position. This will vary due to the presence of gaps in the alignment.
A standard sequence logo does not provide any indication of correlations between different positions of the alignment. In general, such intersite correlations are relatively insignificant in biological sequences (Schneider 1997 ; Stormo 1998 ), but there are exceptions, such as base-paired sites in folded RNA structures. Structural logos (Gorodkin et al. 1997 ), an extension of the sequence logo idea, display part of this additional level of detail.
The symbols that compose the stacks display colors according to the chemical species they represent. The default colors for nucleotides are G, orange; T and U, red; C, blue; and A, green. Amino acids have colors according to their chemical properties (Lewin 1994 ); polar amino acids (G, S, T, Y, C, Q, N) show as green, basic (K, R, H) blue, acidic (D, E) red, and hydrophobic (A, V, L, I, P, W, F, M) amino acids as black. The user may customize the coloring scheme, or select a simple black and white option.
WebLogo can create output in several common graphics' formats, including the bitmap formats GIF and PNG, suitable for on-screen display, and the vector formats EPS and PDF, more suitable for printing, publication, and further editing. Additional graphics options include bitmap resolution, titles, optional axis, and axis labels, antialiasing, error bars, and alternative symbol formats.
The Web site is available to all users without fee. Those who would prefer to run WebLogo on a local server may obtain a command line interface version with source code (distributed under an Open Source license). We welcome bug reports and suggestions for additional features. Please send these to logo{at}compbio.berkeley.edu.
 |
Acknowledgements
|
|---|
WebLogo uses PostScript code and ideas from the programs alpro and makelogo, both part of Tom Schneider's delila package (Schneider et al. 1982 ). Many thanks to him for making this software freely available, for encouraging its use, and for feedback on WebLogo. We are also grateful for the enthusiastic encouragement of Michael Galperin. Grants from the NIH (1-K22-HG00056) and the Searle Scholars program (01-L-116) support this work. JMC was supported by NIH grant 1-P50-GM62412, and DOE contract DE-AC03-76SF00098. GEC received funding from the Sloan/DOE postdoctoral fellowship in computational molecular biology.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
 |
Footnotes
|
|---|
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.849004.
3 Corresponding author. E-MAIL brenner{at}compbio.berkeley.edu; FAX (208) 279-8978. 
 |
REFERENCES
|
|---|
Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S.R., Griffiths-Jones, S., Howe, K.L., Marshall, M., and Sonnhammer, E.L. 2002. The Pfam protein families database. Nucleic Acids Res. 30: 276280.[Abstract/Free Full Text]
Brennan, R.G. and Matthews, B.W. 1989. Structural basis of DNA-protein recognition. Trends in Biochem. Sci. 14: 286290.[CrossRef][Medline]
Cover, T.M. and Thomas, J.A. 1991. Elements of information theory. John Wiley & Sons, New York.
Emmert, S., Schneider, T.D., Khan, S.G., and Kraemer, K.H. 2001. The human XPG gene: Gene architecture, alternative splicing and single nucleotide polymorphisms. Nucleic Acids Res. 29: 14431452.[Abstract/Free Full Text]
Galperin, M.Y., Nikolskaya, A.N., and Koonin, E.V. 2001. Novel domains of the prokaryotic two-component signal transduction systems. FEMS Microbiol. Lett. 203: 1121.[CrossRef][Medline]
Gorodkin, J., Heyer, L.J., Brunak, S., and Stormo, G.D. 1997. Displaying the information contents of structural RNA alignments: The structure logos. Comput. Appl. Biosci. 13: 583586.[Abstract/Free Full Text]
Henikoff, S., Henikoff, J.G., Alford, W.J., and Pietrokovski, S. 1995. Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene 163: GC17GC26.[CrossRef][Medline]
Higgins, D.G. and Sharp, P.M. 1988. CLUSTAL: A package for performing multiple sequence alignment on a microcomputer. Gene 73: 237244.[CrossRef][Medline]
Huang, C.C., Couch, G.S., Pettersen, E.F., and Ferrin, T.E. 1996. Chimera: An extensible molecular modeling application constructed using standard components. In Pacific symposium on biocomputing, Vol. 1, pp. 724. http://www.cgl.ucsf.edu/chimera.
Lewin, B. 1994. Genes V. Oxford University Press, New York.
Nelson, P.S., Clegg, N., Arnold, H., Ferguson, C., Bonham, M., White, J., Hood, L., and Lin, B. 2002. The program of androgen-responsive genes in neoplastic prostate epithelium. Proc. Natl. Acad. Sci. 99: 1189011895.[Abstract/Free Full Text]
Parkinson, G., Gunasekera, A., Vojtechovsky, J., Zhang, X., Kunkel, T.A., Berman, H., and Ebright, R.H. 1996. Aromatic hydrogen bond in sequence-specific protein DNA recognition. Nat. Struct. Biol. 3: 837841.[CrossRef][Medline]
Pearson, W.R. and Lipman, D.J. 1988. Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. 85: 24442448.[Abstract/Free Full Text]
Rigden, D.J., Jedrzejas, M.J., and Galperin, M.Y. 2003. An extracellular calcium-binding domain in bacteria with a distant relationship to EF-hands. FEMS Microbiol. Lett. 221: 103110.[CrossRef][Medline]
Robison, K., McGuire, A.M., and Church, G.M. 1998. A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome. J. Mol. Biol. 284: 241254.[CrossRef][Medline]
Schneider, T.D. 1997. Information content of individual genetic sequences. J. Theor. Biol. 189: 427441.[CrossRef][Medline]
Schneider, T.D. 2001. Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation. Nucleic Acid Res. 29: 48814891.[Abstract/Free Full Text]
Schneider, T.D. and Stephens, R.M. 1990. Sequence logos: A new way to display consensus sequences. Nucleic Acids Res. 18: 60976100.[Abstract/Free Full Text]
Schneider, T.D., Stormo, G.D., Haemer, J.S., and Gold, L. 1982. A design for computer nucleic-acid sequence storage, retrieval, and manipulation. Nucleic Acids Res. 10: 30133024.[Abstract/Free Full Text]
Schneider, T.D., Stormo, G.D., Gold, L., and Ehrenfeucht, A. 1986. Information content of binding sites on nucleotide sequences. J. Mol. Biol. 188: 415431.[CrossRef][Medline]
Schultz, S.C., Shields, G.C., and Steitz, T.A. 1991. Crystal structure of a CAP-DNA complex: The DNA is bent by 90°. Science 253: 10011007.[Abstract/Free Full Text]
Seeman, N.C., Rosenberg, J.M., and Rich, A. 1976. Sequence-specific recognition of double helical nucleic acids by proteins. Proc. Natl. Acad. Sci. 73: 804808.[Abstract/Free Full Text]
Shaner, M.C., Blair, I.M., and Schneider, T.D. 1993. Sequence logos: A powerful, yet simple, tool. Proceedings of the twenty-sixth annual Hawaii international conference on system sciences. In Architecture and biotechnology computing (eds. T.N. Mudge et al.) Vol 1., pp. 813821. IEEE Computer Society Press, Los Alamitos, CA.
Stephens, R.M. and Schneider, T.D. 1992. Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites. J. Mol. Biol. 228: 11241136.[CrossRef][Medline]
Stormo, G.D. 1998. Information content and free energy in DNA-protein interactions. J. Theor. Biol. 195: 135137.[CrossRef][Medline]
 |
WEB SITE REFERENCES
|
|---|
http://weblogo.berkeley.edu/; WebLogo: A Sequence Logo Generator.
Received September 26, 2002;
accepted in revised format January 6, 2004.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
J. M. Miano
Deck of CArGs
Circ. Res.,
July 3, 2008;
103(1):
13 - 15.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Martinelli, P. Torreri, M. Tinti, L. Stella, G. Bocchinfuso, E. Flex, A. Grottesi, M. Ceccarini, A. Palleschi, G. Cesareni, et al.
Diverse driving forces underlie the invariant occurrence of the T42A, E139D, I282V and T468M SHP2 amino acid substitutions causing Noonan and LEOPARD syndromes
Hum. Mol. Genet.,
July 1, 2008;
17(13):
2018 - 2029.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Torkamani, N. Kannan, S. S. Taylor, and N. J. Schork
Congenital disease SNPs target lineage specific structural elements in protein kinases
PNAS,
July 1, 2008;
105(26):
9011 - 9016.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N.-O. Chimge, A. V. Makeyev, F. H. Ruddle, and D. Bayarsaihan
Identification of the TFII-I family target genes in the vertebrate genome
PNAS,
July 1, 2008;
105(26):
9006 - 9010.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T.-H. Chang, J.-T. Horng, and H.-D. Huang
RNALogo: a new approach to display structural RNA alignment
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W91 - W96.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Gotea and I. Ovcharenko
DiRE: identifying distant regulatory elements of co-expressed genes
Nucleic Acids Res.,
July 1, 2008;
36(suppl_2):
W133 - W139.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
O. Gillor, J. A. C. Vriezen, and M. A. Riley
The role of SOS boxes in enteric bacteriocin regulation
Microbiology,
June 1, 2008;
154(6):
1783 - 1792.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. C. Roemer, J. Adelman, M. E. A. Churchill, and D. P. Edwards
Mechanism of high-mobility group protein B enhancement of progesterone receptor sequence-specific DNA binding
Nucleic Acids Res.,
June 1, 2008;
36(11):
3655 - 3666.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Ichiyanagi and N. Okada
Mobility Pathways for Vertebrate L1, L2, CR1, and RTE Clade Retrotransposons
Mol. Biol. Evol.,
June 1, 2008;
25(6):
1148 - 1157.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Elbaz, T. Salomon, and S. Schuldiner
Identification of a Glycine Motif Required for Packing in EmrE, a Multidrug Transporter from Escherichia coli
J. Biol. Chem.,
May 2, 2008;
283(18):
12276 - 12283.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. B. Noyes, X. Meng, A. Wakabayashi, S. Sinha, M. H. Brodsky, and S. A. Wolfe
A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system
Nucleic Acids Res.,
May 1, 2008;
36(8):
2547 - 2560.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Shen, G. Ji, B. J. Haas, X. Wu, J. Zheng, G. J. Reese, and Q. Q. Li
Genome level analysis of rice mRNA 3'-end processing signals and alternative polyadenylation
Nucleic Acids Res.,
May 1, 2008;
36(9):
3150 - 3161.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Kalanon and G. I. McFadden
The Chloroplast Protein Translocation Complexes of Chlamydomonas reinhardtii: A Bioinformatic Comparison of Toc and Tic Components in Plants, Green Algae and Red Algae
Genetics,
May 1, 2008;
179(1):
95 - 112.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Shen, Y. Liu, L. Liu, C. Liang, and Q. Q. Li
Unique Features of Nuclear mRNA Poly(A) Signals and Alternative Polyadenylation in Chlamydomonas reinhardtii
Genetics,
May 1, 2008;
179(1):
167 - 176.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. L. Houtz, R. Magnani, N. R. Nayak, and L. M. A. Dirk
Co- and post-translational modifications in Rubisco: unanswered questions
J. Exp. Bot.,
May 1, 2008;
59(7):
1635 - 1645.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Hu, B. G. Zimmermann, H. Zhou, J. Wang, B. S. Henson, W. Yu, D. Elashoff, G. Krupp, and D. T. Wong
Exon-Level Expression Profiling: A Comprehensive Transcriptome Analysis of Oral Fluids
Clin. Chem.,
May 1, 2008;
54(5):
824 - 832.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. A. Glazov, S. McWilliam, W. C. Barris, and B. P. Dalrymple
Origin, Evolution, and Biological Role of miRNA Cluster in DLK-DIO3 Genomic Region in Placental Mammals
Mol. Biol. Evol.,
May 1, 2008;
25(5):
939 - 948.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. A. Smyth, D. Di Lorenzo, and B. N. Kennedy
A Novel, Evolutionarily Conserved Enhancer of Cone Photoreceptor-specific Expression
J. Biol. Chem.,
April 18, 2008;
283(16):
10881 - 10891.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. M. Caffrey, H. S. Park, J. Been, P. Gordon, C. W. Sensen, and G. Voordouw
Gene Expression by the Sulfate-Reducing Bacterium Desulfovibrio vulgaris Hildenborough Grown on an Iron Electrode under Cathodic Protection Conditions
Appl. Envir. Microbiol.,
April 15, 2008;
74(8):
2404 - 2413.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Szurmant, L. Bu, C. L. Brooks III/, and J. A. Hoch
An essential sensor histidine kinase controlled by transmembrane helix interactions with its auxiliary proteins
PNAS,
April 15, 2008;
105(15):
5891 - 5896.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. A. Zaini, A. C. Fogaca, F. G. N. Lupo, H. I. Nakaya, R. Z. N. Vencio, and A. M. da Silva
The Iron Stimulon of Xylella fastidiosa Includes Genes for Type IV Pilus and Colicin V-Like Bacteriocins
J. Bacteriol.,
April 1, 2008;
190(7):
2368 - 2378.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Graindorge, O. Le Tonqueze, R. Thuret, N. Pollet, H. B. Osborne, and Y. Audic
Identification of CUG-BP1/EDEN-BP target mRNAs in Xenopus tropicalis
Nucleic Acids Res.,
April 1, 2008;
36(6):
1861 - 1870.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. A. Rodionov, X. Li, I. A. Rodionova, C. Yang, L. Sorci, E. Dervyn, D. Martynowski, H. Zhang, M. S. Gelfand, and A. L. Osterman
Transcriptional regulation of NAD metabolism in bacteria: genomic reconstruction of NiaR (YrxA) regulon
Nucleic Acids Res.,
April 1, 2008;
36(6):
2032 - 2046.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. A. Rodionov, J. De Ingeniis, C. Mancini, F. Cimadamore, H. Zhang, A. L. Osterman, and N. Raffaelli
Transcriptional regulation of NAD metabolism in bacteria: NrtR family of Nudix-related regulators
Nucleic Acids Res.,
April 1, 2008;
36(6):
2047 - 2059.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Gao, A. Masuda, T. Matsuura, and K. Ohno
Human branch point consensus sequence is yUnAy
Nucleic Acids Res.,
April 1, 2008;
36(7):
2257 - 2267.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Huang, L. Li, C. Wu, D. Schibli, K. Colwill, S. Ma, C. Li, P. Roy, K. Ho, Z. Songyang, et al.
Defining the Specificity Space of the Human Src Homology 2 Domain
Mol. Cell. Proteomics,
April 1, 2008;
7(4):
768 - 784.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. He and J. Parkinson
SubSeqer: a graph-based approach for the detection and identification of repetitive elements in low-complexity sequences
Bioinformatics,
April 1, 2008;
24(7):
1016 - 1017.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. van Bakel, F. J. van Werven, M. Radonjic, M. O. Brok, D. van Leenen, F. C. P. Holstege, and H. T. M. Timmers
Improved genome-wide localization by ChIP-chip using double-round T7 RNA polymerase-based amplification
Nucleic Acids Res.,
March 27, 2008;
36(4):
e21 - e21.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Jerg and U. Gerischer
Relevance of nucleotides of the PcaU binding site from Acinetobacter baylyi
Microbiology,
March 1, 2008;
154(3):
756 - 766.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Li, R. L. Bass, and Y. Liang
fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control
Bioinformatics,
March 1, 2008;
24(5):
629 - 636.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Kundu-Michalik, M.-A. Bisotti, E. Lipsius, A. Bauche, A. Kruppa, T. Klokow, G. Kammler, and J. Kruppa
Nucleolar Binding Sequences of the Ribosomal Protein S6e Family Reside in Evolutionary Highly Conserved Peptide Clusters
Mol. Biol. Evol.,
March 1, 2008;
25(3):
580 - 590.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Horvath, D. A. Romero, A.-C. Coute-Monvoisin, M. Richards, H. Deveau, S. Moineau, P. Boyaval, C. Fremaux, and R. Barrangou
Diversity, Activity, and Evolution of CRISPR Loci in Streptococcus thermophilus
J. Bacteriol.,
February 15, 2008;
190(4):
1401 - 1412.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. P. da Rocha, A. C. de Miranda Paquola, M. do Valle Marques, C. F. M. Menck, and R. S. Galhardo
Characterization of the SOS Regulon of Caulobacter crescentus
J. Bacteriol.,
February 15, 2008;
190(4):
1209 - 1218.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. E. Sanchez, M. Dellarole, K. Gaston, and G. de Prat Gay
Comprehensive comparison of the interaction of the E2 master regulator with its cognate target DNA sites in 73 human papillomavirus types by sequence statistics
Nucleic Acids Res.,
February 11, 2008;
36(3):
756 - 769.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y.-L. Tzeng, C. M. Kahler, X. Zhang, and D. S. Stephens
MisR/MisS Two-Component Regulon in Neisseria meningitidis
Infect. Immun.,
February 1, 2008;
76(2):
704 - 716.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Reinke, C. Saini, F. Fleury-Olela, C. Dibner, I. J. Benjamin, and U. Schibler
Differential display of DNA-binding proteins reveals heat-shock factor 1 as a circadian transcription factor
Genes & Dev.,
February 1, 2008;
22(3):
331 - 345.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. H. Yeats and J. K.C. Rose
The biochemistry and biology of extracellular plant lipid-transfer proteins (LTPs)
Protein Sci.,
February 1, 2008;
17(2):
191 - 198.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
U. J. Pape, S. Rahmann, and M. Vingron
Natural similarity measures between position frequency matrices with an application to clustering
Bioinformatics,
February 1, 2008;
24(3):
350 - 357.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Occhino, F. Ghiotto, S. Soro, M. Mortarino, S. Bosi, M. Maffei, S. Bruno, M. Nardini, M. Figini, A. Tramontano, et al.
Dissecting the Structural Determinants of the Interaction between the Human Cytomegalovirus UL18 Protein and the CD85j Immune Receptor
J. Immunol.,
January 15, 2008;
180(2):
957 - 968.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G.-Q. Hu, X. Zheng, Y.-F. Yang, P. Ortet, Z.-S. She, and H. Zhu
ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes
Nucleic Acids Res.,
January 11, 2008;
36(suppl_1):
D114 - D119.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. D. Rawlings, F. R. Morton, C. Y. Kok, J. Kong, and A. J. Barrett
MEROPS: the peptidase database
Nucleic Acids Res.,
January 11, 2008;
36(suppl_1):
D320 - D325.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|