Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wolf, Y. I.
Right arrow Articles by Koonin, E. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wolf, Y. I.
Right arrow Articles by Koonin, E. V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 9, Issue 1, 17-26, January 1999

RESEARCH
Distribution of Protein Folds in the Three Superkingdoms of Life

Yuri I. Wolf,1,4 Steven E. Brenner,2 Paul A. Bash,3 and Eugene V. Koonin1,5

1 National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894 USA; 2 Department of Structural Biology, Stanford University, Stanford, California 94305-5126 USA; 3 Department of Molecular Pharmacology and Biological Chemistry, Northwestern University, Chicago, Illinois 60611 USA

A sensitive protein-fold recognition procedure was developed on the basis of iterative database search using the PSI-BLAST program. A collection of 1193 position-dependent weight matrices that can be used as fold identifiers was produced. In the completely sequenced genomes, folds could be automatically identified for 20%-30% of the proteins, with 3%-6% more detectable by additional analysis of conserved motifs. The distribution of the most common folds is very similar in bacteria and archaea but distinct in eukaryotes. Within the bacteria, this distribution differs between parasitic and free-living species. In all analyzed genomes, the P-loop NTPases are the most abundant fold. In bacteria and archaea, the next most common folds are ferredoxin-like domains, TIM-barrels, and methyltransferases, whereas in eukaryotes, the second to fourth places belong to protein kinases, beta -propellers and TIM-barrels. The observed diversity of protein folds in different proteomes is approximately twice as high as it would be expected from a simple stochastic model describing a proteome as a finite sample from an infinite pool of proteins with an exponential distribution of the fold fractions. Distribution of the number of domains with different folds in one protein fits the geometric model, which is compatible with the evolution of multidomain proteins by random combination of domains.

[Fold predictions for proteins from 14 proteomes are available on the World Wide Web at ftp://ncbi.nlm.nih.gov/pub/koonin/FOLDS/index.html. The FIDs are available by anonymous ftp at the same location.]


4   Permanent address: Institute of Cytology and Genetics, Russian Academy of Sciences, Novosibirsk 630090, Russia.
5   Corresponding author.


9:17-26 ©1999 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/99 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
EndocrinologyHome page
K. N. Jha, I. A. Shumilin, L. C. Digilio, O. Chertihin, H. Zheng, G. Schmitz, P. E. Visconti, C. J. Flickinger, W. Minor, and J. C. Herr
Biochemical and Structural Characterization of Apolipoprotein A-I Binding Protein, a Novel Phosphoprotein with a Potential Role in Sperm Capacitation
Endocrinology, May 1, 2008; 149(5): 2108 - 2120.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. K. Basu, L. Carmel, I. B. Rogozin, and E. V. Koonin
Evolution of protein domain promiscuity in eukaryotes
Genome Res., March 1, 2008; 18(3): 449 - 461.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. Fukami-Kobayashi, Y. Minezaki, Y. Tateno, and K. Nishikawa
A Tree of Life Based on Protein Domain Organizations
Mol. Biol. Evol., May 1, 2007; 24(5): 1181 - 1189.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Wang and G. Caetano-Anolles
Global Phylogeny Determined by the Combination of Protein Domains in Proteomes
Mol. Biol. Evol., December 1, 2006; 23(12): 2444 - 2454.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
P. K. Shah, P. Aloy, P. Bork, and R. B. Russell
Structural similarity to bridge sequence space: Finding new families on the bridges
Protein Sci., May 1, 2005; 14(5): 1305 - 1314.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Beyer and T. Wilhelm
Dynamic simulation of protein complex formation on a genomic scale
Bioinformatics, April 15, 2005; 21(8): 1610 - 1616.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. J. Deeds, H. Hennessey, and E. I. Shakhnovich
Prokaryotic phylogenies inferred from protein structural domains
Genome Res., March 1, 2005; 15(3): 393 - 402.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Yang, R. F. Doolittle, and P. E. Bourne
Phylogeny determined by protein domain content
PNAS, January 11, 2005; 102(2): 373 - 378.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. Veeramachaneni and W. Makalowski
Visualizing Sequence Similarity of Protein Families
Genome Res., June 1, 2004; 14(6): 1160 - 1169.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. Fernandez, R. Scott, and R. S. Berry
The nonconserved wrapping of conserved protein folds reveals a trend toward increasing connectivity in proteomic networks
PNAS, March 2, 2004; 101(9): 2823 - 2827.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
M. Gensheimer and A. Mushegian
Chalcone isomerase family and fold: No longer unique to plants
Protein Sci., February 1, 2004; 13(2): 540 - 544.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
G. Caetano-Anolles and D. Caetano-Anolles
An Evolutionarily Structured Universe of Protein Architecture
Genome Res., July 1, 2003; 13(7): 1563 - 1571.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
C. Chothia, J. Gough, C. Vogel, and S. A. Teichmann
Evolution of the Protein Repertoire
Science, June 13, 2003; 300(5626): 1701 - 1703.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
S. Selvaraj and M. M. Gromiha
Role of Hydrophobic Clusters and Long-Range Contact Networks in the Folding of ({alpha}/{beta})8 Barrel Proteins
Biophys. J., March 1, 2003; 84(3): 1919 - 1925.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Muller, R. M. MacCallum, and M. J.E. Sternberg
Structural Characterization of the Human Proteome
Genome Res., November 1, 2002; 12(11): 1625 - 1641.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Mallick, D. R. Boutz, D. Eisenberg, and T. O. Yeates
Genomic evidence that the intracellular proteins of archaeal microbes contain disulfide bonds
PNAS, July 23, 2002; 99(15): 9679 - 9684.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Kihara, Y. Zhang, H. Lu, A. Kolinski, and J. Skolnick
Ab initio protein structure prediction on a genomic scale: Application to the Mycoplasma genitalium genome
PNAS, April 30, 2002; 99(9): 5993 - 5998.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
D. Frishman
Knowledge-based selection of targets for structural genomics
Protein Eng. Des. Sel., March 1, 2002; 15(3): 169 - 183.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Kawabata, S. Fukuchi, K. Homma, M. Ota, J. Araki, T. Ito, N. Ichiyoshi, and K. Nishikawa
GTOP: a database of protein structures predicted from genome sequences
Nucleic Acids Res., January 1, 2002; 30(1): 294 - 298.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
E. L. Braun and E. Grotewold
Fungal Zuotin Proteins Evolved from MIDA1-like Factors by Lineage-Specific Loss of MYB Domains
Mol. Biol. Evol., July 1, 2001; 18(7): 1401 - 1412.
[Abstract] [Full Text] [PDF]


Home page
Microbiol. Mol. Biol. Rev.Home page
K. S. Makarova, L. Aravind, Y. I. Wolf, R. L. Tatusov, K. W. Minton, E. V. Koonin, and M. J. Daly
Genome of the Extremely Radiation-Resistant Bacterium Deinococcus radiodurans Viewed from the Perspective of Comparative Genomics
Microbiol. Mol. Biol. Rev., March 1, 2001; 65(1): 44 - 79.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Smit and A. Mushegian
Biosynthesis of Isoprenoids via Mevalonate in Archaea: The Lost Pathway
Genome Res., October 1, 2000; 10(10): 1468 - 1484.
[Abstract] [Full Text]


Home page
Nucleic Acids ResHome page
S. Balasubramanian, T. Schneider, M. Gerstein, and L. Regan
Proteomics of Mycoplasma genitalium: identification and characterization of unannotated and atypical proteins in a small model genome
Nucleic Acids Res., August 15, 2000; 28(16): 3075 - 3082.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
J. Lin and M. Gerstein
Whole-genome Trees Based on the Occurrence of Folds and Orthologs: Implications for Comparing Genomes on Different Levels
Genome Res., June 1, 2000; 10(6): 808 - 818.
[Abstract] [Full Text]


Home page
Infect. Immun.Home page
G. Subramanian, E. V. Koonin, and L. Aravind
Comparative Genome Analysis of the Pathogenic Spirochetes Borrelia burgdorferi and Treponema pallidum
Infect. Immun., March 1, 2000; 68(3): 1633 - 1648.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. E. Brenner, P. Koehl, and M. Levitt
The ASTRAL compendium for protein structure and sequence analysis
Nucleic Acids Res., January 1, 2000; 28(1): 254 - 256.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
I. V. Grigoriev and S.-H. Kim
Detection of protein fold similarity based on correlation of amino acid properties
PNAS, December 7, 1999; 96(25): 14318 - 14323.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
D. Fischer
Rational structural genomics: affirmative action for ORFans and the growth in our structural knowledge
Protein Eng. Des. Sel., December 1, 1999; 12(12): 1029 - 1030.
[Full Text] [PDF]


Home page
ScienceHome page
O. White, J. A. Eisen, J. F. Heidelberg, E. K. Hickey, J. D. Peterson, R. J. Dodson, D. H. Haft, M. L. Gwinn, W. C. Nelson, D. L. Richardson, et al.
Genome Sequence of the Radioresistant Bacterium Deinococcus radiodurans R1
Science, November 19, 1999; 286(5444): 1571 - 1577.
[Abstract] [Full Text]


Home page
Genome Res.Home page
K. S. Makarova, L. Aravind, M. Y. Galperin, N. V. Grishin, R. L. Tatusov, Y. I. Wolf, and E. V. Koonin
Comparative Genomics of the Archaea (Euryarchaeota): Evolution of Conserved Protein Families, the Stable Core, and the Variable Shell
Genome Res., July 1, 1999; 9(7): 608 - 628.
[Abstract] [Full Text]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Mallick, K. E. Goodwill, S. Fitz-Gibbon, J. H. Miller, and D. Eisenberg
Selecting protein targets for structural genomics of Pyrobaculum aerophilum: Validating automated fold assignment methods by using binary hypothesis testing
PNAS, March 14, 2000; 97(6): 2450 - 2455.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
E. Portugaly and M. Linial
Estimating the probability for a protein to have a new fold: A statistical computational model
PNAS, May 9, 2000; 97(10): 5161 - 5166.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Kihara, Y. Zhang, H. Lu, A. Kolinski, and J. Skolnick
Ab initio protein structure prediction on a genomic scale: Application to the Mycoplasma genitalium genome
PNAS, April 30, 2002; 99(9): 5993 - 5998.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.