Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lin, J.
Right arrow Articles by Gerstein, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Lin, J.
Right arrow Articles by Gerstein, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 10, Issue 6, 808-818, June 2000

LETTER
Whole-genome Trees Based on the Occurrence of Folds and Orthologs: Implications for Comparing Genomes on Different Levels

Jimmy Lin,1 and Mark Gerstein1,2

1 Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520 USA

We built whole-genome trees based on the presence or absence of particular molecular features, either orthologs or folds, in the genomes of a number of recently sequenced microorganisms. To put these genomic trees into perspective, we compared them to the traditional ribosomal phylogeny and also to trees based on the sequence similarity of individual orthologous proteins. We found that our genomic trees based on the overall occurrence of orthologs did not agree well with the traditional tree. This discrepancy, however, vanished when one restricted the tree to proteins involved in transcription and translation, not including problematic proteins involved in metabolism. Protein folds unite superficially unrelated sequence families and represent a most fundamental molecular unit described by genomes. We found that our genomic occurrence tree based on folds agreed fairly well with the traditional ribosomal phylogeny. Surprisingly, despite this overall agreement, certain classes of folds, particularly all-beta ones, had a somewhat different phylogenetic distribution. We also compared our occurrence trees to whole-genome clusters based on the composition of amino acids and di-nucleotides. Finally, we analyzed some technical aspects of genomic trees---e.g., comparing parsimony versus distance-based approaches and examining the effects of increasing numbers of organisms. Additional information (e.g. clickable trees) is available from http://bioinfo.mbb.yale.edu/genome/trees.


2 Corresponding author.


10:808-818 ©2000 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/00 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
K. Fukami-Kobayashi, Y. Minezaki, Y. Tateno, and K. Nishikawa
A Tree of Life Based on Protein Domain Organizations
Mol. Biol. Evol., May 1, 2007; 24(5): 1181 - 1189.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
W. F. Doolittle and E. Bapteste
Inaugural Article: Pattern pluralism and the Tree of Life hypothesis
PNAS, February 13, 2007; 104(7): 2043 - 2049.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Borenstein, T. Shlomi, E. Ruppin, and R. Sharan
Gene loss rate: a probabilistic measure for the conservation of eukaryotic genes
Nucleic Acids Res., January 12, 2007; 35(1): e7 - e7.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Wang and G. Caetano-Anolles
Global Phylogeny Determined by the Combination of Protein Domains in Proteomes
Mol. Biol. Evol., December 1, 2006; 23(12): 2444 - 2454.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. Kunin, L. Goldovsky, N. Darzentas, and C. A. Ouzounis
The net of life: Reconstructing the microbial phylogenetic network
Genome Res., July 1, 2005; 15(7): 954 - 959.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
X. Gu, W. Huang, D. Xu, and H. Zhang
GeneContent: software for whole-genome phylogenetic analysis
Bioinformatics, April 15, 2005; 21(8): 1713 - 1714.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
E. J. Deeds, H. Hennessey, and E. I. Shakhnovich
Prokaryotic phylogenies inferred from protein structural domains
Genome Res., March 1, 2005; 15(3): 393 - 402.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Carbone, F. Kepes, and A. Zinovyev
Codon Bias Signatures, Organization of Microorganisms in Codon Space, and Lifestyle
Mol. Biol. Evol., March 1, 2005; 22(3): 547 - 561.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Kunin, D. Ahren, L. Goldovsky, P. Janssen, and C. A. Ouzounis
Measuring genome conservation across taxa: divided strains and united kingdoms
Nucleic Acids Res., January 28, 2005; 33(2): 616 - 621.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
X. Gu and H. Zhang
Genome Phylogenetic Analysis Based on Extended Gene Contents
Mol. Biol. Evol., July 1, 2004; 21(7): 1401 - 1408.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
M. Gensheimer and A. Mushegian
Chalcone isomerase family and fold: No longer unique to plants
Protein Sci., February 1, 2004; 13(2): 540 - 544.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
K. H. Chu, J. Qi, Z.-G. Yu, and V. Anh
Origin and Phylogeny of Chloroplasts Revealed by a Simple Correlation Analysis of Complete Genomes
Mol. Biol. Evol., January 1, 2004; 21(1): 200 - 206.
[Abstract] [Full Text] [PDF]


Home page
MicrobiologyHome page
T. Coenye and P. Vandamme
Extracting phylogenetic information from whole-genome sequencing projects: the lactic acid bacteria as a test case
Microbiology, December 1, 2003; 149(12): 3507 - 3517.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. J. Martin, J. Herrero, A. Mateos, and J. Dopazo
Comparing Bacterial Genomes Through Conservation Profiles
Genome Res., May 1, 2003; 13(5): 991 - 998.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Lin, J. Qian, D. Greenbaum, P. Bertone, R. Das, N. Echols, A. Senes, B. Stenger, and M. Gerstein
GeneCensus: genome comparisons in terms of metabolic pathway activity and protein family sharing
Nucleic Acids Res., October 15, 2002; 30(20): 4574 - 4582.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. Mallick, D. R. Boutz, D. Eisenberg, and T. O. Yeates
Genomic evidence that the intracellular proteins of archaeal microbes contain disulfide bonds
PNAS, July 23, 2002; 99(15): 9679 - 9684.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
V. Daubin, M. Gouy, and G. Perriere
A Phylogenomic Approach to Bacterial Phylogeny: Evidence of a Core of Genes Sharing a Common History
Genome Res., July 1, 2002; 12(7): 1080 - 1090.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Echols, P. Harrison, S. Balasubramanian, N. M. Luscombe, P. Bertone, Z. Zhang, and M. Gerstein
Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes
Nucleic Acids Res., June 1, 2002; 30(11): 2515 - 2523.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
G. D. P. Clarke, R. G. Beiko, M. A. Ragan, and R. L. Charlebois
Inferring Genome Trees by Using a Filter To Eliminate Phylogenetically Discordant Sequences and a Distance Matrix Based on Mean Normalized BLASTP Scores
J. Bacteriol., April 15, 2002; 184(8): 2072 - 2080.
[Abstract] [Full Text]


Home page
Mol Biol EvolHome page
G. W. Stuart, K. Moffett, and J. J. Leader
A Comprehensive Vertebrate Phylogeny Using Vector Representations of Protein Sequences from Whole Genomes
Mol. Biol. Evol., April 1, 2002; 19(4): 554 - 562.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
H. Hegyi and M. Gerstein
Annotation Transfer for Genomics: Measuring Functional Divergence in Multi-Domain Proteins
Genome Res., October 1, 2001; 11(10): 1632 - 1640.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Bertone, Y. Kluger, N. Lan, D. Zheng, D. Christendat, A. Yee, A. M. Edwards, C. H. Arrowsmith, G. T. Montelione, and M. Gerstein
SPINE: an integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics
Nucleic Acids Res., July 1, 2001; 29(13): 2884 - 2898.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Qian, B. Stenger, C. A. Wilson, J. Lin, R. Jansen, S. A. Teichmann, J. Park, W. G. Krebs, H. Yu, V. Alexandrov, et al.
PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information
Nucleic Acids Res., April 15, 2001; 29(8): 1750 - 1764.
[Abstract] [Full Text] [PDF]




Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
Genes Dev. Learn. Mem.
Protein Science RNA Genome Res.