Vol 13, Issue 3, 422-427, March 2003
LETTER
The Phylogenetic Extent of Metabolic Enzymes and Pathways
José Manuel Peregrin-Alvarez,
Sophia Tsoka and
Christos A. Ouzounis1
Computational Genomics Group, The European Bioinformatics Institute,
EMBL Cambridge Outstation, Cambridge CB10 1SD, UK
 |
ABSTRACT
|
|---|
The evolution of metabolic enzymes and pathways has been a subject
of intense study for more than half a century. Yet, so far, previous
studies have focused on a small number of enzyme families or
biochemical pathways. Here, we examine the phylogenetic distribution of
the full-known metabolic complement of Escherichia coli, using
sequence comparison against taxa-specific databases. Half of the
metabolic enzymes have homologs in all domains of life, representing
families involved in some of the most fundamental cellular processes.
We thus show for the first time and in a comprehensive way that
metabolism is conserved at the enzyme level. In addition, our analysis
suggests that despite the sequence conservation and the extensive
phylogenetic distribution of metabolic enzymes, their groupings into
biochemical pathways are much more variable than previously thought.
One of the fundamental tenets in molecular biology
was expressed by Monod, in his famous phrase "What is true for
Escherichia coli is true for the elephant" (Jacob 1988 ). For
a long time, this statement has inspired generations of molecular
biologists, who have used Bacteria as model organisms to understand the
basic principles of life. The discovery of the three domains of life
(Woese and Fox 1977 ) testified that there exist some pronounced
differences between organisms, for example in transcription regulation
(Struhl 1999 ). Paradoxically, metabolism has always been considered as
one of the most conserved cellular processes (Lehninger 1979 ), that
remains invariable from Bacteria to Eucarya, but no quantification of
this view has been provided.
Instead, the phylogenetic extent of metabolism has been assessed by
experimental case studies of individual biochemical pathways (Crawford
1989 ) and, more recently, by comparative genomics. Entire genome
sequences from a wide variety of species offered the possibility of
performing metabolic reconstruction, based on known metabolic pathways
and genome sequence comparison (Karp et al. 1996 ). Case studies have
suggested that even some of the most central pathways in biochemistry
such as the citric acid cycle (Huynen et al. 1999 ), glycolysis
(Dandekar et al. 1999 ), and amino acid biosynthetic pathways (Forst and
Schulten 2001 ) may vary significantly over large phylogenetic
distances.
A comprehensive analysis of metabolism has not been performed until
now, possibly due to the scarcity of systematically collected
information on genome sequences and metabolic pathways. Metabolic
enzyme families are considered to be highly conserved and have been
used to reconstruct the deep branching patterns of the tree of life
(Doolittle et al. 1996 ). Yet, it remains unclear which enzymes are
represented in all major taxa, what pathways they participate in, and
which ones are most conserved at the sequence level.
We set out to address the phylogenetic extent and conservation of
enzymes and pathways by using a highly curated, reliable source of
metabolic information. The EcoCyc database holds information about the
full genome and all known metabolic pathways of Escherichia
coli (Karp et al. 2000 ). Recently, the database has been used to
represent computational predictions of other organisms (Karp 2001 ).
 |
RESULTS
|
|---|
We have searched the nonredundant protein sequence database,
previously partitioned in seven major taxonomic groups, with all 548
enzymes from the known metabolic complement of Escherichia
coli. Whenever a homolog of each query enzyme is found in the
corresponding taxonomic group, this is recorded into a binary vector
(see Methods). Conceptually, this approach is similar to a
low-resolution version of the phylogenetic profile method (Pellegrini
et al. 1999 ). Instead of searching individual species, however, we
focus on major taxonomic groups and we seek enzymes that "travel
together" within or across these groups. No assumptions about
functional roles or associations are made (Pellegrini et al. 1999 ): We
only examine the phylogenetic extent of the query set, in this case the
entire known metabolic complement of E. coli. The end result
is a matrix of 548 genes across all the taxonomic combinations; in all,
37 (out of 128 possible) such combinations can be observed. Genes that
have the same distribution pattern (i.e., identical binary vectors) in
each taxonomic category are collected accordingly.
The majority of E. coli enzymes (274 of them, or 50%) have
homologs in all domains of life, Bacteria, Archaea, and Eucarya,
covering six taxonomic combinations (Fig.
1). Furthermore, there are an additional 13
enzymes (2%) which are universally present in all seven taxa,
including the viruses (Table 1). This
universal set represents enzyme families involved in various
biochemical processes, including amino acid, cofactor, and nucleotide
biosynthesis (Kyrpides et al. 1999 ). It is worth noting that the
presence of all these enzymes in viruses is not fully understood yet
(Kaiser et al. 1999 ). Enzymes present in Bacteria and (1)
Eucarya are 57, covering four taxonomic combinations (10%; e.g.,
glucosamine-6-phosphate isomerase) or (2) Archaea are 52 (9%) (e.g.,
cytochrome D ubiquinol oxidase). It is interesting that 71 enzymes are
present only in Bacteria (13%; e.g., L-fucose isomerase), possibly
representing unique metabolic capabilities of this taxon. Notably, we
have not observed any metabolic enzymes that are species-specific to
E. coli. Finally, the remaining 81 enzymes (15%) have
homologs in various taxonomic combinations (24 in total), with very low
counts that are not statistically significant (see below). Overall,
52% of known metabolic enzymes from E. coli are found to be
present in all three domains, a fact indicating that metabolism is
highly conserved during evolution (Fig. 1).

View larger version (30K):
[in this window]
[in a new window]
|
Figure 1. Distribution of the 548 known E. coli metabolic enzymes into
37 taxonomic combinations (see Methods). The seven taxonomic groups
correspond to domains of life in the case of Archaea and Bacteria or
the four major eukaryotic groups (Fungi, Metazoa, Protista,
Viridiplantae), while Viruses are considered as an additional group
(see Methods). Universal enzymes are colored in gray (dark gray for
those with viral homologs), those with homologs in Eucarya in blue,
with homologs in Archaea in red, and in Bacteria only in green. Other
combinations of the seven taxonomic groups are shown in white (see
Methods).
|
|
To assess whether the observed patterns of phylogenetic distribution
for metabolic enzymes are different from any other proteins, we have
performed simulations of this analysis using protein sets of equal
size, randomly selected from the E. coli genome (see Methods).
Because some of the eukaryotic taxa (e.g., Protista or Fungi) may be
significantly underrepresented in terms of the amount of available
sequence data, we have assessed the gross pattern of taxonomic
distributions by combining all four eukaryotic taxa (Protista, Fungi,
Plants, Metazoa) into one group. It is striking that the 71
Bacteria-specific enzymes are significantly underrepresented compared
to the control set (with an average of 195 proteins that are
Bacteria-specific; Fig. 2A). The 52 enzymes
with homologs in Archaea but not Eucarya are slightly underrepresented
(with an average of 63 proteins in the control sets for this taxonomic
grouping; Fig. 2B). Finally, if we take enzymes with homologs in
Archaea, Bacteria, all four eukaryotic taxa, and viruses (170 in total,
or 31%), it is evident that this set of proteins stands out as being
overrepresented in this universal taxonomic pattern compared with
random (Fig. 2C). It is thus evident that randomly selected proteins
from a bacterial genome tend to be confined within the corresponding
taxonomic group, while metabolic enzymes are expected to be present
across a much wider phylogenetic spectrum (Fig. 2). Enzymes with
homologs in other taxonomic group combinations exhibit similarly strong
deviations from a random background being either over- or
underrepresented in the corresponding combination (Fig.
3).

View larger version (38K):
[in this window]
[in a new window]
|
Figure 2. Frequency distributions of E. coli metabolic enzymes (green
bars) versus 30 control sets of equal size with randomly selected
E. coli proteins (blue bars, see Methods) in (A)
Bacteria, (B) Bacteria and Archaea or (C) Bacteria,
Archaea and all eukaryotic groups (including viruses). Counts are shown
on the x-axis and the frequency of the sets on the
y-axis. The set of E. coli enzymes are less likely to
be present only in Bacteria than the control sets (A);
conversely, the set of E. coli enzymes are more likely to be
present in all phylogenetic groups than the control sets
(C).
|
|

View larger version (46K):
[in this window]
[in a new window]
|
Figure 3. Statistics of phylogenetic distribution of E. coli enzymes.
The 13 statistically significant taxonomic combinations are shown on
the x-axis (color coding as in Fig. 1; for taxonomic
abbreviations see Methods). Frequency of enzymes belonging to the
corresponding taxonomic combination (left bars, colored),
compared to the mean value of 30 control runs (right bars,
hatched), is shown on the left y-axis. Variance
estimates for the mean values are not shown, for clarity. Black
diamonds represent Z-score values (see Methods for details); scale is
shown on the right y-axis.
|
|
To examine which enzymes are actually most conserved at the sequence
level, we recorded all pairwise sequence identity values between the
E. coli enzymes and their homologs from Homo sapiens
(Table 2), as an indicative measure of
protein sequence conservation. There are 11 E. coli metabolic
enzymes which have similar lengths (±10%) and have sequence identity
55% to the corresponding human proteins (Table 2). The most
conserved known metabolic enzyme appears to be guanosine 5'
monophosphate oxidoreductase, with 68% identity shared between the
E. coli and H. sapiens sequences, followed by
glyceraldehyde 3-phosphate dehydrogenase and succinyl-CoA synthetase
chain (Table 2). This is the first time, to our knowledge, that
such a simple comparison has been performed.
View this table:
[in this window]
[in a new window]
|
Table 2. The 11 Most Conserved Metabolic Enzymes in E. coli
According to Their Sequence Identity to Homologs in
Homo sapiens
|
|
Finally, we have examined the conservation of pathways using the
detection of enzyme homologs in the corresponding taxonomic partitions.
Interestingly, only five of 87 pathways (less than 6%) whose enzymes
have homologs in all seven taxa are completely conserved. In fact,
these correspond only to short interconversion reactions defined as
pathways (not shown). If we relax this strict criterion and admit
enzymes with homologs in all three domains of life, that is, in any
eukaryotic group, 23 pathways with more than four enzymes and 70%
coverage are detected (Table 3). The three
invariant pathways correspond to biosynthetic pathways for tryptophan,
leucine, and arginine (Table 3). Other pathways in this list include
metabolic reactions for cofactor biosynthesis and central metabolism
(Table 3). In general, conserved pathways appear to be involved in
energy metabolism, central intermediary metabolism, sugar degradation,
cofactor biosynthesis, and the processing of amino acids and
nucleotides. This observation lends support to the hypothesis that this
set of processes is one of the most ancient aspects of cellular
physiology, possibly present in the universal ancestor (Woese 1998 ).
View this table:
[in this window]
[in a new window]
|
Table 3. The Most Conserved Metabolic Pathways in E. coli Whose
Enzymes Are Universally Present in the Three Domains of Life, Archaea
(A), Bacteria (B) and Eucarya (E)
|
|
In conclusion, it appears that although metabolic enzymes are widely
distributed and highly conserved during evolution, their corresponding
participation as groupings in pathways might vary significantly. In
that sense, pathways appear to be more variable than previously
thought, and pathway evolution exhibits a primarily "mosaic"
pattern (Teichmann et al. 2001 ; Tsoka and Ouzounis 2001 ), with
homologous enzymes being reused in different cellular roles (Jensen
1976 ).
 |
DISCUSSION
|
|---|
We have shown that the known metabolic complement of E.
coli is highly conserved, with half of the enzymes being present
in at least one species from the three domains of life, thus considered
universal. With statistical simulations, we have also shown that the
universal enzymes are much more conserved than the ones absent from
Eucarya. The high coverage of the database and the presence of homologs
in multiple species eliminates to a significant extent the possibility
of detecting lateral gene transfers, also supported by recent,
independent analyses (Salzberg et al. 2001 ; Stanhope et al. 2001 ).
Thus, the observed taxonomic patterns of enzyme distribution are most
likely to reflect genuine divergent relationships, tracing the origins
of metabolic enzymes and pathways back in time. In particular, the set
of universal enzymes may be used for deep phylogeny studies to assess
robustness of phylogenetic trees (Huelsenbeck et al. 2001 ), the
determination of divergence times (Feng et al. 1997 ), and hypotheses
for lateral gene transfer (Brown et al. 2001 ). Finally, we have
examined the conservation of biochemical pathways across species and
have found that the pathways in which the universal enzymes participate
correspond to reactions metabolizing small molecules, such as sugars,
amino acids, and nucleotides, as previously suggested (Kyrpides et al.
1999 ).
The topology and functional diversification of biochemical pathways is
yet to be explored, given that the experimentally determined
information available is confined to only a few model species, such as
E. coli, and processes, such as metabolism. It remains to be
seen how representative the metabolism of E. coli is and
whether our conclusions will hold for other species, or indeed for
other cellular processes. Much more work is necessary to validate the
metabolic reconstructions currently based on sequence homology (Tsoka
and Ouzounis 2000b ).
 |
METHODS
|
|---|
We have extracted the sequences of all enzymes (548 in total) from
EcoCyc (Karp et al. 2000 ), both the monomeric enzymes (208 in total)
and the components of enzyme complexes (348 in total; some monomeric
enzymes can also be found as enzyme complex components; Tsoka and
Ouzounis 2000a ). To address the generic phylogenetic distribution of
enzymes, we have subsequently divided the nonredundant protein sequence
database SwAll (SwissProt+TrEMBL; Bairoch and Apweiler 2000 )(602,333
entries in total) into seven major taxonomic groups: Archaea (A),
Bacteria (B), Fungi (F), Metazoa (M), Protista (P), Viridiplantae (Vp),
and Viruses (Vr). The section of the database containing these groups
covers 582,638 sequences or 97% of the total number of entries (the
3% remainder represents vector data, artificial sequences, insertion
sequences, transposons, and other unclassified entries). The
partitioning was greatly facilitated by SRS (Etzold and Argos 1993 ),
using the taxonomic records from the SwAll database (SwissProt+TrEMBL;
Bairoch and Apweiler 2000 ). All E. coli entries were excluded
from this target database.
The E. coli enzymes were subsequently used as queries to
search against the seven taxonomic partitions using BLAST (Altschul et
al. 1997 ) at an E-value threshold 1006, as previously
(Tsoka and Ouzounis 2001 ), filtered for compositional bias with CAST
(Promponas et al. 2000 ; score threshold 40). The detected homologs were
used to classify the query enzyme dataset into the seven taxonomic
groups. For each query enzyme, we recorded the distribution pattern of
all homologs in a binary vector, representing its taxonomic
distribution.
To assess the statistical significance of these measurements, 30
samples of equal size were taken from the E. coli genome as
control sets and were subjected to an identical analysisas previously
described (Tsoka and Ouzounis 2000a ). In total, 16,988 runs (548
queries * 31 runs) were performed against the nonredundant protein
sequence database. Total computation time was approximately 120 h on a
4-CPU Sun E450 with 2GB of RAM. Of the observed (62, out of 128
possible) combinations of taxonomic partitions, only 16 sets contain
more than ten counts either in enzymes or in the control sets and were
further tested for significance. We have used a 2-tailed t-test, which
is robust for skewed nonnormal distributions, at 99% confidence level.
Of the 16 sets, only 13 show a highly significant t-test value and were
considered as meaningful (Figs. 1,3).
 |
Acknowledgements
|
|---|
We thank Peter D. Karp (SRI International) for comments and members
of the Computational Genomics Group for discussions. This work was
supported by the European Molecular Biology Laboratory, the TMR
Programme of the European Commission (DGXIIScience, Research and
Development) and the Ministry of Science and Technology, Spain. S.T.
acknowledges support from the UK Medical Research Council. C.A.O.
thanks IBM Research for additional support.
The publication costs of this article were defrayed in part by payment
of page charges. This article must therefore be hereby marked
"advertisement" in accordance with 18 USC section 1734 solely to
indicate this fact.
 |
Footnotes
|
|---|
1 Corresponding author. 
E-MAIL ouzounis{at}ebi.ac.uk; FAX 44-1223-494471.
Article and publication are at
http://www.genome.org/cgi/doi/10.1101/gr.246903.
 |
REFERENCES
|
|---|
Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389-3402.[Abstract/Free Full Text]
Bairoch, A. and Apweiler, R. 2000. The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 28: 45-48.[Abstract/Free Full Text]
Brown, J.R., Douady, C.J., Italia, M.J., Marshall, W.E., and Stanhope, M.J. 2001. Universal trees based on large combined protein sequence data sets. Nat. Genet. 28: 281-285.[CrossRef][Medline]
Crawford, I.P. 1989. Evolution of a biosynthetic pathway: The tryptophan paradigm. Annu. Rev. Microbiol. 43: 567-600.[CrossRef][Medline]
Dandekar, T., Schuster, S., Snel, B., Huynen, M., and Bork, P. 1999. Pathway alignment: Application to the comparative analysis of glycolytic enzymes. Biochem. J. 343: 115-124.
Doolittle, R.F., Feng, D.F., Tsang, S., Cho, G., and Little, E. 1996. Determining divergence times of the major kingdoms of living organisms with a protein clock. Science 271: 470-477.[Abstract]
Etzold, T. and Argos, P. 1993. SRSan indexing and retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9: 49-57.[Abstract/Free Full Text]
Feng, D.F., Cho, G., and Doolittle, R.F. 1997. Determining divergence times with a protein clock: Update and reevaluation. Proc. Natl. Acad. Sci. 94: 13028-13033.[Abstract/Free Full Text]
Forst, C.V. and Schulten, K. 2001. Phylogenetic analysis of metabolic pathways. J. Mol. Evol. 52: 471-489.[Medline]
Huelsenbeck, J.P., Ronquist, F., Nielsen, R., and Bollback, J.P. 2001. Bayesian inference of phylogeny and its impact on evolutionary biology. Science 294: 2310-2314.[Abstract/Free Full Text]
Huynen, M.A., Dandekar, T., and Bork, P. 1999. Variation and evolution of the citric-acid cycle: A genomic perspective. Trends Microbiol. 7: 281-291.[CrossRef][Medline]
Jacob, F., 1988. A statue within. Basic Books, New York, NY.
Jensen, R.A. 1976. Enzyme recruitment in evolution of new function. Annu. Rev. Microbiol. 30: 409-425.[CrossRef][Medline]
Kaiser, A., Vollmert, M., Tholl, D., Graves, M.V., Gurnon, J.R., Xing, W., Lisec, A.D., Nickerson, K.W., and Van Etten, J.L. 1999. Chlorella virus PBCV-1 encodes a functional homospermidine synthase. Virology 263: 254-262.[CrossRef][Medline]
Karp, P.D. 2001. Pathway databases: A case study in computational symbolic theories. Science 293: 2040-2044.[Abstract/Free Full Text]
Karp, P.D., Ouzounis, C., and Paley, S. 1996. HinCyc: A knowledge base of the complete genome and metabolic pathways of H. influenzae. Proc. Int. Conf. Intell. Syst. Mol. Biol. 4: 116-124.[Medline]
Karp, P.D., Riley, M., Saier, M., Paulsen, I.T., Paley, S.M., and Pellegrini-Toole, A. 2000. The EcoCyc and MetaCyc databases. Nucleic Acids Res. 28: 56-59.[Abstract/Free Full Text]
Kyrpides, N., Overbeek, R., and Ouzounis, C. 1999. Universal protein families and the functional content of the last universal common ancestor. J. Mol. Evol. 49: 413-423.[CrossRef][Medline]
Lehninger, A.L., 1979. Biochemistry, pp. 363 Worth Publishers, Inc., New York, NY.
Ouzounis, C.A. and Karp, P.D. 2000. Global properties of the metabolic map of Escherichia coli. Genome Res. 10: 568-576.[Abstract/Free Full Text]
Pellegrini, M., Marcotte, E.M., Thompson, M.J., Eisenberg, D., and Yeates, T.O. 1999. Assigning protein functions by comparative genome analysis: Protein phylogenetic profiles. Proc. Natl. Acad. Sci. 96: 4285-4288.[Abstract/Free Full Text]
Promponas, V.J., Enright, A.J., Tsoka, S., Kreil, D.P., Leroy, C., Hamodrakas, S., Sander, C., and Ouzounis, C.A. 2000. CAST: An iterative algorithm for the complexity analysis of sequence tracts. Bioinformatics 16: 915-922.[Abstract/Free Full Text]
Salzberg, S.L., White, O., Peterson, J., and Eisen, J.A. 2001. Microbial genes in the human genome: Lateral transfer or gene loss? Science 292: 1903-1906.[Abstract/Free Full Text]
Stanhope, M.J., Lupas, A., Italia, M.J., Koretke, K.K., Volker, C., and Brown, J.R. 2001. Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebrates. Nature 411: 940-944.[CrossRef][Medline]
Struhl, K. 1999. Fundamentally different logic of gene regulation in eukaryotes and prokaryotes. Cell 98: 1-4.[CrossRef][Medline]
Teichmann, S.A., Rison, S.C., Thornton, J.M., Riley, M., Gough, J., and Chothia, C. 2001. The evolution and structural anatomy of the small molecule metabolic pathways in Escherichia coli. J. Mol. Biol. 311: 693-708.[CrossRef][Medline]
Tsoka, S. and Ouzounis, C.A. 2000a. Prediction of protein interactions: Metabolic enzymes are frequently involved in gene fusion. Nat. Genet. 26: 141-142.[CrossRef][Medline]
Tsoka, S. and Ouzounis, C.A. 2000b. Recent developments and future directions in computational genomics. FEBS Lett. 480: 42-48.[CrossRef][Medline]
Tsoka, S. and Ouzounis, C.A. 2001. Functional versatility and molecular diversity of the metabolic map of Escherichia coli. Genome Res. 11: 1503-1510.[Abstract/Free Full Text]
Woese, C. 1998. The universal ancestor. Proc. Natl. Acad. Sci. 95: 6854-6859.[Abstract/Free Full Text]
Woese, C.R. and Fox, G.E. 1977. Phylogenetic structure of the prokaryotic domain: The primary kingdoms. Proc. Natl. Acad. Sci. 74: 5088-5090.[Abstract/Free Full Text]
Received March 6, 2002;
accepted in revised format December 11, 2002.
13:422-427 © by 2003 Cold Spring Harbor Laboratory Press ISSN 1088-9051/03 $5.00

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
M. Campillos, C. von Mering, L. J. Jensen, and P. Bork
Identification and analysis of evolutionarily cohesive functional modules in protein networks
Genome Res.,
March 1, 2006;
16(3):
374 - 382.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. D. Karp, C. A. Ouzounis, C. Moore-Kochlacs, L. Goldovsky, P. Kaipa, D. Ahren, S. Tsoka, N. Darzentas, V. Kunin, and N. Lopez-Bigas
Expansion of the BioCyc collection of pathway/genome databases to 160 genomes
Nucleic Acids Res.,
October 24, 2005;
33(19):
6083 - 6089.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. B. Pereira-Leal, B. Audit, J. M. Peregrin-Alvarez, and C. A. Ouzounis
An Exponential Core in the Heart of the Yeast Protein Interaction Network
Mol. Biol. Evol.,
March 1, 2005;
22(3):
421 - 425.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. S. Mikkelsen, J. E. Galagan, and J. P. Mesirov
Improving genome annotations using phylogenetic profile anomaly detection
Bioinformatics,
February 15, 2005;
21(4):
464 - 470.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. M.R. Coulson, N. Hall, and C. A. Ouzounis
Comparative Genomics of Transcriptional Control in the Human Malaria Parasite Plasmodium falciparum
Genome Res.,
August 1, 2004;
14(8):
1548 - 1554.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. A. Gutierrez, M. D. Larson, and C. Wilkerson
The Plant-Specific Database. Classification of Arabidopsis Proteins Based on Their Phylogenetic Profile
Plant Physiology,
August 1, 2004;
135(4):
1888 - 1892.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
V. Kunin, J. B. Pereira-Leal, and C. A. Ouzounis
Functional Evolution of the Yeast Protein Interaction Network
Mol. Biol. Evol.,
July 1, 2004;
21(7):
1171 - 1176.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
N. Lopez-Bigas and C. A. Ouzounis
Genome-wide identification of genes likely to be involved in human genetic disease
Nucleic Acids Res.,
June 4, 2004;
32(10):
3108 - 3114.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Snel and M. A. Huynen
Quantifying Modularity in the Evolution of Biomolecular Systems
Genome Res.,
March 1, 2004;
14(3):
391 - 397.
[Abstract]
[Full Text]
[PDF]
|
 |
|
|
|