Genome Research

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Arnesano, F.
Right arrow Articles by O'Halloran, T. V.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Arnesano, F.
Right arrow Articles by O'Halloran, T. V.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 12, Issue 2, 255-271, February 2002

Metallochaperones and Metal-Transporting ATPases: A Comparative Analysis of Sequences and Structures

Fabio Arnesano,1 Lucia Banci,1 Ivano Bertini,1,3 Simone Ciofi-Baffoni,1 Elena Molteni,1 David L. Huffman,2 and Thomas V. O'Halloran2

1 Magnetic Resonance Center CERM and Department of Chemistry, University of Florence, Via Luigi Sacconi 6, 50019, Sesto Fiorentino, Florence, Italy; 2 Department of Chemistry and the Department of Biochemistry, Molecular Biology, and Cell Biology, Northwestern University, Evanston, Illinois 60208, USA

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
CONCLUSIONS
METHODS
WEB SITE REFERENCES
REFERENCES

A comparative structural genomic analysis of a new class of metal-trafficking proteins can provide insights into the intracellular chemistry of reactive cofactors such as copper and zinc. Starting from the sequences of the metallochaperone Atx1 and from the first soluble domain of the copper-transporting ATPase Ccc2, both from yeast, a search on the available genomes was performed using a homology criterion and a metal-binding motif x`-x"-C-x'''-x⁗-C. By limiting ourselves to 20% identity with any of the proteins found, several soluble copper-transport proteins were identified, as well as soluble domains of membrane-bound ATPases. Structural models were calculated using high-resolution solution structures as templates, and the models were validated using statistical and energy criteria. Residue conservation and substitution have been interpreted and discussed in terms of structure-function relationship. The potential energy surfaces have been analyzed in terms of protein-protein interactions. We find that metallochaperones and their physiological partner ATPases from several phylogenetic kingdoms recognize one another, via an interplay of electrostatics, hydrogen bonding, and hydrophobic interactions, in a manner that precisely orients the metal-binding side chains for rapid metal transfer between otherwise tight binding sites. Finally, other putative metal-transport proteins are mentioned that have low homology and/or a different metal-binding consensus motif and that appear to use similar structures for recognition and transfer. This analysis highlights the wealth and the complexity of the field.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
CONCLUSIONS
METHODS
WEB SITE REFERENCES
REFERENCES

Copper, an essential trace metal, is used as a cofactor in a variety of redox proteins. In eukaryotes, copper-dependent metalloenzymes are found in multiple cellular locations including the cytosol, mitochondria, and cell surface (Linder 1991). Excess copper, however, is highly toxic to most organisms (Linder 1991; Vulpe and Packman 1995). For this reason, the transport and sequestration of copper must be tightly controlled (Rosenzweig 2001; Pena et al. 1999; Huffman and O'Halloran 2001).

A class of metal ion receptor proteins, called metallochaperones, have been identified, which deliver copper to specific intracellular targets (O'Halloran and Culotta 2000). Recent studies indicate that a main difficulty is faced by metallochaperones as they deliver copper to the correct intracellular destination: The cytoplasm has a significant thermodynamic overcapacity for copper chelation (Rae et al. 1999). This has led to the suggestion that metallochaperones act like enzymes to lower the activation barrier for copper transfer to specific partners (Huffman and O'Halloran 2000).

Copper chaperones were first isolated in the baker's yeast Saccharomyces cerevisiae (Lin and Culotta 1995; Pufahl et al. 1997), and functional homologs have been noted in humans (Klomp et al. 1997), sheep (Lockhart and Mercer 2000), mice (Nishihara et al. 1998), Arabidopsis thaliana (Himelblau et al. 1998), and Caenorhabditis elegans (Wakabayashi et al. 1998). A prototypical copper chaperone, Atx1, conducts Cu(I) in the cytoplasm and transfers this cargo directly to a specific partner protein (Pufahl et al. 1997; Huffman and O'Halloran 2000). Both the yeast Atx1 and the human Atx1 homolog (HAH1 or ATOX1) specifically traffic copper to the secretory pathway for incorporation into copper enzymes destined for the cell surface or extracellular milieu (Klomp et al. 1997; Pufahl et al. 1997; Hung et al. 1998). A homologous protein CopZ also binds copper, and variations are known in Enterococcus hirae (Odermatt and Solioz 1995) and Bacillus subtilis (Banci et al. 2001a).

A class of integral membrane proteins that transport heavy metals across cellular membranes provided the earliest insights into the biology of copper trafficking (Vulpe et al. 1993; Bull and Cox 1994; Solioz and Vulpe 1996). Members of this family, referred to as P-type or CPx-type ATPases, have been described, mainly by gene isolation, in a variety of bacteria, yeasts, nematodes, and mammals, including humans (Pena et al. 1999; Rensing et al. 2000; Fu et al. 1995; Vulpe et al. 1993; Bull et al. 1993; Lockhart et al. 2000; Tanzi et al. 1993; Yamaguchi et al. 1993; Chelly et al. 1993; Mercer et al. 1993). P-type ATPases are ubiquitous membrane proteins and have been classified into five groups according to ion specificity (Axelsen and Palmgren 1998). A growing subfamily of metal-transporting P-type ATPases is involved in homeostasis of heavy metal ions. This subfamily can be divided into two subgroups. The first is involved in the transfer of monovalent metal ions and includes Cu(I)- and Ag(I)-transporting proteins, such as the human Menkes (Vulpe et al. 1993; Chelly et al. 1993; Mercer et al. 1993) and Wilson (Bull et al. 1993; Lockhart et al. 2000; Tanzi et al. 1993; Yamaguchi et al. 1993) disease-related proteins and homologs from S. cerevisiae (Yuan et al. 1995; Fu et al. 1995), E. hirae (Odermatt et al. 1993), Synechococcus (Phung et al. 1994), and Helicobacter pylori (Melchers et al. 1998). Members of the second subgroup of soft metal P-type ATPases transport divalent metal ions including Zn(II), Cd(II), and Pb(II). This subgroup includes ZntA from Escherichia coli and CadA from Staphylococcus aureus plasmid pI258 (Rensing et al. 1999). Recently, a related gene conferring Co(II) resistance was identified in Synechocystis PCC 6803 (Rutherford et al. 1999).

The cytoplasmic N-terminus of the copper P-type ATPases contains one or more metal-binding motifs (Bull and Cox 1994). These N-terminal domains and the copper chaperones and copper-transporting proteins are mainly characterized by M-x"-C-x'''-x⁗-C motif, also found in other proteins proposed to bind or transport the heavy metals (Solioz and Vulpe 1996; Silver et al. 1989), such as the bacterial periplasmic mercury-binding protein MerP (Sahlman and Jonsson 1992). Sequence similarities with metallochaperones indicate that each heavy-metal binding domain of ATPases comprises ~70 residues (Pufahl et al. 1997).

In humans, the Menkes ATPase containing six N-terminal metal-binding motifs (Vulpe et al. 1993; Lutsenko et al. 1997; Chelly et al. 1993; Mercer et al. 1993) is located in the trans-Golgi network (Yamaguchi et al. 1996) and is believed to translocate copper, donated by HAH1 copper chaperone, across intracellular membranes into the secretory pathway. Similarly, in the yeast S. cerevisiae, the target of copper delivery by the chaperone Atx1 is a P-type copper-transporting ATPase, called Ccc2, confined to a late Golgi compartment (Pufahl et al. 1997; Yuan et al. 1997; Huffman and O'Halloran 2000).

Solution structures of the Cu(I)-bound and reduced apo forms of both Atx1 (Arnesano et al. 2001b) and the first soluble domain of Ccc2 (Ccc2a hereafter) (Banci et al. 2001b) have been recently solved. The solution structures of MerP (Steele and Opella 1997), of the fourth domain of the Menkes transporter (MNK4 hereafter) (Gitschier et al. 1998), and CopZ, in the reduced apo form from E. hirae (Wimmer et al. 1999) and the Cu(I)-bound form from B. subtilis (Banci et al. 2001a), are also available. The crystal structures were solved for the oxidized apo and Hg(II) forms of Atx1 (Rosenzweig et al. 1999). Both the MerP solution structure and the Atx1 crystal structure contained a Hg(II) ion with a linear coordination geometry, whereas the solution structure of MNK4 contained the Ag(I) ion. All these structures share a classical "ferredoxin-like" beta 1-alpha 1-beta 2-beta 3-alpha 2-beta 4 fold (Hubbard et al. 1997) in which the secondary structure elements, four beta -strands and two alpha -helices, are connected by loop regions.

The aim of the present study is to browse the available genomes of all types of organisms and analyze them according to the specific consensus sequence for metal binding, as well as sequence homologies, with the ultimate goal of locating metallochaperones, metal-transporting proteins, and ATPases. The location of these sequences, their alignment, and the structural modeling allow us to identify the conserved amino acid segments and their role in the structural properties. Then, from this comparison, hints for the relation between structural properties and biological function are discussed.

The proteins analyzed here are all proposed to be metal-transport proteins. Some of them transport the metal through the membrane (P-type ATPases); others are small soluble proteins. Of the latter, those that carry the copper ion to a specific target protein are called copper chaperones, such as Atx1. The others, such as CopZ and MerP, are soluble metal-transport proteins involved in detoxification and, as shown below, fall into distinct structural classes.

    RESULTS
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
CONCLUSIONS
METHODS
WEB SITE REFERENCES
REFERENCES

Pattern-Based Sequence Analysis of Metallochaperones and Other Metal Transporters

Starting from the 73 amino acid sequence of Atx1 from yeast S. cerevisiae and using the consensus motif x`-x"-C-x'''-x⁗-C and the amino acid chain length to restrict the search, and through a few cycles of the procedure described in the Methods section, the sequences of 57 proteins (including the template Atx1) were selected whose sequence alignment is shown in Figure 1. Sequences from 12 eukaryotic organisms are grouped together in a green box. Pairwise residue identity within this subclass is 46% ± 19%. The most evolved organisms (Homo sapiens, Rattus norvegicus, Mus musculus, Ovis aries, and Canis familiaris) show a higher sequence similarity. All the eukaryotic organisms possess the metal-binding motif M-x"-C-x⁗-G-C. The proteins of this subclass contain a large number of Lys residues. The alignment highlights four positions (25, 29, 70, 73) where positively charged residues (Lys and Arg) are conserved. In particular, a Lys residue is always present at positions 70 and 73. The NMR structure of the Cu(I) form of yeast S. cerevisiae Atx1 (sequence 1) is available (Arnesano et al. 2001b).


View larger version (116K):
[in this window]
[in a new window]
 
Figure 1   Sequence alignment of the Atx1 amino acid sequence from yeast S. cerevisiae with the metallochaperone homologs located in the present research. At the top, amino acid numbering is reported including gaps (first line) and according to the sequence of Atx1 (second line). The two metal-binding cysteines are shaded in blue. Positions where hydrophobic residues are conserved are shaded in green. Key positive (Arg and Lys) and negative (Glu and Asp) residues are indicated in blue and in red, respectively. The boxes include eukaryotic metallochaperone sequences (green), and bacterial, CopZ-like and MerP-like, sequences (blue). The red box includes the subgroup of MerP-like sequences. The secondary structure elements reported above the alignment are referred to Atx1. In column a we report sequence identity to Atx1 (sequence 1); in column b we report identity to sequence 13 and sequence 36 for CopZ-like and MerP-like sequences, respectively. Eukaryotic metallochaperones: 1 gi|6730164 Saccharomyces cerevisiae (Atx1); 2 gi|7492713 Schizosaccharomyces pombe; 3 gi|11290108 Oryza sativa; 4 gi|11290106 Glycine max; 5 gi|15228869 Arabidopsis thaliana; 6 gi|1945365 Homo sapiens; 7 gi|6013208 Canis familiaris; 8 gi|7531050 Ovis aries; 9 gi|7531046 Mus musculus; 10 gi|7531044 Rattus norvegicus; 11 gi|4165309 Caenorhabditis elegans; 12 gi|7296474 Drosophila melanogaster; Bacterial and archaeal soluble metal transporters: CopZ-like: 13 gi|5107576 Enterococcus hirae (CopZ); 14 gi|7429061 Bacillus subtilis (CopZ); 15 gi|1477773 Helicobacter pylori, strain A68; 16 gi|9789743 H. pylori, strain J99; 17 gi|3121871 Helicobacter felis; 18 gi|7471711 Deinococcus radiodurans; 19 gi|7462405 Thermotoga maritima; 20 gi|11346720 Campylobacter jejuni; 21 gi|13357760 Ureaplasma urealyticum; 22 gi|8894834 Streptomyces coelicolor; 23 gi|9965436 Streptococcus mutans; 24 gi|13622769 Streptococcus pyogenes; 25 gi|11353981 Neisseria meningitidis, serogroup A strain Z2491; 26 gi|11353792 N. meningitidis, serogroup B strain MC58; 27 gi|14600385 Aeropyrum pernix; 28 gi|14195314 Haemophilus influenzae; 29 gi|12722322 Pasteurella multocida; 30 gi|12723757 Lactococcus lactis; 31 gi|12725117 L. lactis II; 32 gi|10580283 Halobacterium sp.; 33 gi|10173169 Bacillus halodurans; 34 gi|11349934 Pseudomonas aeruginosa; 35 gi|8388756 Pseudomonas syringae; MerP-like: 36 gi|127010 Shigella flexneri (MerP); 37 gi|2944141 Pseudomonas stutzeri I; 38 gi|2947088 P. stutzeri II; 39 gi|2498542 Alcaligenes sp.; 40 gi|127008 P. aeruginosa, plasmid pVS1; 41 gi|2498544 Pseudomonas fluorescens; 42 gi|4572444 Sphingomonas paucimobilis; 43 gi|127009 Serratia marcescens; 44 gi|4572382 E. coli; 45 gi|2498541 Acinetobacter calcoaceticus; 46 gi|2498543 Enterobacter cloacae; 47 gi|2052180 Pseudomonas sp., strain KHP41; 48 gi|2159997 Pseudomonas sp., strain K-62; 49 gi|6689527 Xanthomonas campestris; 50 gi|2498545 Shewanella putrefaciens; 51 gi|2935549 Pseudomonas alcaligenes; 52 gi|2765117 Thiobacillus sp.; 53 gi|14195504 H. influenzae; 54 gi|12721567 P. multocida; 55 gi|10640685 Thermoplasma acidophilum; 56 gi|14324466 Thermoplasma volcanium; 57 gi|11496143 Pseudoalteromonas haloplanktis.

A slightly larger sequence variability (34% ± 18% pairwise residue identity) is found within the subclass of bacterial proteins (grouped in a blue box in Fig. 1), comprising 45 sequences. In this case, only the pattern C-x'''-x⁗-C is conserved, with position x` being occupied by a Met, an Ile, a Leu, or a Val residue. However, within this class some proteins were grouped (red sub-box in Fig. 1) whose primary sequence is highly conserved, with pairwise residue identity of 56% ± 25%. As the first member of this subclass of organisms, the sequence of MerP from Shigella flexneri (sequence 28) was considered. This is a mercury detoxification protein (Foster 1987; O'Halloran 1993; Lund and Brown 1987; Sahlman and Jonsson 1992) whose Hg(II)-bound NMR structure is available (Steele and Opella 1997). There are 21 sequences belonging to this subclass. All of them are members of the widespread mercury detoxification operons (Liebert et al. 2000).

The sequence variability within the remaining bacterial metallochaperone homologs is larger (27% ± 11% pairwise residue identity). This subclass comprises 23 sequences. Fifteen of them contain a His residue at positions x" or x⁗ of the metal-binding motif. In two cases (sequences 20 and 25), His is replaced by Asn, which, however, can be considered His-like. The representative sequence of this subclass is that of CopZ, a protein found in E. hirae and B. subtilis, which is proposed to act as a copper transporter (Odermatt et al. 1992; Odermatt et al. 1993; Odermatt and Solioz 1995; Cobine et al. 1999; Banci et al. 2001a). The NMR structures of the Cu(I)-bound form of CopZ from B. subtilis (sequence 14) (Banci et al. 2001a) and the apo form of CopZ from E. hirae (sequence 13) (Wimmer et al. 1999) are available.

Figure 2A schematically shows amino acid variability among metal ion transporters in the protein frame of yeast Atx1. It can be noticed that sequence conservation increases when approaching the metal-binding site.


View larger version (29K):
[in this window]
[in a new window]
 
Figure 2   Amino acid variability among metallochaperone homolog sequences (A) and among metal-transporting ATPases (B). Color code is reported in the bottom panel. Positions with gaps in more than 50% of the aligned sequences are colored in white. The structures of yeast Cu-Atx1 (Arnesano et al. 2001b) (PDB ID 1FD8) and of the 1st domain of Cu-Ccc2 (Banci et al. 2001b) (PDB ID 1FVS) are used.

Gram-positive bacteria show a different organization of the cell, which in particular lacks periplasm. This indicates that periplasmic metal resistance proteins, such as MerP, are absent. Although CopZ is in the cytoplasm, only a relatively small number of organisms show proteins having sequence similarity with the CopZ sequence. It can be inferred that other regulation mechanisms are effective in copper homeostasis (O'Halloran and Culotta 2000; Rosen 1999; Silver and Ji 1994).

To date, the complete genomes of 48 organisms (9 archaea, 35 bacteria, and 4 eukaryotes) have been sequenced. (Protein sequences reported in the present paper are those available in the GenBank in March 2001. Complete genomes are available at the following Internet site: http://www.ncbi.nlm.nih.gov/PMGifs/Genomes/org.html.) Under the present search conditions, small soluble metal-transport proteins were found only in 21 fully sequenced genomes of bacteria and archaea. For the other organisms with known complete genomes, it is probable that different proteins with low sequence homology and different metal-binding motifs are involved in metal homeostasis. On the other hand, Atx1-like metallochaperone proteins were found in all eukaryotes with complete genomes.

Pattern Analysis of Metal-Transporting ATPases

Metal-binding domains of large membrane proteins were searched, starting from the 72 amino acid sequence of the first soluble domain of the copper-transporting ATPase Ccc2 from yeast S. cerevisiae (Ccc2a) and using the x`-x"-C-x'''-x⁗-C consensus motif. The search detected 116 sequences corresponding to single metal-binding domains. The sequence alignment is shown in Figure 3. A total of 63 domains belonging to 16 different proteins was found in 12 eukaryotic organisms and 53 domains of 43 proteins in 36 bacteria and archaea. Copper-transporting ATPases were found in all fully sequenced eukaryotic genomes and in 24 bacterial and archaeal complete genomes. The number of metal-binding motifs in metal-transporting ATPases increases with evolution, ranging from one to two in bacteria and up to six in eukaryotes. In humans, two copper ATPases are present. They are called the Menkes and Wilson proteins and each contains six soluble domains. On the basis of sequence homology, both proteins were identified in the mouse and the rat, whereas only one copper-transporting ATPase was found in the sheep (Wilson-like) and the chinese hamster (Menkes-like). They all possess six domains, with the exception of the Wilson proteins of the mouse and the rat, in which only five are present. When six domains are present, a higher homology is found between corresponding domains of different organisms, with respect to homology within domains of the same organism. This also holds for proteins with only five metal-binding domains, taking into account that, on the basis of sequence homology, the fourth is missing.


View larger version (156K):
[in this window]
[in a new window]
 
Figure 3   Sequence alignment of the Ccc2a amino acid sequence from yeast S. cerevisiae with the heavy metal-binding domains located in the present research. At the top, amino acid numbering is reported including gaps (first line) and according to the sequence of Ccc2a (second line). The two metal-binding cysteines are shaded in blue. Key negative (Glu and Asp) and positive (Arg and Lys) residues are indicated in red and blue, respectively. Positions where hydrophobic residues are conserved are highlighted in green. The two boxes include eukaryotic (green) and bacterial (blue) sequences. The secondary structure elements reported above the alignment are referred to Ccc2a. In column a we report sequence identity to Ccc2a; in column b the length of interdomain linkers for multidomain proteins. 1 gi|6320475 S. cerevisiae, 1st domain (Ccc2a); 2 S. cerevisiae, 2nd domain (Ccc2b); 3 gi|7492267 Schizosaccharomyces pombe; 4 gi|8745541 Candida albicans, 1st domain; 5 C. albicans, 2nd domain; 6 gi|6633848 A. thaliana I, 1st domain; 7 A. thaliana I, 2nd domain; 8 gi|12229667 A. thaliana II, 1st domain; 9 A. thaliana II, 2nd domain; 10 gi|1351993 human Menkes protein, 1st domain (MNK1); 11 human Menkes protein, 2nd domain (MNK2); 12 human Menkes protein, 3rd domain (MNK3); 13 human Menkes protein, 4th domain (MNK4); 14 human Menkes protein, 5th domain (MNK5); 15 human Menkes protein, 6th domain (MNK6); 16 gi|14758520 human Wilson protein, 1st domain; 17 human Wilson protein, 2nd domain; 18 human Wilson protein, 3rd domain; 19 human Wilson protein, 4th domain; 20 human Wilson protein, 5th domain; 21 human Wilson protein, 6th domain; 22 gi|2739170 O. aries, 1st domain; 23 O. aries, 2nd domain; 24 O. aries, 3rd domain; 25 O. aries, 4th domain; 26 O. aries, 5th domain; 27 O. aries, 6th domain; 28 gi|2440287 M. musculus I, 1st domain; 29 M. musculus I, 2nd domain; 30 M. musculus I, 3rd domain; 31 M. musculus I, 4th domain; 32 M. musculus I, 5th domain; 33 M. musculus I, 6th domain; 34 gi|12229577 M. musculus II, 1st domain; 35 M. musculus II, 2nd domain; 36 M. musculus II, 3rd domain; 37 M. musculus II, 4th domain; 38 M. musculus II, 5th domain; 39 gi|1351992 Cricetulus griseus, 1st domain; 40 C. griseus, 2nd domain; 41 C. griseus, 3rd domain; 42 C. griseus, 4th domain; 43 C. griseus, 5th domain; 44 C. griseus, 6th domain; 45 gi|12229551 R. norvegicus I, 1st domain; 46 R. norvegicus I, 2nd domain; 47 R. norvegicus I, 3rd domain; 48 R. norvegicus I, 4th domain; 49 R. norvegicus I, 5th domain; 50 R. norvegicus I, 6th domain; 51 gi|6006293 R. norvegicus II, 1st domain; 52 R. norvegicus II, 2nd domain; 53 R. norvegicus II, 3rd domain; 54 R. norvegicus II, 4th domain; 55 R. norvegicus II, 5th domain; 56 gi|7428301 C. elegans, 1st domain; 57 C. elegans, 2nd domain; 58 C. elegans, 3rd domain; 59 gi|7292707 D. melanogaster, 1st domain; 60 D. melanogaster, 2nd domain; 61 D. melanogaster, 3rd domain; 62 D. melanogaster, 4th domain; 63 gi|10122140 Cryptosporidium parvum; 64 gi|7531047 B. subtilis, 1st domain (CopAa); 65 B. subtilis, 2nd domain (CopAb); 66 gi|15283450 Aquifex aeolicus; 67 gi|1354935 Escherichia coli, 1st domain; 68 E. coli, 2nd domain; 69 gi|2493017 Haemophilus influenzae; 70 gi|8894835 Streptomyces coelicolor I; 71 gi|6689151 S. coelicolor II; 72 gi|6714779 S. coelicolor III; 73 gi|2493001 Synechocystis I; 74 gi|7428304 Synechocystis II; 75 gi|11267419 Campylobacter jejuni; 76 gi|7471220 Deinococcus radiodurans, 1st domain; 77 D. radiodurans, 2nd domain; 78 gi|416665 E. hirae (CopA); 79 gi|2493003 H. pylori, strain 26695; 80 gi|3121870 H. felis; 81 gi|13432119 Mycobacterium leprae I; 82 gi|13637888 M. leprae II; 83 gi|1706183 Mycobacterium tuberculosis I; 84 gi|1706184 M. tuberculosis II; 85 gi|11267401 Neisseria meningitidis, serogroup A strain Z2491; 86 gi|11267405 N. meningitidis, serogroup B strain MC58; 87 gi|1353678 Proteus mirabilis, 1st domain; 88 P. mirabilis, 2nd domain; 89 gi|7531049 Sinorhizobium meliloti, 1st domain; 90 S. meliloti, 2nd domain; 91 gi|584792 Synechococcus I; 92 gi|584820 Synechococcus II; 93 gi|7436397 Thermotoga maritima; 94 gi|14601418 Aeropyrum pernix; 95 gi|11498084 Archaeoglobus fulgidus; 96 gi|7428308 Methanobacterium thermoautotrophicum, 1st domain; 97 M. thermoautotrophicum, 2nd domain; 98 gi|8388793 Pseudomonas syringae, 1st domain; 99 P. syringae, 2nd domain; 100 gi|9965435 S. mutans; 101 gi|13622770 S. pyogenes; 102 gi|10173170 B. halodurans, 1st domain; 103 B. halodurans, 2nd domain; 104 gi|11351437 P. aeruginosa, 1st domain; 105 P. aeruginosa, 2nd domain; 106 gi|12723759 L. lactis; 107 gi|10580282 Halobacterium sp., strain NRC-1, 1st domain; 108 Halobacterium sp., strain NRC-1, 2nd domain; 109 gi|12722323 P. multocida; 110 gi|120199 Sinorhizobium meliloti (fixI); 111 gi|3122077 Rhizobium leguminosarum; 112 gi|3122094 Bradyrhizobium japonicum; 113 gi|13475529 Mesorhizobium loti; 114 gi|2338745 Rhodobacter capsulatus; 115 gi|7271788 Rhodobacter sphaeroides; 116 gi|13422766 Caulobacter crescentus.

The number of residues in the interdomain stretches is reported in column b of Figure 3. The interdomain sequences show a large variability in length. For instance, the two domains of yeast Ccc2 are joined immediately by a very short linker, as are the last two domains of the Wilson and Menkes proteins. In contrast, the other interdomain sequences of the latter proteins possess 10 to 90 residues. The exceptions are the mouse and rat Wilson ATPases in which, after the third domain, a stretch of approximately 150 residues is found that lacks the metal-binding motif. These interdomain regions may provide other structural elements or modulate motions of the individual metal-binding domains in concert with other elements of the ATPase.

If the six domains of the Menkes protein are aligned for the same organism, invariably domain 3 has the lowest homology with the other domains of the same protein. However, they are very similar among themselves in the various organisms (65% average identity). In particular, three key positions that discriminate domain 3 from the other domains can be identified from inspection of the sequence alignment shown in Figure 3. Position 13 (corresponding to x" in the consensus pattern) is always His in domain 3, whereas the other domains have a Thr. Positions 76 and 77 are two Pro or a Ser and a Pro in domain 3, whereas the other domains have two adjacent Gly and Phe. The calculated instability index (II) (Guruprasad et al. 1990) predicts a lower stability for domain 3 (II = 75 ± 8) compared with the others (II = 50 ± 8).

The Met residue present in the consensus motif is almost always conserved. In addition, a Gly residue precedes this Met in all eukaryotes and archaea and in almost all bacteria. A Phe (mainly in eukaryotes) or a Tyr (mainly in bacteria and archaea) residue is often present at position 77. Some negatively charged residues are partially conserved: A Glu is often found at position 22 in both bacteria and eukaryotes, with an Asp or a Glu at positions 72, 74, and 78, mainly in eukaryotes.

Almost all sequences aligned in Figure 3 have residue identity higher than 20% with respect to the Ccc2a sequence (Fig. 3, column a) with an average value of 29% for eukaryotes and 25% for bacteria and archaea.

In bacteria, the first domain of a metal-transporting ATPase from B. subtilis (sequence 64) has the highest percentage of residue identity to the sequence of Ccc2a (35%). The average residue identity of all other bacterial sequences to sequence 64 from B. subtilis is 38% ± 8%. This sequence has 44% identity with CopA (sequence 78), a protein from E. hirae that is known to act as a copper-transporting ATPase (Odermatt et al. 1993). The protein CopZ has been postulated to act as an activator of gene transcription of the cop operon. Its action depends on the availability of copper, thus controlling the expression of CopA (Odermatt and Solioz 1995; Cobine et al. 1999). An interaction between CopZ and the ATPase, CopA, has not been shown so far.

Within the class of bacterial ATPases, some proteins can be identified (sequences 110-116 in Fig. 3) whose primary sequence is highly conserved one with the others, with pairwise residue identity of 41% ± 8%. These ATPases are found in organisms belonging to the group of Rhizobiaceae, Rhodobacter, and Caulobacter; all of them are members of the Fix operon (Nellen-Anthamatten et al. 1998; Cabanes et al. 2000; Delgado et al. 1998). This operon also includes cytochrome c oxidase (CCO), which requires copper to function, and the ATPases called FixI, which serve as copper pumps in the activation of CCO. FixI ATPases have an average residue identity of 20% with Ccc2a, and their soluble part is predicted to have the same beta alpha beta beta alpha beta fold.

The residue variability along the whole family of ATPase soluble domains is shown in Figure 2B.

Homology Modeling of Metallochaperones and Other Metal Transporters

First, we compared the structure of Cu(I)-Atx1 from S. cerevisiae (Arnesano et al. 2001b) (sequence 1 in Fig. 1) with the other available structures of soluble metal transporters, the apo-CopZ (Wimmer et al. 1999) from E. hirae (sequence 13), the Cu(I)-CopZ (Banci et al. 2001a) from B. subtilis (sequence 14), and the Hg(II)-MerP (Steele and Opella 1997) from Sh. flexneri (sequence 36). The protein fold is similar in all cases. Overall backbone RMSD values of 2.3 Å and 2.2 Å were found between Cu-Atx1 and Cu-CopZ and between Cu-Atx1 and apo-CopZ, respectively. The RMSD between Cu-Atx1 and Hg-MerP is 2.5 Å. The bacterial proteins apo-CopZ and Hg-MerP are quite similar to each other with a backbone RMSD value of 1.6 Å. When comparing Cu-CopZ from B. subtilis and the apo-CopZ from E. hirae, an RMSD value of 2.2 Å is found, which drops to 1.7 Å when residues close to the metal site (loop 1 and helix alpha 1) are excluded, that is, the ones most affected by metal binding. The largest differences between Cu-Atx1 and bacterial metal transporters involve loops 2 and 4, which are the regions experiencing the largest variations in residue type, as well as insertions and deletions. Other differences involve helix alpha 1, in particular at its N-terminus close to the metal site, and strand beta 4, which is shorter in MerP and CopZ than in Atx1.

For the sequences whose structures are not available, the PHD method (Rost and Sander 1993, 1994) has been used to predict secondary structural elements. Variations in the primary structure produce small changes in length and position of secondary structural elements, whereas the global fold is conserved. The proteins of unknown structure have then been modeled using Cu(I)-Atx1, apo- and Cu(I)-CopZ, and Hg(II)-MerP as templates.

An overlay of the structures for the superfamily of 57 metal transporters is reported in Figure 4A, in which three colors were used according to sequence classification. It can be clearly seen that structural differences between the three subgroups are correlated with those between the experimentally determined structures used as templates and are in agreement with sequence homology. The pairwise backbone RMSD between the modeled structures (calculated for the common stretches 7-12, 14-31, 37-44, 46-54, 59-67, 71-76) is 1.5 ± 0.7 Å. Higher deviations are found for regions in which residues are not conserved and insertions/deletions are found in the various sequences (Fig. 2A). Loop 2 has a variable length in the various sequences, as indicated by gaps in the alignment and, for this reason, different loop conformations are obtained from the modeling. Large changes are also observed in strand beta 4 and helix alpha 1, as expected on the basis of the above considerations concerning the template structures. The metal ion position is well maintained among eukaryotic metallochaperone models, whereas it is more variable in bacterial metal transporters, reflecting a larger sequence variability in these latter sequences for residues in the metal-binding motif and its vicinity.


View larger version (51K):
[in this window]
[in a new window]
 
Figure 4   (A) Overlay of the super family of 57 metallochaperone homolog structures; three colors are used according to sequence classification of Fig. 1: Atx1-like (green), CopZ-like (blue), and MerP-like (red). The copper ion is shown as a green or a blue sphere for Atx1-like and CopZ-like structures, respectively; for MerP-like structures the mercury ion is shown in red. (B) Overlay of the super family of 116 structures, each corresponding to one soluble domain of a metal-transporting ATPase; two colors are used to distinguish between eukaryotes (green) and prokaryotes, including archaea (blue). The copper ion is shown as a sphere.

Homology Modeling of Metal-Transporting ATPases

The only available structures for a subdomain of metal-transporting ATPases are solution structures of the copper-bound 1st domain of Ccc2 (Banci et al. 2001b) from S. cerevisiae (sequence 1 in Fig. 3) and of the silver-bound 4th domain of the human Menkes protein (Gitschier et al. 1998) (sequence 13), both belonging to eukaryotes. The residue identity between the two is 30% and the backbone RMSD between the two structures is 0.8 Å. The solution structures were used as templates for the modeling. All the eukaryote sequences show a large sequence homology with Ccc2a (29% ± 4% average identity). The third domains of Menkes and Wilson proteins show the lowest amino acid identity (22% ± 2%) with Ccc2a, but they have higher sequence identity with MNK4 (39% ± 2%). Concerning bacteria and archaea sequences, the use of these two templates is justified by a relatively large amino acid identity, as discussed above.

An overlay of the superfamily of 116 structures, each corresponding to one soluble domain of a metal-transporting ATPase, is reported in Figure 4B, in which two colors were used to distinguish between eukaryotes (green) and prokaryotes (blue), including archaea. All the structures are well superimposed with an average pairwise backbone RMSD of 0.4 ± 0.2 Å (calculated for the common stretches 12-32, 40-52, 67-72, 74-79). The most variable region is loop 4, which can be traced to the amino acid variability among ATPases reported in Figure 2B in the protein frame of yeast Ccc2a. The third domain of the Menkes and Wilson proteins is well modeled, with an average RMSD of 0.8 Å to Ccc2a and 0.3 Å to MNK4, in agreement with residue identity.

Model Assessment: Energetics, Secondary Structure, and Hydrophobic Core Analysis

In the reported alignments (Figs. 1, 3), we included sequences having low residue identity with the members used as templates for the modeling but higher than 20% with at least one member of the family. A significant sequence homology is not necessary for two proteins to have a common fold (Orengo et al. 1994). Indeed, the sequences we have considered are correlated with each other, as there exists a path in sequence space such that every sequence can be reached from every other sequence. In this sense, a common fold requires a significant level of conservation of amino acid classes rather than of individual amino acids (Babajide et al. 1997). Also in low homology sequences reported here, key residues are conservatively substituted.

Even when the overall fold is the same, if sequence identity with template structures is lower than 30%, the following errors can affect the model: 1) the region is aligned correctly but is distorted or shifted as a rigid body relative to the correct structure (e.g., loops, helices); and 2) the region is modeled incorrectly because it does not have an equivalent segment in any of the templates (e.g., loops). RMSD to the template structure does not reflect these errors. In fact, modeling is strictly dependent on the structures chosen as templates. There are several criteria to evaluate a structural model and back-check for the choice of the templates (Sali 1995). A good model should have a low energy, according to a molecular mechanics force field, even if this is not sufficient (Novotny et al. 1984; Novotny et al. 1988). Thus, structural features have been analyzed from high-resolution protein structures, and related parameters have been used as strong indicators of errors in the model. Such features include packing, creation of a hydrophobic core, residue and atomic solvent accessibility, spatial distribution of charged groups, distribution of atom-atom distances, and main chain hydrogen bonding (Sali 1995 and references therein). In addition, programs based on the analysis of known protein structures provide stereochemical tests.

The quality of the structures was evaluated through Ramachandran plots obtained using the program PROCHECK (Laskowski et al. 1993). The latter program also analyzes the secondary structure elements. In addition, all models were inspected with the program PROSA (Sippl 1993), which allows the identification of poor models or regions in models with unsatisfactory interactions with the rest of the protein. The program checks if poor packing of the structure is present.

The results of the Ramachandran plot analysis are reported in Table 1 for metallochaperones and soluble metal transporters, and in Table 2 for ATPase domain structures. The secondary structure elements, as analyzed by PROCHECK, particularly the two helices, are well conserved both in position and in length (Fig. 5), all structures sharing the beta 1-alpha 1-beta 2-beta 3-alpha 2-beta 4 fold. The length of the last strand beta 4, however, shows a large variability. In particular, this strand is generally shorter in bacterial metal transporters than in eukaryotic metallochaperones and, in some cases, is totally absent. This reflects differences also observed in the template structures. In fact, in CopZ from E. hirae and B. subtilis and in MerP from Sh. flexneri, strand beta 4 has a reduced length with respect to Atx1 from S. cerevisiae. Hydrogen-bonded turns are often found close to the N- or C-termini of regular structural elements.

                              
View this table:
[in this window]
[in a new window]
 
Table 1.   Structural Analysis of the Family of Metallochaperone and Metal Transporter Modeled Structures as it Results from the Ramachandran Plots



                              
View this table:
[in this window]
[in a new window]
 
Table 2.   Structural Analysis of the Family of ATPase Metal Binding Domain Modeled Structures as it Results from the Ramachandran Plots


View larger version (59K):
[in this window]
[in a new window]
 
Figure 5   Secondary structure of structural models of metallochaperones and soluble metal transporters (A) and of ATPase domains (B). beta -strands are indicated in red, alpha -helices in green, turns in blue. For each secondary structure element the pair/surface combined energy is reported in brackets as obtained from the PROSA analysis.

PROSA-combined energy (Methods section) has been reported for each secondary structure element on the top of plots of Figure 5. It can be noted that, generally, regular secondary structure elements, that is, alpha  helices and beta  strands, have lower energy than loop regions. An exception is represented by strand beta 4 that, however, is less conserved as discussed above. On the other hand, loop 4 in chaperone and soluble metal-transport protein models and loop 2 in ATPase models have low energy. These energies were converted into Z-scores (Sippl 1993; Babajide et al. 1997), which can be considered as a measure for the quality of the modeling: A lower Z-score corresponds to a more favorable potential energy associated with the structure under examination. The Z-scores obtained for chaperone and soluble metal-transport protein structures are in the range -4.9 to -8.9, with a mean value of -6.6. The values for ATPase structures are in the range of -3.2 to -7.6, with a mean value of -6.1. These Z-scores are in a range expected on the basis of the amino acid sequence and of its length (~70 a.a.) (Sippl 1993). In summary, the PROSA analysis for the family of models confirms their consistency, as the range of observed Z-scores is quite narrow over all the structures. Inasmuch as this set includes experimental structures, this assessment implies that the overall quality of the models is comparable to that of the available experimental structures.

The hydrophobic core of chaperones, soluble metal-transport proteins, and ATPases are shown in Figure 6. In metallochaperones and other soluble metal transporters, 12 residues are buried in the majority of the structures (>70%), and 10 additional residues are buried in 30%-70%. In ATPases, 12 residues are buried in more than 70% structures and 10 additional residues are buried in 30%-70%. Buried residues are generally highly conserved in metallochaperones, soluble metal-transport proteins, and ATPase sequences (marked with * in Fig. 6). It is interesting to note that all these classes of proteins have a buried residue in corresponding positions of the amino acid sequences, consistent with a common beta alpha beta beta alpha beta fold. The most buried regions are strands beta 1 and beta 3 and part of helices alpha 1 and alpha 2. Residues in loops 2 and 4, in which amino acid variability is very high, are the most exposed. The second metal-binding cysteine of the motif (in helix alpha 1) is partially exposed in metallochaperones and small soluble metal transporters, whereas it is buried in ATPase domains. At position x` of the consensus pattern, a Met often occurs, which is sometimes replaced by hydrophobic residues like Ile, Val, or Leu. These occupy a unique site in loop 1 that has a low solvent accessibility. In ATPases, the side chain of this Met is completely buried in almost all structures and penetrates into the core. Loop 1 and loop 5 regions are considerably more polar in the metallochaperones.


View larger version (41K):
[in this window]
[in a new window]
 
Figure 6   Hydrophobic core of structural models of metallochaperones and soluble metal transporters (A) and of ATPase soluble domains (B), defined as the van der Waals surface formed by atoms of residues that are buried in more than 70% structures. These residues are indicated in the figure. Conserved hydrophobic amino acids are marked with *.

Tables 3 and 4 report the contacts conserved in more than 70% of the modeled proteins and involving buried residues. Nearly all amino acids involved in these contacts are conserved or conservatively substituted in the majority of the protein sequences that we have analyzed.

                              
View this table:
[in this window]
[in a new window]
 
Table 3.   Contacts Involving Buried Residues (boldface) Conserved in More Than 70% of Metallochaperone and Metal Transporter Model Structures



                              
View this table:
[in this window]
[in a new window]
 
Table 4.   Contacts Involving Buried Residues (in bold) Conserved in More Than 70% of Metal-transporting ATPase Model Structures

Other Potential Metal-Binding Domains

The search in the GenBank by using the x`-x"-C-x'''-x⁗-C consensus motif led us to other proteins of different sequence length and/or low homology with the input sequences Atx1 and Ccc2. These significant differences prevented reliable structure modeling and are summarized below.

Some organisms possess other putative copper-transporting ATPases, in addition to those already reported in the alignment of Figure 3. In Pseudomonas aeruginosa, Neisseria meningitidis, Campylobacter jejuni, H. pylori, and Vibrio cholerae the N-terminal soluble part of these ATPases encompasses ~150 residues and contains three metal-binding motifs: CxHC, CxGC, and CxAC. Similarly, the archaeon Halobacterium sp. has an ATPase with three metal-binding motifs, CTLC, CRGC, and CATC.

The N-terminal domain of CCS, the copper chaperone for superoxide dismutase (SOD) (Culotta et al. 1997), is a structural homolog of Atx1 (Lamb et al. 1999), and it shares the same consensus motif for metal binding.

Two proteins of the mercury resistance (mer) operon, the mercuric reductase MerA and the inner membrane protein MerC, also contain a CxxC metal-binding motif (Miller 1999; Liebert et al. 2000). The N-terminal domain of MerA is aligned with the Atx1 sequence, with low homology (17% ± 2% residue identity), and with MerP (33% ± 3% residue identity).

Finally, we found two subgroups of ATPase sequences showing similarity to the E. coli zinc-transporting ZntA (Rensing et al. 1997) (sequences 1-12 in Fig. 7) or to the S. aureus cadmium-transporting CadA (Nucifora et al. 1989) (sequences 13-19 in Fig. 7). It was shown that ZntA and CadA also display Pb(II) translocating activity (Rensing et al. 1998). The average residue identity with respect to yeast Ccc2a and MNK4 is 19% and 20%, respectively, and the secondary structure prediction for the soluble domains of these Zn/Cd/Pb ATPases indicates the beta alpha beta beta alpha beta fold common to the copper transporters. It can be inferred that the metal-binding specificity for divalent cations does not reside in the overall fold but in sequence variation in the proximity of the metal-binding motif. Given the lack of experimental structures loaded with one of the divalent cations for which these proteins are specific, modeling was not pursued; however, some observations can be made on the basis of their sequence alignment (Fig 7).


View larger version (44K):
[in this window]
[in a new window]
 
Figure 7   Sequence alignment of zinc- and cadmium-transporting ATPases. At the top, amino acid numbering is reported including gaps. The two metal-binding cysteines are shaded in blue. Some conserved negative (Glu and Asp) and positive (Arg and Lys) residues are indicated in red and blue, respectively. Positions where hydrophobic residues are conserved are highlighted in green. In column a we report sequence identity to Ccc2a from S. cerevisiae. 1 gi|586655 E. coli, strain K-12 (ZntA); gi|15803981 E. coli, strain EDL-933; 3 gi|15641046 Vibrio cholerae; 4 gi|2624376 P. mirabilis; 5 gi|3123078 Synechocystis; 6 gi|15807741 D. radiodurans; 7 gi|7436386 B. subtilis; 8 gi|14521140 Pyrococcus abyssi; 9 gi|15789464 Halobacterium sp., strain NRC-1; 10 gi|10720043 H. felis; 11 gi|15611794 H. pylori, strain J99; 12 gi|2493007 H. pylori, strain 26695; 13 gi|79893 Staphylococcus aureus (CadA); 14 gi|14020985 S. aureus II, 1st domain; 15 S. aureus II, 2nd domain; 16 gi|231677 Bacillus firmus; 17 gi|15616598 Bacillus halodurans; 18 gi|3121832 Listeria monocytogenes; 19 gi|9789448 L. lactis.

These putative zinc- and cadmium-transporting ATPases display a different pattern of amino acid residues at the variable positions in the metal-binding motif. In particular, the presence of negatively charged residues near the metal-binding site may facilitate the binding of a divalent cation. For instance, the conserved Asp residue of the ZntA-like sequences is found in the metal-binding consensus motif x`DCx'''x⁗C (x` = M, L) and can provide a metal-binding carboxylate group. In CadA-like sequences, the conserved residues are x⁗ = N with x` = F, L. Cadmium has a preference for nitrogen and oxygen donor atoms and tetrahedral or octahedral geometries (Cotton and Wilkinson 1990). Among the aligned CadA-like sequences there are no conserved His residues. On the contrary, Asn residues are conserved in various positions and in the consensus motif and may provide a donor atom. Conserved Glu residues may also coordinate the metal.

    DISCUSSION
TOP
ABSTRACT
INTRODUCTION
RESULTS
DISCUSSION
CONCLUSIONS
METHODS
WEB SITE REFERENCES
REFERENCES

Metallochaperones, soluble metal-transport proteins, and the P-type ATPases share a common structural fold, yet salient differences in their structures correlate with their roles in the mechanism of metal trafficking. The conservation of key residues among the families provides a strong basis for modeling using as templates the available high-resolution solution structures. The solution structures of Atx1 and Ccc2a in both Cu-bound and Cu-free forms (Arnesano et al. 2001b; Banci et al. 2001b) reveal features important for rapid and reversible transfer between physiological partners. A structural model for the Atx1-Ccc2a complex has also been determined on the basis of experimental data (Arnesano et al. 2001a). This model provides information on the factors determining the specificity of the interaction. A structural genomics approach applied to two classes of proteins that are the partners in the copper transfer process allows us to explore and model features relevant to metal trafficking across different phyla.

Hydrophobic Core

The clustering of buried residues is structurally important for stabilizing the tertiary fold of these proteins, and it represents a support for the metal-binding region, that is, loops 1 and 5, which are partially exposed to the solvent. Conserved contacts involving buried residues are listed in Tables 3 and 4.

An important difference in the classes of proteins is found for residue at position 73 of the global alignment (residue 65 in loop 5 of yeast Atx1) in eukaryotic metallochaperones, with respect to the corresponding amino acid in the ATPase domains and the small soluble metal-transport proteins. In eukaryotic metallochaperones, this residue is invariably a Lys whose positively charged side chain is very close to the copper site and points toward the solvent (Fig. 8A). In ATPase domains, as well as in bacterial metal-transport proteins, the corresponding position in loop 5 (number 77 and 73, respectively) is occupied by a Phe or a Tyr, with the exception of the third domain of Menkes and Wilson proteins and few other cases. Conserved contacts are found between the aromatic ring of this Phe/Tyr, the side chain of the conserved Met in loop 1, and the second metal binding Cys (in helix alpha 1; Fig. 8B). Compared with Ccc2a (Banci et al. 2001b) and MNK4 (Gitschier et al. 1998), larger structural changes are observed in the Atx1 metal-binding region (loop 1 and loop 5) upon copper release (Arnesano et al. 2001b). Those studies indicate that key structure and functional roles of the conserved Lys in metallochaperones, relative to the Phe/Tyr, may account for this difference.


View larger version (39K):
[in this window]
[in a new window]
 
Figure 8   Conserved contacts between residues in the vicinity of the metal-binding site in eukaryotic metallochaperones (A) and ATPases (B).

The stabilization of the metal-binding site in Cu-Atx1, with respect to the apo form, is mainly attributable to Cu(I) coordination by the sulfur atoms of the cysteines. The positive side chain of Lys 65 in loop 5 leads to neutralization of the overall -1 charge of the Cu(I) bis-thiolate center, thus stabilizing the loop after metal binding and protecting the sulfur atom of the second Cys from solvent (Arnesano et al. 2001b). In the apo form, the first turn at the N-terminus of helix alpha 1 folds back and loop 5 moves away from the copper site as Lys 65 is no longer attracted by the neutral protonated cysteines. The peptide dipoles of this turn also create a more positive potential that stabilizes the negative charge in the Cu(I)-loaded protein.

This trigger mechanism is absent in Ccc2a, in which a Phe replaces the Lys in loop 5. The apo form of the protein in the metal site is preorganized to receive the copper ion (Banci et al. 2001b).

The apparent flexibility of the peptide fold in the metal-binding region, observed in the metallochaperone (i.e., yeast apoAtx1) but not in the ATPase domain (i.e., yeast apoCcc2a), is supported by model analysis for other eukaryotic proteins (Fig. 8A,B). This feature of Atx1 may facilitate metal release to the target. These differences in the proximity of the metal-binding site between metallochaperones and ATPases can be related to their different functional roles in the copper-trafficking pathway.

The picture is somehow different in bacteria, wherein neither soluble metal transporters nor ATPase metal-binding domains contain a Lys in loop 5, and a Tyr or a Phe is found instead. In particular, in almost all bacterial ATPase domains this key position is occupied by a Tyr, at variance with eukaryotes, where a Phe is frequently found.

In the third metal-binding domain of Menkes and Wilson proteins, a Pro is invariably found in loop 5 in place of Phe/Tyr. This does not affect the conformation of the loop in modeled structures but may be relevant to the dynamic properties of these domains. Furthermore, the amino acid sequences of the third domain display less similarity to the other domains, and the computed instability index of the third domain, based on its primary structure, indicates lower stability. This can be either important for a cooperative metal binding of different domains or for relocation of the ATPases to the plasma membrane (Cobine et al. 2000), probably induced by conformational changes.

The Metal-Binding Site

The metal-binding motif conserved in metallochaperones, soluble metal-transport proteins, and metal-transporting ATPases is formed by the sequence x`-x"-C-x'''-x⁗-C that has been used as a search criterion. Residue x` is highly conserved, being a Met in 91% of the proteins found. This side chain is not directly involved in metal ion coordination because it points toward the hydrophobic core of the protein and is involved in hydrophobic interactions, which may stabilize the folding of the metal-binding loop. Position x" shows only a few types of amino acids, usually Ser, Thr, or His. In ATPase domains from eukaryotes, a Thr residue is always at this position except for the third domain of the Menkes and Wilson proteins, where a His residue is found. Residue x''' denotes a truly variable position, whereas x⁗ is quite conserved only within some subgroups. Residue x⁗ is always a Gly in eukaryotic metallochaperones. It is often a His or a Gly in CopZ-like proteins or an Ala or Thr in MerP-like proteins. It is Ser or Ala in eukaryotic ATPases. The large variability observed at position x''' indicates that, in the extreme case, this residue may be deleted or an additional residue may be inserted, producing different consensus motifs for metal binding, CxC and CxxxC, respectively, as observed in other metal tranporters such as SilP, a silver ATPase from Salmonella typhimurium (Gupta et al. 1999) containing CxxxC, or in CopY from E. hirae containing two conserved CxC motifs (Odermatt and Solioz 1995).

Other factors, in addition to the identity of residues in the metal-binding motif, could determine metal selectivity and its influence on protein-protein interactions, as also addressed by experimental works (Lutsenko et al. 1997; Larin et al. 1999; Veglia et al. 2000). One of them could be the size of the binding pocket, which on its turn can be controlled by the tightness of the turn of the binding loop or by electrostatics. Regarding the latter, differences in charge distribution in the vicinity of the metal-binding site among the various subfamilies of structural models are observed. The different pattern of charged residues in MerP-like structures with respect to eukaryotic metallochaperones and CopZ-like metal tranporters may be related to the different charge and type of coordination, for example, of Hg(II) with respect to Cu(I).

The nature of the residues in the turn of the metal-binding loop affects their interactions with the solvent and with other protein residues and can contribute to the turn tightness. Highly conserved residues are present in loop 3, which in almost all the structures are in contact with the residues following the second Cys of the metal-binding motif. A conformational change upon Hg(II) binding has been observed for a Phe residue in loop 3 in MerP from Sh. flexneri (Steele and Opella 1997).

Another important aspect to be considered is the surface interactions that determine the tightness of the turn of the metal-binding loop. Position 42 in metallochaperones and soluble metal transporters and position 44 in soluble domains of ATPases, both located in loop 3, are occupied by a Leu or a Phe in 86% of the sequences and are exposed to the solvent. In almost all structures, these residues are in contact with the amino acid immediately following the second Cys in the metal-binding motif (position 20 in metallochaperones and metal transporters; position 18 in ATPases). It can be proposed that these corresponding positions may be involved in controlling the stability of the metal-binding loop.

Surface Potentials and Protein-Protein Interactions

Residues on the surface of the structural models are less conserved than those forming the hydrophobic core. This is related to the specific function and interaction partner of the protein.

In metallochaperones, soluble metal-transport proteins, and ATPase domains we found different surface charge distributions, even within eukaryotic and bacterial subgroups. Figure 9 shows the surface potentials of a metallochaperone (Atx1 from S. cerevisiae), a soluble metal-transport protein (CopZ from B. subtilis), and two ATPase-soluble domains (Ccc2a from S. cerevisiae and CopAa from B. subtilis).


View larger version (47K):
[in this window]
[in a new window]
 
Figure 9   Surface potentials of S. cerevisiae Atx1 and B. subtilis CopZ structures (A). Surface potentials of S. cerevisiae Ccc2a and B. subtilis CopAa structures (B).

In metallochaperones there are two conserved negative regions, corresponding to strand beta 2 and the N-terminal end of helix alpha 2, and two conserved positive regions, helix alpha 1 and the C-terminal part of helix alpha 2 with loop 5. In particular, positions 25 and 29 (helix alpha 1) and positions 67 and 73 (helix alpha 2 and loop5) are occupied by Lys or Arg residues. On the contrary, in eukaryotic ATPases, several Glu and Asp residues, conserved in positions 22, 72, 74, and 78 with few exceptions, generate a negatively charged face on the protein surface in the proximity of the copper-binding region (Fig. 9). With respect to MerP-like proteins, little is known about the structure of putative partner proteins, but it is interesting to note that the negative regions are more scattered on the surface with respect to Atx1-like structures, whereas positive patches are present on helices alpha 1 and alpha 2, in analogy with metallochaperone sequences.

At variance with eukaryotic metallochaperones, CopZ-like models (bacteria) show a large variability of charged residues. Interestingly, the calculated isoelectric points (pI's) show that the CopZ-like sequences (pI's of 4-5) are more acidic than the Atx1-like chaperones (pI's of 7-8).

It has been suggested that electrostatic forces play a crucial role in the interaction of the two yeast proteins Atx1 and Ccc2a (Portnoy et al. 1999; Wernimont et al. 2000). A model for the complex has been proposed based on experimental data (Arnesano et al. 2001a). Interactions occur on a protein-protein interface that includes the C-x-x-C metal-binding motif. In the model of the complex between Atx1 and Ccc2a, loop 1 and the N-terminus of helix alpha 1 of one protein are in contact with loop 5 and the C-terminus of helix alpha 2 of the partner, and the two helices alpha 1 are tilted ~45° to allow close contact between the two metal-binding regions of the two proteins (Arnesano et al. 2001a). The two pairs of cysteines are facing each other to facilitate a low barrier metal transfer through a series of two- and three-coordinate metal-bridged intermediates (Pufahl et al. 1997; Wernimont et al. 2000). The regions at the interf