|
|
|
|
Vol. 12, Issue 1, 47-56, January 2002
LETTER
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
A systematic computational analysis of protein sequences containing known nuclear domains led to the identification of 28 novel domain families. This represents a 26% increase in the starting set of 107 known nuclear domain families used for the analysis. Most of the novel domains are present in all major eukaryotic lineages, but 3 are species specific. For about 500 of the 1200 proteins that contain these new domains, nuclear localization could be inferred, and for 700, additional features could be predicted. For example, we identified a new domain, likely to have a role downstream of the unfolded protein response; a nematode-specific signalling domain; and a widespread domain, likely to be a noncatalytic homolog of ubiquitin-conjugating enzymes.
| |
INTRODUCTION |
|---|
|
|
|---|
Large proteins are often composed of domains. These are polypeptide
regions that adopt compact three-dimensional (3D)
structures and are often found in diverse molecular contexts (Janin and
Chothia 1985
). The independent evolutionary histories of
domains found within the same protein lead to an assumption that the
domain is the fundamental unit of protein structure and function
(Doolittle 1995
). Domains are most readily observable in
known 3D structures, but because of the relative paucity of available
structural data, the majority of protein domain families have been
identified first by sequence analysis. Many domains are `genetically
mobile', meaning that they can be found associated with different
domain combinations in different proteins. The term `module' is
sometimes used to distinguish between mobile domains and those that are
invariably found in identical molecular contexts.
Sequence characterization of domain families represents a first step
toward the determination of their 3D structures and molecular functions. Domain identification from sequence is usually performed on
a case-by-case basis, by applying a variety of automatic methods supplemented with careful manual analysis. The number of protein domain
families characterized from sequence has been increasing steadily over
the years and has led to the development of Web-based resources such as
SMART and Pfam (Schultz et al. 1998
, Bateman et al. 2000
) for effective
and reliable domain identification.
We have systematically searched for new domain families, using proteins
annotated by the SMART (Simple Modular Architecture Research Tool)
database of domains as our starting point. We have targeted our
strategy to all proteins that contain at least one of 107 types of
predominantly nuclear domains in the SMART collection. Crucial to our
technique is the accurate knowledge of known domain boundaries provided
by databases such as SMART and Pfam (Schultz et al. 1998
, Bateman et
al. 2000
). Using sequence regions not covered by previously
characterized domains, we have searched for homologs in nonredundant
sequence databases and used previously computed domain architectures to
determine which of the initial search regions could correspond to new
domain families. A manual analysis of the various candidate families
led to the final characterization of novel domain types and their
sequence borders.
| |
RESULTS |
|---|
|
|
|---|
Classification of the Novel Domains
The protocol described earlier revealed a variety of novel domains
that could be classified into four broad categories:
| 1. | Fifteen novel domain families with representatives in diverse molecular
contexts in different species (Table 1, Part
A). Of
these, three have recently been described on separate occasions (Table
1, Part A, Callebaut et al. 2001 |
| 2. | Three domain families were found to be specific to single or closely related species (Table 1, Part B). |
| 3. | Seven further domain families are likely to be divergent members of
previously recognized families, with weak (but not statistically significant) similarity to previously described domains. (One of these,
the BED domain, has been recently published independently (Aravind 2000 |
| 4. | Three additional families were recognized as representing family-specific N or C-terminal extensions of previously known domains (Table 1, Part D). These regions always co-occur with a particular neighboring domain, although their domain context within the protein as a whole may vary. Because of their size, they are likely to have well-defined structures, but might only exist in the context of the domain that they extend. In three of these cases, the extension is only found in closely related species. We do not consider these sequence families to be modules, and we do not discuss them further. |
|
Alignments of the novel domains, the proteins they are found in, and their phyletic distribution are publicly available in the SMART database (http://smart.embl-heidelberg.de/).
Of the total 28 regions discovered, 8 were found by simple single-pass
BLAST searches. For the remaining 20, PSI-BLAST searches were necessary to provide statistically
significant links between proteins with different
domain architectures. This is broadly consistent with the reported
three-fold sensitivity of PSI-BLAST over
BLAST (Park et al. 1998
).
Conserved protein domains are most useful when they can be used to make predictions of likely function. For the domains presented here, this was possible to varying degrees. We provide three examples to illustrate the more important categories described earlier, and show the types of (necessarily conjectural) functional information that can be inferred from the present identifications.
A Widespread Module in Diverse Species: A Novel Domain in Peptide N-glycanases and Other Putative Nuclear Proteins
The majority of our novel domains are found in diverse species and in different protein contexts without significant sequence similarity to other domains. A particularly interesting example is described here.
A hypothetical Arabidopsis protein (SpTREMBL accession:
Q9MAT3) is predicted to contain two N-terminal zinc finger motifs (ZnF_C2H2), followed by a UBA domain (Hofmann and Bucher
1996
). A predicted coiled-coil region links this to a
C-terminal half that contains no currently described domains.
PSI-BLAST searches initiated with this C-terminal region
show significant sequence similarity (E-value <10-5) to UBX
domain-containing proteins and metazoan homologs of
peptide:N-glycanases (PNGases).
Searching of preliminary protein predictions from the Plasmodium falciparum genome, with the conserved region identified in an Arabidopsis sequence (SpTREMBL accession no. Q9FKI1), revealed an additional association with a UBCc domain-containing protein (E-value 3 × 10-4).
We refer to these homologous regions as PUG domains, after the
Peptide:N-Glycanases and other putative nuclear
UBA or UBX domain-containing proteins. PNGases
are believed to have a role in the unfolded protein response (UPR)
(Suzuki et al. 2000
). The UPR results in increased levels of
transcription of endoplasmic reticulum (ER)-resident protein-coding
genes, following accumulation of unfolded proteins in the ER. The PUG
domain is found to co-occur in proteins with three domains that are
central to ubiquitin-mediated proteolysis: UBA, in
Arabidopsis, UBCc in Plasmodium, and UBX in mammals
and Arabidopsis. This indicates that PUG domain-containing proteins might link the UPR to ubiquitin-mediated protein degradation. Other links between the UPR and UBQ-mediated proteolysis have been
shown previously (Travers et al. 2000
).
The candidate orthologs of PNGases in Saccharaomyces
cerevisiae, Saccharomyces pombe, and Arabidopsis
do not appear to encode this domain, indicating its presence in these
proteins is a metazoan innovation. Of these putative PNGases, only the
S. cerevisiae protein has been directly characterized; it
appears to be localized to the nucleus, with a lower level occurring in
the cytosol (Suzuki et al. 2000
). As the apparent orthologs in metazoan
genomes appear to be present singly, rather than as multiple paralogs
(which might imply functional variation), it seems likely that the
proteins encoded by them will have a similar cellular localization.
Additional HMMer2 searches, using an HMM derived from these PUG domain
sequences, showed marginal similarity to IRE1p-like kinases (SpTREMBL
accession: Q9SHL6) (E-value: 0.21) within a region
known to be homologous to the C-terminal tail of 2`-5' oligo
(A)-dependent ribonuclease (Zhou et al. 1993
) (see Fig. 1). Although of only marginal significance,
the similarity also extends to cellular function because IRE1p-like
kinases are known to initiate the UPR (Shamu and Walter
1996
). The C-terminal tail of IRE1p is required for
induction of the UPR (Shamu and Walter 1996
), and has been
shown to possess site-specific endoribonuclease activity (Sidrauski and
Walter 1997
). This activity is consistent with the
C-terminal location for RNase activity found in its homolog, 2`-5'
oligo (A)-dependent ribonuclease (Bork and Sander 1993
). Consequently, we tentatively suggest the presence of divergent PUG
domains in the C termini of IRE1p-like kinases.
|
Further analysis of the metazoan PNGase sequences revealed a conserved region that is also present in multiple copies in hypothetical Caenorhabditis elegans proteins (e.g., four copies in C17B7.5). This domain was not found in the initial rounds of searching because it does not occur with any of our starting set of nuclear domains. We have included this domain in the SMART collection and have named it PAW (domain present in PNGases and other worm proteins).
Novel Modules Found in Narrow Phyletic Ranges: A Nematode-Specific Putative Signaling Domain in C. elegans
Lineage-specific expansions of protein domain families (i.e., a
large increase in the number of a particular domain in one genome
compared with other genomes) are a widespread phenomenon (e.g.,
International Human Genome Sequencing Consortium 2001
). In
extreme cases, it may not be possible to establish links between a
domain that is widespread in one organism and known domains seen in
other species. Such cases may represent genuine `invention' of new
domains, or, perhaps more likely, instances where the tempo of
molecular evolution has risen to the extent that sequence similarity with known domains is no longer detectable. Alternative scenarios of
massive loss from other lineages are less parsimonious. Three (i.e.,
~11%) of our new domains appear to occur in very restricted phylogenetic lineages; these exclude species-specific N- or C-terminal extensions of known domains (see Table 1, Part B).
PSI-BLAST searching with the region C-terminal to a SET
domain (Cui et al. 1998
) of the hypothetical protein Y43F11A.5 (SpTREMBL accession: Q9U2G8) detected a novel domain found in many
different predicted proteins from C. elegans but thus far in
no other species. The domain is ~120 residues in length, and found
associated with the catalytic domain of caspases (CASc), protein
kinases of undetermined specificity (STYKc), and the SET methyltransferase domain. Multiple tandem copies of the domain may be
present in the same sequence (Fig. 2). We
named this domain SPK [associated with SET,
PHD (Aasland et al. 1995
), protein Kinase]. The alignment is provided on the Web (see
http://www.embl-heidelberg.de/~doerks/alignment_fig3.html/)
|
Further analysis of nucleic acid sequence databases revealed SPK domains in the Caenorhabditis briggsae sequence, in regions for which no proteins have been predicted (e.g., NCBI GI:11095060, data not shown). No other species were found to contain the domain. It is possible that the domain exists in nematode lineages other than Caenorhabditis, but is simply not found due to insufficient sequence coverage of these species.
The association of SPK with SET, PHD, catalytic protein kinases or caspase domains (see Fig. 2) hints at an important role in metabolic, developmental, or evolutionary processes that are unique to Caenorhabditis. However, none of the putative proteins in which the domain has been found have been characterized by any experimental technique other than RNAi screening. All homologs tested by RNAi are wild type according to wormbase (http://www.wormbase.org/). This technique would not be expected to reveal more subtle phenotypes associated with later developmental stages.
Modules in New Contexts: A Noncatalytic Subfamily of Ubiquitin-Conjugating Enzyme Homologs
The protocol presented here detects regions of homology between
sequences where no domains have previously been assigned. Some of our
newly identified regions appear to be distantly related to known
domains, but correspond to new molecular contexts. Such cases indicate
potential changes of domain function or add new insights to the
function of the proteins in which the domain has been newly identified.
An increasing number of known domains are being realized as members of
wider superfamilies because of the availability of 3D structures. For
example, the UBX domain has recently been reclassified as a subfamily
of the ubiquitin fold superfamily (Buchberger et al. 2001
). In addition
to protein structure determination, carefully applied sensitive
sequence searching methods can also provide such insights. This is
exemplified by the following example detected in this study.
The mouse GCN2 eIF2
kinase and histidyl-tRNA synthetase (SpTREMBL
accession: Q9QZ05) is an essential component of translation control
(Jentsch et al. 1991
; Sattleger et al. 1998
). A PSI-BLAST search initiated with the region N-terminal to an inactive protein kinase domain (see Fig. 3) in the GCN2
protein revealed significant similarity to presumed orthologs in other
eukaryotic species from yeast to vertebrates. Further
PSI-BLAST iterations and additional HMM searches reveal
significant similarity to WD-repeat-containing proteins; yeast DEAD
(DEXD)-like helicases; UPF0029, an uncharacterized protein family from
the Pfam database (accession no. PF01205); a range of hypothetical
proteins; and many RING finger-containing proteins. We called the newly
defined region RWD after the better characterized RING
finger and WD-domain-containing-proteins and DEAD-like helicases. PSI-BLAST searches
initiated with different seeds also revealed homology with
ubiquitin-conjugating enzymes (UBCc) domain, (e.g. SpTREMBL acc: Q94721
hits Q9SDY5 on iteration 3, E
value = 9 × 10-4), although the catalytic cysteine
critical for ubiquitin-conjugating activity is not conserved in most
members of the novel subfamily (see
http://www.embl-heidelberg.de/~doerks/aligment_fig4.html/)). This
observation is particularly interesting in light of previous experimental studies on A07 (SpTREMBL accession: Q9QZR0), a protein that includes both an RWD and a RING finger domain, that have shown
that a region between 85 and 363 amino acids in A07 (including the RING
finger) binds ubiquitin-conjugating enzyme E2 and acts as a substrate
for E2-dependent ubiquitination (Lorick et al. 1999
).
|
Predictions of Function
On the basis of reports in the literature and/or co-occurrence with previously identified domains, some functional features can be predicted for 78.6% of our newly identified set of 28 domain families. This represents an increase in the state of functional prediction for ~700 proteins (i.e., the total number of distinct proteins that are covered by novel domains with a putative function; see Table 1, Parts A-D). The predicted functions represent a variety of different cellular processes and molecular functions such as DNA/RNA- or metal-binding protein-protein interactions.
Five further cases of function prediction are outlined as follows.
Chromatin-Binding Domains
The CSZ domain-containing protein SPT6 and orthologs regulate transcription through establishment or maintenance of chromatin structure (Chiang et al. 1996Protein Interaction Domains
Recent studies reveal the interaction of the RPR domain in protein pcf11 with the C-terminal domain of the largest subunit of RNA polymerase II (Yuryev et al. 1996Predicted Localization of the Novel Domains
Context can also be used to predict whether a novel domain is associated with a certain cellular localization. For example, some of our novel domains are only found with representatives from our initial set of predominantly nuclear domains (i.e., those used to seed the searching procedure). This logic indicates a putative nuclear function and role for 10 of the domain families presented here, representing ~500 proteins. Others among the novel domain families are likely to have roles in both nucleus and cytoplasm.Novel Domains Related to Human Diseases
Four (14%) of the newly discovered domain families and one of the family-specific domain extensions occur in proteins whose deficiencies are implicated in severe human diseases. The respective genes or chromosomal regions are known to be responsible for cancer, neurodegenerative processes, or chromosomal aberrations (Table 2).
|
| |
DISCUSSION |
|---|
|
|
|---|
Some well-characterized signaling domains, such as SH2 or PH, are present in a huge number of proteins and occur in combination with a large number of other domains. The fact that they are so widespread no doubt facilitated their early detection and characterization. Perhaps unsurprisingly, the domains found in the present analysis have more limited distributions than examples such as those. Even so, each new domain is found, on average, in 4.0 different architectures in ~30 proteins. More widespread domains have been detected by our approach [e.g., the BRK domain occurs in more than seven different settings and a total of 30 proteins (Table 1, Part A)].
Only three (11%) of the newly discovered domains are species specific; of these, two are limited to plants and one is nematode specific (Table 1, Part B). This could simply reflect the fact that even when species-specific pathways exist, proteins involved in them are likely to be recruited from preexisting components. Alternatively, species-specific domains may more likely be found only with other species-specific domains, rather than with domains found in a large phyletic range, and so would be underrepresented in the results of the search methods applied here.
In general, we cannot answer the question of whether the domains presented here have distant homologs that are not detectable using present methods (in common with any other new domain discovery report). The general evolutionary principle of reuse of preexisting components indicates that this is likely. However, we believe that, even if this is the case, the domains presented here, by dint of considerable sequence variation, are likely to have acquired new biological functions that are worthy of independent investigation.
In conclusion, we have identified a total of 28 novel domain families,
4 of which have been independently reported in the recent literature.
Some of the domains are likely to be found in proteins localized to the
nucleus. The predicted functions range from enzymatic activities to
nucleotide binding. The systematic search for novel domains led to a
26% increase over the known nuclear domains that have been discovered
in the last 15 yr, when the C2H2 zinc finger was first described
(Miller et al. 1985
).
The novel domains were all detectable using standard search methods (i.e., PSI-BLAST), within default E-value thresholds. The novelty of our approach has been to search using all candidate sequences that could contain a new domain of interest. In contrast, it would appear from our results that only using well-characterized sequences to search prevents the detection of some domains.
Although the majority of domains reported here are present in a wide variety of species, indicating that they have crucial biological roles, they are, on average, present in fewer proteins than previously reported domains. Taken together with the increasing volumes of data being produced by genome projects, targeted approaches to domain detection, such as those presented here, must have a role in enumerating the evolutionarily conserved components required for life.
| |
METHODS |
|---|
|
|
|---|
Definition of Nuclear Domains
A subset of SMART database families represents domains often found
in nuclear proteins, as defined by annotation in sequence databases
(Schultz et al. 2000
). The computer program,
Meta-A(nnotator) (Eisenhaber and Bork 1998
),
which assigns protein localizations based on Swiss-Prot
annotations, was used to predict the most likely localization for a
domain family. A domain family was included in this analysis if more
than 80% of Swiss-Prot entries of proteins containing the
domain were annotated by Meta-A as nuclear. By this
method, 86 domains were assigned a nuclear location. Eleven suspected
false positives were removed following literature searches, and an
additional 32 signaling domains with partial nuclear localization were
added when literature searches could confirm this assignment.
Thus, a set of 107 predominantly nuclear domain families was derived (see http://www.embl-heidelberg.de/~doerks/nuclear_subset.html/). Many domains, such as those with RNA-binding functions, are found in proteins that translocate between the cytoplasm and the nucleus or are found in both cytoplasmic and nuclear proteins. Consequently, some of these `nuclear' domain families may contain cytoplasmic protein representatives. However, according to our protocol, based on Swiss-Prot annotations, the majority of proteins containing these domains will possess a significant population in the nucleus.
Automatic Screening for New Domains
All proteins containing one or more domains represented in the
nuclear subset were extracted from public sequence databases, and their
complete domain structure characterized using SMART. Regions not
annotated using known SMART domain models were extracted, along with
their domain context (i.e., position in the protein relative to other
domains). Interdomain sequences shorter than 30 amino acids were
regarded as less likely to represent novel globular domains (although
such short domains do exist) and discarded. Noncontiguous regions of
the same sequence were analyzed independently of each other. All of
these sequence regions were then clustered into groups using the
grouper program of the SEALS package with a
default single linkage clustering threshold of 50 bits (Walker and
Koonin 1997
). The longest member of each of these groups
was filtered for coiled-coil and low complexity sequences (Lupas et al.
1991
; Wootton and Federhen 1996
) and then used to search a
nonredundant sequence database, using the iterative search algorithm
PSI-BLAST (Altschul et al. 1997
), with an E-value inclusion threshold of E<0.001. Eight search rounds were
performed, unless the database searching procedure converged in a prior
iteration (see Altschul et al. 1997
for details of the
PSI-BLAST procedure). The domain organizations of all
homologs identified by PSI-BLAST searches were retrieved
from the precalculated SMART database. The homologous regions
identified in the searches were considered as the candidate domain
family. Candidate regions that were found in different domain contexts
(see following) in different proteins indicated a possible novel module
family. These families were analyzed further using the methods
described as follows.
Manual Confirmation and Refinement of Predicted Domains
To be considered as a module (i.e., a genetically mobile domain), homologous sequences were required to be present in at least two diverse molecular contexts (`domain architectures'). Domain architectures (i.e., the linear arrangement of domains within a protein) were predicted using the SMART and Pfam databases. When a sequence contained no predicted domain other than that of the candidate family, this, too, was regarded as a distinct architecture. When a sequence invariably occurred either N- or C-terminal to a single known domain, it was regarded as an extension of the known domain.
Inaccurate prediction of gene structure (i.e., artificial fusion of adjacent genes) might lead to new domain architectures being counted spuriously. Domain architectures were inspected manually for such apparently erroneous fusions; for example, protein sequences containing both nuclear and extracellular domains were excluded. Similarly, a sequence was discarded if it had no homologs of similar domain architecture, but instead was similar to several pairs of nonhomologous proteins and each pair corresponded to the presumed erroneously fused gene.
At this stage, multiple alignments were generated (Thompson et al 1994
)
for all candidate domains. In conjunction with known locations of
domains and other sequence features (e.g., N and C termini,
transmembrane regions), these were used to define the borders of the
putative new domains. In 10 cases, HMM-based searches of databases
using HMMer2 (Eddy 1998
) were needed to detect additional family
members. The results were checked manually for consistency, with
respect to amino acid conservation and phyletic distribution, to
exclude false positives, which would be expected from our 10 HMM
searches, given the E-value threshold of 0.1. Newly detected sequences were incorporated into the alignment, and the search procedure iterated. When these further analyses led to the
identification of distant, but significant, similarity to annotated
Pfam or SMART domains, the candidate domain was not pursued further. In
cases in which we were unable to connect a family to a known domain with significant sequence similarity, but in which hits with marginal similarity were present, we recorded the family as representing possible divergent members of previously known protein domain families.
| |
ACKNOWLEDGMENTS |
|---|
We thank the scientists and funding agencies comprising the international Malaria Genome Project for making sequence data from the genome of Plasmedium falciparum (3D7) public prior to publication of the completed sequence. The Sanger Centre (UK) provided sequence for chromosomes 1, 3-9, and 13, with financial support from the Wellcome Trust. A consortium composed of The Institute for Genome Research, along with the Naval Medical Research Center (USA), sequenced chromosomes 2, 10, 11, and 14, with support from NIAID/NIH, the Burroughs Wellcome Fund, and the Department of Defense. The Stanford Genome Technology Center (USA) sequenced chromosome 12, with support from the Burroughs Wellcome Fund. The Plasmodium Genome Database is a collaborative effort of investigators at the University of Pennsylvania (USA) and Monash University (Melbourne, Australia), supported by the Burroughs Wellcome Fund
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
4 These authors contributed equally to this work.
5 Corresponding author.
E-MAIL doerks{at}embl-heidelberg.de; FAX 49 622 1517.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.203201.
Abbreviations in bold refer to domains that can be found in the SMART database: http://smart.embl-heidelberg.de/
| |
REFERENCES |
|---|
|
|
|---|
.
Trends Biochem. Sci.
26:
7-9[CrossRef][Medline].
An automatic mail server for protein secondary structure prediction.
Comput. Appl. Biosci.
10:
53-60
kinase and histidyl-tRNA synthetase-related domains required for general amino acid control.
J. Biol. Chem.
273:
20404-20416Received June 29, 2001; accepted in revised form October 16, 2001.
This article has been cited by other articles:
![]() |
A. Auger, L. Galarneau, M. Altaf, A. Nourani, Y. Doyon, R. T. Utley, D. Cronier, S. Allard, and J. Cote Eaf1 Is the Platform for NuA4 Molecular Assembly That Evolutionarily Links Chromatin Acetylation to ATP-Dependent Exchange of Histone H2A Variants Mol. Cell. Biol., April 1, 2008; 28(7): 2257 - 2270. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. W. Trotter, H.-Y. Fan, M. L. Ivey, R. E. Kingston, and T. K. Archer The HSA Domain of BRG1 Mediates Critical Interactions Required for Glucocorticoid Receptor-Dependent Transcriptional Activation In Vivo Mol. Cell. Biol., February 15, 2008; 28(4): 1413 - 1426. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Commisso and G. L. Boulianne The NHR1 Domain of Neuralized Binds Delta and Mediates Delta Trafficking and Notch Signaling Mol. Biol. Cell, January 1, 2007; 18(1): 1 - 13. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Allen, A. Buchberger, and M. Bycroft The PUB Domain Functions as a p97 Binding Module in Human Peptide N-Glycanase J. Biol. Chem., September 1, 2006; 281(35): 25502 - 25508. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Katiyar, S. Joshi, and W. J. Lennarz The Retrotranslocation Protein Derlin-1 Binds Peptide:N-Glycanase to the Endoplasmic Reticulum Mol. Biol. Cell, October 1, 2005; 16(10): 4584 - 4594. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Balaji, M. M. Babu, L. M. Iyer, and L. Aravind Discovery of the principal specific transcription factors of Apicomplexa and their implication for the evolution of the AP2-integrase DNA binding domains Nucleic Acids Res., July 21, 2005; 33(13): 3994 - 4006. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Alpha-Bazin, A. Lorphelin, N. Nozerand, G. Charier, C. Marchetti, F. Berenguer, J. Couprie, B. Gilquin, S. Zinn-Justin, and E. Quemeneur Boundaries and physical characterization of a new domain shared between mammalian 53BP1 and yeast Rad9 checkpoint proteins Protein Sci., July 1, 2005; 14(7): 1827 - 1839. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ishikawa, S. Azuma, S. Ikawa, K. Semba, and J.-i. Inoue Identification of DRG family regulatory proteins (DFRPs): specific regulation of DRG1 and DRG2 Genes Cells, February 1, 2005; 10(2): 139 - 150. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Stone, H. Hauksdottir, A. Troy, J. Herschleb, E. Kraft, and J. Callis Functional Analysis of the RING-Type Ubiquitin Ligase Family of Arabidopsis Plant Physiology, January 1, 2005; 137(1): 13 - 30. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Katiyar, G. Li, and W. J. Lennarz A complex between peptide:N-glycanase and two proteasome-linked proteins suggests a mechanism for the degradation of misfolded glycoproteins PNAS, September 21, 2004; 101(38): 13774 - 13779. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nameki, M. Yoneyama, S. Koshiba, N. Tochio, M. Inoue, E. Seki, T. Matsuda, Y. Tomo, T. Harada, K. Saito, et al. Solution structure of the RWD domain of the mouse GCN2 protein Protein Sci., August 1, 2004; 13(8): 2089 - 2100. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Marin, J. I. Lucas, A.-C. Gradilla, and A. Ferrus Parkin and relatives: the RBR family of ubiquitin ligases Physiol Genomics, May 19, 2004; 17(3): 253 - 263. [Abstract] [Full Text] [PDF] |
||||
|
|