|
|
|
Published online before print
November 21, 2000, 10.1101/gr.GR-1463R
Vol. 10, Issue 12, 1996-2005, December 2000
LETTER
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
A random sequence survey of the genome of Trypanosoma cruzi, the agent of Chagas disease, was performed and 11,459 genomic sequences were obtained, resulting in ~4.3 Mb of readable sequences or ~10% of the parasite haploid genome. The estimated total GC content was 50.9%, with a high representation of A and T di- and trinucleotide repeats. Out of the estimated 5000 parasite genes, 947 putative new genes were identified. Another 1723 sequences corresponded to genes detected previously in T. cruzi through expression sequence tag analysis. 7735 sequences had no matches in the database, but the presence of open reading frames that passed Fickett's test suggests that some might contain coding DNA. The survey was highly redundant, with ~35% of the sequences included in a few large sequence families. Some of them code for protein families present in dozens of copies, including proteins essential for parasite survival and retrotransposons. Other sequence families include repetitive DNA present in thousands of copies per haploid genome. Some families in the latter group are new, parasite-specific, repetitive DNAs. These results suggest that T. cruzi could constitute an interesting model to analyze gene and genome evolution due to its plasticity in terms of sequence amplification and divergence. Additional information can be found at http://www.iib.unsam.edu.ar/tcruzi.gss.html.
[The sequence data described in this paper have been submitted to the dbGSS database under the following GenBank accession nos.: AQ443439-AQ443513, AQ443743-AQ445667, AQ902981-AQ911366, AZ049857-AZ051184, and AZ302116-AZ302563.]
| |
INTRODUCTION |
|---|
|
|
|---|
Protozoan parasites from the order Kinetoplastida
are the causative agents of widespread diseases in humans as well as of considerable economic loss through infection of domestic animals and
wildlife. Among them are Trypanosoma cruzi and
Trypanosoma brucei, the agents of Chagas disease in the
Americas and sleeping sickness in Africa, respectively, and
Leishmania spp., causing a variety of pathologies in humans.
In addition to their medical importance, these parasites were the
source of discoveries of fundamental cellular and molecular phenomena
such as RNA editing (Stuart et al. 1997
; Estevez and Simpson 1999
),
mRNA trans-splicing (Nilsen 1995
; Lee and Van der Ploeg 1997
),
glycosylphosphatidyl-inositol anchoring of proteins (Krakow et al.
1986
), and antigenic variation (Rudenko et al. 1998
), among others.
Kinetoplastids are also interesting evolutionarily because they
represent one of the earliest eukaryotic organisms that diverged from
the ancestor of the main eukaryotic branch.
Initial studies on the genome structure of these organisms were
hampered by the difficulty of applying classical genetic tools. As is
the case with yeast, chromosomes do not condense during mitosis and
cannot be visualized directly with color stains. In spite of these
drawbacks several recent studies have revealed a highly plastic genome
with an unusual gene organization (Zingales et al. 1997
; Network 1998
;
Ersfeld et al. 1999
). In the case of T. cruzi it has been
shown that there is a large chromosomal size variation between strains
that can also be observed between pairs of homologous chromosomes
(Henriksson et al. 1995
).
Many genes in kinetoplastids, including housekeeping genes, are present
in multiple copies, either clustered in tandems or distributed in
different chromosomes (Campetella et al. 1992b
; El-Sayed and Donelson
1997
). Recently, the complete sequence of chromosome 1 from L. major Friedlin (Myler et al. 1999
) and a partial sequence of
chromosome 3 from T. cruzi (Andersson et al. 1998
) were
obtained, showing in both cases a similar organization with two
clusters of genes per chromosome transcribed in opposite directions.
These studies also showed that the gene density is high in the regions
where genes are clustered, with ~1 gene every 3.6 Kb in both cases.
These and other unusual characteristics in genome structure and
organization make these parasites an interesting field of study to
further understand eukaryote gene and genome evolution.
As part of the parasite genome projects launched by the Tropical
Disease Program of the World Health Organization, we and others have
performed expressed sequence tag (EST) analysis of the T. cruzi genome as a means for rapid gene finding (Verdún et al.
1998
; Porcel et al. 2000
). As a first step toward obtaining the
complete sequence of the clone selected for the T. cruzi
genome project (CL-Brener clone) we have now made a random genome
survey of
10% (4.3 Mb) of the haploid genome of the parasite. The
results obtained allowed us to make a general characterization of the T. cruzi genome, identify putative new genes, and define large gene families and repetitive sequence families present in the parasite
genome, including a novel repetitive element and uncharacterized abundant sequences.
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
Overall Structure of the T. cruzi Genome
A random genomic library of T. cruzi DNA was constructed
and used to produce 11,459 reads with an average length of 374 bp after
vector removal and quality clipping. These GSS (Genomic Sequence
Survey) sequences represent 4.3 Mb of readable sequences or ~10% of
the parasite haploid genome, which is ~40 Mb (Frohme et al. 1998
).
The total GC content of the sequence produced was 50.9%, a value that
is slightly larger than the one obtained from estimations made on 8796 T. cruzi ESTs (49.6 %), and the one obtained for 93.4 Kb from
chromosome 3 (48.5%) (Andersson et al. 1998
). The fractional GC
content for all the GSSs varied from 0.18-0.71, with ~69% of the
sequences falling in the 0.47-0.57 range.
The frequency of di- and trinucleotide repeats calculated for all the
GSS sequences (11,459 GSS) is shown in Figure
1. T. cruzi has levels of CpG,
TpG, and CpA dinucleotides which are near their expected values, unlike
mammalian or vertebrate genomes that show a suppression of the CpG
dinucleotide and increased levels of the TpG and CpA dinucleotides
(Regev et al. 1998
). The TpA dinucleotide, however, is suppressed in
T. cruzi, with an observed/expected ratio of 0.54. This
dinucleotide is also suppressed in mammalian genomes (Smith and
Waterman 1983
), and in Fugu the pufferfish (Elgar et al.
1999
), although the reason for this is not known. Also, the abundance
of (A)n and (T)n di- and trinucleotides is remarkable. The frequency of
these di- and trinucleotides is greater than the observed frequency for
the CpA dinucleotide, the most common dinucleotide in vertebrates. When
the same analysis was done for the partial sequence of chromosome 3, we
obtained similar values, except for the (A)n and (T)n di- and
tri-nucleotides. These also represent the most frequent species, but
their observed/expected ratios are higher. In our survey the (A)n
dinucleotide had a 0.13 obs/exp ratio, whereas the partial sequence of
chromosome 3 had a 0.4 ratio. For the (A)n trinucleotide these values
are higher: a 0.67 ratio in our survey and a 1.43 ratio in the partial
sequence of chromosome 3.
|
Coding Content of the GSS Sequences
We compared the sequences in our data set to a protein nonredundant
database using BLASTX (Altschul et al. 1997
). From this
analysis, 3724 GSS (32.5%) showed significant (E<1e-5) similarities
to sequences present in the public database (GenBank release 115)
(Table 1); 2778 GSS (24.2%) matched
T. cruzi sequences while 947 GSS (8.3%) matched non-T.
cruzi sequences. 67.5% did not match any sequence in the database.
When we compared each GSS to the dbEST database, only 1723 (15%) of
the GSSs gave significant matches and 96% of these were matches to
T. cruzi ESTs. The low hit rate against ESTs could be
explained in part by the low sequence coverage attained in the EST
sequencing, which only covered one of the four main life cycle stages
of the parasite. Thus, the coding content of our survey could be
considered to be 15% based on the number of matches against dbEST or
32.5% based on the number of matches against proteins in nonredundant
databases. However, this does not include sequences absent from the
databases.
|
To detect putative coding sequences using a different criteria, we
first searched our sequences for open reading frames (ORFs) > 300
bp (defined as sequences without a stop codon and no requirement for
start codons). We then evaluated which of these resulting sequences
could be regarded as coding using the testcode algorithm developed by
Fickett (1982)
, which measures the positional randomness of a sequence,
and is independent of the reading frame. About 83% of the sequences
(9520 GSSs) had ORFs with the mentioned requirements, and 23% of these
(2222 GSSs) were found to be potentially coding by the testcode
algorithm. Using this figure as the minimum number of coding sequences
in our survey, we can estimate the number of genes for T. cruzi to be about 5000 per haploid genome (considering an average
gene size of 1500 bp and a haploid genome size of 40 Mbp).
Identification of Putative New Genes
Based on the BLASTX analysis, which is summarized in
Table 1, we detected 947 putative new genes, which are those positive
matches against a non-T. cruzi protein in the nonredundant database. The best 50 matches are shown in Table
2 (detailed information on T. cruzi
GSS sequences can be found at
http://www.iib.unsam.edu.ar/tcruzi/gss.html).
|
Among the new genes found there is a GSS (GSSTc12036) with similarity
to N-myristoyl transferases (Nmt). Nmt is only found in eukaryotic
cells and transfers fatty acid myristate from myristoyl-CoA to the
amino-terminal glycine of substrate proteins (Russell Johnson et al.
1994
). Genetic and biochemical studies have established Nmt as a target
for the development of a new class of fungicidal drugs, and the
structure of Nmt from two lower eukaryotes, namely Saccharomyces and Candida has been solved (Weston et
al. 1998
; Bhatnagar et al. 1999
). In trypanosomes protein
N-myristoylation has not yet been demonstrated, whereas it has been
shown that these parasites can do S-myristoylation of proteins (Armah
and Mensa-Wilmot 1999
). Other interesting findings were two GSS
homologous to proteins involved in chromatin remodeling. GSSTc788 is
homologous to the Drosophila ISWI protein, which is part of
several ATP-dependent chromatin remodeling complexes such as NURF
(Nucleosome remodeling factor),
CHRAC (chromatin accessibility
complex) and ACF (ATP-utilizing chromatin assembly and remodeling factor)
(Muchardt and Yaniv 1999
), whereas GSSTc11568 is homologous to several
histone deacetylases. Chromatin remodeling is a mechanism of
transcriptional regulation that has been demonstrated in many
eukaryotes including yeast, which as trypanosomes does not show
chromosome condensation during its cell cycle. Another GSS (GSSTc12012)
had homology with a T. brucei VSG expression site-associated
protein precursor (ESAG-2), which is a member of a large gene family
that includes nonfunctional genes (Kooter et al. 1988
). Also, several
GSSs with homology to proteins having RNA binding domains were
identified, including one clone (GSSTc11533) that showed homology to
the RNA binding domain present in the developmentally regulated
proteins p37 and p34 from T. brucei (Zhang et al. 1998
).
Identification of Large Gene Families
To identify large gene families, GSS sequences were clustered using
the PHRAP program to assemble contigs. Using this method
we were able to group 7883 reads in 2091 contigs; the other 3576 were
singlets (reads having no nonvector match to any other read). This
means that our survey contains 5667 unique sequences, according to our
clustering method, and thus a redundancy of at least ~50%. Further
clustering is possible, however, because the results from
BLASTX contained several GSS that belong to different
contigs showing matches to the same sequences in the database. To
estimate the total number of GSS that belong to a given gene family, we
used the consensus sequence from each contig to search a local T. cruzi GSS database using BLASTN. Based on this
analysis we were able to delineate abundant sequences in our survey,
which are the main contributors to the observed redundancy. The most
abundant gene families in the T. cruzi genome are summarized
in Table 3.
|
Among the largest families identified is the superfamily of T. cruzi antigens, also known as trans-sialidase-like molecules (632 copies per haploid genome). Their members have a number of different
activities, most of them involved in the host-parasite interaction
(Frasch 2000
). Other sequences already known to conform large gene
families in T. cruzi identified through this screening were
the cysteine proteinase cruzipain (Campetella et al. 1992a
), dgf-1, a
large protein of unknown function (Wincker et al. 1992
), and the
parasite mucins (Di Noia et al. 1998
). In all of these cases, the
estimated number of copies agreed well with experimental data, showing
that our sampling can be considered a good representation of the whole
genome. A considerable number of GSS gave matches to proteins known to
be part of retrotransposons: reverse transcriptase, apurinic/apyrimidinic endonuclease (AP-nuclease) and homologs of the
gag protein present in retroviral and LTR-retrotransposons. These proteins most probably correspond to the L1Tc non-LTR
retrotransposon (Martín et al. 1995
; Olivares et al. 1997
), the
most abundant retrotransposable element found in our survey (250 GSS).
However, they could also come from other elements since 40 GSS with
homology to the T. brucei INGI retrotransposon and 11 GSS with
homology to an Anopheles retrotransposon were detected when we
performed a search in a database of repetitive elements (see below).
Some GSS contigs failed to show similarity to any known protein in the nonredundant databases (BLASTX), and also failed to show homology to any known DNA sequence (BLASTN) in the nonredundant database. To see if these unknown sequences were part of uncharacterized large gene families, we used the consensus sequence from each of these contigs to search a database made from all the 2091 consensus sequences of the contigs. We then grouped together contigs showing significant (E < 1e-10) homology as reported by BLASTN. None of the homologous contigs found, nor any GSS that formed those contigs, had significant matches in the databases. Out of 1050 contigs analyzed (2562 GSSs, 22.3% of the survey), the first 10 representing the most abundant unknown sequences in the T. cruzi genome are included in Table 3B.
Identification of Repetitive DNA Families
The program REPEATMASKER was used to search for simple repeats (1-6 bp), low-complexity regions, and known repetitive DNA. Using the database of simple repeats from RepBase (Genetic Information Research Institute, http://www.girinst.org/, Release 5.02) the program found 1556 GSS (13.5%) containing 1091 matches to 62 different simple repeats comprising 46,965 bp (0.74%) and 1001 matches to low-complexity regions comprising 56,455 bp (0.89%). The most abundant simple repeats are shown in Figure 2A. As was observed in the analysis of di- and trinucleotide repeats, the most abundant simple repeat in T. cruzi is (T)n and its complement. Interestingly, (TA)n and (TAA)n appear to be the second two most abundant repeats in our survey. In agreement, the most abundant low-complexity regions, as reported by REPEATMASKER are AT-rich, T-rich and A-rich regions, which together represent >85% of the low complexity regions in Figure 2B. Taken together, this analysis and the data on frequencies of di- and tri-nucleotide repeats suggest that, although suppressed in the whole survey, (TA)n and (TAA)n are abundant in a subset of the GSS. This subset most probably includes GSS corresponding to intergenic regions that are rich in A and T in T. cruzi.
|
To search for repetitive elements, we used the invertebrate database of
RepBase that, although not specific for trypanosomatids, contains
repetitive elements from these organisms. The program found 3523 matches to 13 different elements comprising 629,831 bp (14.7%) in 3290 GSS (29%). The great majority of these elements were from T. cruzi or T. brucei with the exception of a repetition from
T. borreli (TBRP1) which had five matches and a
retrotransposon from Anopheles gambiae (RT1) which had 11 matches. As shown in Table 3C, the most abundant element in the T. cruzi genome is the 195-bp minichromosomal repetitive element
(González et al. 1984
). Also included in Table 3C are repetitive
elements that were described in T. cruzi but were absent from
RepBase. This is the case of telomere-associated sequences, TRBSEQA
(Requena et al. 1992
), HCR6 (de Mendonça-Lima and Traub-Cseko
1991
), and the spliced leader gene for which we detected 27, 31, 9, and
12 GSS, respectively, using BLASTN (E < 1e-40). We
did not find any match to primate or mammalian repetitive elements when we searched the respective RepBase libraries.
TcIRE, A New T. cruzi Repetitive Element
A new repetitive element apparently dispersed in the T. cruzi genome was found during the analysis of the GSS contigs. This element, which was named TcIRE (for T. cruzi Interspersed Repeated Element) was found in 44 contigs (266 GSS) representing almost 3% of the genome with 1664 copies per haploid genome. The structure of TcIRE, depicted in Figure 3, shows a central conserved core flanked by less conserved regions. The BLASTN analysis showed that the 3' region of TcIRE is similar in some cases to the last 70 bp of the 3'-UTRs of some mucin genes. This results in two different groups of TcIRE elements that differ in their 3' regions. As can be seen in the alignment shown in Figure 3B, the two groups start to differ after the conserved region. The 5' region, although less conserved, appears to be shared by both groups.
|
One copy of TcIRE is present in an intergenic region of a recently
sequenced T. cruzi cosmid containing enzymes for the synthesis of pyrimidines (Gao et al. 1999
). TcIRE is located between the OMPDCase-OPRTase gene and a surface antigen gene from the
transsialidase superfamily. This intergenic region is ~4.5 Kb long,
and the TcIRE copy is located past the center (~13,480-13,960 bp)
of this region. For means of comparison, we found only three GSS
showing significant homology with the OMPDCase-OPRTase gene. Southern
blots from several strains and clones of T. cruzi and from
Leishmania were probed with a 300-bp fragment amplified from
clone GSSTc11311 corresponding to the most conserved region of TcIRE.
This analysis showed that TcIRE is present in high copy numbers in all
the strains and clones tested (Fig. 4).
Conversely, it seems to be absent in Leishmania, as we did not
get any hybridization signal even under low-stringency conditions (data
not shown). Hybridization of chromosomes separated by PFGE using the
same probe showed that TcIRE might be present in several chromosomes,
although the region of hybridization corresponds to poorly resolved
chromosomes of high molecular weight. When the TcIRE consensus sequence
was used to search the T. cruzi EST database using
BLASTN, it showed significant (E < 1e-40)
homology with eight ESTs. None of these ESTs, however, had homologs in
nonredundant databases. Furthermore, no significant matches were
obtained when these ESTs or the TcIRE consensus sequence were used to
search the Pfam database of multiple alignments using HMMER.
|
When the same fragment from clone GSSTc11311 was used to probe a Northern blot made from epimastigote total RNA, no signal was found (data not shown), which suggests that the ESTs detected might come from a genomic contamination of the original library used to sequence the ESTs. This data suggest that TcIRE does not code for a protein product and can be considered part of the T. cruzi repetitive DNA.
| |
METHODS |
|---|
|
|
|---|
Genomic Library
A random genomic library from T. cruzi strain CL Brener
was constructed in the plasmid vector pBS(
) (Stratagene). DNA was prepared by using the proteinase K-phenol extraction method (Sambrook et al. 1989
) and mechanically sheared by using a nebulizer. After treatment with Bal31 nuclease, phenol extraction, and ethanol precipitation, the DNA was blunt-ended with T4 DNA polymerase. Fragments were size-fractionated by agarose gel electrophoresis and the
range between 1.4-2.1 Kb were recovered and cloned into the
dephosphorylated HincII site of the vector.
Nucleotide Sequencing
Fresh plated colonies were grown at 37°C in Terrific broth
containing 100 µg/mL ampicillin in 96 deep-well plates at 350 rpm. The bacterial cultures were grown in two steps, first in 0.2 mL for
14-16 h; afterward 0.8 mL of medium was added and grown overnight. The
template DNA for the sequencing reaction was prepared by a modified
alkaline lysis method (Sambrook et al. 1989
) using a 96-well format,
followed by a purification step with Wizard Midipreps DNA Purification
Resin (Promega) and 96-well MultiScreen plates with glass fiber
membrane (Millipore).
The amount of isolated DNA template was estimated on 1.0% agarose gel by comparison to serial dilutions of pBluescript II KS(+) (Stratagene). Sequencing reactions were performed in a Genius (Techne) or an Uno II (Biometra) thermal cycler by using BigDye Terminator Cycle Sequencing Ready Reaction kits, with AmpliTaq DNA Polymerase (FS enzyme) (PE Biosystems) following the protocols supplied by the manufacturer, and analyzed in an ABI prism 377 (PE Biosystems). Single-pass sequencing was performed on each template using T7 or T3 primer. Bases were called by PHRED, and edited to remove vector sequence (as detected by CROSSMATCH) from the 5' end and unreliable data from the 3' end of sequences. Sequences longer than 100 bases were further analyzed.
Sequence Analysis
The sequences were compared against the NCBI nonredundant protein
or nucleotide databases by using the program BLASTX or
BLASTN respectively (Altschul et al. 1997
) on the BLAST network service at the National Center for
Biotechnology Information (NCBI). Blast searches against in-house
databases were run locally using the BLAST suite of
programs as distributed by the NCBI in a PC computer running Linux.
Sequences used to search the databases were either consensus sequences
or individual GSS sequences as explained in the text.
For contig assembly, sequences were assembled with PHRAP;
the assemblies were visually inspected with CONSED. PHRED, PHRAP,
CROSS-MATCH, and CONSED are courtesy
of B. Ewing, P. Green, and D. Gordon (University of Washington,
Seattle). The T. cruzi GSS database used to run local
BLAST searches was compiled from the 11,459 GSS after
base-calling and vector masking with the mentioned programs. The
frequency of di- and trinucleotides was calculated using
COMPSEQ (EMBOSS, European Molecular Biology Open Software
Suite; The Sanger Centre, UK) and searching for all possible words of
length 2 and 3, respectively. The expected values are calculated on the
assumption that all words have the same probability of occurrence. The
fractional GC content of sequences was calculated with the program
GEECEE (EMBOSS). The number of ORFs was calculated
using GETORF (EMBOSS). Fickett's test (testcode) was
implemented as a Perl program, based on the original algorithm (Fickett
1982
), and used to calculate a testcode value for each sequence.
Microsatellite repeats, low-complexity regions and repetitive elements
were searched for using the program REPEATMASKER (courtesy
of A. Smit and P. Green, University of Washington, Seattle) over the
entire GSS database using different repeat libraries that are part of
RepBase (release 5.02). The Pfam database (Bateman et al. 2000
) was
searched using HMMPFAM, from the HMMER package (courtesy of S. Eddy, Washington University, St. Louis).
Estimation of the Copy Number and the Relative Abundance
Copy number (CN) was estimated directly from the number of GSS
(GSS) belonging to a given gene family. The estimation also included
the size (GS) of the gene/sequence element to account for the fact that
bigger sequences should be more represented in a random sequence survey
than smaller ones at the same copy number. For this calculation the
haploid genome size (HGS) used was 4 × 107 bp (Frohme et
al. 1998
) and the total number of GSS (TGSS) was 11,459:
|
|
Southern Blot and Pulse Field Gel Electrophoresis
DNA was purified using the conventional proteinase K-phenol
extraction method. 4 µg from each sample was digested with PstI (New England BioLabs) and separated on a 0.7% agarose gel. Southern blot transfer and analysis was performed following standard procedures (Sambrook et al. 1989
). For the PFGE blots, DNA was prepared and processed as described (Henriksson et al. 1995
). The probe was radioactively labeled with [
-32P]dCTP (NEN Life Science
Products, Inc.) by a PCR-based method using DNA from clone GSSTc11311
as template and the following oligonucleotides: TcIRE-rev,
5'-cgccaacacaactgatacg-3', and TcIRE-fwd, 5'-gcaaaagagagcaccaac-3'. After hybridization the nylon filters were washed under stringent conditions (0.1× SSC, 0.1 % SDS, at 62°C).
| |
ACKNOWLEDGMENTS |
|---|
We thank J.J. Cazzulo for critical reading of the manuscript. We thank Diego Rey Serantes, Fernanda Peri and Rodrigo Pavón for technical assistance. This work was funded by grants from the UNDP/World Bank/WHO Special Program for Research and Training in Tropical Diseases; SAREC, the Research Department of the Swedish International Development Agency (SIDA); Agencia Nacional de Promoción Científica y Tecnológica, Argentina and Consejo Nacional de Investigaciones Científicas y Técnicas, Argentina. The work of A.C.C.F. was partially supported by an International Research Scholar award from the Howard Hughes Medical Institute.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
1 These authors contributed equally to this work.
2 Corresponding author.
E-MAIL dsanchez{at}iib.unsam.edu.ar; FAX 54-11-4752-9639.
Article published online before print: Genome Res., 10.1101/gr.146300.
Article and publication are at www.genome.org/cgi/doi/10.1101/gr.146300.
| |
REFERENCES |
|---|
|
|
|---|
Received May 2, 2000; accepted in revised form September 20, 2000.
This article has been cited by other articles:
![]() |
R. T. Souza, M. R. M. Santos, F. M. Lima, N. M. El-Sayed, P. J. Myler, J. C. Ruiz, and J. F. da Silveira New Trypanosoma cruzi Repeated Element That Shows Site Specificity for Insertion Eukaryot. Cell, July 1, 2007; 6(7): 1228 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. O. Obado, M. C. Taylor, S. R. Wilkinson, E. V. Bromley, and J. M. Kelly Functional mapping of a trypanosome centromere by chromosome fragmentation identifies a 16-kb GC-rich transcriptional "strand-switch" domain as a major feature Genome Res., January 1, 2005; 15(1): 36 - 43. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Buscaglia, V. A. Campo, J. M. Di Noia, A. C. T. Torrecilhas, C. R. De Marchi, M. A. J. Ferguson, A. C. C. Frasch, and I. C. Almeida The Surface Coat of the Mammal-dwelling Infective Trypomastigote Stage of Trypanosoma cruzi Is Formed by Highly Diverse Immunogenic Mucins J. Biol. Chem., April 16, 2004; 279(16): 15860 - 15869. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Luchtan, C. Warade, D. B. Weatherly, W. M. Degrave, R. L. Tarleton, and J. C. Kissinger TcruziDB: an integrated Trypanosoma cruzi genome resource Nucleic Acids Res., January 1, 2004; 32(90001): D344 - 346. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. C. Cuevas, J. J. Cazzulo, and D. O. Sanchez gp63 Homologues in Trypanosoma cruzi: Surface Antigens with Metalloprotease Activity and a Possible Role in Host Cell Infection Infect. Immun., October 1, 2003; 71(10): 5739 - 5749. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Wickstead, K. Ersfeld, and K. Gull Repetitive Elements in Genomes of Parasitic Protozoa Microbiol. Mol. Biol. Rev., September 1, 2003; 67(3): 360 - 375. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. VIRREIRA, F. TORRICO, C. TRUYENS, C. ALONSO-VEGA, M. SOLANO, Y. CARLIER, and M. SVOBODA COMPARISON OF POLYMERASE CHAIN REACTION METHODS FOR RELIABLE AND EASY DETECTION OF CONGENITAL TRYPANOSOMA CRUZI INFECTION Am J Trop Med Hyg, May 1, 2003; 68(5): 574 - 582. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Aguero, V. Campo, L. Cremona, A. Jager, J. M. Di Noia, P. Overath, D. O. Sanchez, and A. C. Frasch Gene Discovery in the Freshwater Fish Parasite Trypanosoma carassii: Identification of trans-Sialidase-Like and Mucin-Like Genes Infect. Immun., December 1, 2002; 70(12): 7140 - 7144. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Sperling, P. Dessen, M. Zagulski, R. E. Pearlman, A. Migdalski, R. Gromadka, M. Froissard, A.-M. Keller, and J. Cohen Random Sequencing of Paramecium Somatic DNA Eukaryot. Cell, June 1, 2002; 1(3): 341 - 352. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Di Noia, C. A. Buscaglia, C. R. De Marchi, I. C. Almeida, and A. C.C. Frasch A Trypanosoma cruzi Small Surface Molecule Provides the First Immunological Evidence that Chagas' Disease Is Due to a Single Parasite Lineage J. Exp. Med., February 11, 2002; 195(4): 401 - 413. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Bringaud, N. Biteau, S. E. Melville, S. Hez, N. M. El-Sayed, V. Leech, M. Berriman, N. Hall, J. E. Donelson, and T. Baltz A New, Expressed Multigene Family Containing a Hot Spot for Insertion of Retroelements Is Associated with Polymorphic Subtelomeric Regions of Trypanosoma brucei Eukaryot. Cell, February 1, 2002; 1(1): 137 - 151. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. Rabinowicz Genomics in Latin America: Reaching the Frontiers Genome Res., March 1, 2001; 11(3): 319 - 322. [Full Text] |
||||
![]() |
I. D'Orso and A. C. C. Frasch TcUBP-1, a Developmentally Regulated U-rich RNA-binding Protein Involved in Selective mRNA Destabilization in Trypanosomes J. Biol. Chem., September 7, 2001; 276(37): 34801 - 34809. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||