Genome Research scroll

Home Help [Feedback] [For Subscribers] [Archive] [Search] [Contents]
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wang, M. L.
Right arrow Articles by Goodman, H. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wang, M. L.
Right arrow Articles by Goodman, H. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Vol. 9, Issue 4, 325-333, April 1999

A Cluster of ABA-Regulated Genes on Arabidopsis thaliana BAC T07M07

Ming Li Wang,1,2,3 Stephen Belmonte,2 Ulandt Kim,2,4 Maureen Dolan,2,5 John W. Morris,2,6 and Howard M. Goodman1,2,7

1 Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115 USA; 2 Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts 02114 USA

    ABSTRACT
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

Arabidopsis thaliana BAC T07M07 encoding the abscisic acid-insensitive 4 (ABI4) locus has been sequenced completely. It contains a 95,713-bp insert and 24 predicted genes. Most putative genes were confirmed by gel-based RNA profiling and a cluster of ABA-regulated genes was identified. One of the 24 genes, designated PP2C5, encodes a putative protein phosphatase 2C. The encoded protein was expressed in Escherichia coli, and its enzyme activity in vitro was confirmed.

[The sequence data described in this paper have been submitted to GenBank under accession no. AF085279.]

    INTRODUCTION
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

Sequencing the entire genomes of model organisms is fundamentally shifting the way we study gene expression and function. Traditionally, the search for gene function started with a phenotypic mutant and proceeded to gene cloning and functional analysis---"from phenotype to gene". Now, as genome sequencing is revealing the whole genotype of an organism, in principle, a reverse approach can be taken---"from gene to phenotype" (reverse genetics). Rather than studying a single gene's function and expression, with the use of sequence arrays one can view the genome as a whole to study multiple genes' function and expression (functional genomics) (DeRisi et al. 1997; Rowen et al. 1997).

Arabidopsis thaliana is an excellent plant organism for both genome and biological studies because of its small genome size, small physical size, and short life cycle (Meyerowitz 1994; Goodman et al. 1995). In addition, an international effort has been established for sequencing the A. thaliana genome (Kaiser 1996). We have been concentrating our efforts on Arabidopsis chromosome 2. We constructed a yeast artificial chromosome (YAC) physical map for chromosome 2 (Zachgo et al. 1996) and in a 2-Mb region in the vicinity of 80 cM, a higher resolution physical map composed primarily of bacterial artificial chromosome (BAC) clones (Wang et al. 1997). This 2-Mb region is being sequenced in collaboration with The Institute for Genome Research (TIGR). As part of this effort, we have sequenced several BACs on chromosome 2, and have begun to examine the encoded genes. Different software and database search tools (Xu et al. 1994; Burge and Karlin 1997) were used for gene prediction. Prior to functional studies, a gel-based RNA profiling method was used to confirm whether these putative genes were indeed expressed. In general, the model that emerged by comparison of all the predictions was in good agreement with the experimental data. We report here the sequence of BAC T07M07, prediction of putative genes, confirmation by gel-based RNA profiling, and identification of a cluster of ABA-regulated genes on this BAC.

    RESULTS
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

Subcloning and Sequencing of BAC DNA

Random fragmentation of high molecular weight (HMW) BAC DNA and size selection were critical for obtaining a complete set of sequencing clones. An initial T07M07 library created from a nebulized DNA sample with a broad range of size fragments (0.5-4.0 kb) did not appear to be random, as sequence assembly from 1269 reads formed deep, but not extended contigs. A second successful library used less DNA (4 vs. 6 µg) and nebulization produced a more narrow range of fragment sizes (primarily 0.5-1.6 kb). From this second library, three contigs were assembled from 2211 reads prior to gap closure by directed sequencing. Bacterial contamination as judged by blast matches was ~1% to 2%, a number lower than that reported for the yeast genomic DNA contamination (~9%) when sequencing libraries were constructed from Caenorhabditis elegans YAC DNA (Vaudin et al. 1995).

Structural Features of the Sequenced BAC T07M07

BAC T07M07 contained a 95,713-bp insert (GenBank accession no. AF085279). The generally high A+T content of A. thaliana was reflected in this BAC that had a level of 64.9%. Overall, this BAC was devoid of repetitive sequences and did not contain any CpG islands (CG-rich regions) commonly found in human sequences. This was consistent with our fingerprint and Southern hybridization results (M.L. Wang, unpubl.). Working primarily from the Grail predictions, genes were spaced relatively evenly every ~4.0 kb (average 3988 bp). In one exceptional region, between basepair positions 2300 and 11,400, there were no predicted genes and no significant blast sequence similarities. Most genes (17 of 24 predicted genes) contained introns, with an average intron size of 141 nucleotides (range 24-723). Of the genes with introns, the average number of introns in a gene was 4 (range 1-9). The average exon size for the 7 predicted genes without introns was 1091 bp (range 528-1794), whereas for the 17 predicted genes with introns, it was 217 bp (range 35-1352). For genes with introns, the average size of the gene was 1759 bp (range 376-3505), and the average predicted amino acid coding region was 1149 bp (range 209-2397). The genes were distributed evenly on the upper and lower strands.

Identifying Gene Function by Sequence Similarity

A summary of the results of blast analysis of the BAC sequence is shown in Table 1. These putative genes can be classified into five groups: (1) identical to known Arabidopsis genes [2 genes, gene 5 identical to AtEm6 and gene 10 identical to ABI4 (abscisic acid insensitive 4)]; (2) similar to an Arabidopsis EST or an Arabidopsis predicted protein (11 genes, 3, 4, 9, 12, 13, 14, 16, 19, 22, 23, and 24); (3) similar to other plant sequences (6 genes, 2, 6, 8, 11, 15, and 18); (4) similar to nonplant sequences but with no significant match to plant sequences (4 genes, 1, 7, 17, and 21); (5) not similar to any database sequence (1 gene, 20).

                              
View this table:
[in this window]
[in a new window]
 
Table 1.   Predicted Genes Encoded by BAC T07M07

Of the 24 predicted genes, 15 genes had matches with a level of significance (P < 1.0 e-20) that suggested that they might have a function related to their matched sequences. We would not normally consider a BLASTP P_value in the range of 0.001-0.0001 as significant. However, it is worth noting that the best two matches to gene 21 (mouse cation-dependent mannose-6-phosphate receptor and a Schizosaccharomyces pombe hypothetical protein) were to the same region of the gene, and many of the residues that were conserved between sequences were conserved in all three proteins. The match region included part of the mouse receptor's transmembrane domain and cytoplasmic tail which (in the bovine homolog) is important to proper trafficking of the protein (Rohrer et al. 1995). It is interesting that two small gene families seem to be present on the BAC. The first, genes, 4 and 22, produced significant matches to the same set of hypothetical proteins. The second, represented by ABI4 and gene 24, both contain the AP2 DNA-binding motif. Finally, one predicted gene had no significant match to any known gene. Some putative genes (3, 5, and 15) are probably highly expressed, because several ESTs were identified (data not shown).

ABI4 and AtEm6 are both regulated by ABA. AtEm6 (gene 5) is ABA-inducible and highly expressed in immature and dry seeds (Finkelstein 1993; Gaubier et al. 1993). Interestingly, database searching identified another potential ABA-related gene, 6, that has homology to ABI1 and ABI2 and is tentatively designated as PP2C5. Both ABI1 and ABI2 are protein phosphatase 2C (PP2C) proteins and have a high degree of sequence similarity to one another. Mutants in ABI1 and ABI2 have identical amino acid substitutions (Gly right-arrow Asp) in the same relative positions, residues 168 and 180, respectively (Leung et al. 1997). The amino acids in the ABI1 and ABI2 mutated region, as well as in the other metal and the phosphate-binding sites, are also conserved in PP2C5 (Fig. 5, below). The closest BLAST match (GenBank release 107) to the predicted PP2C5 amino acid sequence was to MSMP2C, a PP2C from alfalfa shown to be active in the stress-activated MAPK pathway (Meskiene et al. 1998). The other significant Arabidopsis matches to PP2C5 identified through database searching included the recently described AtP2C-HA (Rodriguez 1998; GenBank accession no. 3242077) and AthPP2Ca, a PP2C gene identified through yeast complementation studies (Kuromori and Yamamoto 1994; GenBank accession no. D38109).

Confirmation of Putative Genes by Reverse Northern Analysis

The positions of the putative genes relative to a minimal set of plasmid sequencing subclones of BAC T07M07 are schematically shown in Figure 1. Most of the putative genes spanned two to four plasmid clones, and individual plasmid clones represented only two putative genes (5 and 20). DNA fragments amplified from the plasmid clones by PCR using primers designed from the pBluescript II KS+ vector sequences were checked on a 1% agarose gel (Fig. 2). As shown in Figure 2, PCR was highly specific. A single major band was amplified from most plasmid clones (a minor second band was observed for only three clones that may have come from amplification of contaminating DNA templates). After gel electrophoresis, the amplified DNA was transferred to a nylon membrane and hybridized with a single-stranded cDNA probe synthesized from whole Arabidopsis seedling poly(A) RNA. The expression signals seen in the reverse Northern analysis (gel-based RNA profiling, Fig. 3) of nine putative genes (1, 5, 6, 11, 17, 18, 20, 21, and 23) were as predicted. For 15 of the predicted genes, the expression signals were in the expected region but either extended into slightly different plasmid clones or showed uneven signals across the plasmid clones. These differences may be explained, at least in part, by the presence in the probe of the 5'- and 3'-untranslated regions of the corresponding mRNA that could hybridize and extend the region of hybridization beyond that predicated by the computer programs and/or the existence of nonhybridizing nonpredicted introns within the genomic fragments. However, in addition to the expected regions of expression, there were expression signals in some regions of the BAC (N1-N5) that were not predicted to encode genes on the basis of the gene finding software. For example, there was no putative gene prediction in the region between 2300-11,400 bp, but there were clear hybridization signals (N1) in this region.


View larger version (13K):
[in this window]
[in a new window]
 
Figure 1   Relative positions of putative genes and plasmid clones sequenced on BAC T07M07. (Brown arrows) Plasmid clones; (blue arrows) predicted genes with arrowheads indicating the direction of transcription. Bent lines between the blue exons indicate predicted introns. The scale is in base pairs.


View larger version (70K):
[in this window]
[in a new window]
 
Figure 2   Amplified DNA fragments and putative genes on BAC T07M07. A minimal set of 104 sequencing clone inserts were amplified with primers designed from the pBluescript II KS+ vector sequences. The DNA samples were electrophoresed for 30 min at 60 V in a 1% agarose gel and stained with ethidium bromide. A 1-kb ladder is in each corner lane of the first row. The DNA samples for each set were loaded on the gel in the order corresponding to their positions on the BAC (Fig. 1).


View larger version (187K):
[in this window]
[in a new window]
 
Figure 3   Gene expression profile of BAC T07M07 as determined by reverse Northern (gel-based RNA profiling). Following electrophoresis, DNA samples (see Fig. 2) were transferred onto a nylon membrane and the blot was hybridized to a single-stranded cDNA probe synthesized from whole Arabidopsis seedling poly(A) RNA. Hybridization signals indicate the presence of a transcript from the cloned region or gene region (with the caveat that transcripts from genes of similar sequence may also cross-hybridize). Dashed lines indicate the positions of predicted genes numbered above the line. Arrows indicate that the predicted gene extends to the next row. Small numbers mark the DNA sample in every tenth lane. New (N) indicates expression from regions not predicted to encode a gene. A Actin gene (A) used as an internal control for gene expression.

Because there were two characterized ABA-regulated genes identified on this BAC, we examined the entire BAC for genes whose expression was regulated by ABA. Using probes developed from RNA isolated from ABA treated or water (control) treated plants, we found by reverse Northern analysis that gene 3, a zinc finger protein, was also ABA responsive (data not shown, but see Northern analysis below). We have thus identified a small cluster of three ABA-regulated genes, located within 25 kb on BAC T07M07.

Confirmation of Putative Genes by Northern Analysis

To verify the results of reverse Northern analysis, three ABA-regulated genes (3, 5, and 10) and one unregulated gene (15, as a control) were chosen for standard Northern analysis (Fig. 4). Transcripts (2.0, 0.7, 1.8, and 2.5 kb) were detected for genes 3, 5, 10, and 15, respectively, confirming the results of the reverse Northern analysis. Relative to the control water treatment, gene 3 (zinc finger protein) expression was repressed (approximately threefold) by ABA at the seedling stage (Fig. 4, lane 2) but not at the flowering or silique stages (Fig. 4, lanes 4 and 6, respectively). Gene 5 (AtEm6, late embryogenesis abundant protein) expression was induced (~3- to 10-fold) by ABA at the seedling and flowering stages; however, at the silique stage, it is highly expressed in both control and ABA-treated plants. Gene 10 (ABI4, regulatory protein) expression was induced (approximately two- to threefold) by ABA at the flowering and silique stages but not at the seedling stage, at which there was a high level of expression in both the control and treated plants. Gene 15 was not responsive to ABA treatment (only seedling stage data shown).


View larger version (62K):
[in this window]
[in a new window]
 
Figure 4   Level of RNA regulated by ABA treatment in selected putative genes. Total RNAs were isolated from tissues of 2-, 4-, and 6-week-old plants (seedling, flowering, and silique stages, respectively) and probed with gene-specific DNAs. (Lane M) RNA marker; (-) water-treated plants; (+) ABA-treated plants. The main hybridization bands corresponded to the transcripts predicted for each putative gene. The18S rDNA hybridization band serves as a sample loading control.

Functional Analysis of PP2C5 in Vitro

The coding region of PP2C5 (Fig. 5) was amplified by PCR from a cDNA library with primers designed from the predicted gene sequence. The amplified DNA fragment was cloned and sequenced. Comparison of the cDNA to the genomic sequence exactly confirmed the two predicted splicing sites (data not shown). Recombinant protein encoded by PP2C5 was expressed in E. coli and purified from cell culture by affinity chromatography. PP2C5 encodes an ~35-kD protein (Fig. 6A). Because the enzymatic activity of a PP2C protein is phosphate cleavage, the recombinant protein was assayed by measuring its ability to release 32P into solution from labeled casein. Column-purified labeled casein used as the substrate contained only 1% unincorporated 32P in solution (data not shown). Within 140 min of reaction initiation, a plateau was reached and the enzyme had dephosphorylated close to 30% of the labeled casein when 10 mM Mg2+ and 10 mM Mn2+ were provided in the reaction (Fig. 6B). The enzyme activity was abolished when 20 mM EDTA was added to the solution (Fig. 6B). These results were reproducible in two different experiments and consistent with the activity of other known PP2C enzymes (Stone et al. 1994; Bertauche et al. 1996; Leube et al. 1998; McGowan et al. 1988).


View larger version (91K):
[in this window]
[in a new window]
 
Figure 5   Peptide sequence comparison of PP2C genes. Multiple sequence alignment of PP2C5 to Medicago sativa MSMP2C (Y11607) and A. thaliana PP2C proteins; ABI1 (GenBank accession no. X77116), ABI2 (Y08965), AtP2C-HA (ATAJ3119), and ATHPP2Ca (D38109). Note the high level of conservation at the putative phosphate-binding site (open circle ), putative metal binding sites () and the position of the ABI1 and ABI2 mutation (). Sites were determined from homology to previously characterized PP2C proteins (Leung et al. 1994, 1997; Meyer et al. 1994; Das et al. 1996). Shading, on the basis of the PP2C5 sequence, shows conserved residues with a black background and similar residues with a gray background.



View larger version (87K):
[in this window]
[in a new window]
 
Figure 6   Purification of recombinant protein and enzyme activity assay. (A) Gene expression and protein purification of PP2C5. E. coli containing the recombinant expression plasmid was grown for 3 hr either under inducing or noninducing conditions. Following protein isolation, samples were analyzed on a 12% SDS-polyacrylamide gel. From left to right, the samples were as follows: protein marker (prestained SDS-PAGE standards, Bio-Rad, 25 µg in 20 µl); PP2C5-noninduced total protein (40 µg in 40 µl); PP2C5-induced total protein (40 µg in 40 µl); purified PP2C5 protein (5 µg in 5 µl); and purified PP2C5 protein (2.5 µg in 2.5 µl). (B) Enzymatic activity assay for PP2C. One microgram of purified PP2C5 was used in each reaction. Each experiment was repeated twice. Each value represents the average of the two different experiments. Labeled protein was dephosphorylated at 30°C in a 10-µl solution [containing 50 mM Tris-HCl at pH 7.0, 0.1% (vol/vol) 2-mercaptoethanol, ~3 × 105 cpm 32P-labeled casein, ~1 µg of recombinant protein, and either 10 mM Mg2+ and 10 mM Mn2+ or the same concentration of cations plus 10 mM EDTA]. At each time point (min), 100 µl of 20% trichloroacetic acid (TCA) was added, incubated on ice for 5 min, and microcentrifuged at room temperature for 5 min to precipitate the protein. Radioactivity in 90 µl of supernatant and 20 µl of pellet was measured by scintillation counting. Dephosphorylation was expressed as a percentage of 32P released into the supernatant. () -EDTA; (open circle ) +EDTA.

    DISCUSSION
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

Although gene prediction programs continue to improve, it will always be necessary to confirm the existence of the putative genes prior to gene functional studies. For BAC T07M07, the software prediction programs had a stringent bias, all of the predicted genes were confirmed (no false predictions), but several genes were missed. In a strict sense, expression has only been confirmed from the cloned regions, but not the exact exon sequences of the predicted genes. Where genes span an entire clone, there is of course, no concern that the hybridization signal might arise from a neighboring gene. However, we cannot tell from the reverse Northern blots which region of the amplified clone produced the positive hybridization signal. A traditional method to confirm the existence of the computer-predicted genes is to amplify them by PCR, clone, and sequence the clones to compare with genomic sequences. If the putative genes have introns, after cloning and sequencing, the splicing sites are accurately determined (PP2C5 as an example); if the putative genes have no introns, they need to be amplified from total mRNA by RT-PCR (e.g., 3' rapid amplification of cDNA ends) and then cloned. After sequencing, the presence of a poly(A) tail on the cDNA clone confirms gene expression and eliminates the possibility of genomic DNA contamination as a possible template source. This approach can only confirm gene existence one by one, with a low efficiency, but with high fidelity. The development of DNA-expression chip technology provides a powerful new tool for confirmation of the existence of putative genes. The method we describe here, gel-based RNA profiling (reverse Northern blot) is an alternative to DNA microarray technology. The only difference is spotting expressed sequence tags (ESTs) on glass slides versus transferring amplified genomic DNA fragments from plasmid clones onto nylon membranes. One limitation of the technique relative to DNA microarray technology is lower spotting density, resulting in low rates of data accumulation. Another more fundamental limitation with this technique, as well as microarray technology, is that gene identification relies on mRNA isolated from a limited set of treatment time points and physiological growth stages. mRNAs that are expressed in a small number of cells, for short time periods or at low levels, may be missed in the assay. It is therefore possible that the five genes that were identified on the hybridization but missed by the gene prediction software actually underestimate the number of missed genes. This is a tempting speculation given the large gap between the predicted genes 1 and 2. Despite the limitations, our results demonstrate that gel-based RNA profiling is an efficient method for gene confirmation and is especially useful and practical for conducting experiments on a small scale in laboratories without DNA microarray facilities.

We have made profitable use of the technique to examine changes in mRNA levels following Abscisic acid (ABA) treatment. ABA, a plant phytohormone, regulates many aspects of plant growth and development (such as seed development, dormancy, and germination) and adaptation to environmental stimuli (such as cold, wounding, and water or salt stress). Regulation of ABA-related gene expression in crop plants has the potential to be of agricultural importance. Interestingly, there is a cluster of ABA-regulated genes on this BAC. AtEm6 (gene 5) encodes one member of the late embryogenesis-abundant protein (lea) family that is ABA inducible and highly expressed in immature and dry seeds (for review, see Skriver and Mundy 1990). Its expression level is also regulated by some ABI genes (especially ABI3, ABI4, and ABI5) (Finkelstein 1993, 1994; Parcy et al. 1994). While positional cloning ABI4 (Finkelstein et al. 1998), we found that both ABI4 and AtEm6 were located on the same BAC, 17 kb apart from each other. ABI4 (gene 10), a new member of the AP2 domain family, encodes a regulatory protein. Intriguingly, we identified another ABA-related gene in the vicinity of the locus AtEm6. Gene 3 (Cys-3-His zinc finger protein) is 500 bp downstream from AtEm6. Its expression is repressed by ABA at the seedling stage. Gene 6, PP2C5, which has high sequence similarity to ABI1 and ABI2 in the metal and phosphate-binding domains, is also located within the ABA-related cluster (2 kb upstream of AtEm6). However, unlike ABI1 and ABI2 that are up-regulated by ABA treatment, we did not detect a significant PP2C5 response to ABA in the Northern analysis experiments (data not shown). Plant tissue samples for mRNA isolation were collected 24 hr after treatment with ABA; therefore, the gene expression changes identified are probably an indirect response (secondary message) to ABA application. Because the reverse Northern profiling relied on mRNA samples from whole plants, and examined only a limited portion of the life cycle, it is an open question as to whether there are other ABA-regulated genes around the AtEm6 locus. It is an interesting, but as yet unanswered question, whether the spacial proximity of these genes and other genes reflects a functional interaction. One possible approach to answer this question would be to introduce a genomic fragment encompassing all of these genes into transgenic plants. Phenotypic changes might be observed, and changes in other gene expression patterns might be observed on a cDNA microarray (Schena et al. 1995; Lashkari et al. 1997). Such an analysis might help discern the transgene function(s) in vivo. The binary bacterial artificial chromosome (BIBAC) system that has been developed recently for the delivery of HMW DNA into plants (Hamilton et al. 1996) should be useful in this regard.

The reversible phosphorylation of proteins, catalyzed by protein kinases and phosphatases, is a major mechanism for the regulation of many cellular functions from metabolism to signal transduction. PP2C is a Mg2+-dependent enzyme (Ingebritsen and Cohen 1983). There are at least five genes for protein phosphatase 2Cs experimentally characterized in Arabidopsis and several more putative PP2C proteins identified on sequenced Arabidopsis BACs. Although these five gene paralogs for protein phosphatases 2C are all expressed in Arabidopsis, they may not serve a single function, spacial and temporal expression patterns have yet to be completely determined and may be expected to vary between the genes. As shown in Figure 5, the regions of similarity among characterized PP2C proteins is primarily in the carboxy-terminal end and this also appears to be true of the putative PP2C proteins from sequenced BACs (F17I5, F28M20, F21O9, T01B08, T20B5, T29F13; data not shown). It is inviting to speculate that amino-terminal domains will be shown to determine substrate specificity. ABI1, ABI2, ATP2CHA, and ATHPP2C are actually more closely related to each other than alfalfa MsMP2c and PP2C5, suggesting a divergence of functional partners. The three PP2C proteins ABI1, ABI2, and AthPP2Ca have been studied in vivo. In our case, the enzyme activity of PP2C5 was only tested in vitro. Recently, it has been demonstrated that in plants, ABI1, ATHPP2C, and MsMP2c function as negative regulators in ABA or stress-activated MAPK pathways (Meskiene et al. 1998; Sheen 1998). A single cell system (Sheen 1996) is a rapid way to characterize in vivo activity of PP2C5 genes. Expression of PP2C5, perhaps in combination with known kinases or phosphatases in vivo, may provide additional insights into the ABA signal-transduction pathway.

    METHODS
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

DNA Nebulization and Library Construction

BAC T07M07 from the TAMU Arabidopsis library (Choi et al. 1995) was isolated from a 500-ml culture by a modified alkaline lysis method. BAC DNA (4-6 µg) was resuspended in 2 ml of 50 mM Tris (pH 8.0), 15 mM MgCl2, 25% glycerol and transferred into a prepared nebulizer (IPI Medical Products). DNA was nebulized for 150 sec at 30 psi at the tank outlet. The fragmented DNA sample (1800 µl) was collected and concentrated to 400 µl by five to six butanol extractions, precipitated with 2.5 volumes of ethanol (2 hr at -20°C) and pelleted by microcentrifugation at 4°C for 15 min. The DNA pellet was washed with 70% cold ethanol, dried at room temperature and dissolved in ~100 µl of 1× TE. Ends of the fragmented DNA (~1.5 µg) were filled in by Pfu DNA polymerase (Stratagene). The reaction solution was loaded onto a 1%, 8-cm-long low melting point (LMP) agarose (Kodak) gel in 1× TBE and subjected to electrophoresis. Gel slices containing DNA fragments ranging from 500 bp to 3.0 kb were excised. The gel was melted at 65°C for 10 min and the agarose digested with beta -agarase I (New England Biolabs) at 40°C for 2 hr. The digested agarose solution was chilled on ice for 15 min and microcentrifuged at 4°C for 15 min to pellet any undigested agarose. The supernatant was extracted with an equal volume of phenol/chloroform, then chloroform, and the DNA was precipitated with two volumes of isopropanol. Following centrifugation, the DNA pellet was washed with 70% cold ethanol, dried at room temperature for 15 min, dissolved in 20 µl of 1× TE, and stored at 4°C for later ligation. Bluescript II KS+ vector (Stratagene) DNA was digested with EcoRV at 37°C for 3 hr and dephosphorylated with calf intestinal alkaline phosphatase (CIP) (NEB). The dephosphorylated vector DNA (~100 ng) was mixed with the size-selected BAC DNA (~100 ng) and ligated with T4 DNA ligase (NEB) at 16°C overnight. After ligation, the ligase was heat inhibited at 65°C for 10 min. DNA in the ligation solution was electrotransformed into E. coli DH5alpha -competent cells by use of a Bio-Rad gene pulser. Transformants were recovered in 1 ml of SOC (2% Bacto-tryptone, 0.5% Bacto-yeast extract, 10 mM NaCl, 2.5 mM KCl, 10 mM MgCl2, 10 mM MgSO4, 10 mM glucose) at 37°C with shaking at 300 rpm for 1 hr. Transformants in the recovery solution were mixed with an equal volume of 30% glycerol, divided into aliquots (100 µl), and stored at -80°C for later use.

DNA Sequencing

The frozen transformant stock was thawed briefly and diluted 50 times with LB liquid medium and then plated onto selective LB agar plates containing 50 µg/ml ampicillin. Recombinants were identified with a blue-white (X-gal and IPTG indicator) screening system. White colonies were inoculated into 96-well titer plates (each well containing 1.3 ml of liquid LB medium) and cultured at 37°C with shaking at 300 rpm for 22 hr. Plasmid DNA was isolated by rapid extraction alkaline plasmid kits (QIAGEN) and dissolved in 50 µl of double-distilled water for later use as DNA-sequencing templates. DNA was sequenced on an ABIprism 377 DNA sequencer by dye terminator chemistry as described by the manufacturer (Perkin Elmer). The Phred, Phrap, and Consed sequence assembly and viewing programs (Ewing and Green 1998; Ewing et al. 1998; Gordon et al. 1998) were used to remove BAC vector sequences and to assemble contigs. After large sequence contigs were assembled and only two gaps remained, the gaps were closed by amplifying the intervening DNA by PCR from the original BAC DNA with primers on the basis of adjacent sequences.

Gene Prediction

Putative genes were identified by a combination of three gene-prediction programs. Grail (Xu et al. 1994) was run with client server software and selecting for Arabidopsis as the DNA source. Gene assembly for the Grail program requires the investigator to define a region of the DNA sequence to assemble. This was done somewhat subjectively on the basis of the clustering of the predicted exons. Genscan (Burge and Karlin 1997) was run with the Arabidopsis organism option and an exon cutoff of 1.0. Genefinder (P. Green, pers. comm.) was run with the Arabidopsis tables developed by ESSA (European Scientists Sequencing Arabidopsis). As partial confirmation of the predicted genes, the BAC sequence was compared with sequences deposited in GenBank by use of blastn and blastx, and translations of the predicted genes were compared by use of blastp (Altschul et al. 1990; Gish et al. 1993).

Amplification from the BAC Sequences; Northern and Reverse Northern Analysis

Minimum-overlapping plasmid clones were selected and insert DNAs were amplified by use of primers designed from pBluescript II KS+ vector sequences. The DNA concentration in PCR was quantified, 1 µg of DNA was electrophoresed on a 1% agarose gel for 30 min, and DNA fragments were transferred onto a nylon membrane (Biodyne, Pal Biosupport, East Hills, NY; 0.45 µm) by alkaline transfer. These blots were used for reverse Northern analysis. Tissue was collected from 2-, 4-, and 6-week-old glass-house-grown whole plants 24 hr following spraying with water (control) or ABA (100 µM), immediately frozen in liquid nitrogen, and stored at -80°C. Stage-specific, total or poly(A) RNA were isolated with Trizol (GIBCO-BRL) and the Oligotex mRNA mini kit (Qiagen). Total RNA for each sample (10 µg) was separated on a 1% agarose gel and transferred onto a nylon membrane. For reverse Northern analysis, a labeled single-stranded cDNA probe was synthesized from whole Arabidopsis seedling poly(A) RNA and used as probes on Southern blots containing DNA fragments covering the whole BAC T07M07. After hybridization, the blots were washed four times at 65°C for 15 min with 40 mM Na2HPO4 (pH 7.2), 1% SDS, and 1 mM EDTA (pH 8.0). For Northern analysis, DNA fragments encompassing putative genes were amplified from BAC T07M07, labeled by [alpha -32P]dATP and used as probes for hybridization. Primers were designed on the basis of the sequences of putative genes from start codons to stop codons and synthesized in the Department of Molecular Biology core facility. The blots were washed with 1× SSC, 0.1% SDS, at 65°C for 15 min twice; 0.1× SSC, 0.1% SDS, at 65°C for 15 min twice, and then exposed to X-ray film.

Amplification and Cloning of PCR Fragments from a cDNA Library

DNAs prepared from a cDNA library (Minet et al. 1992) were used as DNA templates for PCR amplification of putative genes. High-fidelity enzyme mix (Boehringer Mannheim) and Taq extender (Stratagene) were used in the PCR. PCR-amplified fragments corresponding to the ABA regulated genes on T07M07 were extracted with phenol/chloroform/isoamylalcohol (25:24:1), then chloroform/isoamylalcohol (24:1), precipitated with ethanol, dried at room temperature, dissolved in 1× TE and cloned into the pET-19b expression vector (Novagen) that encodes 10 histidine residues at the amino terminus of the expressed protein. PCR fragments (~1000 ng) and pET-19b vector (500 ng) DNA were digested with XhoI and dephosphorylated with CIP. The digested PCR fragment DNA was electrophoresed on a 1% LMP agarose gel. The gel slice containing the DNA band was excised under long UV light and melted at 65°C. Digested vector DNA and PCR-amplified DNA were ligated at 16°C for 16 hr and then electroporated into E. coli DH5alpha -competent cells. Recombinants were identified by outgrowth on LB agar plates containing ampicilin (50 µg/ml). Four clones from each putative gene were sequenced and aligned with genomic sequences. Plasmid DNA from a sense clone without mutations and pET-19b vector DNA (as a control) were transformed into E. coli BL21pLysS-competent cells (Novagen) (expression host) by heat-shock for later gene expression.

Expression and Purification of Recombinant Protein

A single colony was picked from LB agar plates containing 50 µg/ml ampicillin and 34 µg/ml chloramphenical and inoculated into 5 ml of LB solution with the same antibiotics and cultured at 37°C with shaking at 300 rpm for 16 hr. Five milliliters of cultured cell solution was inoculated into 500 ml of LB solution with the same antibiotics. When the OD600 reached 0.6-0.7, isopropyl beta -D-thiogalactoside (IPTG, 1 mM) was added to the culture for induction of expression from the T7 promoter. The solution was cultured for another 3 hr to allow accumulation of recombinant protein. Cells were harvested by centrifugation and suspended in 5 mM imidazole, 500 mM NaCl, 20 mM Tris-HCl (pH 7.9), and lysed by sonication at the following settings (time, hold; duty cycle, 80; output control, 9) on a Sonifier 250 (Branson) at 4°C for 1 min. Protein was purified by nickel affinity chromatography Novagen kits (His-bind resin; His-bind buffer) and dialyzed at 4°C overnight. Protein was concentrated by use of centricon concentrators (Amicon) and its concentration determined by a Bio-Rad protein assay.

Casein Labeling and Protein Enzyme Activity Assay

J.M. Stone (pers. comm.) provided the casein-labeling method. Before labeling, 5× incubation buffer (250 mM Tris at pH 7.4, 0.5 mM EGTA, 50% glycerol, 0.5% beta -mercaptoethanol) and stop buffer (100 mM EDTA, 500 mM NaF) were prepared, and 250 units of the catalytic subunit of protein kinase from bovine heart (Sigma) were dissolved in 25 µl of 40 mM dithiothreitol (DTT). Partially dephosphorylated casein from bovine milk (Sigma) was labeled by [gamma -32P]ATP (NEN/DuPont) in a total volume of 500 µl (containing 1× incubation buffer, 10 mM Mg-acetate, 1 U/µl protein kinase, 0.4 µCi/µl [32P]ATP, 11.2 µg/µl casein) at room temperature for 16 hr. The reaction was terminated by adding 55 µl of stop buffer and left at room temperature for 10 min. Denatured protein was removed by brief centrifugation. To remove unincorporated [32P]ATP, the reaction solution was chromatographed on a 10-ml size Sephadex G-50 column pre-equilibrated with 1× incubation buffer. The labeled protein solution was loaded onto the column, 32 fractions (500 µl each) were eluted by addition of 1× incubation buffer, collected into 1.5-ml Eppendorf tubes and stored at 4°C. Incorporated label in each fraction was determined by scintillation counting and the initial highly radioactive fraction (4 × 105 cpm/µl) was used as a substrate for the protein assay. Phosphatase activity was determined as described (Stone et al. 1994).

    ACKNOWLEDGMENTS

We thank Drs. Jen Sheen and Gabor Lazar for suggestions and stimulating discussions regarding the experiments, Dr. Julie M. Stone for instructions for the casein labeling and the recombinant protein assay, Dr. Hawk-Bin Kwon for suggestions in RNA isolation, and Dr. Julie Nardone for critical reading of the manuscript. We acknowledge Dr. Michèle Minet for providing the Arabidopsis pFL61 cDNA library. This work was supported by a grant from Hoechst to H.M.G.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.

    FOOTNOTES

Present addresses: 3Cereon Genomics, LLC., Cambridge, Massachusetts 02139 USA; 4Marine Biological Laboratory, Woods Hole, Massachusetts 02543 USA; 5DuPont Agricultural Products, Newark, Delaware 19711 USA; 6Millennium Pharmaceuticals Inc., Cambridge, Massachusetts 02139 USA.

7 Corresponding author.

E-MAIL Howard.Goodman{at}molbio.mgh.harvard.edu; FAX (617) 726-3535.

    REFERENCES
Top
Abstract
Introduction
RESULTS
DISCUSSION
METHODS
References

  • Altschul, S.F., W. Gish, W. Miller, E.W. Myers, and D.J. Lipman. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403-410[CrossRef][Medline].
  • Bertauche, N., J. Leung, and J. Giraudat. 1996. Protein phosphatase activity of abscisic acid insensitive 1 (ABI1) protein from Arabidopsis thaliana. Eur. J. Biochem. 241: 193-200[Medline].
  • Burge, C. and S. Karlin. 1997. Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268: 78-94[CrossRef][Medline].
  • Choi, S., R.A. Creelman, J.E. Mullet, and R.A. Wing. 1995. Construction and characterization of a bacterial artificial chromosome library of Arabidopsis thaliana. Plant Mol. Biol. Rep. 13: 124-128.
  • Das, A.K., N.R. Helps, P.T.W. Cohen, and D. Barford. 1996. Crystal structure of the protein serine/threonine phosphatase 2C at 2.0 Å resolution. EMBO J. 15: 6798-6809[Medline].
  • DeRisi, J.L., V.R. Iyer, and P.O. Brown. 1997. Exploring the metabolic and genetic control of gene expression on a genome scale. Science 278: 680-686[Abstract/Free Full Text].
  • Ewing, B. and P. Green. 1998. Base-calling of automated sequencer traces using Phred. II. Error probabilties. Genome Res. 8: 186-194[Abstract/Free Full Text].
  • Ewing, B., L. Hillier, M.C. Wendl, and P. Green. 1998. Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res. 8: 175-185[Abstract/Free Full Text].
  • Finkelstein, R. 1993. Abscisic acid-insensitive mutations provide evidence for stage-specific signal pathways regulating expression of an Arabidopsis late embryogenesis-abundant (lea) gene. Mol. & Gen. Genet. 238: 401-408[CrossRef][Medline].
  • -----. 1994. Mutations at two new Arabidopsis ABA response loci are similar to the abi3 mutations. Plant J. 5: 765-771[CrossRef].
  • Finkelstein, R., M.L. Wang, T.J. Lynch, S. Rao, and H.M. Goodman. 1998. The Arabidopsis abscisic acid response locus ABI4 encodes an APETALA2 domain protein. Plant Cell 10: 1043-1054[Abstract/Free Full Text].
  • Gaubier, P., M. Raynal, G. Hull, G.M. Huestis, F. Grellet, C. Arenas, M. Pagès, and M. Delseny. 1993. Two different Em-like genes are expressed in Arabidopsis thaliana seeds during mutation. Mol. & Gen. Genet. 238: 409-418[CrossRef][Medline].
  • Gish, W. and D.J. States. 1993. Identification of protein coding regions by database similarity search. Nat. Genet. 3: 266-272[CrossRef][Medline].
  • Goodman, H.M., J.R. Ecker, and C. Dean. 1995. The genome of Arabidopsis thaliana. Proc. Natl. Acad. Sci. 92: 10831-10835[Abstract/Free Full Text].
  • Gordon, D., C. Abajian, and P. Green. 1998. Consed: A graphical tool for sequence finishing. Genome Res. 8: 195-202[Abstract/Free Full Text].
  • Hamilton, C.M., A. Frary, C. Lewis, and S.D. Tanksley. 1996. Stable transfer of intact high molecular weight DNA into plant chromosomes. Proc. Natl. Acad. Sci. 93: 9975-9979[Abstract/Free Full Text].
  • Ingebritsen, T.S. and P. Cohen. 1983. Protein phosphatases: Properties and role in cellular regulation. Science 221: 331-338[Abstract/Free Full Text].
  • Kaiser, J. 1996. First global sequencing effort begins. Science 274: 30[CrossRef][Medline].
  • Kuromori, T. and M. Yamamoto. 1994. Cloning of cDNAs from Arabidopsis thaliana that encode putative protein phosphatase 2C and a human Dr1-like protein by transformation of a fission yeast mutant. Nucleic Acids Res. 22: 5296-5301[Abstract/Free Full Text].
  • Lashkari, D.A., J.H. McCusker, and R.W. Davis. 1997. Whole genome analysis: Experimental access to all genome sequenced segments through larger-scale efficient oligonucleotide synthesis and PCR. Proc. Natl. Acad. Sci. 94: 8945-8947[Abstract/Free Full Text].
  • Leube, M.P., E. Grill, and N. Amrhein. 1998. ABI1 of Arabidopsis is a protein serine/threonine phosphatase highly regulated by the proton and magnesium ion concentration. FEBS Lett. 424: 100-104[CrossRef][Medline].
  • Leung, J., M. Bouvier-Durand, P. Morris, D. Guerrier, F. Chefdor, and J. Giraudat. 1994. Arabidopsis ABA response gene ABI1: Features of a calcium-modulated protein phosphatase. Science 264: 1448-1452[Abstract/Free Full Text].
  • Leung, J., S. Merlot, and J. Giraudat. 1997. The Arabidopsis ABSCISIC ACID-INSENSITIVE2 (ABI2) and ABI1 gene encode homologous protein phosphatases 2C involved in abscisic acid signal transduction. Plant Cell 9: 759-771[Abstract].
  • McGowan, C.H. and P. Cohen. 1988. Protein phosphatase-2C from rabbit skeletal muscle and liver: An Mg2+-dependent enzyme Methods Enzymol. 159: 416-427[Medline].
  • Meskiene, I., L. Bogre, W. Glaser, J. Balog, M. Brandstotter, K. Zwerger, G. Ammerer, and H. Hirt. 1998. MP2C, a plant protein phosphatase 2C, functions as a negative regulator of mitogen-activated protein kinase pathways in yeast and plants. Proc. Natl. Acad. Sci. 95: 1938-1943[Abstract/Free Full Text].
  • Meyer, K., M.P. Leube, and E. Grill. 1994. A protein phosphatase 2C involved in ABA signal transduction in Arabidopsis thaliana. Science 264: 1452-1455[Abstract/Free Full Text].
  • Meyerowitz, E.M. 1994. Arabidopsis: Structure and organization of the Arabidopsis thaliana nuclear genome (ed. E.M. Meyerowitz and C.R. Somerville), pp. 21-36. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
  • Minet, M., M.E. Dufour, and F. Lacroute. 1992. Complementation of Saccharomyces cerevisiae auxotrophic mutants by Arabidopsis thaliana cDNAs. Plant J. 2: 417-422[Medline].
  • Parcy, F., C. Valon, M. Raynal, P. Gaubier-Comella, M. Delseny, and J. Giraudat. 1994. Regulation of gene expression programs during Arabidopsis seed development: Roles of the ABI3 locus and of endogenous abscisic acid. Plant Cell 6: 1567-1582[Abstract].
  • Rodriguez, P.L., M.P. Leube, and E. Grill. 1998. Molecular cloning in Arabidopsis thaliana of a new protein phosphotase 2C (PP2C) with homology to ABI1 and ABI2. Plant Mol. Biol. 38: 879-883[CrossRef][Medline].
  • Rohrer, J., A. Schweizer, K.F. Johnson, and S. Kornfeld. 1995. A determinant in the cytoplasmic tail of the cation-dependent mannose 6-phosphate receptor prevents trafficking to lysosomes. J. Cell. Biol. 130: 1297-1306[Abstract/Free Full Text].
  • Rowen, L., G. Mahairas, and L. Hood. 1997. Sequencing the human genome. Science 278: 605-607[Abstract/Free Full Text].
  • Schena, M., D. Shalon, R.W. Davis, and P.O. Brown. 1995. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270: 467-470[Abstract/Free Full Text].
  • Sheen, J. 1996. Ca2+-dependent protein kinases and stress signal transduction in plant. Science 274: 1900-1902[Abstract/Free Full Text].
  • -----. 1998. Mutational analysis of protein phosphatase 2C involved in abscisic acid signal transduction in higher plants. Proc. Natl. Acad. Sci. 95: 975-980[Abstract/Free Full Text].
  • Skriver, K. and J. Mundy. 1990. Gene expression in response to abscisic acid and osmotic stress. Plant Cell 2: 503-512[Free Full Text].
  • Stone, J.M., M.A. Collinge, R.D. Smith, M.A. Horn, and J.C. Walker. 1994. Interaction of a protein phosphatase with an Arabidopsis serine-threonine receptor kinase. Science 266: 793-795[Abstract/Free Full Text].
  • Vaudin, M., A. Roopra, L. Hillier, R. Brinkman, J. Sulston, R.K. Wilson, and R.H. Waterson. 1995. The construction and analysis of M13 libraries prepared from YAC DNA. Nucleic Acids Res. 23: 670-674[Abstract/Free Full Text].
  • Wang, M.L., L. Huang, D.K. Bongard-Pierce, S. Belmonte, E.A. Zachgo, J.W. Morris, M. Dolan, and H.M. Goodman. 1997. Construction of an ~2 Mb contig in the region around 80 cM of Arabidopsis thaliana chromosome 2. Plant J. 12: 711-730[CrossRef][Medline].
  • Xu, Y., J.R. Einstein, R.J. Mural, M. Shah, and E.C. Uberbacher. 1994. An improved system for exon recognition and gene modeling in human DNA sequences. Intell. Systems Mol. Biol. 2: 376-384.
  • Zachgo, E.A., M.L. Wang, J. Dewdney, D. Bouchez, C. Camilleri, S. Belmonte, L. Huang, M. Dolan, and H.M. Goodman. 1995. A physical map of chromosome 2 of Arabidopsis thaliana. Genome Res. 6: 19-25[Abstract/Free Full Text].

Received November 5, 1998; accepted in revised form January 29, 1999.


9:325-333 ©1999 by Cold Spring Harbor Laboratory Press  ISSN 1088-9051/99 $5.00

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Plant Physiol.Home page
S. E. Keates, T. A. Kostman, J. D. Anderson, and B. A. Bailey
Altered Gene Expression in Three Plant Species in Response to Treatment with Nep1, a Fungal Protein That Causes Necrosis
Plant Physiology, July 1, 2003; 132(3): 1610 - 1622.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. Kerk, J. Bulgrien, D. W. Smith, B. Barsam, S. Veretnik, and M. Gribskov
The Complement of Protein Phosphatase Catalytic Subunits Encoded in the Genome of Arabidopsis
Plant Physiology, June 1, 2002; 129(2): 908 - 925.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
E. M. Söderman, I. M. Brocard, T. J. Lynch, and R. R. Finkelstein
Regulation and Function of the Arabidopsis ABA-insensitive4 Gene in Seed and Abscisic Acid Response Signaling Networks
Plant Physiology, December 1, 2000; 124(4): 1752 - 1765.
[Abstract] [Full Text]


Home page
Genes Dev.Home page
F. Arenas-Huertero, A. Arroyo, L. Zhou, J. Sheen, and P. León
Analysis of Arabidopsis glucose insensitive mutants, gin5 and gin6, reveals a central role of the plant hormone ABA in the regulation of plant vegetative development by sugar
Genes & Dev., August 15, 2000; 14(16): 2085 - 2096.
[Abstract] [Full Text]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wang, M. L.
Right arrow Articles by Goodman, H. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wang, M. L.
Right arrow Articles by Goodman, H. M.
Social Bookmarking
 Add to CiteULike