|
|
|
|
Published online before print
February 12, 2004, 10.1101/gr.1687304 Genome Res. 14:478-490, 2004 ©2004 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/04 $5.00
Resources Development and Application of a Salmonid EST Database and cDNA Microarray: Data Mining and Interspecific Hybridization Characteristics1 Centre for Biomedical Research, University of Victoria, Victoria, British Columbia V8W 3N5 Canada 2 Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6 Canada 3 Aquaculture Division, Fisheries and Oceans Canada, West Vancouver, British Columbia V7V 1N6 Canada 4 Array Facility, Prostate Centre, Vancouver General Hospital, Vancouver, British Columbia V6H 3Z6 Canada 5 Pacific Biological Station, Fisheries and Oceans Canada, Nanaimo, British Columbia V9T 6N7 Canada 6 Genome Sciences Centre, BC Cancer Agency, Vancouver, British Columbia V5Z 4E6 Canada
We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3' sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids.
Gene and genome duplications are thought to be primary mechanisms of increasing the number of coding sequences subject to selection, leading to new proteins, morphogenic variations, and phenotypes (Ohno 1970
In addition to their scientific importance as recent tetraploids, salmonids also serve as prominent models for studies involving environmental toxicology (Katchamart et al. 2002
A previously reported S. salar EST project surveyed 1152 ESTs from six cDNA libraries, with 510BLAST-identified sequences representing 178 salmon genes (Davey et al. 2001
EST Survey This report describes ESTs obtained from high-complexity normalized and non-normalized, directionally cloned cDNA libraries, as well as subtracted cDNA libraries, from the following species: S. salar (23 libraries/library groups representing 16 adult tissues and whole juvenile), O. mykiss (six libraries/library groups from three adult tissues, and whole embryo and juvenile), O. tshawytscha (two libraries from adult mixed tissue), O. nerka (two libraries from adult brain and whole juvenile), and C. clupeaformis (two libraries from adult brain; see Table 2 below). The set of S. salar cDNA libraries represents most principal tissues in adult fish. EST clones are available from the corresponding author.
The 95,320 clones from these cDNA libraries (71,144 S. salar, 19,093 O. mykiss, 1824 O. tshawytscha, 1051 O. nerka, and 2208 C. clupeaformis) were M13 forward-sequenced and quality checked. For all libraries except SSH (suppression subtractive hybridization), M13 forward sequences of properly oriented inserts should include 3' UTR. Because of low conservation in 3' UTRs and the pseudotetraploidy of salmonid genomes, we focused on 3' sequencing to allow differentiation between paralogs arising from the recent salmonid genome duplication. 5' (reverse) sequencing was attempted on 7487 of the 71,144 S. salar clones. The 80,388 high-quality ESTs (55,082 forward and 6737 reverse S. salar, 14,544 forward O. mykiss, 1317 forward O. tshawytscha, 1243 forward O. nerka, and 1465 forward C. clupeaformis) were assembled by using PHRAP under high stringency to identify EST clusters (contiguous sequences, or contigs) representing redundant transcripts (Tables 1, 2). The average trimmed PHRED20 length of these ESTs is 546 bases. The 61,819 S. salar ESTs were assembled into 11,560 contigs (with 17,150 singletons remaining), 14,544 O. mykiss ESTs formed 2370 contigs (6611 singletons), 1317 O. tshawytscha ESTs formed 136 contigs (949 singletons), 1243 O. nerka ESTs formed 291 contigs (229 singletons), and 1465 C. clupeaformis ESTs formed 138 contigs (1038 singletons; Table 1). There are 28,710 assembled S. salar sequences (putative transcripts), 8981 O. mykiss putative transcripts, 1085 O. tshawytscha putative transcripts, 520 O. nerka putative transcripts, and 1176 C. clupeaformis putative transcripts (Table 1). Results of alternate assemblies (CAP3 and stackPACK) of this EST collection are available at http://web.uvic.ca/cbr/grasp. The largest S. salar contig contains 252 ESTs (prolactin); the largest O. mykiss contig is size 93 (parvalbumen ); the largest O. tshawytscha contig is size 10 (cytochrome c oxidase subunit II); the largest O. nerka contig is size 21 (similar to ribosomal protein L41); and the largest C. clupeaformis contig is size 28 (ependymin; Table 1). BLAST alignments of ESTs against combined ribosomal and mitochondrial sequence databases (see Methods) identified 1052 S. salar, 396 O. mykiss, 103 O. tshawytscha, 40 O. nerka, and 157 C. clupeaformis reads.
Preliminary analysis of aligned S. salar and O. mykiss assembled ESTs identifies 1892 sequence pairs with >80% identity (see Methods). Of these, 1429 ( 76%) were contained within a distinct peak from 90%97% identity (average 94%) at the nucleotide level. As it is difficult to distinguish orthologs from sequence pairs related by paralogy resulting from gene or genome duplications, a more focused study is underway.
REPuter (Kurtz et. al. 2001
Library Complexity and Gene Discovery
Insert orientation in various types of cDNA library was analyzed to determine its potential influence on gene discovery rates. All libraries in this database were classified by type (i.e. normalized, subtracted), and insert orientations in two libraries from each class were determined (see Supplemental table at http://web.uvic.ca/cbr/grasp for data, method, and discussion of bias). Incidences of reverse-oriented inserts were as follows: 4.5% in non-normalized, nonfractionated libraries (average of two analyzed libraries' values); 29% in non-normalized, higher-MW libraries; 20% in non-normalized, lower-MW libraries; 10% in normalized libraries; and 71.5% in subtracted (randomly cloned) libraries. The weighted average across all four directionally cloned library types (contributing 84.9% of the ESTs in the database) is 9.1% reverse orientation. M13 forward-read ESTs from reverse-oriented inserts give 5' sequence. The somewhat higher incidence of reverse-oriented inserts in higher-MW fraction libraries might contribute to the higher "% new" and "% singleton" values of these libraries over their lower-MW counterparts in our database (Table 2). However, insert orientation differences between library classes do not explain the dramatically higher "% no BLAST hit" values seen in higher-MW libraries (Table 2). Because most EST projects contributing to GenBank databases are biased toward 5' sequencing, the higher "% no BLAST hit" values of our higher-MW libraries are likely conservative indices of the elevated rates of gene discovery associated with these libraries. Assembled S. salar and O. mykiss ESTs were checked for open reading frames (ORFs) >200 bp (Fig. 1A). The chance of a random 66 codon (198 bp) ORF is (61/64)66 = 0.04206 (P < 0.05). Most of our ESTs are 3' reads. The average observed 3' UTR in this database is 264 bases (603' ESTs considered; range, 59 to 592 bases), and average trimmed EST lengths are 484 to 563 bases (Table 1). Therefore, we believe that screening for 200-bp ORFs allows for adequate evaluation of the coding portion of the ESTs without excessive bias against genes with longer 3' UTRs.
Of the 28,710 assembled S. salar sequences, 22,622 (79%) have ORFs >200 bp (Fig. 1A). Of these, 10,123 (45%) have significant (E < 105) BLASTX hits, and 9822 (43%) have significant (E < 105) BLASTN hits (Fig. 1A). Novel salmonid genes may be included in the 12,499 assembled ESTs containing 200-bp ORFs but without BLASTX matches (Fig. 1A). Of the 6088 assembled S. salar ESTs without 200-bp ORFs, 388 (6%) have significant BLASTX hits (likely representing cDNAs coding for short proteins) and 1664 (27%) have significant BLASTN hits (likely representing cDNAs for short proteins as well as previously identified salmonid intronic and untranslated sequences; Fig. 1A). The 4424 assembled S. salar ESTs having neither 200-bp ORFs nor BLASTN hits (Fig. 1A) probably include novel salmonid cDNAs with long 3' UTRs. The ORF and BLAST results for O. mykiss assembled sequences are very similar to those for S. salar (Fig. 1B).
Using Functional Annotation to Infer Putative Organ Functions To illustrate ways in which this salmonid EST database may be mined for information on putative organ functions, we focused on a selection of S. salar organ-specific library groups: gill, mixed gut (stomach + mid-gut + hind-gut, not including pyloric caecum), ovary, pyloric caecum, and pituitary gland (Table 4). Overall, 26% of S. salar assembled ESTs matched sequences in the GO database (Table 4). For organ-specific libraries or library groups, the percentage of assembled ESTs hitting GO sequences ranged from 25% (ovary) to 40% (pyloric caecum; Table 4). Z-statistics were used to determine if, for a given GO classification, the proportion of assembled ESTs in an organ-specific S. salar library group differed significantly from the proportion of assembled ESTs from remaining S. salar library groups (see Methods, Table 4, and Supplemental data at http://web.uvic.ca/cbr/ grasp). Because putative organ functions are sought, only those GO categories with disproportionately high numbers of assembled ESTs will be discussed. The gill library has disproportionately high numbers of assembled ESTs in the "iron binding," "oxidoreductase, acting on heme group of donors," and "transporter" GO categories (Table 4). Disproportionately high numbers of ovary assembled ESTs are seen in GO categories related to heavy metal (copper, iron, and zinc) binding and enzyme inhibition (Table 4). The pituitary gland library has disproportionately high numbers of assembled ESTs in "iron binding," "hormone binding," and "oxidoreductase, acting on heme group of donors" categories (Table 4).
This approach was used to acquire putative functional information on a poorly characterized organ, the pyloric caecum. The teleost pyloric caecum, a large elaborate set of finger-like extensions off the gut, is known to play an important role in nutrient uptake (Buddington and Diamond 1986
Both mixed gut and pyloric caecum library groups have disproportionately high numbers of assembled ESTs in the "enzyme," "oxidoreducatase, acting on heme group of donors," and "transporter" GO categories (Table 4). These may point to general functions along the digestive tract. "Iron binding" and "hydrolase" GO categories have disproportionately high numbers of assembled ESTs in mixed gut but not pyloric caecum (Table 4). There are disproportionately high numbers of assembled ESTs in the "cytochrome P450," "selenium binding," "oxidoreductase, acting on NADH or NADPH," "oxidoreductase, acting on peroxide as acceptor," and "transferring sulfur-containing groups" categories in pyloric caecum but not mixed gut library groups (Table 4), indicating putative specialized roles for the pyloric caecum. Selenium is a component of selenoprotein P and glutathione peroxidases, antioxidant enzymes that protect cells from oxidative injury (Deplancke and Gaskins 2002
Additional hypothetical functions of the pyloric caecum may be proposed by examining the largest EST clusters (representing highly expressed genes) in the pyloric caecum library group, and locating other members of these clusters across all S. salar library groups (Table 5). Several defense-relevant EST clusters, including CC chemokine macrophage inflammatory protein (MIP)-3a, galectin, and GDP-D-mannose-4,6-dehydratase (GMD), derive most of their ESTs from pyloric caecum libraries (Table 5). Galectins serve as master regulators of immune cell homeostasis during innate immune responses (Rabinovich et al. 2002
Application of a Salmonid cDNA Microarray to Different Species
To evaluate the effect of element (cDNA spotted onto the microarray slide) and target (labeled cDNA hybridized to the slide) species affiliations on hybridization characteristics, data and statistics for AS and RT microarray elements were compiled separately. On AS probes, AS target gave the highest signal (mean of three slides: 2.01E7, SEM 4.99E5), followed by RT (mean of three slides: 1.88E7, SEM 8.16E4), LW (mean of three slides: 1.54E7, SEM 3.31E5), and rainbow smelt (mean of three slides: 6.61E6, SEM 5.37E5; Table 6). On RT probes, RT target gave the highest signal (mean of three slides: 2.53E6, SEM 4.19E4), followed by AS (mean of three slides: 1.93E6, SEM 5.25E4), LW (mean of three slides: 1.88E6, SEM 1.64E4), and rainbow smelt (mean of three slides: 8.24E5, SEM 7.56E4; Table 6).
The ranking of hybridization performances conformed to expectations, given the evolutionary relationships of the species tested (Fig. 2A,B). AS and RT, members of the subfamily Salmoninae, diverged in the Miocene 8 to 20 million years ago (Stearley 1992
Identification of Candidate Duplicated Genes
Aquaculture and Sampling S. salar (McConnell strain) juveniles were obtained from Heritage Aquaculture (British Columbia, Canada), and cultured throughout their life history. Subadult S. salar were sampled from various tissues at 2.75 years of age (Fisheries and Oceans Canada, West Vancouver, British Columbia) and used for generating all adult cDNA libraries and labeled targets for microarray hybridizations. For juvenile cDNA libraries, S. salar (McConnell strain) and O. mykiss (Tzenzaicut Lake strain) were obtained from SeaSpring Hatchery (Duncan, British Columbia) and Vancouver Island Trout Hatchery (Duncan, British Columbia), respectively. For labeled targets used in microarray hybridizations, embryonic stages of O. mykiss were derived from a domesticated strain (Spring Valley Trout Farm, Langley, British Columbia) and cultured to 80g before sampling. O. mykiss gonadal tissues ( 1.5 years; Spring Valley Strain), used to generate subtractive cDNA libraries, were obtained from Mountain Trout Sales (Sooke, British Columbia). O. tshawytscha tissues were obtained from 4-year-old females (Robertson Creek, British Columbia); O. nerka tissues were obtained from whole juvenile fish (Dr. L.J. Albright, Simon Fraser University); C. clupeaformis brain and liver were obtained from 3-year-old animals (Laboratoire Bernatchez, Université Laval, Quebec), and Osmerus mordax livers were obtained from adult smelt (NRC Institute for Marine Biosciences). Fish were raised in fiberglass tanks with natural lighting and at densities <10kg/m3 with water input rate >1 L min1 kg1. S. salar and O. tshawytscha were reared in fresh 10°C well water until smolt stage (1.5 years) and then transferred to sea water until sexual maturation. O. mykiss were cultured only in fresh 10°C well water. Most fish were fed to satiation three times per day with commercial salmon diets (Pacific Apollo 1000, Moore Clarke, Vancouver, British Columbia) comprised of 40% protein and 25% lipid. Fish were killed by a blow to the head, followed by rapid dissection. Tissues were flash-frozen in liquid nitrogen and stored at 80°C until RNA extraction. For gut tissues, discrete sections were excised and the lumen gently rinsed free of food and feces with a stream of ice-cold phosphate-buffered saline (10 mM PO4, 138 mM NaCl, and 27 mM KCl at pH 7.4).
cDNA Libraries
Sequencing, Sequence Analysis, and Contig Assembly Vectors were screened by using cross_match (part of the PHRAP package, version 0.990329), with minscore = 18. This is more sensitive than Consed, allowing detection of adaptor sequences in subtractive libraries. All vector was trimmed from the ends of the sequence. If there was remaining vector in the middle, it was removed and the shorter of the two remaining fragments trimmed with it. To trim poly-A tails, sequences were scanned from their ends forward to the beginning of the last run of consecutive As. If the tail of the sequence up to that point was at least 60% A, then it was considered part of the tail. This test was repeated from that point forward until it failed. The portion of the sequence that passed was considered poly-A tail. If this test found nothing, then the last 100 bases of the sequence were scanned for a run of at least 15 consecutive As. If found, then the trailing sequence was assumed to be bad or vector, and all sequence up to and including the run of As was trimmed. To scan for poly-T tails, the same tests were performed on reverse-complemented sequences. Sequences were not considered poly-A or poly-T tails if they were <10 bases in length.
PHRAP (http://www.genome.washington.edu/UWGC), under stringent clustering parameters (minimum score, 100; repeat stringency, 0.99), was used to assemble ESTs into contigs. Contig consensus sequences and singleton sequences were aligned with nonredundant GenBank nucleotide and amino acid sequence databases by using BLASTN and BLASTX, respectively (Altschul et al. 1990
Assembled EST contigs were scanned for repeats by using REPuter (Kurtz et al. 2001 O. mykiss orthologs to S. salar contigs were detected by semiglobal (end-gaps-free) pairwise alignment of forward and reverse-complement contigs. Alignments with overlaps of <100 nucleotides were discarded. O. mykiss contigs were considered orthologous to an S. salar contig if either the forward- or reverse-complement alignment showed at least 80% identity.
Functional Characterization of EST Contigs
Microarray Fabrication and Quality Control
All cDNAs were printed as double, side-by-side spots on Telechem Superamine slides (Arrayit) with the Biorobotics Microgrid II microarray printer (Apogent Discoveries). Microspot 10K quill pins (Biorobotics) in a 48-pin tool were used to deposit
Microarray Hybridization and Analysis The fluorescent images of hybridized arrays were acquired by using ScanArray Express (PerkinElmer). The Cy3 cyanine fluor was excited at 543 nm, and the same laser power (90%) and photomultiplier tube (PMT) setting (75) were used for all slides in the study. Fluorescent intensity data was extracted by using Imagene 5.5 software (Biodiscovery). To avoid transformations associated with background correction (i.e., setting negative background corrected median signal values to zero), raw median signal values were analyzed. No normalization was applied to the data. From the raw Imagene fluorescence intensity report files, the gene lists were sorted, and median signal values from 1356 control elements (204 buffer alone, 912 bare glass, and 240 GFP cDNA) were analyzed. For each slide, threshold was calculated as the mean intensity for these 1356 controls plus 2 SD. For data analyses, the 6440 S. salar (AS) chip elements and 916 O. mykiss (RT) chip elements were considered separately. The mean numbers of AS and RT elements passing threshold, mean total slide signal (salmonid elements only) and SEM, mean total slide background (local background fluorescence intensities associated with salmonid elements) and SEM, and average signal and background per salmonid element were calculated by slide and by species. To assess array-wide performance, signal-to-background ratio was calculated as raw total signal divided by raw total background.
This research was supported by Genome Canada, Genome BC, and the Province of BC and, additionally, by the Natural Sciences and Engineering Research Council of Canada (B.K., W.D.). We would like to thank Carlo Biagi, Steve Dann, and Shelby Temple for their assistance in obtaining tissues for cDNA library construction; Bento Soares and Brian Berger for providing methods and advice on normalizing cDNA libraries, and all those at the BCCA Genome Sciences Centre who contributed to this work. The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.1687304. Article published online before print in February 2004.
7 Corresponding author. [Supplemental material is available online at http://web.uvic.ca/cbr/grasp. The sequence data from this study have been submitted to GenBank dbEST under accession nos.: Salmo salar, BU965588 [GenBank] BU965906, CA036414 [GenBank] CA039704, CA039711 [GenBank] CA064598, CA767613 [GenBank] CA770910, and CB498694 [GenBank] CB518126; Oncorhynchus mykiss, CB485850 [GenBank] CB498693; Oncorhynchus tshawytscha, CB484816 [GenBank] CB485849; Oncorhynchus nerka, CD510521 [GenBank] CD511184; and Coregonus clupeaformis, CB483540 [GenBank] CB484653. The following individuals kindly provided reagents, samples, or unpublished information as indicated in the paper: C. Biagi, S. Dann, S. Temple, and R. Roper stimulated the S. salar head kidney cells used to create one cDNA library group.]
Allendorf, F.W. and Danzmann, R.G. 1997. Secondary tetrasomic segregation of MDH-B and preferential pairing of homologues in rainbow trout. Genetics 145: 10831092.[Abstract] Allendorf, F.W. and Thorgaard, G.H. 1984. Tetraploidy and the evolution of salmonid fishes. In Evolutionary genetics of fishes (ed. B.J. Turner), pp. 153. Plenum Press, New York. Altschul, S.F., Gish, W., Miller, W., Myers, E.W., and Lipman, D.J. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403410.[CrossRef][Medline]
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 33893402. Anderson, T.W. and Finn, J.D. 1996. The new statistical analysis of data. Springer-Verlag, New York.
Bailey, G.S., Poulter, R.T., and Stockwell, P.A. 1978. Gene duplication in tetraploid fish: Model for gene silencing at unlinked duplicated loci. Proc. Natl. Acad. Sci. 75: 55755579. Bailey, G.S., Williams, D.E., and Hendricks, J.D. 1996. Fish models for environmental carcinogenesis: The rainbow trout. Environ. Health Perspect. 104: 521. Basu, N., Todgham, A.E., Ackerman, P.A., Bibeau, M.R., Nakano, K., Schulte, P.M., and Iwama, G.K. 2002. Heat shock protein genes and their functional significance in fish. Gene 295: 173183.[CrossRef][Medline]
Bonaldo, M.F., Lennon, G., and Soares, M.B. 1996. Normalization and subtraction: Two approaches to facilitate gene discovery. Genome Res. 6: 791806. Booke, H.E. 1968. Cytotaxonomic studies of the coregonine fishes of the Great Lakes, USA: DNA and karyotype analysis. J. Fish. Res. Board Can. 25: 16671687. Brazma, A., Hingamp, P., Quackenbush, J., Sherlock, G., Spellman, P., Stoeckert, C., Aach, J., Ansorge, W., Ball, C.A., Causton, H.C., et al. 2001. Minimum information about a microarray experiment (MIAME): Toward standards for microarray data. Nat. Genet. 29: 365371.[CrossRef][Medline]
Buddington, R.K. and Diamond, J.M. 1986. Aristotle revisited: The function of pyloric caeca in fish. Proc. Natl. Acad. Sci. 83: 80128014.
Burk, R.F., Hill, K.E., and Motley, A.K. 2003. Selenoprotein metabolism and function: Evidence for more than one function for selenoprotein P. J. Nutr. 133: 1517S1520S. Coe, I.R., von Schalburg, K.R., and Sherwood, N.M. 1995. Characterization of the Pacific salmon gonadotropin-releasing hormone gene, copy number, and transcription start site. Mol. Cell. Endocrinol. 115: 113122.[CrossRef][Medline] Danielson, P.B. 2002. The cytochrome p450 superfamily: Biochemistry, evolution and drug metabolism in humans. Curr. Drug Metab. 3: 561597.[CrossRef][Medline] Davey, G.C., Caplice, N.C., Martin, S.A., and Powell, R. 2001. A survey of genes in the Atlantic salmon (Salmo salar) as identified by expressed sequence tags. Gene 263: 121130.[Medline] Deplancke, B. and Gaskins, H.R. 2002. Redox control of the transsulfuration and glutathione biosynthesis pathways. Curr. Opin. Clin. Nutr. Metab. Care 5: 8592.[CrossRef][Medline] Devlin, R.H. 1993. Sequence of sockeye salmon type 1 and type 2 growth hormone genes and the relationship of rainbow trout with Atlantic and Pacific salmon. Can. J. Fish. Aquat. Sci. 50: 17381748. Devlin, R.H., Biagi, C.A., Yesaki, T.Y., Smailus, D.E., and Byatt, J.C. 2001. Growth of domesticated transgenic fish. Nature 409: 781782.[CrossRef][Medline] Douglas, S.E. and Gallant, J.W. 1998. Isolation of cDNAs for trypsinogen from the winter flounder. Pleuronectes americanus. J. Mar. Biotechnol. 6: 214219. Douglas, S.E., Bullerwell, C.E., and Gallant, J.W. 1999a. Molecular investigation of aminopeptidase N expression in the winter flounder. Pleuronectes americanus. J. Appl. Ichthyol. 15: 8086.[CrossRef] Douglas, S.E., Gallant, J.W., Bullerwell, C.E., Wolff, C., Munholland, J., and Reith, M.E. 1999b. Winter flounder expressed sequence tags: Establishment of an EST database and identification of novel fish genes. Mar. Biotechnol. 1: 458464.[Medline] Eshel, R., Besser, M., Zanin, A., Sagi-Assif, O., and Witz, I.P. 2001. The FX enzyme is a functional component of lymphocyte activation. Cell. Immunol. 213: 141148.[CrossRef][Medline]
Ewing, B. and Green, P. 1998. Base-calling of automated sequencer traces using PHRED, II: Error probabilities. Genome Res. 8: 186194.
Ewing, B., Hillier, L., Wendl, M.C., and Green, P. 1998. Base-calling of automated sequencer traces using PHRED, I: Accuracy assessment. Genome Res. 8: 175185. Faillace, M.P., Julian, D., and Korenbrot, J.I. 2002. Mitotic activation of proliferative cells in the inner nuclear layer of the mature fish retina: Regulatory signals and molecular markers. J. Comp. Neurol. 451: 127141.[CrossRef][Medline] Gregory, T.R. 2002. Animal genome size database. http://www.genomesize.com. Holland, P.W., Garcia-Fernandez, J., Williams, N.A., and Sidow, A. 1994. Gene duplications and the origins of vertebrate development. Development (Suppl.) 1994: 125133. Katchamart, S., Miranda, C.L., Henderson, M.C., Pereira, C.B., and Buhler, D.R. 2002. Effect of xenoestrogen exposure on the expression of cytochrome P450 isoforms in rainbow trout liver. Environ. Toxicol. Chem. 11: 24452451.
Kido, Y., Aono, M., Yamaki, T., Matsumoto, K., Murata, S., Saneyoshi, M., and Okada, N. 1991. Shaping and reshaping of salmonid genomes by amplification of tRNA-derived retroposons during evolution. Proc. Natl. Acad. Sci. 88: 23262330.
Kurtz, S., Choudhuri, J.V., Ohlebusch, E., Schleiermacher, C., Stoye, J., and Giegerich, R. 2001. REPuter: The manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res. 29: 46334642. Madigou, T., Uzbekova, S., Lareyre, J.J., and Kah, O. 2002. Two messenger RNA isoforms of the gonadotrophin-releasing hormone receptor, generated by alternative splicing and/or promoter usage, are differentially expressed in rainbow trout gonads during gametogenesis. Mol. Reprod. Dev. 63: 151160.[CrossRef][Medline] Nelson, J.S. 1994. Fishes of the world, 3rd ed. John Wiley & Sons, New York. Ohno, S. 1970. Evolution by gene duplication. Springer-Verlang, Heidelberg, Germany. Ohno, S., Stenius, C., Faisst, E., and Zenzes, M.T. 1965. Post-zygotic chromosomal rearrangements in rainbow trout (Salmo irideus GIBBONS) Cytogenetics 4: 117129. Ohno, S., Wolf, U., and Atkin, N.B. 1968. Evolution from fish to mammals by gene duplication. Hereditas 59: 169187.[Medline]
Ohyama, C., Smith, P.L., Angata, K., Fukuda, M.N., Lowe, J.B., and Fukuda, M. 1998. Molecular cloning and expression of GDP-D-mannose-4,6-dehydratase, a key enzyme for fucose metabolism defective in Lec13 cells. J. Biol. Chem. 273: 1458214587. Phillips, R.B. and Oakley, T.H. 1997. Phylogenetic relationships among the Salmoninae based on nuclear and mitochondrial DNA sequences. In Molecular systematics of fishes (eds. T.D. Kocher and C.A. Stepien), pp. 145162. Academic Press, San Diego, CA. Phillips, R. and Ráb, P. 2001. Chromosome evolution in the Salmonidae (Pisces): An update. Biol. Rev. 76: 125.[Medline] Rabinovich, G.A., Rubinstein, N., and Toscano, M.A. 2002. Role of galectins in inflammatory and immunomodulatory processes. Biochim. Biophys. Acta 1572: 274284.[Medline] Schomburg, L., Schweizer, U., Holtmann, B., Flohe, L., Sendtner, M., and Kohrle, J. 2003. Gene disruption discloses role of selenoprotein P in selenium delivery to target tissues. Biochem. J. 370: 397402.[CrossRef][Medline]
Shum, B.P., Guethlein, L., Flodin, L.R., Adkison, M.A., Hedrick, R.P., Nehring, R.B., Stet, R.J.M., Secombes, C., and Parham, P. 2001. Modes of salmonid MHC class I and II evolution differ from the primate paradigm. J. Immunol. 166: 32973308. Sidow, A. 1996. Gen(om)e duplications in the evolution of early vertebrates. Curr. Opin. Genet. Dev. 6: 715722.[CrossRef][Medline] Smith, G.R. and Stearley, R.F. 1989. The classification and scientific names of rainbow and cutthroat trouts. Fisheries 14: 410.
Soares, M.B., Bonaldo, M.F., Jelene, P., Su, L., Lawton, L., and Efstratiadis, A. 1994. Construction and characterization of a normalized cDNA library. Proc. Natl. Acad. Sci. 91: 92289232. Stearley, R.F. 1992. Historical ecology of the Salmoninae, with special reference to Oncorhynchus. In Systematic historical ecology and North American freshwater fishes (ed. R.L. Mayden), pp. 622658. Stanford University Press, Stanford, CA. Tipsmarck, C.K., Madsen, S.S., Seidelin, M., Christensen, A.S., Cutler, C.P., and Cramb, G. 2002. Dynamics of Na+,K+,2Cl cotransporter and Na+,K+-ATPase expression in the branchial epithelium of brown trout (Salmo trutta) and Atlantic salmon (Salmo salar). J. Exp. Zool. 293: 106118.[CrossRef][Medline] Vinogradov, A.E. 1998. Genome size and GC-percent in vertebrates as determined by flow cytometry: The triangular relationship. Cytometry 31: 100109.[CrossRef][Medline] Wolfe, K.H. 2001. Yesterday's polyploids and the mystery of diploidization. Nat. Rev. Genet. 2: 333341.[CrossRef][Medline] Zhang, C., Brown, S.B., and Hara, T.J. 2001. Biochemical and physiological evidence that bile acids produced and released by lake char (Salvelinus namaycush) function as chemical signals. J. Comp. Physiol. B 171: 161171.[CrossRef][Medline]
http://www.geneontology.org; the Gene Ontology Consortium (2001). http://web.uvic.ca/cbr/grasp; University of Victoria Centre for Biomedical Research. http://woodstock.ceh.uvic.ca/nkuipers/public_html/; Web interface for querying a database containing BLAST-identified candidate repeats in the GRASP EST database. http://www.genome.washington.edu/UWGC; University of Washington Genome Centre (PHRED version 0.990722.j; PHRAP version 0.990329).
Received June 25, 2003;
accepted in revised format December 12, 2003.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||