Genome Res. 6:807-828, 1996
©1996 by Cold Spring Harbor Laboratory Press; ISSN 1088-9051
Generation and analysis of 280,000 human expressed sequence tags.
L D Hillier,
G Lennon,
M Becker,
M F Bonaldo,
B Chiapelli,
S Chissoe,
N Dietrich,
T DuBuque,
A Favello,
W Gish,
M Hawkins,
M Hultman,
T Kucaba,
M Lacy,
M Le,
N Le,
E Mardis,
B Moore,
M Morris,
J Parsons,
C Prange,
L Rifkin,
T Rohlfing,
K Schellenberg, and
M Marra
Genome Sequencing Center, Washington University School of Medicine, St. Louis, Missouri 63108, USA. lhillier@watson.wustl.edu
Abstract
We report the generation of 319,311 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' and 3' ends of 194,031 human cDNA clones. Our goal has been to obtain tag sequences from many different genes and to deposit these in the publicly accessible Data Base for Expressed Sequence Tags. Highly efficient automatic screening of the data allows deposition of the annotated sequences without delay. Sequences have been generated from 26 oligo(dT) primed directionally cloned libraries, of which 18 were normalized. The libraries were constructed using mRNA isolated from 17 different tissues representing three developmental states. Comparisons of a subset of our data with nonredundant human mRNA and protein data bases show that the ESTs represent many known sequences and contain many that are novel. Analysis of protein families using Hidden Markov Models confirms this observation and supports the contention that although normalization reduces significantly the relative abundance of redundant cDNA clones, it does not result in the complete removal of members of gene families.

CiteULike Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
A. Siepel, M. Diekhans, B. Brejova, L. Langton, M. Stevens, C. L.G. Comstock, C. Davis, B. Ewing, S. Oommen, C. Lau, et al.
Targeted discovery of novel human exons by comparative genomics
Genome Res.,
December 1, 2007;
17(12):
1763 - 1773.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Carninci
Constructing the landscape of the mammalian transcriptome
J. Exp. Biol.,
May 1, 2007;
210(9):
1497 - 1506.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Brulliard, D. Lorphelin, O. Collignon, W. Lorphelin, B. Thouvenot, E. Gothie, S. Jacquenet, V. Ogier, O. Roitel, J.-M. Monnez, et al.
Nonrandom variations in human cancer ESTs indicate that mRNA heterogeneity increases during carcinogenesis
PNAS,
May 1, 2007;
104(18):
7522 - 7527.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. C. Cheng, K. M. Sakamoto, E. M. Horwitz, S. L. Karsten, L. Shoemaker, H. I. Kornblumc, and P. Malik
Report on the Workshop "New Technologies in Stem Cell Research," Society for Pediatric Research, San Francisco, California, April 29, 2006
Stem Cells,
April 1, 2007;
25(4):
1070 - 1088.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. Masoudi-Nejad, K. Tonomura, S. Kawashima, Y. Moriya, M. Suzuki, M. Itoh, M. Kanehisa, T. Endo, and S. Goto
EGassembler: online bioinformatics service for large-scale processing, clustering and assembling ESTs and genomic DNA fragments.
Nucleic Acids Res.,
July 1, 2006;
34(Web Server issue):
W459 - W462.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Zhang, L. Zhang, and K. R. Coombes
Gene sequence signatures revealed by mining the UniGene affiliation network
Bioinformatics,
February 15, 2006;
22(4):
385 - 391.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R. Brent
Genome annotation past, present, and future: How to define an ORF at each locus
Genome Res.,
December 1, 2005;
15(12):
1777 - 1786.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. Jiang, K. M. Whitworth, N. J. Bivens, J. E. Ries, R. J. Woods, L. J. Forrester, G. K. Springer, N. Mathialagan, C. Agca, R. S. Prather, et al.
Large-Scale Generation and Analysis of Expressed Sequence Tags from Porcine Ovary
Biol Reprod,
December 1, 2004;
71(6):
1991 - 2002.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. H. Peng, H. Zadeh, G. R. Lazo, J. P. Gustafson, S. Chao, O. D. Anderson, L. L. Qi, B. Echalier, B. S. Gill, M. Dilbirligi, et al.
Chromosome Bin Map of Expressed Sequence Tags in Homoeologous Group 1 of Hexaploid Wheat and Homoeology With Rice and Arabidopsis
Genetics,
October 1, 2004;
168(2):
609 - 623.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. G. Hossain, V. Kalavacharla, G. R. Lazo, J. Hegstad, M. J. Wentz, P. M. A. Kianian, K. Simons, S. Gehlhar, J. L. Rust, R. R. Syamala, et al.
A Chromosome Bin Map of 2148 Expressed Sequence Tag Loci of Wheat Homoeologous Group 7
Genetics,
October 1, 2004;
168(2):
687 - 699.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. L. Qi, B. Echalier, S. Chao, G. R. Lazo, G. E. Butler, O. D. Anderson, E. D. Akhunov, J. Dvorak, A. M. Linkiewicz, A. Ratnasiri, et al.
A Chromosome Bin Map of 16,000 Expressed Sequence Tag Loci and Distribution of Genes Among the Three Genomes of Polyploid Wheat
Genetics,
October 1, 2004;
168(2):
701 - 712.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. J. Robinson, D. J. Cram, C. T. Lewis, and I. A.P. Parkin
Maximizing the Efficacy of SAGE Analysis Identifies Novel Transcripts in Arabidopsis
Plant Physiology,
October 1, 2004;
136(2):
3223 - 3233.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. E. Scheetz, J. J. Laffin, B. Berger, S. Holte, S. A. Baumes, R. Brown II, S. Chang, J. Coco, J. Conklin, K. Crouch, et al.
High-Throughput Gene Discovery in the Rat
Genome Res.,
April 1, 2004;
14(4):
733 - 741.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
T. E. Scheetz, J. Zabner, M. J. Welsh, J. Coco, M. Eyestone, M. de Fatima Bonaldo, T. Kucaba, T. L. Casavant, M. B. Soares, and P. B. McCray Jr.
Large-scale gene discovery in human airway epithelia reveals novel transcripts
Physiol Genomics,
March 12, 2004;
17(1):
69 - 77.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Mitreva, J. P. McCarter, J. Martin, M. Dante, T. Wylie, B. Chiapelli, D. Pape, S. W. Clifton, T. B. Nutman, and R. H. Waterston
Comparative Genomics of Gene Expression in the Parasitic and Free-Living Nematodes Strongyloides stercoralis and Caenorhabditis elegans
Genome Res.,
February 1, 2004;
14(2):
209 - 220.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. S. Clark, Y. J.K. Edwards, D. Peterson, S. W. Clifton, A. J. Thompson, M. Sasaki, Y. Suzuki, K. Kikuchi, S. Watabe, K. Kawakami, et al.
Fugu ESTs: New Resources for Transcription Analysis and Genome Annotation
Genome Res.,
December 1, 2003;
13(12):
2747 - 2753.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Gissi and G. Pesole
Transcript Mapping and Genome Annotation of Ascidian mtDNA Using EST Data
Genome Res.,
September 1, 2003;
13(9):
2203 - 2212.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Balasenthil and R. K. Vadlamudi
Functional Interactions between the Estrogen Receptor Coactivator PELP1/MNAR and Retinoblastoma Protein
J. Biol. Chem.,
June 6, 2003;
278(24):
22119 - 22127.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Carninci, K. Waki, T. Shiraki, H. Konno, K. Shibata, M. Itoh, K. Aizawa, T. Arakawa, Y. Ishii, D. Sasaki, et al.
Targeting a Complex Transcriptome: The Construction of the Mouse Full-Length cDNA Encyclopedia
Genome Res.,
June 1, 2003;
13(6):
1273 - 1289.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
F. Yokoi, H. Hiraishi, and K. Izuhara
Molecular Cloning of a cDNA for the Human Phospholysine Phosphohistidine Inorganic Pyrophosphate Phosphatase
J. Biochem.,
May 1, 2003;
133(5):
607 - 614.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Li, B. P. Brunk, J. C. Kissinger, D. Pape, K. Tang, R. H. Cole, J. Martin, T. Wylie, M. Dante, S. J. Fogarty, et al.
Gene Discovery in the Apicomplexa as Revealed by EST Sequencing and Assembly of a Comparative Gene Database
Genome Res.,
March 1, 2003;
13(3):
443 - 454.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
R. Sorek and H. M. Safer
A novel algorithm for computational identification of contaminated EST libraries
Nucleic Acids Res.,
February 1, 2003;
31(3):
1067 - 1074.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
H. G. R. Thompson, J. W. Harris, B. J. Wold, S. R. Quake, and J. P. Brody
Identification and Confirmation of a Module of Coexpressed Genes
Genome Res.,
October 1, 2002;
12(10):
1517 - 1522.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. Chen, M. Sun, S. Lee, G. Zhou, J. D. Rowley, and S. M. Wang
Identifying novel transcripts and novel genes in the human genome by using novel SAGE tags
PNAS,
September 17, 2002;
99(19):
12257 - 12262.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Ye and J. M. Parry
The discovery and confirmation of single nucleotide polymorphisms in the human p53R2 gene by EST database analysis
Mutagenesis,
September 1, 2002;
17(5):
361 - 364.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
F. Diehl, B. Beckmann, N. Kellner, N. C. Hauser, S. Diehl, and J. D. Hoheisel
Manufacturing DNA microarrays from unpurified PCR products
Nucleic Acids Res.,
August 15, 2002;
30(16):
e79 - e79.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. Stapleton, G. Liao, P. Brokstein, L. Hong, P. Carninci, T. Shiraki, Y. Hayashizaki, M. Champe, J. Pacleb, K. Wan, et al.
The Drosophila Gene Collection: Identification of Putative Full-Length cDNAs for 70% of D. melanogaster Genes
Genome Res.,
August 1, 2002;
12(8):
1294 - 1300.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
C. Hartmann, L. Johnk, G. Kitange, Y. Wu, L. K. Ashworth, R. B. Jenkins, and D. N. Louis
Transcript Map of the 3.7-Mb D19S112-D19S246 Candidate Tumor Suppressor Region on the Long Arm of Chromosome 19
Cancer Res.,
July 15, 2002;
62(14):
4100 - 4108.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
J. S. Stanley, D. M. Mock, J. B. Griffin, and J. Zempleni
Biotin Uptake into Human Peripheral Blood Mononuclear Cells Increases Early in the Cell Cycle, Increasing Carboxylase Activities
J. Nutr.,
July 1, 2002;
132(7):
1854 - 1859.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Y. Shevchenko, G. G. Bouffard, Y. S. N. Butterfield, R. W. Blakesley, J. L. Hartley, A. C. Young, M. A. Marra, S. J. M. Jones, J. W. Touchman, and E. D. Green
Systematic sequencing of cDNA clones using the transposon Tn5
Nucleic Acids Res.,
June 1, 2002;
30(11):
2469 - 2477.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. K. Nam, S. Lee, G. Zhou, X. Cao, C. Wang, T. Clark, J. Chen, J. D. Rowley, and S. M. Wang
Oligo(dT) primer generates a high frequency of truncated cDNAs through internal poly(A) priming during reverse transcription
PNAS,
April 30, 2002;
99(9):
6152 - 6156.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. D. Eley, J. L. Reiter, A. Pandita, S. Park, R. B. Jenkins, N. J. Maihle, and C. D. James
A chromosomal region 7p11.2 transcript map: Its development and application to the study of EGFR amplicons in glioblastoma
Neuro-oncol,
April 1, 2002;
4(2):
86 - 94.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Skrabanek and F. Campagne
TissueInfo: high-throughput identification of tissue expression profiles and specificity
Nucleic Acids Res.,
November 1, 2001;
29(21):
e102 - e102.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Ehrt, D. Schnappinger, S. Bekiranov, J. Drenkow, S. Shi, T. R. Gingeras, T. Gaasterland, G. Schoolnik, and C. Nathan
Reprogramming of the Macrophage Transcriptome in Response to Interferon-{gamma} and Mycobacterium tuberculosis: Signaling Roles of Nitric Oxide Synthase-2 and Phagocyte Oxidase
J. Exp. Med.,
October 15, 2001;
194(8):
1123 - 1140.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
A. A. Camargo, H. P. B. Samaia, E. Dias-Neto, D. F. Simao, I. A. Migotto, M. R. S. Briones, F. F. Costa, M. Aparecida Nagai, S. Verjovski-Almeida, M. A. Zago, et al.
From the Cover: The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome
PNAS,
October 9, 2001;
98(21):
12103 - 12108.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. Hayashida-Hibino, H. Watanabe, K. Nishida, M. Tsujikawa, T. Tanaka, Y. Hori, Y. Saishin, and Y. Tano
The Effect of TGF-{beta}1 on Differential Gene Expression Profiles in Human Corneal Epithelium Studied by cDNA Expression Array
Invest. Ophthalmol. Vis. Sci.,
July 1, 2001;
42(8):
1691 - 1697.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Z. Kan, E. C. Rouchka, W. R. Gish, and D. J. States
Gene Structure Prediction and Alternative Splicing Analysis Using Genomically Aligned ESTs
Genome Res.,
May 1, 2001;
11(5):
889 - 900.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
O. Ohara and G. Temple
Directional cDNA library construction assisted by the in vitro recombination reaction
Nucleic Acids Res.,
February 15, 2001;
29(4):
e22 - e22.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. H. Buetow, M. Edmonson, R. MacDonald, R. Clifford, P. Yip, J. Kelley, D. P. Little, R. Strausberg, H. Koester, C. R. Cantor, et al.
High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry
PNAS,
December 28, 2000;
(2000)
21506298.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
J. Andrews, G. G. Bouffard, C. Cheadle, J. Lü, K. G. Becker, and B. Oliver
Gene Discovery Using Computational and Microarray Analysis of Transcription in the Drosophila melanogaster Testis
Genome Res.,
December 1, 2000;
10(12):
2030 - 2043.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
G. K.-S. Wong, D. A. Passey, Y.-z. Huang, Z. Yang, and J. Yu
Is "Junk" DNA Mostly Intron DNA?
Genome Res.,
November 1, 2000;
10(11):
1672 - 1678.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
S. Kawamoto, J. Yoshii, K. Mizuno, K. Ito, Y. Miyamoto, T. Ohnishi, R. Matoba, N. Hori, Y. Matsumoto, T. Okumura, et al.
BodyMap: A Collection of 3' ESTs for Analysis of Human Gene Expression Information
Genome Res.,
November 1, 2000;
10(11):
1817 - 1827.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
D. R. Bentley
Decoding the human genome sequence
Hum. Mol. Genet.,
October 1, 2000;
9(16):
2353 - 2358.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. Carninci, Y. Shibata, N. Hayatsu, Y. Sugahara, K. Shibata, M. Itoh, H. Konno, Y. Okazaki, M. Muramatsu, and Y. Hayashizaki
Normalization and Subtraction of Cap-Trapper-Selected cDNAs to Prepare Full-Length cDNA Libraries for Rapid Discovery of New Genes
Genome Res.,
October 1, 2000;
10(10):
1617 - 1630.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
M. Hirosawa, K.-i. Ishikawa, T. Nagase, and O. Ohara
Detection of Spurious Interruptions of Protein-Coding Regions in Cloned cDNA Sequences by GeneMark Analysis
Genome Res.,
September 1, 2000;
10(9):
1333 - 1341.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
J. Stollberg, J. Urschitz, Z. Urban, and C. D. Boyd
A Quantitative Evaluation of SAGE
Genome Res.,
August 1, 2000;
10(8):
1241 - 1248.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
A. E. Lash, C. M. Tolstoshev, L. Wagner, G. D. Schuler, R. L. Strausberg, G. J. Riggins, and S. F. Altschul
SAGEmap: A Public Gene Expression Resource
Genome Res.,
July 1, 2000;
10(7):
1051 - 1060.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
L. Eckmann, J. R. Smith, M. P. Housley, M. B. Dwinell, and M. F. Kagnoff
Analysis by High Density cDNA Arrays of Altered Gene Expression in Human Intestinal Epithelial Cells in Response to Infection with the Invasive Enteric Bacteria Salmonella
J. Biol. Chem.,
May 5, 2000;
275(19):
14084 - 14094.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R. Emmert-Buck, R. L. Strausberg, D. B. Krizman, M. F. Bonaldo, R. F. Bonner, D. G. Bostwick, M. R. Brown, K. H. Buetow, R. F. Chuaqui, K. A. Cole, et al.
Molecular Profiling of Clinical Tissue Specimens: Feasibility and Applications
J. Mol. Diagn.,
May 1, 2000;
2(2):
60 - 66.
[Full Text]
|
 |
|

|
 |

|
 |
 
B. A. Rikke, S. Murakami, and T. E. Johnson
Paralogy and Orthology of Tyrosine Kinases that Can Extend the Life Span of Caenorhabditis elegans
Mol. Biol. Evol.,
May 1, 2000;
17(5):
671 - 683.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
M. R. Emmert-Buck, R. L. Strausberg, D. B. Krizman, M. F. Bonaldo, R. F. Bonner, D. G. Bostwick, M. R. Brown, K. H. Buetow, R. F. Chuaqui, K. A. Cole, et al.
Molecular Profiling of Clinical Tissue Specimens : Feasibility and Applications
Am. J. Pathol.,
April 1, 2000;
156(4):
1109 - 1115.
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. Dias Neto, R. Garcia Correa, S. Verjovski-Almeida, M. R. S. Briones, M. A. Nagai, W. da Silva Jr., M. A. Zago, S. Bordin, F. F. Costa, G. H. Goldman, et al.
Shotgun sequencing of the human transcriptome with ORF expressed sequence tags
PNAS,
March 28, 2000;
97(7):
3491 - 3496.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
G. M. Rubin, L. Hong, P. Brokstein, M. Evans-Holm, E. Frise, M. Stapleton, and D. A. Harvey
A Drosophila Complementary DNA Resource
Science,
March 24, 2000;
287(5461):
2222 - 2224.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
C. R. Englert, G. V. Baibakov, and M. R. Emmert-Buck
Layered Expression Scanning: Rapid Molecular Profiling of Tumor Samples
Cancer Res.,
March 1, 2000;
60(6):
1526 - 1530.
[Abstract]
[Full Text]
|
 |
|

|
 |

|
 |
 
M. Ko, J. Kitchen, X Wang, T. Threat, X Wang, A Hasegawa, T Sun, M. Grahovac, G. Kargul, M. Lim, et al.
Large-scale cDNA analysis reveals phased gene expression patterns during preimplantation mouse development
Development,
January 4, 2000;
127(8):
1737 - 1749.
[Abstract]
[PDF]
|
 |
|

|
 |

|
 |
 
L. Duret and D. Mouchiroud
Determinants of Substitution Rates in Mammalian Genes: Expression Pattern Affects Selection Intensity but Not Mutation Rate
Mol. Biol. Evol.,
January 1, 2000;
17(1):
68 - 70.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Gawin, A. Niederführ, N. Schumacher, H. Hummerich, P. F.R. Little, and M. Gessler
A 7.5 Mb Sequence-Ready PAC Contig and Gene Expression Map of Human Chromosome 11p13-p14.1
Genome Res.,
November 1, 1999;
9(11):
1074 - 1086.
[Abstract]
[Full Text]
|
 |
|
|