|
|
|
|
Vol. 12, Issue 2, 292-297, February 2002
LETTER
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Control genes, commonly defined as genes that are ubiquitously expressed at stable levels in different biological contexts, have been used to standardize quantitative expression studies for more than 25 yr. We analyzed a group of large mammalian microarray datasets including the NCI60 cancer cell line panel, a leukemia tumor panel, and a phorbol ester induction time course as well as human and mouse tissue panels. Twelve housekeeping genes commonly used as controls in classical expression studies (including GAPD, ACTB, B2M, TUBA, G6PD, LDHA, and HPRT) show considerable variability of expression both within and across microarray datasets. Although we can identify genes with lower variability within individual datasets by heuristic filtering, such genes invariably show different expression levels when compared across other microarray datasets. We confirm these results with an analysis of variance in a controlled mouse dataset, showing the extent of variability in gene expression across tissues. The results show the problems inherent in the classical use of control genes in estimating gene expression levels in different mammalian cell contexts, and highlight the importance of controlled study design in the construction of microarray experiments.
[Supplemental material available online at http://genome.mcgill.ca/~pdlee/control_genes and and http://www.genome.org.]
| |
INTRODUCTION |
|---|
|
|
|---|
Although DNA microarrays open the door to large-scale expression
experiments (Lander 1999
; Young 2000
), a major
challenge facing these studies is the design of experimental controls
that will permit comparison of quantitative expression profiles
obtained from diverse biological contexts. In traditional assays,
standardization of mRNA levels has been achieved by comparison to the
level of a control gene, commonly defined as one that is ubiquitously
expressed at stable levels across many biological contexts. Methods of
standardization based on control genes have furthermore been used in
microarray and genomic studies (Khan et al. 1998
; Beger et al. 2001
).
We reexamine the traditional concepts of controls in expression
experiments in the aim of determining appropriate measures for the
control of microarray experiments.
In an attempt to identify genes that are expressed at constant levels
across a wide range of biological contexts, we analyzed four published
datasets that were prepared following similar methods based on a single
microarray technology (Affymetrix oligonucleotide microarrays). The
NCI60 dataset (Butte et al. 2000
) consists of microarray measurements
of gene expression in 60 cancer cell lines originating from nine tissue
types. A dataset obtained from patients with hematologic malignancies
(Golub et al. 1999
) includes expression profiles for multiple
homogeneous acute lymphoblastic leukemia (ALL) and acute myeloid
leukemia (AML) tumor samples. Temporal and developmental
fluctuations in control gene expression were assessed using a dataset
obtained from four cell lines following treatment with phorbol
12-myristate 13-acetate (TPA) (Tamayo et al. 1999
). Finally, the Huge
Index dataset provides in vivo gene expression data for six human
tissues (Warrington et al. 2000
).
| |
RESULTS AND DISCUSSION |
|---|
|
|
|---|
We initially studied the expression levels of 12 genes commonly used to normalize RNA levels measured by Northern blots or RT-PCR. The expression levels for many of these genes fluctuates dramatically both within and across datasets (Fig. 1). Within datasets, the maximum fold change (MFC, the ratio of the maximum and minimum values observed within a dataset) ranges from 1.3 for ACTB within the TPA induction dataset to >300 for VIM in the NCI60 dataset (Table 1). All commonly used control genes have an MFC of >2.0 in at least one dataset. In addition, the observed coefficients of variation (CVs) are frequently >0.5, reflecting the highly variable levels of expression of these genes within datasets.
|
|
We next used a simple heuristic filter to identify sets of genes
showing lower variability. After excluding genes with signal intensities below threshold and with an MFC <2, genes were sorted and
ranked according to their CVs. We use this measure of variability because it compensates for the apparent dependence of dispersion on
signal (Novak et al. 2002
). Similar results were obtained using alternate methods of estimating dispersion (data not shown). Of the
housekeeping genes analyzed, only GAPD and ACTB rank
among the 100 genes with the lowest variability; however, no
traditional control genes display consistently low variability across
the four datasets. Nine genes identified by filtering have CVs <0.7 across all four datasets (Table 2). These
are not genes that have commonly been used as controls, but include
several ribosomal protein (RP) genes (including RPS27A, RPL19, RPL11,
RPS29, and RPS3).
Even this set of genes shows differing amounts of variability across
datasets (Supplementary Fig. 1, available online at
http://genome.mcgill.ca/~pdlee/control_genes and
http://www.genome.org). For example, although RPS27A has the lowest CV in the NCI60 and leukemia datasets, its MFC ranges from 2.2 in the NCI60 dataset to 5.6 in the TPA induction dataset.
|
Our failure to identify control genes in the four expression datasets studied might occur if the microarray measurements were associated with high levels of technical variability. To assess whether the observed variation could be due to technical variability rather than biological context, we examined expression levels in triplicate for RNA samples obtained from liver, heart, lung, and brain of three male C57BL/6 mice reared under identical conditions using MU11KA and B arrays (Affymetrix) containing probe-sets for 11,000 mouse genes and ESTs. The expression levels of traditional control genes show greater variability among RNA samples obtained from different tissues than among RNA obtained from the same tissue harvested from different mice, or among identical RNA samples hybridized to replicate microarrays (Fig. 2). To determine whether other genes displayed similar behavior, we performed analysis of variance (ANOVA) on a per-gene basis to determine the amount of observed variability that could be attributed to differences among replicates, mice, or tissues. Technical replicates using identical RNA samples hybridized to three distinct arrays show the least amount of variability: only 3% of genes display significant differences across replicates (P < 0.05). Among biological replicates using RNA from three individual mice, 5%-10% of genes show significant differences (P < 0.05) after adjusting for variation between tissues, and between technical replicates. In contrast, 81%-99% of genes show significant variability (P < 0.05) among different tissues after adjusting for the variability between technical and biological replicates. This trend remains consistent regardless of the filtering criteria or procedure used to select genes (Table 3, Supplementary Fig. 3, available online at http://genome.mcgill.ca/~pdlee/control_genes and http://www.genome.org). ANOVA performed on the TPA induction and NCI60 datasets similarly reveals greater variability in gene expression across different tissues than across different time points, cell lines, or datasets (Supplementary Fig. 2 and Supplementary Tables 1, 2, and 3, available online at http://genome.mcgill.ca/~pdlee/control_genes and http://www.genome.org). Performing our analysis using multiple normalization methods did not impact our findings (Supplementary Fig. 2 and Supplementary Table 4, available online at http://genome.mcgill.ca/~pdlee/control_genes and http://www.genome.org). These results indicate that the variability in gene expression detected in this experiment is not due to technical or intermouse variability, but rather due to the inherent differences in individual RNA levels present among different tissue types.
|
|
It is possible that our failure to identify control genes may result from data-filtering techniques that excluded RNA species expressed at low copy number across a wide range of tissues, or genes that are simply not present on the microarrays used in these studies. These issues may be addressed by the future development of more sensitive complete genome arrays. Despite this, our results clearly show that the expression levels of genes that have been commonly used as controls in classical experiments vary significantly among different cellular and experimental contexts. Furthermore, we fail to identify mammalian genes that qualify as "control genes" on the basis of a definition of ubiquitous and stable expression. Although some genes do appear quite stable in expression level within any one experiment, there do not appear to be any genes expressed at stable levels across all four datasets studied in this paper. Hence, the traditional use of individual genes as normalization controls in experiments that compare diverse biological tissues would lead to substantial errors in the derived estimates of fold change in gene expression levels. From inspection of the data, it is apparent that some transcripts may serve as control genes for studies performed in a single tissue context; however, these conclusions are limited by a study design that does not address the effects of physiologic regulation on the expression of these genes.
The unproven existence of control genes seems to have achieved
acceptance in part because of its conceptual simplicity and practical
limitations of the past. Recent studies have expressed concern that
individual genes or groups of genes may serve as inadequate internal
standards for measuring RNA expression levels (Souaze et al. 1996
;
Savonet et al. 1997
; Ivell 1998
; Serazin-Leroy et al. 1998
; Oliveira et
al. 1999
; Thellin et al. 1999
; Suzuki et al. 2000
; Wu and Rees 2000
);
Measures for data standardization and quality control in microarray
databases are currently being reviewed by the MGED working group on
Microarray Data Annotations (www.mged.org). The establishment of
common frames of reference requires a reexamination of assumptions
inherent in the design of biological experiments. From these findings,
we propose that all genes are differentially expressed in at least one
biological context and that the expression of every gene is therefore
context dependent. Given the absence of ubiquitous control genes,
variation in microarray expression studies must instead be interpreted
using statistical characteristics of the data without preconceptions arising from the traditional notions of internal control genes.
| |
METHODS |
|---|
|
|
|---|
Public Microarray Datasets
Microarray datasets for the NCI60 cancer cell line panel, the ALL/AML tumors, and the TPA treatment in HL60, U937, NB4, and Jurkat cell lines are available at http://www.genome.wi.mit.edu/MPR/datasets. The human tissue expression profiles contained in the Huge Index dataset were obtained at http://www.hugeindex.org/.
Mouse Microarray Dataset
Mouse tissues were obtained from three adult male C57BL/6
littermates. Mice were killed by cervical dislocation and the tissues rapidly dissected and homogenized in Trizol reagent (Life
Technologies). Total cellular RNA was prepared according to the
manufacturer's instructions and analyzed by nondenaturing (1%
agarose-1 × TBE) gel electrophoresis. Probes for the microarray
studies were prepared by priming 20 µg of total RNA with 100 pmole of
T7- (T) 24 primer (Genosys). The RNA-primer mixture was denatured for
10 min at 70°C, and then chilled on ice. First-strand cDNA was
synthesized using Superscript II reverse transcriptase (Life
Technologies). Second-strand synthesis was performed using RNAse H, DNA
polymerase I, and Escherichia coli DNA ligase (Life
Technologies). Biotinylated riboprobes were prepared from the entire
cDNA reaction using the ENZO Bioarray High Yield RNA Transcript
Labeling Kit (ENZO Diagnostics). The average probe length was reduced
by incubating the probe in 1X Fragmentation Buffer for 35 min at
95°C. Hybridization was performed at 45°C for 16-20 h using 15 µg of biotinylated probe. Following hybridization, the arrays were
subjected to 10 low-stringency washes and 4 high-stringency washes
using a GeneChip Fluidics Station 400 (Affymetrix). Bound probe was
detected by incubating arrays with SAPE (streptavidin phycoerthryin,
Molecular Probes) and scanning the chips using a GeneArray Scanner
(Agilent). Scanned images were analyzed using the GeneChip Analysis
Suite 3.3 (Affymetrix). Full details of the microarray methods have
been described previously (Novak et al. 2002
).
Data Analysis
Traditional control genes analyzed in human datasets included:
-actin (ACTB),
-2-microglobulin (B2M),
phosphofructokinase (PFKP), phosphoglycerate kinase
(PGK1), aldolase A (ALDOA), phosphoglycerate mutase
(PGAM),
-tubulin (TUBA), glyceraldehyde-3
phosphate dehydrogenase (GAPD), glucose-6 phosphate
dehydrogenase (G6PD), lactate dehydrogenase A (LDHA),
hypoxanthine phosphoribosyltransferase (HPRT), and vimentin (VIM). Traditional control genes analyzed in mouse datasets
included asparagine synthetase (Asns), phosphofructokinase
(Pfkp), lactate dehydrogenase A (Ldh1), vimentin
(Vim), phosphoglycerate kinase (Pgk1), ubiquitin
(Ubc), glucose-6 phosphate dehydrogenase (G6pd), phosphoglycerate mutase (Pgam1),
-2-microglobulin
(B2m), glutamate dehydrogenase (Glud), hypoxanthine
phosphoribosyltransferase (Hprt), and
-tubulin (Tuba1).
For accession numbers, see Supplementary Table 5 (available online at
http://genome.mcgill.ca/~pdlee/control_genes and http://www.genome.org).
Regression scaling was performed only on datapoints assigned a `P'
absolute call by the Affymetrix GeneChip software: the absolute call
estimates the hybridization quality for an individual probe set on the
basis of measures of background and signal dispersion. The regression
scaling algorithm has been described previously (Novak et al. 2002
): it
uses normalization to the regression coefficient of the first sample in
each dataset. We rescaled datasets on the basis of mean overall
intensity per scan. Mean intensity was calculated on the genes with a
minimum average difference of 50 and an absolute call of `P' by the
GeneChip algorithm.
Data manipulation and analysis was accomplished using a variety of Perl and VBScripts in Microsoft Excel. Graphs were created using R (http://www.r-project.org). ANOVA was performed using SAS (SAS Institute Inc), testing the amount of observed variability in expression of each gene resulting from replicate (repeat hybridizations of the same RNA sample), mouse (samples from three individual mice), or tissue (samples from four different tissues); a general linear model was used on a per-gene basis (PROC GLM). P values considered were calculated for each variable individually, having adjusted for the variation resulting from remaining variables (added-last test / SAS Type III F-Test). We conducted ANOVA separately on subsets of the data meeting initial filtering criteria of minimum expression levels of greater than 20, 50, 100, or 200 units across all 12 experiments. ANOVA results must be interpreted with caution because the small sample size makes assessments of normality and homoscedasticity difficult. P values considered were for the added-last F-test (testing each variable individually, having adjusted for all other variables). Datasets, supplementary figures, tables, and analytical scripts are available at http://genome.mcgill.ca/~pdlee/control_genes.
| |
WEB SITE REFERENCES |
|---|
|
|
|---|
http://www-genome.wi.mit.edu/MPR, Whitehead/MIT Center for Genome Research.
http://www.hugeindex.org, The Human Gene Expression Index.
http://www.mged.org, Microarray Gene Expression Data Group.
http://www.r-project.org, The R Project.
http://genome.mcgill.ca/~pdlee/control_genes, Supplementary Data and Figures (Montreal Genome Centre).
| |
ACKNOWLEDGMENTS |
|---|
We thank J. Novak, B. Ge, J. Engert, C. Loredo-Osti, T. Golub, J. Staunton, D. DeGraaf, P. Tamayo, and T. Maniatis for their useful discussions. This research was supported by grants from the Canadian Institutes for Health Research (Grant number GOP 36056), the Canadian Genetics Diseases Network and the Mathematics of Information Technology and Complex Systems (Networks of Centres of Excellence Program), and a research contract from Bristol Myers Squibb, Millennium Pharmaceuticals Inc., and Affymetrix. R.S. and T.J.H. are respectively the recipients of a fellowship and a Clinician-Scientist award from the Canadian Institutes of Health Research. C.G. is a Chercheur-Boursier of Fonds de la Recherche en Santé du Québéc.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
3 Corresponding author.
E-MAIL tjhudson{at}med.mcgill.ca; FAX (514) 933-7146.
Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.217802.
| |
REFERENCES |
|---|
|
|
|---|
or the philosophy of RNA controls.
J. Endocrinol.
159:
197-200[CrossRef][Medline].Received October 5, 2001; accepted in revised form November 29, 2001.
This article has been cited by other articles:
![]() |
M. Libault, S. Thibivilliers, D. D. Bilgin, O. Radwan, M. Benitez, S. J. Clough, and G. Stacey Identification of Four Soybean Reference Genes for Gene Expression Normalization The Plant Genome, July 1, 2008; 1(1): 44 - 54. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. K.T. Tan, L. K. Tan, K. Yu, P. H. Tan, M. Lee, L. H. Sii, C. Y. Wong, G. H. Ho, A. W.Y. Yeo, P. K.H. Chow, et al. Clinical Validation of a Customized Multiple Signature Microarray for Breast Cancer Clin. Cancer Res., January 15, 2008; 14(2): 461 - 469. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J Agulleiro, M. Andre, S. Morais, J. Cerda, and P. J Babin High Transcript Level of Fatty Acid-Binding Protein 11 but Not of Very Low-Density Lipoprotein Receptor Is Correlated to Ovarian Follicle Atresia in a Teleost Fish (Solea senegalensis) Biol Reprod, September 1, 2007; 77(3): 504 - 516. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Synnergren, T. L. Giesler, S. Adak, R. Tandon, K. Noaksson, A. Lindahl, P. Nilsson, D. Nelson, B. Olsson, M. C.O. Englund, et al. Differentiating Human Embryonic Stem Cells Express a Unique Housekeeping Gene Signature Stem Cells, February 1, 2007; 25(2): 473 - 480. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Etschmann, B. Wilcken, K. Stoevesand, A. von der Schulenburg, and A. Sterner-Kock Selection of Reference Genes for Quantitative Real-time PCR Analysis in Canine Mammary Tumors Using the GeNorm Algorithm. Vet. Pathol., November 1, 2006; 43(6): 934 - 942. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Agca, J. E Ries, S. J Kolath, J.-H. Kim, L. J Forrester, E. Antoniou, K. M Whitworth, N. Mathialagan, G. K Springer, R. S Prather, et al. Luteinization of porcine preovulatory follicles leads to systematic changes in follicular gene expression Reproduction, July 1, 2006; 132(1): 133 - 145. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Bernat, G. E. Crawford, A. Y. Ogurtsov, F. S. Collins, D. Ginsburg, and A. S. Kondrashov Distant conserved sequences flanking endothelial-specific promoters contain tissue-specific DNase-hypersensitive sites and over-represented motifs Hum. Mol. Genet., July 1, 2006; 15(13): 2098 - 2105. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Smirnov, B. W. Foulk, G. V. Doyle, M. C. Connelly, L. W.M.M. Terstappen, and S. M. O'Hara Global gene expression profiling of circulating endothelial cells in patients with metastatic carcinomas. Cancer Res., March 15, 2006; 66(6): 2918 - 2922. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-W. Chua, P. Vijayakumar, P. M. Nissom, C.-Y. Yam, V. V.T. Wong, and H. Yang A novel normalization method for effective removal of systematic variation in microarray data Nucleic Acids Res., March 9, 2006; 34(5): e38 - e38. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Yamashita, K. Wakazono, T. Nomoto, Y. Tsujino, T. Kuramoto, and T. Ushijima Expression Quantitative Trait Loci Analysis of 13 Genes in the Rat Prostate Genetics, November 1, 2005; 171(3): 1231 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Smirnov, D. R. Zweitzig, B. W. Foulk, M. C. Miller, G. V. Doyle, K. J. Pienta, N. J. Meropol, L. M. Weiner, S. J. Cohen, J. G. Moreno, et al. Global Gene Expression Profiling of Circulating Tumor Cells Cancer Res., June 15, 2005; 65(12): 4993 - 4997. [Abstract] [Full Text] [PDF] |
||||
![]() |
K.M. Whitworth, C. Agca, J.-G. Kim, R.V. Patel, G.K. Springer, N.J. Bivens, L.J. Forrester, N. Mathialagan, J.A. Green, and R.S. Prather Transcriptional Profiling of Pig Embryogenesis by Using a 15-K Member Unigene Set Specific for Pig Reproductive Tissues and Embryos Biol Reprod, June 1, 2005; 72(6): 1437 - 1451. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Li, M. L. Spletter, and J. A. Johnson Dissecting tBHQ induced ARE-driven gene expression through long and short oligonucleotide arrays Physiol Genomics, March 21, 2005; 21(1): 43 - 58. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Boorman, R. D. Irwin, M. K. Vallant, D. K. Gerken, E. K. Lobenhofer, M. R. Hejtmancik, P. Hurban, A. M. Brys, G. S. Travlos, J. S. Parker, et al. Variation in the Hepatic Gene Expression in Individual Male Fischer Rats Toxicol Pathol, January 1, 2005; 33(1): 102 - 110. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Takagi, M. Shibutani, N. Kato, H. Fujita, K.-Y. Lee, S. Takigami, K. Mitsumori, and M. Hirose Microdissected Region-specific Gene Expression Analysis with Methacarn-fixed, Paraffin-embedded Tissues by Real-time RT-PCR J. Histochem. Cytochem., July 1, 2004; 52(7): 903 - 913. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Lee, A. K. Hsu, J. Sajdak, J. Qin, and P. Pavlidis Coexpression Analysis of Human Genes Across Many Microarray Data Sets Genome Res., June 1, 2004; 14(6): 1085 - 1094. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Lalonde, P. E. D. Lachance, and A. Chaudhuri Monocular Enucleation Induces Nuclear Localization of Calcium/Calmodulin-Dependent Protein Kinase IV in Cortical Interneurons of Adult Monkey Area V1 J. Neurosci., January 14, 2004; 24(2): 554 - 564. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S Chugh, S. Whitesel, M. Turner, C. T Roberts Jr., and S. R Nagalla Genetic basis for chamber-specific ventricular phenotypes in the rat infarct model Cardiovasc Res, February 1, 2003; 57(2): 477 - 485. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Murphy, K. K. O. Watt, D. Cameron-Smith, C. J. Gibbons, and R. J. Snow Effects of creatine supplementation on housekeeping genes in human skeletal muscle using real-time RT-PCR Physiol Genomics, January 15, 2003; 12(2): 163 - 174. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Boeuf, J. Keijer, N. L. W. Franssen-Van Hal, and S. Klaus Individual variation of adipose gene expression and identification of covariated genes by cDNA microarrays Physiol Genomics, October 2, 2002; 11(1): 31 - 36. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||