|
|
|
|
Vol. 11, Issue 6, 1043-1052, June 2001
LETTER
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |
ABSTRACT |
|---|
|
|
|---|
Elevated plasma lipoprotein levels play a crucial role in the development of coronary artery disease. Genetic factors strongly influence the levels of plasma lipoproteins, but the genes and sequence variations contributing to the most common forms of dyslipidemias are not known. We used GeneChip probe arrays to resequence the coding regions of 10 key genes of lipid metabolism. The sequences of these genes were analyzed in 80 dyslipidemic individuals. Fourteen nonsynonymous and twenty-two synonymous single nucleotide changes were identified that could be confirmed by conventional sequencing. Seven of the fourteen nonsynonymous sequence variants were polymorphisms with allele frequency >1% in the general population. The remaining seven were not found in normolipidemic controls (25 Caucasians and 25 African-Americans). The relationship between nonsynonymous sequence variations and various dyslipidemias was explored in association and family studies. No evidence was found for coding sequence variations in any of the 10 genes contributing to dyslipidemia. Only a single sequence variation, a missense mutation in the low density lipoprotein receptor gene, co-segregated with hyperlipidemia in the proband's family. This study illustrates some of the difficulties associated with identifying sequence variations contributing to complex traits.
| |
INTRODUCTION |
|---|
|
|
|---|
Genetic factors contribute importantly to many of the common
diseases afflicting Western societies, such as
coronary artery disease, hypertension, diabetes, and cancer. Although
these disorders cluster in families, they are not inherited in a
Mendelian fashion; thus it has been difficult to identify the sequence
variations that confer susceptibility. Most sequence variations within
coding regions of genes result in synonymous substitutions or
conservative amino acid replacements that have little or no apparent
effect on protein function (Cargill et al. 1999
). Only 10% to 20% of single nucleotide polymorphisms (SNPs) lead to a nonconservative amino
acid replacement (Cargill et al. 1999
). It has been proposed that a
subset of common sequence variants contribute to common diseases in the
population (Lander 1996
; Risch and Merikangas 1996
). The common
disease-common variant (CD-CV) hypothesis is supported by a few
examples of well-documented associations between sequence variants and
medical conditions: F5 1691G
A (Factor V Leiden) and
thrombophilia (Dahlback 1997
), APOE*E4 and late-onset Alzheimer disease (Strittmatter et al. 1993
), and C-C chemokine receptor 5 (CCR5)
32 and resistance to the human
immunodeficiency virus (Liu et al. 1996
).
Considerable effort is now focused on identifying genetic polymorphisms
that contribute to other common diseases, including coronary artery
disease (Cambien et al. 1999
). Elevated plasma levels of cholesterol
play a central role in the development of atherosclerosis. Most of the
cholesterol circulating in plasma is transported as a constituent of
low density lipoproteins (LDLs) or high density lipoproteins (HDLs).
The risk of coronary artery disease is directly related to the plasma
level of LDL-cholesterol and inversely related to plasma
HDL-cholesterol levels (Kannel et al. 1979
). Twin and family studies
are consistent with ~50% of the interindividual variation in the
levels of these plasma lipoproteins being a result of genetic factors
(Austin et al. 1987
; Heller et al. 1993
; Perusse et al. 1989
).
Mutations in the LDL receptor (LDLR) and apolipoprotein (apo) B-100
genes are responsible for the two most common Mendelian forms of
hypercholesterolemia, familial hypercholesterolemia (FH; MIM143890) and
familial defective apoB-100 (MIM 144010; Innerarity et al. 1990
;
Goldstein et al. 1995
); however, sequence variations in these two genes
do not contribute significantly to the moderate elevations in plasma lipoproteins that are most frequent in subjects with coronary artery
disease (Wang et al. 1998
).
Many of the genes encoding proteins involved in cholesterol metabolism
have not been screened systematically for sequence variants. The
present study was undertaken to determine whether sequence variations
in the coding regions of 10 key genes in lipid metabolism contribute to
variations in plasma levels of cholesterol and triglycerides. The genes
analyzed in this study include those encoding sterol regulatory element
binding proteins (SREBPs) 1 and 2 (SREBF1 and SREBF2;
Hua et al. 1995
; Brown and Goldstein 1997
; Miserez et al. 1997
), SREBP
cleavage-activating protein (SCAP; Hua et al. 1996
; Nakajima
et al. 1999
), 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) reductase
(HMGCR; Reynolds et al. 1984
), acyl coenzyme A
acyltransferase2 (ACAT-2; Chang et al. 1997
; Cases et al.
1998
), cholesterol 25-hydroxylase (CH25H; Lund et al. 1998
), LDLR (LDLR; Sudhof et al. 1985
), scavenger receptor, type B,
class 1 (SR-BI; human gene name CLA-1; Calvo and Vega 1993
;
Acton et al. 1996
; Cao et al. 1997
), site-1 protease (S1P)
(S1P; Sakai et al. 1998a
,b
; Nakajima et al. 2000
), LXR
(NR1H3; Willy et al. 1995
) and 40 bp encoding the LDLR binding
domain of apolipoproteinB-100 (APOB; Yang et al. 1986
). We
also screened 29 bp of the known regulatory sequences of the LDLR.
These genes were selected because of the crucial role they play in the
regulation, biosynthesis, and clearance of cholesterol. SREBP-1 and
SREBP-2 are structurally related transcription factors that regulate
many genes involved in lipid metabolism (for review, see Brown and
Goldstein 1997
). These two transcription factors are oriented in a
hairpin configuration in the membrane of the endoplasmic reticulum in
association with SCAP, the putative cholesterol sensor. Sterol
depletion is associated with the transport of the SREBP-SCAP complex
to the Golgi (DeBose-Boyd et al. 1999
; Nohturfft et al. 1999). S1P
resides in the Golgi and mediates the cleavage of SREBP within its
luminal loop, which ultimately results in the release of a fragment
from the amino terminus that contains a basic helix-loop-helix zipper
domain. This ~68 kD fragment enters the nucleus and activates
transcription of multiple genes involved in cholesterol and fatty acid
metabolism (Brown and Goldstein 1997
), including the genes encoding the
LDLR, which is the major vehicle for the import of lipoprotein-derived
cholesterol into cells (Brown and Goldstein 1986
) and HMGCR,
which encodes a rate-limiting enzyme in cholesterol biosynthesis. A
potent down-regulator of SREBP cleavage is 25-hydroxycholesterol, which
is generated by the hydroxylation of cholesterol at position 25 by
cholesterol 25-hydroxylase (CH25H; Lund et al. 1998
). Excess
intracellular cholesterol is converted to cholesteryl esters by two
other membrane-associated proteins of the endoplasmic reticulum, ACAT-1
and ACAT-2 (Chang et al. 1997
; Cases et al. 1998
). The major route of
clearance of cholesterol from the body is via the biliary system
(Russell and Setchell 1992
). SR-BI has been implicated as the major
conduit by which cholesterol is transported from HDL to the bile (Ji et al. 1999
). Hepatic cholesterol can be secreted directly into the bile
or first converted to bile acids prior to secretion (Russell and
Setchell 1992
). The rate-limiting enzyme for bile acid synthesis, cholesterol 7
hydroxylase, is regulated by LXR
(gene name,
NR1H3), an orphan nuclear hormone receptor (Peet et al. 1998
).
We used high density GeneChip probe arrays (Chee et al. 1996
; Wang et
al. 1998
) to screen for sequence variations in ~25 kb of sequence
from these 10 genes. GeneChip probe arrays allow for the simultaneous
determination of sequences in multiple genes in a rapid and
comprehensive fashion, which makes the method ideally suited for
well-characterized metabolic pathways. Because we cannot predict the
specific phenotype that would result from sequence variations in these genes,
we screened 80 unrelated individuals with a wide array of lipid disorders.
| |
RESULTS |
|---|
|
|
|---|
Characterization of Subjects
GeneChip probe arrays (Affymetrix) were used to screen 25,298 bp of genomic sequence (24,675 bp of coding sequence, 29 bp of LDLR promoter sequence, and 594 bp of intron sequence) from 10 candidate genes in 80 unrelated individuals with various disorders of lipid metabolism and/or fat distribution (see Methods). Subjects selected for the study had a high likelihood of having a major genetic component to their lipid disorder. The major inclusion criteria were: (1) a clinical picture consistent with homozygous FH without a reduction in LDLR function in cultured fibroblasts; (2) markedly abnormal plasma lipid levels or abnormal response to lipid-lowering agents or diet; or (3) a lipoprotein disorder and at least two first-degree relatives with a similar phenotype.
The Sensitivity of the Genechip Probe Array Method
The GeneChip probe array method detects sequence variants by
hybridizing labeled PCR products to chips containing arrays of corresponding oligonucleotides (for review, see Chee et al. 1996
; Wang
et al. 1998
). For each nucleotide position being queried, four
oligonucleotides that differ only in the base at the central position
are arrayed in rows on the chip. For our study, these oligonucleotide
arrays were designed to detect all exon sequences and 2-3 bp of
adjacent intron sequences.
Putative SNPs were reported when the hybridization pattern of the sample DNA did not match the canonical sequence. Each individual GeneChip probe array-detected SNP was assigned a graded level of certainty: certain (C), likely (L), or possible (P). In some instances in which the same SNP was identified in multiple samples, some samples were scored as C whereas others were scored as L; in this paper these SNPs were classified as CL (or LP for SNPs that were scored as likely or possible).
A total of 8, 13, 12, and 96 SNPs were classified as CL, L, LP and P,
respectively (Table 1). No SNP site in this
study was classified as C in all samples containing the polymorphic
sequence. Because subjects with rare disorders as well as common
phenotypes were included in the sample population, we anticipated
identifying so-called private, unique sequence variations. Therefore,
SNPs classified as P (n = 96) were included in our analysis.
|
Each sequence polymorphism detected by GeneChip probe arrays was subjected to verification by use of a combination of SSCP analysis and sequencing. Some SNPs detected on the chip in multiple individuals could be confirmed only in a subset of samples. Nevertheless, we classified the SNP site as confirmed even if the nucleotide change was verified in only a single individual. Most of the eight SNPs classified as CL were confirmed (75%). The rate of confirmation of SNPs in the other categories was lower: 38% for L, 67% for LP, and 18% for the SNPs classified as P. Although the false positive rate was very high for the possible SNPs, the absolute number of confirmed SNPs in the P category (n = 17) was nearly as high as in all the other categories combined (n = 19).
GeneChip probe array analysis also classified each sample as homozygous, heterozygous or indeterminate for each SNP. These designations were tested for true SNP frequency by SSCP and sequencing for all SNPs identified in SREBF2; the confirmation rates for these categories were 48% for the sites designated as homozygous, and 31% and 14% for those called heterozygous and indeterminate, respectively (data not shown).
We did not determine systematically the false negative rate for
GeneChip probe array analysis, but several SNPs, as well as a deletion
and an insertion, were not detected by this method. Four sequence
variants that resulted in changes in the amino acid sequence were
missed by GeneChip probe arrays: (1) An insertion in exon 2 of
SREBF2 in an expanded repeat found by E. Forgacs (pers.
comm.), which was present in 1 of the 80 subjects; (2) an LDLR mutation
(Asp147His; Hobbs et al. 1992
; Leitersdorf et al. 1993
); (3) a
sequencing error in the original sequence of SREBF2 (resulting
in Gly1045Ala); and (4) a nonsynonymous SNP detected in SREBF2
(Table 2) that was not identified in three of the eight samples heterozygous for the polymorphism.
|
We also did not determine systematically the accuracy of the allele genotypes in the sample for each SNP. Only one SNP in exon 10 of SREBF2 that was classified as LP was reanalyzed in the entire sample by use of SSCP. Reanalysis revealed that 25 samples reported as wild type were heterozygous, and 16 samples were homozygous for the SNP.
A total of 36 SNP sites was confirmed in at least 1 of the 1600 different alleles assayed. No sequence variants were identified in the
40-bp segment of APOB that was screened. An average of 1 site
per 705 bp varied in the 24,675 bp coding sequences. The 10 genes
differed more than 10-fold in their overall frequency of sequence
variations (Table 3). The LDLR
gene had the highest nucleotide diversity and SREBF1 the
lowest. The average value for the normalized number of variant sites
(
) was 2.51 × 10
4.
|
Over 75% of the SNPs were transitions (28 of 36) and 36% of these SNPs were cytosine to thymidine transitions at CpG dimers.
SNPs Detected in 80 Dyslipidemic Subjects and Normal Controls
Of the 36 confirmed SNPs, 14 were nonsynonymous substitutions (Table 3). Seven of the nonsynonymous changes had an allele frequency >1% in the general population. All of these common SNPs resulted in conservative amino acid changes, with the exception of an arginine to serine substitution at amino acid 860 in SREBF-2. However, this residue is also serine in hamster SREBF2. The most common SNP identified in this study was Ala595Gly in SREBF2. The allele frequency of this SNP differed significantly between African-Americans and Caucasians. Another relatively common SNP found in SREBF2 was a valine to methionine substitution at amino acid 623, which was detected in an African-American with lipodystrophy. This sequence variant comprised 4% of the alleles in the normolipemic African-Americans and was not found in the Caucasian samples tested.
Mutations Detected in Dyslipidemic Subjects
Seven mutations were identified exclusively in the group of 80 probands and not in the 50 control subjects (Table 2). Only one of
these mutations was found in more than one proband. Valine was
substituted for isoleucine at amino acid 638 of the HMGCR in
four individuals. This residue is located in the catalytic domain of
HMG CoA reductase, but is not conserved across species (Istvan et al.
2000
). The corresponding residues in hamster and rat are methionine and
leucine, respectively, and some bacterial forms of the enzyme also have
valine at this position (E. Istvan, pers. comm.). The SNP did not
cosegregate with hyperlipidemia in the four families with this sequence
variant (data not shown).
The other six missense mutations were each found in only a single
subject. Three of these mutations were in LDLR, and of these mutations, one was previously known (Hobbs et al. 1989
). The second LDLR mutation was identified in a 41-year-old Caucasian man whose plasma cholesterol level was extremely responsive to changes in dietary
cholesterol content; the mutation had been identified previously in a
patient with classic FH (Ekstrom et al. 1995
), but was not associated
with defective LDLR function in cultured fibroblasts (Ekstrom et a.
2000
). The third LDLR mutation was identified in a 35-year-old
Caucasian man who had a myocardial infarction, type III hyperlipidemia,
and an elevated plasma LDL-cholesterol level. Analysis of the family
revealed that the other first-degree relatives who inherited the LDLR
mutation had markedly elevated plasma LDL-cholesterol levels (Fig.
1). This LDLR mutation (Asp79Val) had not
been described previously in any other FH patients.
|
The two nonsynonymous mutations identified in S1P were each found in only a single subject. Both mutations were reproduced by site-directed mutagenesis and the cDNA was expressed in cultured cells to assess the effect of the sequence change on enzyme activity (data not shown). Expression of S1P with the Asn544Ser substitution was associated with normal processing of SREBP, whereas no processing of SREBP was seen with expression of S1P containing the Gln868 nonsense mutation (J. Zhang, unpubl.). The proband heterozygous for the nonsense mutation had moderately elevated plasma LDL-cholesterol, but the sequence variant failed to cosegregate with the hyperlipidemia in the family (Fig. 1B).
An Arg1064Gln substitution in SREBF1 was identified in a
39-year-old Caucasian woman with noninsulin-dependent diabetes mellitus and type V hyperlipidemia. The missense mutation is located in the
region of SREBP-1 that interacts with SCAP and participates in
cholesterol sensing (Sakai et al. 1997
). The arginine residue at this
position is also present in hamster and fruit fly, suggesting that this
residue may be functionally important. However, when this mutant allele
was expressed in tissue culture, the SREBP-1 protein was processed
normally in response to cholesterol depletion (R. DeBose-Boyd, unpubl.).
Finally, we screened 50 unrelated noninsulin-dependent diabetic, 50 hypercholesterolemic, and 50 hypertriglyceridemic subjects for each of these seven mutations. Only the Ile638Val substitution in HMGCR was found in one hypercholesterolemic and two diabetic subjects, giving on overall allele frequency of 1%.
Silent SNPs Detected in 10 Candidate Genes
Of the 25 silent polymorphisms identified in this study (Table
4), 22 were detected by GeneChip probe
array sequencing, and all but one were in exons. Over 85% of the SNPs
located in the coding regions were in the third position of the codon,
and the remainder were in the first position. The only intronic SNP
detected by GeneChip probe array was at the
3 position of the splice
acceptor site of intron 18c (C
T) of SREBF1. This
polymorphism occurs with a frequency of ~50% in the population. Two
other intronic SNPs were identified by SSCP and sequencing in
nucleotides not queried by the GeneChip probe arrays method. Neither
polymorphism segregated with the hyperlipidemia in the family.
|
| |
DISCUSSION |
|---|
|
|
|---|
In this study, we sequenced 25,298 bp from 160 independent alleles of 10 genes encoding key proteins in the regulatory, biosynthetic, and clearance pathways of lipid metabolism. A total of 36 SNPs was identified by GeneChip probe arrays; 1 was in an intron, and 35 were in coding sequences. Of the 35 SNPs identified in the coding sequences, 21 were synonymous, and 14 were nonsynonymous. One-half of the 14 nonsynonymous sequences were polymorphisms with a frequency of >1% in the population. All but one of the remaining seven nonsynonymous sequence variants were found in only a single individual and not in 50 unrelated controls or 100 hyperlipidemic (50 hypercholesterolemic and 50 hypertriglyceridemic) subjects. Only one of these sequence variants (LDLR: D79V) could be shown to cosegregate with hyperlipidemia in the family of the proband.
Common forms of dyslipidemia should constitute a model system for the
genetic analysis of complex traits for two reasons. First, there is
considerable evidence from family and twin studies that genetic
variation is an important determinant of plasma lipoprotein levels
(Austin et al. 1987
; Perusse et al. 1989
; Heller et al. 1993
). Second,
the metabolism of lipids and lipoproteins has been studied in
considerable detail, and many of the key genes involved have been
cloned and extensively characterized. Because each of the 10 genes
examined in this study has a clearly defined role in lipid metabolism,
and because all of the 80 individuals screened had a familial lipid
disorder, it is surprising that the only DNA sequence variants
identified that were clearly associated with altered plasma lipoprotein
levels were rare mutations in LDLR.
The simplest interpretation of this finding is that sequence polymorphisms in coding regions of these genes are not a common cause of dyslipidemia. This interpretation is tempered by the fact that dyslipidemia is a heterogeneous entity, and that each specific type of dyslipidemia (e.g., familial combined hyperlipidemia) was present in only a minor subset of the individuals in the study. Nonetheless, at least 25 of the individuals in the study had increased plasma LDL concentrations, and at least 26 others had increased plasma triglyceride concentrations. Therefore, it is very likely (>90 %) that any allele present in 10 % or more of individuals with these characteristics would have been detected in this study.
A second possibility is that common polymorphisms conferring
susceptibility to dyslipidemia are present in these genes but were
either not detected by the GeneChip probe array method or were in
regions of the genes (promoter or introns) not screened in this study.
Few promoter polymorphisms have been associated with Mendelian
dyslipidemias, but common polymorphisms in the 5' flanking sequences of
the genes encoding hepatic lipase (LIPC) and cholesterol
7
-hydroxylase have been associated with variation in the plasma
concentrations of HDL-cholesterol and LDL-cholesterol, respectively
(Guerra et al. 1997
; Wang et al. 1998
). Therefore, further screening of
the noncoding regions of the candidate genes may reveal sequence
polymorphisms that are associated with dyslipidemia.
A third possibility is that the polymorphisms identified in this study have little influence on plasma lipoprotein levels when considered individually, but that the inheritance of a combination of deleterious alleles confers susceptibility to dyslipidemia. Analysis of the joint effects of DNA sequence variants requires very large sample sizes and was not possible in this study. However, because conditions such as hypercholesterolemia and hypertriglyceridemia are relatively common and clinically well recognized, it is quite feasible to assemble large collections of individuals with homogeneous lipoprotein phenotypes. The application of high throughput genotyping methods to such cohorts may help to define the combinations of polymorphisms that act together to create the phenotype of complex diseases such as hyperlipidemia and atherosclerosis.
The least polymorphic of the coding sequences screened was
SREBF1, a transcription factor that plays a central role in
the hepatic regulation of cholesterol and triglyceride metabolism (Brown and Goldstein 1997
) and adipocyte differentiation (Tontonoz et
al. 1993
; Shimomura et al. 1999
). The gene with the second lowest
frequency of sequence variation was HMGCR, which encodes a
rate-limiting enzyme of cholesterol biosynthesis. The low frequency of
sequence polymorphism in these two genes may be related to selection
pressure, although we cannot exclude the possibility that regional differences
in sequence variation and/or genetic drift are contributing factors.
The most polymorphic coding sequence was found in LDLR.
Selection bias may have contributed to the high frequency of sequence variation in LDLR as mutations abolishing LDLR
function almost invariably cause hypercholesterolemia (Goldstein et al.
1995
). Accordingly, it is likely that mutations causing more subtle
impairment of LDLR function may confer susceptibility to
increased plasma LDL concentrations. Although linkage studies have
indicated that polymorphisms in LDLR account for little of the
variation in LDL-cholesterol concentrations in the general population
(Wang et al. 1998
), such polymorphisms may be more prevalent in a
sample selected for high plasma cholesterol levels, like the one
analyzed in this study.
Two new subjects with FH were identified here. In one subject, previous screening of the LDLR gene by SSCP did not reveal mutations. After the mutation was identified by GeneChip probe arrays and confirmed by conventional sequencing, it was reassessed by SSCP, and, again, the sequence variant was not detected. Although it was not uncommon for a sequence variant to be detected under one SSCP condition and not the other, this particular LDLR mutation was the only variant identified in the GeneChip probe array analysis that did not generate an abnormally migrating band in either of the two SSCP conditions employed in this study.
The high density GeneChip probe array method is very efficient at the
detection of more common sequence variations. These polymorphisms,
which were often classified as C or L, had confirmation rates that were
similar to the two other larger studies that used high density GeneChip
probe arrays for SNP detection (Cargill et al. 1999
; Halushka et al.
1999
). A major difference between earlier studies and the current study
was that we included the SNPs classified as P in our analysis. Sequence
variations classified by GeneChip probe arrays as P were much more
common than those classified as C or L, but were also much less likely
to be confirmed. The SNPs classified as P comprised almost half of the
confirmed SNPs in this study, including all of the rare nonsynonymous
mutations. Therefore, SNP detection by GeneChip probe arrays may be
less reliable for detecting rare mutations in individual samples than for detecting common polymorphic sites.
| |
METHODS |
|---|
|
|
|---|
Subjects
A primary population that consisted of 80 unrelated subjects was
screened. Then, the sequence variants identified in these subjects were
analyzed in genomic DNA from 50 unrelated normolipidemic controls (25 Caucasians and 25 African-Americans), and a subset was screened in 50 individuals with hypercholesterolemia (plasma cholesterol
>90th percentile compared with age- and sex-matched
controls), in 50 individuals with hypertriglyceridemia (plasma
triglyceride level >350 mg/dL), and in 50 individuals with
noninsulin-dependent diabetes. The 80 primary subjects included
probands from 61 families identified in the Lipid Clinic at Parkland
Memorial Hospital or as part of a large family study of dyslipidemias.
All available relatives of the probands completed a detailed medical
questionnaire and were examined for the presence of tendon or cutaneous
xanthomas. After obtaining of informed consent, fasting venous blood
was collected and kept at 4°C until the plasma was isolated. The
plasma cholesterol and triglyceride levels were measured by use of
enzymatic assays (Guerra et al. 1997
) and the lipoprotein levels were
determined by
-quantification (S. Grundy, Univ. of Texas
Southwestern Medical Center). Age- and sex- specific percentile values
for plasma lipid and lipoprotein concentrations were calculated with data from the National Health and Nutrition Education Survey (NHANES) III survey
(http://www.cdc.gov/nchs/about/major/nhanes/resrchact.htm). The
dyslipidemias were classified into categories on the basis of the
lipoprotein profile of the proband and, when available, family members.
The criteria for diagnosis of familial combined hyperlipidemia
(n = 13), familial hypertriglyceridemia (n = 8), polygenic hypercholesterolemia (n = 10), and familial type V
(n = 5) are as described (Goldstein et al. 1972
). The
descriptive term familial was used in cases where at least three
first-degree relatives had similar lipoprotein patterns. The terms
hypoalphalipoproteinemia (n = 4), hyperalphalipoproteinemia
(n = 5), and hypobetalipoproteinemia (n = 5) were
used to describe lipoprotein levels that were less than the
10th or greater than the 90th percentile when
compared with age- and sex-matched controls. Genomic DNA was isolated
from either the white blood cells or transformed lymphocytes (Louie and
King 1991).
GeneChip probe array analysis was also performed on 19 genomic DNA
samples isolated from cultured skin fibroblasts donated to the
Department of Molecular Genetics (J. Goldstein, M. Brown, and H.H.
Hobbs). Twelve of these samples were from probands with presumed
autosomal recessive hypercholesterolemia, who had marked elevations in
plasma cholesterol, tendon xanthomas, and normolipidemic parents
(Khachadurian and Uthman 1973
; Sirtori et al. 1991
; Zuliani et al.
1995
). In addition, subjects with each of the following disorders were
analyzed: (1) severe hypercholesterolemia (~2000 mg/dL), bony
lesions, and liver disease of unknown etiology (n = 1); (2)
hypolipidemia (cholesterol level of 100 mg/dL, triglyceride level of 45 mg/dL), and xanthomatosis (n = 1); (3) type III
hyperlipidemia that does not cosegregate with apoE (Giroux et al. 1997
)
or that has an atypical lipoprotein profile (n = 7); (4)
nonalcoholic steatohepatitis (n = 1); (5) FH that does not
respond to HMG-CoA reductase inhibitors (n = 1); (6)
congenital generalized lipodystrophy (n = 1); and (7)
multiple symmetric lipomatosis (n = 1; Enzi 1984
). A
41-yr-old man with diet-responsive hypercholesterolemia was also
included in the study. His plasma LDL-cholesterol level fell dramatically after eating a low-fat diet for 1 month (from 596 mg/dL to
247 mg/dL). His LDL-cholesterol level continued to fall on a low-fat
diet, and then stabilized between 86 and 114 mg/dL. Upon challenge with
a high-fat diet for 5 days per week for 4 weeks, his LDL-cholesterol
rose to 267 mg/dL. The patient was homozygous for the ApoE3 allele and
had no detectable circulating
-sitosterol (P. Barton Duell, pers. comm.).
The ethnic composition of the probands was as follows: 10 Hispanics, 1 Asian Indian, 1 American Indian, 1 African-American, 1 Ashkenazi Jews, 1 Sephardic Jew, 1 Arab, and the remainder of the sample were non-Jewish Caucasians.
PCR Amplification of Coding Regions in Candidate Genes
The sequences of most genes included were available from GenBank. If the intron sequences were not available, we obtained plasmid or bacterial artificial chromosome clones and used PCR to amplify the introns. Sufficient intron sequence was obtained to design primers for the amplification of the exons with flanking dinucleotide consensus sequences. Oppositely oriented 25-base amplification primers complementary to the flanking intron sequences of the candidate genes were used to amplify the exons and adjacent splice junction sites of the candidate genes. Oligonucleotide sequences are available on request. After optimization of the PCR conditions for each amplicon, the exons of each gene were amplified for all 80 genomic DNA samples. Each amplification product was subjected to agarose gel electrophoresis and visualized after ethidium bromide staining to assess the integrity and concentration of the fragment. Aliquots containing ~0.3 pmoles of each fragment from a given subject were pooled and sent to Affymetrix for GeneChip probe array analysis.
Hybridization to 80 GeneChip Probe Arrays
For sequence analysis by GeneChip probe arrays, 1-2 µg of the
pooled PCR products from each subject was purified by use of Qiaquick
Purification kits (Qiagen) and then fragmented by use of DNase
(Promega). The fragments were biotinylated and denatured prior to
hybridization to the chip for 15 h at 44°C in a buffer containing
tetramethylammonium chloride as described (Wang et al. 1998
).
Hybridization patterns were analyzed by Affymetrix software to detect
SNPs (Chee et al. 1996
).
Each detected SNP was assigned a graded level of certainty: C, L, or P. When the same SNP scored as C in some samples but L in other samples, the SNP was classified as CL (or LP for SNPs that were scored as likely or possible).
Verification of SNPs Detected by GeneChip Probe Array Analysis
Each SNP identified by GeneChip probe array sequencing was examined
by SSCP (Orita et al. 1989
). 32P-labeled PCR products
containing the putative SNPs were denatured, and the single-stranded
fragments were electrophoresed for 20 h at 350 V on two different
nondenaturing gels: (1) 7% polyacrylamide in 2× Tris Borate EDTA
buffer (TBE) and 10% (vol/vol) glycerol, and (2) 0.5× MDE
(BioWhittaker), 0.6× TBE. If an abnormally migrating band was
identified by SSCP, the corresponding PCR product was sequenced by use
of an ABI 377 automated sequencer (Applied Biosystems). If no
aberrantly migrating band was detected using SSCP, a fragment containing the putative SNP was PCR amplified from a representative individual and sequenced. SSCP or restriction endonuclease analysis was
used to evaluate the segregation of the polymorphism in the families of
the probands and to screen normolipidemic and hyperlipidemic subjects.
Calculations
To obtain an estimate of nucleotide diversity, the normalized
numbers of variant sites (
) was calculated by dividing the number of
observed nucleotide changes by the length (basepairs) of the sequence
and correcting for sample size (n) as described (Li 1997
).
Electronic Database Information
The following URLs were important for this study: NHANES III data, http://www.cdc.gov/nchs/about/major/nhanes/resrchact.htm; Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/omim.htm.
| |
ACKNOWLEDGMENTS |
|---|
We thank the following physicians who contributed patient samples for this study: Drs. Giovanni Zuliani, Renato Fellin, Marcello Arca, A.K. Khachadurian, C. Sertori, James Crockett, Vera Rose, William Lee, Paul Benke, J.L. de Gennes, F. Dariou, P. Barton Duell, Jan Breslow, Eran Leitersdorf, Jean Davignon, and Abimanyu Garg. We especially thank Michael S. Brown and Joseph L. Goldstein for providing patient samples and helpful discussions. We also thank David W. Russell and Aravinda Chakravarti for helpful discussions. This work was supported by National Institutes of Health (NIH) HL20948 (H.H.H.), the Perot Family Fund, NIH-HL53917 (J.C.C.), and the D.W. Reynolds Foundation.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| |
FOOTNOTES |
|---|
4 Corresponding author.
E-MAIL helen.hobbs{at}emailswmed.edu; FAX (214) 648-7539.
Article and publication are at www.genome.org/cgi/doi/10.1101/gr.172301.
| |
REFERENCES |
|---|
|
|
|---|
making sense out of sequence.
Nature Genet.
21:
56-60[CrossRef][Medline].Received November 27, 2000; accepted in revised form February 16, 2001.
This article has been cited by other articles:
![]() |
J. Meletiadis, S. Chanock, and T. J. Walsh Human Pharmacogenomic Variations and Their Implications for Antifungal Efficacy Clin. Microbiol. Rev., October 1, 2006; 19(4): 763 - 787. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. L. Welch, S. Bretschger, P.-Z. Wen, M. Mehrabian, N. Latib, J. Fruchart-Najib, J. C. Fruchart, C. Myrick, and A. J. Lusis Novel QTLs for HDL levels identified in mice by controlling for Apoa2 allelic effects: confirmation of a chromosome 6 locus in a congenic strain Physiol Genomics, March 12, 2004; 17(1): 48 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Knoblauch, A. Bauerfeind, C. Krahenbuhl, A. Daury, K. Rohde, S. Bejanin, L. Essioux, H. Schuster, F. C. Luft, and J. Georg Reich Common haplotypes in five genes influence genetic variance of LDL and HDL cholesterol in the general population Hum. Mol. Genet., June 1, 2002; 11(12): 1477 - 1485. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||