XPC Lys939Gln — Your DNA Repair Scanner and Exercise Recovery Capacity
Every day your cells experience thousands of DNA lesions from sunlight, environmental chemicals,
and the byproducts of normal metabolism. The protein encoded by XPC is the first responder in
the global genome nucleotide excision repair pathway11 global genome nucleotide excision repair pathway
GG-NER scans the entire genome for
bulky DNA lesions such as UV-induced cyclobutane pyrimidine dimers and oxidative adducts,
initiating repair before damage becomes heritable mutations.
XPC acts as the damage sensor: it recognizes the structural distortion created when DNA strands
are bent or separated by a lesion, recruits the repair machinery, and initiates the removal of
a 25–30 nucleotide damaged oligonucleotide. Without functional XPC, cells in your muscles, skin,
and internal organs accumulate DNA lesions that would otherwise be corrected within hours.
The rs2228001 variant (Lys939Gln, also written 939A>C in the literature) is a common missense substitution in XPC codon 939. The ancestral Lys939 allele (T on the plus strand, found in ~60% of the global population) maintains efficient damage recognition. The derived Gln939 allele (G on the plus strand, ~40% globally) subtly alters the protein's conformation at a functionally important region, reducing how efficiently XPC identifies and responds to DNA damage.
The Mechanism
XPC protein contains three major domains: a transglutaminase-like domain, a beta-hairpin domain
that inserts into damaged DNA, and a C-terminal domain that recruits TFIIH for strand unwinding.
Codon 939 sits within the C-terminal interaction domain22 Codon 939 sits within the C-terminal interaction domain
The C-terminal region of XPC
directly contacts TFIIH subunit p62/GTF2H1 and is required for full assembly of the pre-incision
NER complex. The Lys→Gln substitution replaces
a positively charged lysine with an uncharged glutamine, altering local electrostatic interactions.
Functional studies of XPC carriers show that individuals with the Gln939 allele have
measurably reduced DNA repair capacity33 measurably reduced DNA repair capacity
Ex vivo studies using host-cell reactivation assays
show Gln939 carriers repair UV-damaged reporter plasmids less efficiently than Lys939 homozygotes
compared to Lys939 homozygotes. The effect is dose-dependent: one Gln allele causes moderate
reduction; two copies cause the most pronounced deficit.
In the context of exercise, this matters because intense physical activity generates
reactive oxygen species that damage DNA44 reactive oxygen species that damage DNA
Muscle contraction substantially increases mitochondrial
ROS production; these oxidants generate 8-oxoguanine and other bulky adducts in both nuclear and
mitochondrial DNA of muscle fibers. The NER pathway
— including XPC — is required to process bulky oxidative adducts that base excision repair
cannot handle alone. Carriers of the Gln939 allele have a slower initial damage recognition
step, meaning exercise-generated DNA lesions persist longer in muscle cells before repair
initiates. Over repeated training cycles, this difference in repair kinetics may contribute
to delayed recovery and greater susceptibility to cumulative genotoxic stress.
The Evidence
The most comprehensive evidence comes from a meta-analysis of 62 case-control studies55 meta-analysis of 62 case-control studies
He et al. International Journal of Cancer 2013 — 25,708 cancer cases and 30,432 controls
from published literature. Gln/Gln homozygotes
(plus-strand GG) had 16% higher overall cancer risk (OR=1.16, 95% CI 1.07–1.25, p<0.001)
compared to Lys/Lys carriers. The effect was stronger under a recessive model (OR=1.14,
95% CI 1.06–1.22) than a dominant model (OR=1.06), suggesting the highest risk accumulates
in homozygous carriers. Cancer-specific analysis revealed elevated risk for bladder cancer,
lung cancer, and colorectal cancer, with larger effects in Asian populations than Caucasians.
For bladder cancer specifically, a Tunisian case-control study66 Tunisian case-control study
Rouissi et al. BMC Cancer
2011 — 193 newly diagnosed bladder tumor cases
found that homozygous carriers of the Gln939 (risk) genotype had more than double the bladder
cancer risk (OR=2.09, 95% CI 1.09–3.99). Bladder cancer is a NER-relevant malignancy because
the urothelium is exposed to carcinogens excreted in urine, and efficient NER is essential
for processing the resulting bulky DNA adducts.
For UV-related skin damage, the variant interacts with other XPC haplotype elements.
A Polish case-control study in melanoma77 Polish case-control study in melanoma
Paszkowska-Szczur K et al. International Journal of Cancer
2013 — 714 melanoma cases and 1,841 healthy controls
found that XPC and XPD polymorphisms were associated with melanoma susceptibility in a Polish cohort,
with certain XPC haplotypes showing decreased melanoma risk, underscoring that Lys939 is the
protective allele. (Note: the specific OR=0.26 figure cited previously could not be confirmed from
the published abstract and has been removed pending full-text verification.)
Environmental carcinogen interactions have also been documented. In a study of 958 oral cancer
patients from Taiwan88 study of 958 oral cancer
patients from Taiwan
Wu CN et al. Cancer Genomics Proteomics 2021,
carriers of the Gln939 allele (C in paper notation, G on plus strand) showed significantly
higher frequencies among cancer cases than controls, and the Gln939 allele was explicitly
associated with decreased DNA repair capacity. The interaction with tobacco smoking and
betel quid chewing (major sources of bulky DNA adducts in Asian populations) was significant —
meaning the repair deficit is amplified when exposure to genotoxins is high.
In the context of radiotherapy, a Polish study of 79 head and neck cancer patients99 a Polish study of 79 head and neck cancer patients
Kaminska et al. Reports of Practical Oncology and Radiotherapy 2024
found that rs2228001 genotype correlated with early skin reaction severity and elevated
C-reactive protein during treatment, consistent with differential DNA damage processing
kinetics in tissues receiving therapeutic ionizing radiation.
Practical Implications
For most carriers, the practical consequence of the Gln939 allele is not dramatic on its own — it is a low-to-moderate penetrance variant that modulates cancer susceptibility over decades of cumulative exposure, not a switch that determines whether cancer develops. The most actionable implications operate in three domains:
Sun and UV exposure: Reduced NER efficiency means UV-induced cyclobutane pyrimidine dimers persist longer in skin cells. This is directly relevant to skin cancer risk and makes consistent sun protection more important for Gln939 carriers.
Exercise recovery: Gln939 carriers have slower initial DNA damage recognition in response to exercise-generated oxidative stress. While no clinical exercise trials have directly measured recovery differences by rs2228001 genotype, the biological mechanism is plausible and supported by the broader literature on NER capacity and genotoxic stress tolerance. Adequate rest between high-intensity sessions and antioxidant micronutrient support (vitamins C, E, and selenium as NER cofactors) may be particularly relevant.
Environmental carcinogen exposure: Smokers and those with significant occupational or dietary carcinogen exposures derive greater absolute benefit from reducing those exposures compared to high-repair-capacity individuals, because Gln939 carriers have less buffer to handle the additional damage burden.
Interactions
XPC operates in a molecular sequence with XPA (rs1800975): XPC performs the initial damage recognition in global-genome NER, then hands off to XPA for damage verification and repair complex assembly. Carriers of impaired alleles at both rs2228001 (XPC recognition) and rs1800975 (XPA verification) may have compounded NER deficiency — damage that XPC detects less efficiently will also be verified less efficiently by XPA, potentially resulting in greater repair delays than either variant produces alone. This XPC–XPA interaction has not been formally quantified in large-scale studies but is biologically predicted from their sequential roles in the NER pathway.
ERCC2/XPD (rs13181) unwinds the DNA duplex around the lesion after XPC recognition and XPA verification. Individuals carrying impaired alleles at all three positions (XPC + XPA + XPD) would theoretically have the most compromised NER. The second common XPC variant rs2228000 (Ala499Val) operates in the same damage recognition domain; combined analysis of rs2228001 and rs2228000 genotypes provides a more complete picture of XPC function than either alone.
APOC3 -641A>C — The Longevity Promoter Variant
Apolipoprotein C-III (APOC3) is one of the most potent brakes in the human
triglyceride clearance system. The liver secretes this small 79-amino-acid
protein onto triglyceride-rich lipoproteins11 triglyceride-rich lipoproteins
VLDL and chylomicrons, the
particles that carry fat through the bloodstream,
where it inhibits lipoprotein lipase22 lipoprotein lipase
the enzyme anchored to blood vessel
walls that breaks down circulating triglycerides.
The more APOC3 you produce, the slower your body clears fat from the
bloodstream. The rs2542052 variant in the APOC3 promoter sits at position -641
relative to the transcription start site — a region that controls how much of
this protein the liver makes.
The C allele at this position reduces APOC3 transcription. Individuals homozygous for this allele produce approximately 30% less APOC3 than those carrying the common A allele, resulting in more efficient triglyceride clearance, smaller and denser VLDL particles, larger LDL particle sizes, and higher HDL cholesterol levels. These are precisely the lipoprotein characteristics associated with reduced cardiovascular and metabolic disease.
The Mechanism
The APOC3 promoter contains several regulatory elements, including an
insulin-responsive element33 insulin-responsive element
a DNA sequence that normally allows insulin to
suppress APOC3 transcription after meals, reducing APOC3 production when
glucose is high. rs2542052 falls
within a cluster of four promoter SNPs (rs2542052, rs10892037, rs11568823,
rs2854116) that are in complete linkage disequilibrium with each other — they
almost always co-occur on the same haplotype. The -641C allele tags a haplotype
associated with reduced APOC3 promoter activity.
APOC3 raises triglycerides through three coordinated mechanisms: it displaces lipoprotein lipase from lipid droplets44 displaces lipoprotein lipase from lipid droplets, blocks hepatic uptake of remnant particles55 blocks hepatic uptake of remnant particles, and promotes hepatic VLDL assembly and secretion. Lower APOC3 production from the C allele simultaneously relieves all three brakes, producing a comprehensively favorable lipid phenotype. Hepatic APOC3 expression is further induced by dietary carbohydrates (especially fructose) and saturated fat, and suppressed by insulin, PPAR-alpha activators, and omega-3 fatty acids — meaning diet and lifestyle directly modulate how much the genotype matters.
The Evidence
The landmark finding came from a 2006 PLOS Biology study by Atzmon and
colleagues66 2006 PLOS Biology study by Atzmon and
colleagues
Atzmon et al. "Lipoprotein Genotype and Conserved Pathway for
Exceptional Longevity in Humans." PLoS Biology, 2006.
Genotyping 213 Ashkenazi Jewish centenarians (mean age 98.2 years), 216 of
their offspring, and 258 age-matched controls revealed that CC homozygosity was
found in 25% of centenarians, 20% of their offspring, and only 10% of controls
(p = 0.0001 and p = 0.001, respectively). CC homozygotes had approximately 30%
lower serum APOC3 (10.1 vs 13.2 mg/dL), significantly lower triglycerides in
females, lower hypertension prevalence (28.6% vs 44%, p = 0.026), greater
insulin sensitivity, and a prospectively confirmed survival advantage
(log-rank p = 0.0008).
These population genetics findings align with mechanistic studies in two major NEJM reports. Jørgensen et al. 201477 Jørgensen et al. 2014 followed 75,725 participants and found that individuals carrying loss-of-function APOC3 mutations had 44% lower nonfasting triglycerides and a 41% reduced risk of ischemic vascular disease (HR 0.59). A complementary study by Crosby et al.88 study by Crosby et al. confirmed a 40% lower coronary heart disease risk in APOC3 loss-of-function carriers. While rs2542052 is a common regulatory variant rather than a loss-of-function mutation, it works through the same pathway of reduced APOC3 expression — and the centenarian data suggest lifelong partial reductions in APOC3 confer meaningful longevity advantages.
The longevity association in Ashkenazi centenarians was specific to rs2542052 (and its linked haplotype), not to the other classic APOC3 variants (rs2854117, rs2854116, rs4520, rs5128, rs4225), which showed no association with triglyceride levels, insulin sensitivity, or blood pressure in the same population. This specificity supports a causal role for the -641 promoter region rather than a broader LD sweep.
Practical Actions
The C allele reduces APOC3 production — but diet determines whether this advantage is realized. Hepatic APOC3 expression is powerfully induced by saturated fat, refined carbohydrates, and fructose. Even CC homozygotes can develop elevated triglycerides with a diet high in these inducers. Conversely, omega-3 fatty acids (EPA and DHA) suppress APOC3 expression through PPAR-alpha activation, lowering ApoC-III concentrations. For AA and AC individuals, omega-3s provide a dietary mechanism to partially compensate for the genetically higher APOC3 set point.
Monitoring fasting triglycerides is especially relevant for AA homozygotes, who lack the promoter-reducing C allele and produce the most APOC3. A fasting triglyceride level above 150 mg/dL warrants dietary intervention; above 500 mg/dL increases acute pancreatitis risk. Fibrate medications (fenofibrate, gemfibrozil) work partly by activating PPAR-alpha, which reduces APOC3 expression — making them mechanistically targeted for A allele carriers with persistently elevated triglycerides.
Interactions
rs2542052 is in complete LD with rs2854116 (T-455C) and rs10892037, forming a promoter haplotype block. The closely related rs5128 variant in the APOC3 3'UTR influences APOC3 translation through a different mechanism (microRNA binding) and may compound effects when co-inherited.
APOC3 sits in the apolipoprotein gene cluster (APOA1-APOC3-APOA4-APOA5) on
chromosome 11q2399 apolipoprotein gene cluster (APOA1-APOC3-APOA4-APOA5) on
chromosome 11q23
this cluster plays coordinated roles in triglyceride
metabolism. Variants in APOA5
(rs662799, rs3135506) also powerfully raise triglycerides. Carrying A alleles
at both rs2542052 (higher APOC3 expression) and risk alleles at rs662799
(lower APOA5 expression) would compound triglyceride burden and amplify the
dietary importance of omega-3s and carbohydrate restriction.
Insulin strongly suppresses APOC3 transcription in the liver. Insulin resistance, metabolic syndrome, and type 2 diabetes therefore tend to elevate APOC3 even in individuals without risk genotypes. For AA homozygotes with any degree of insulin resistance, the combined effect on APOC3 production and triglyceride accumulation is especially pronounced.
CYP1B1 G61E — The Glaucoma-Causing Enzyme Variant
The CYP1B1 enzyme plays two distinct biological roles: it is both a steroid-hormone
metabolizer (converting estradiol to catechol estrogens) and a first-line activator
of environmental procarcinogens including polycyclic aromatic hydrocarbons11 polycyclic aromatic hydrocarbons
PAHs:
carcinogenic compounds found in tobacco smoke and grilled meat.
This dual function makes CYP1B1 important across several health domains — but the
p.Gly61Glu variant (rs28936700) is best known as the most common cause of primary
congenital glaucoma (PCG) in Middle Eastern and North African populations, where it
accounts for up to 63% of genetic glaucoma cases.
The Mechanism
Glycine at position 61 sits near the N-terminal membrane anchor of the CYP1B1 protein,
within a structurally critical hinge region. Substituting glutamic acid at this position —
a charged amino acid replacing a small neutral one — disrupts protein folding and
significantly destabilizes the enzyme. Functional assays show the G61E variant retains
only about 25% of wild-type CYP1B1 activity22 the G61E variant retains
only about 25% of wild-type CYP1B1 activity
ClinVar VCV000007730 — functional studies
demonstrating reduced 17β-estradiol hydroxylation and reduced benzo[a]pyrene
metabolism, with markedly reduced
protein expression in cell-based models.
CYP1B1 is expressed in the trabecular meshwork — the drainage tissue of the eye — and appears to metabolize an as-yet-unidentified signaling molecule essential for normal anterior eye development. When both copies of CYP1B1 are non-functional (homozygous or compound heterozygous), the trabecular meshwork fails to develop normally, leading to elevated intraocular pressure from birth and, without prompt surgical treatment, irreversible optic nerve damage.
The Evidence
Badeeb et al. (2014)33 Badeeb et al. (2014)
CYP1B1 mutations in patients with primary congenital glaucoma
from Saudi Arabia. BMC Med Genet. screened
34 Saudi PCG patients and found that p.Gly61Glu was present in 63% of solved cases
(17/27), predominantly homozygous, making it by far the most frequent pathogenic variant
in this population. Severe phenotypes (corneal diameter >14mm, IOP >30 mmHg) were found
in 50% of severely affected patients carrying this mutation.
Hilal et al. (2010)44 Hilal et al. (2010)
Screening of CYP1B1 and MYOC in Moroccan PCG families.
Mol Vis. found G61E in 7.77% of Moroccan
PCG probands (second most common variant after g.4339delG), suggesting a shared
founder-effect haplotype across Middle East/North Africa.
Reis et al. (2016)55 Reis et al. (2016)
Analysis of CYP1B1 in pediatric and adult glaucoma.
Mol Vis. confirmed that G61E is recurrent
across multiple PCG phenotypes and anterior segment dysgenesis, and noted that
heterozygous carriers of CYP1B1 loss-of-function variants may have elevated adult-onset
primary open-angle glaucoma risk.
Alghamdi et al. (2020)66 Alghamdi et al. (2020)
Loss of microglia activities facilitates glaucoma progression.
PLoS One. specifically studied p.Gly61Glu
and found that cells carrying this mutation show 60% microglia hypoactivation by 72
hours, 71% pre-apoptotic microglia (vs 13% in controls), and differential expression of
27 cytokines — suggesting that progressive neuroinflammation and impaired optic nerve
immune surveillance compound the structural trabecular meshwork defect.
Globally, the T allele frequency is approximately 0.02% (gnomAD). In Middle Eastern populations it reaches ~0.6%, consistent with a population founder effect. At the gnomAD Middle Eastern frequency, roughly 1 in 170 people carry one copy.
Practical Actions
For heterozygous carriers (CT genotype): there is no childhood eye disease, but emerging evidence suggests heterozygous CYP1B1 loss-of-function variants may modestly increase adult-onset open-angle glaucoma risk. Annual intraocular pressure checks with an ophthalmologist from age 40 are prudent, particularly given family history context.
For homozygous or compound heterozygous individuals (two pathogenic CYP1B1 alleles): primary congenital glaucoma typically manifests in infancy with tearing, light sensitivity, and corneal haze. This is a pediatric surgical emergency — timely goniotomy or trabeculotomy is the primary treatment. Neonatal ophthalmologic screening and early specialist referral are critical.
CYP1B1 also metabolizes estrogens; carriers have somewhat reduced 17β-estradiol hydroxylation capacity, though this is less clinically significant for single-copy carriers than the glaucoma risk.
Interactions
CYP1B1 interacts with CYP1A1 and COMT in the catechol estrogen pathway. rs1056836 (CYP1B1 Leu432Val) is a separate functional variant in the same enzyme that modulates estrogen and procarcinogen metabolism — it is common and codominant, unlike this rare pathogenic variant. In carriers of both rs28936700 (one copy) and rs1056836 risk alleles, the overall CYP1B1 enzyme output is further reduced, potentially affecting catechol estrogen balance. This interaction warrants a compound action if both variants are identified together. Separately, CYP1B1 interacts with SULT1E1 (rs1238574) in estrogen inactivation pathways.
ALDOB Y204X — A Pathogenic Stop-Codon Variant Causing Hereditary Fructose Intolerance
Most people process dietary fructose without a second thought — the sugar
passes through the intestinal wall, enters the liver, and is phosphorylated
by fructokinase into fructose-1-phosphate (Fru1P). One enzyme then stands
between Fru1P and the rest of metabolism:
aldolase B11 aldolase B
encoded by the ALDOB gene on chromosome 9q22.3; the liver-specific
isoform that cleaves fructose-1-phosphate into glyceraldehyde and
dihydroxyacetone phosphate for entry into glycolysis and gluconeogenesis.
When aldolase B fails, Fru1P cannot be cleared. It accumulates rapidly in
liver and kidney cells every time fructose is consumed, setting off a cascade
of metabolic disruption. The condition that results is
hereditary fructose intolerance22 hereditary fructose intolerance
an autosomal recessive inborn error of
metabolism caused by biallelic pathogenic variants in ALDOB; distinct from
benign fructosuria and dietary fructose sensitivity.
The Y204X variant (rs370793608) introduces a premature stop codon at tyrosine
204 of the aldolase B protein, producing a severely truncated, non-functional
enzyme. It is listed as
Pathogenic in ClinVar33 Pathogenic in ClinVar
VCV000632627, 2-star classification from multiple
independent laboratory submitters including LabCorp Genetics and Baylor
Genetics.
This variant is extremely rare globally (alternate allele frequency ~1 in
100,000 alleles in population databases), consistent with a severe
disease-causing allele subject to strong purifying selection.
The Mechanism
Fru1P accumulation is the central pathological event in HFI. Cellular Fru1P
acts as a competitive inhibitor of the remaining glycolytic aldolase isoforms
(aldolase A and C), amplifying its own toxic effect. Two parallel consequences
follow:
first, intracellular phosphate is sequestered in Fru1P, depleting free
phosphate and ATP, which impairs glycogenolysis, gluconeogenesis, and energy
production — causing the characteristic post-fructose hypoglycemia44 first, intracellular phosphate is sequestered in Fru1P, depleting free
phosphate and ATP, which impairs glycogenolysis, gluconeogenesis, and energy
production — causing the characteristic post-fructose hypoglycemia
Buziau
AM et al. Recent advances in the pathogenesis of HFI. Cell Mol Life Sci,
2020. Second, elevated Fru1P
disrupts N-linked glycosylation pathways in the hepatocyte, contributing to
chronic liver injury and fat accumulation.
The premature stop at position 204 (of 364 amino acids) results in either a severely truncated protein or, more likely, nonsense-mediated mRNA decay — meaning no functional enzyme is produced from this allele at all. The Y204X truncation removes the entire C-terminal catalytic domain of aldolase B, making this a complete loss-of-function allele. A single pathogenic allele (carrier state) is sufficient to flag reproductive risk, but clinical HFI requires biallelic disruption — either homozygous Y204X or compound heterozygosity with a second pathogenic ALDOB allele.
The Evidence
HFI is rare — the largest population prevalence study in Central Europe
estimated 1:26,10055 estimated 1:26,100, though
the true prevalence may be higher due to under-diagnosis in individuals who
learn to avoid sweet foods by experience. Three mutations dominate the ALDOB
allele spectrum in European patients:
p.A150P (64%), p.A175D (16%), and p.N335K (8%)66 p.A150P (64%), p.A175D (16%), and p.N335K (8%)
Davit-Spraul A et al.
Mol Genet Metab, 2008.
Y204X is a rare variant accounting for a small fraction of pathogenic alleles,
identified in Central European and likely other European family cohorts.
The clinical picture of untreated HFI in an affected individual (two non-functional ALDOB copies) is well documented: fructose ingestion triggers nausea, vomiting, abdominal pain, and sweating within 20–30 minutes. Recurrent exposure causes progressive liver and kidney damage — progressing to hepatomegaly, renal tubular acidosis, growth retardation, and in severe cases liver failure. Affected individuals characteristically develop an aversion to sweet foods and fruit, often unknowingly self-protecting before diagnosis.
Even among adherent patients, a study of 48 HFI patients over 10 years found
that mean daily fructose intake of 169 mg (unavoidable trace amounts) correlated
with abnormal carbohydrate-deficient transferrin glycosylation markers, indicating
that even minimal ongoing exposure has measurable biochemical consequences77 mean daily fructose intake of 169 mg (unavoidable trace amounts) correlated
with abnormal carbohydrate-deficient transferrin glycosylation markers, indicating
that even minimal ongoing exposure has measurable biochemical consequences
Di
Dato F et al. Nutrients, 2019.
Dietary restriction of all fructose, sucrose, and sorbitol — the three molecules
that yield fructose-1-phosphate — completely prevents symptoms and allows normal
organ function. Long-term adherent patients have
normal life expectancy and quality of life88 normal life expectancy and quality of life
Singh SK, Sarma MS. World J Clin
Pediatr, 2022.
Practical Actions
For biallelic carriers (affected individuals), the foundation of management is
strict, lifelong avoidance of fructose, sucrose, sorbitol, and their derivatives.
This requires label-reading expertise: sucrose (table sugar) hydrolyzes to
glucose + fructose; sorbitol is oxidized to fructose intracellularly; high-fructose
corn syrup, honey, agave, fruit juices, and many medications use these sugars.
Vitamin C supplementation is specifically needed because the restriction removes
most fruit sources — studies show that
30% of non-supplemented HFI patients develop vitamin C deficiency99 30% of non-supplemented HFI patients develop vitamin C deficiency
Cano A et al.
Eur J Clin Nutr, 2022.
For heterozygous carriers (one Y204X allele), full enzyme function is maintained with a single functional ALDOB copy. The primary concern is reproductive: a carrier who reproduces with another ALDOB carrier (of any pathogenic allele) faces a 25% probability per pregnancy of an affected child. The three most common European ALDOB alleles (A150P, A175D, N335K) account for ~84% of pathogenic alleles and can be tested for on a targeted panel.
Interactions
The most important interaction for this variant is compound heterozygosity with other pathogenic ALDOB alleles. Because Y204X is a complete loss-of-function allele, any second pathogenic ALDOB allele in trans will result in clinical HFI. The three common European alleles (p.A150P rs45436095, p.A175D rs78654866, p.N335K rs28936415) are the most likely compound heterozygous partners for Y204X in European-ancestry individuals. Carrier partners of Y204X carriers should be offered targeted ALDOB panel testing that includes these three common alleles plus sequencing of the entire coding region.
HMGCR Intron 13 — When Your Cholesterol Gene Edits Its Own Blueprint
Statins are among the most prescribed drugs in the world, yet up to half of
patients do not achieve their LDL-cholesterol target on standard doses.
Much of this variability traces back to the gene statins are designed to
inhibit:
HMGCR11 HMGCR
3-hydroxy-3-methylglutaryl-CoA reductase — the enzyme that catalyses the rate-limiting step in the mevalonate pathway, which produces cholesterol and other critical lipids in every cell of the body.
The rs3846662 variant sits in intron 13 of HMGCR and influences a molecular
editing process that determines how much drug-sensitive enzyme your body
actually makes.
The Mechanism
HMGCR pre-mRNA can be spliced two ways. The dominant product is the
full-length 888-amino-acid enzyme — the form that statins inhibit. The
alternative product, known as
HMGCR Δ13 or HMGCR13(-)22 HMGCR Δ13 or HMGCR13(-)
A shorter isoform in which the 24-amino-acid stretch encoded by exon 13 is excluded. This region overlaps the statin-binding domain of the enzyme,
lacks the segment of the catalytic domain where statins dock. The Δ13 form
retains partial enzymatic activity but is substantially less sensitive to
statin inhibition.
rs3846662 sits within the intron 13 branch-point region and controls the
ratio of full-length to Δ13 transcript.
Yu et al. 201433 Yu et al. 2014
Yu CY et al. HNRNPA1 regulates HMGCR alternative splicing and modulates cellular cholesterol metabolism. Hum Mol Genet, 2014
demonstrated that the RNA-binding protein HNRNPA1 preferentially binds the
A allele at this position and promotes exon 13 skipping: cells with the
A allele produce a higher fraction of Δ13 mRNA. Consistent with this, AA
homozygotes have the highest Δ13 ratio and the most attenuated statin
response, while GG homozygotes maintain a predominantly full-length
transcript and better statin sensitivity.
Medina & Krauss 200944 Medina & Krauss 2009
Medina MW, Krauss RM. The role of HMGCR alternative splicing in statin efficacy. Trends Cardiovasc Med, 2009
proposed that the Δ13 protein forms inactive heterodimers with full-length
HMGCR, further diluting the pool of drug-accessible enzyme beyond what the
mRNA ratio alone would predict.
The Evidence
The clinical link was established by
Medina et al. 200855 Medina et al. 2008
Medina MW et al. Alternative splicing of 3-hydroxy-3-methylglutaryl coenzyme A reductase is associated with plasma LDL cholesterol response to simvastatin. Circulation, 2008
in 170 lymphoblastoid cell lines from the Cholesterol and Pharmacogenetics
(CAP) study. Greater Δ13 expression was inversely correlated (p≤0.0001)
with in vivo reductions of total cholesterol, LDL-C, apoB, and
triglycerides, with the splicing ratio explaining 6–15% of the variation in
statin response — a sizeable fraction given the many factors involved.
Leduc et al. 201666 Leduc et al. 2016
Leduc V et al. Role of rs3846662 and HMGCR alternative splicing in statin efficacy and baseline lipid levels in familial hypercholesterolemia. Pharmacogenet Genomics, 2016
studied 37 French-Canadian familial hypercholesterolaemia patients and found
that women with the AA genotype achieved a significantly smaller LDL-C
reduction on statin therapy (38.4% vs 46.2%, p<0.05) compared with G
carriers. Strikingly, alternative splicing explained 22–55% of the variance
in statin response in this cohort. The sex-specific finding was notable: men
showed similar splicing ratios but no detectable difference in statin
response — likely reflecting oestrogen-dependent regulation of HMGCR
expression that amplifies splicing effects in women.
Cano-Corres et al. 201877 Cano-Corres et al. 2018
Cano-Corres R et al. Influence of 6 genetic variants on the efficacy of statins in patients with dyslipidaemia. J Clin Lab Analysis, 2018
confirmed in 100 patients that G allele carriers showed significantly lower
reduction in total cholesterol and non-HDL-C and were less likely to reach
therapeutic cholesterol targets. Note that this study found the G allele
direction — reflecting complex population-specific interactions between
baseline lipid levels and splicing efficiency that remain under active
investigation.
Practical Actions
For individuals with the AA or AG genotype, statin dose optimisation and closer monitoring of LDL-C response are warranted. If standard statin doses do not achieve LDL targets, the rs3846662 genotype provides a pharmacogenomic rationale for dose escalation, statin switching, or the addition of non-statin agents such as ezetimibe (which inhibits intestinal cholesterol absorption via a completely independent mechanism unaffected by HMGCR splicing).
Plant sterols and stanols (2 g/day from fortified foods or supplements) competitively inhibit dietary cholesterol absorption and can reduce LDL-C by 8–10% independently of the mevalonate pathway — making them a useful adjunct specifically when HMGCR is less statin-sensitive.
Interactions
This variant interacts with APOE genotype. Leduc et al. 2016 noted that the AA genotype at rs3846662 was associated with elevated baseline total and LDL-cholesterol specifically in individuals without APOE4 (rs429358 CC genotype), suggesting the two loci have partially overlapping effects on lipid setpoint and statin response.
Other HMGCR coding variants — rs17238540 and rs17244841 (both associated with statin response in the WOSCOPS trial) — are in partial linkage disequilibrium with rs3846662 and may tag the same underlying splicing haplotype. Interaction with rs7703051 (another HMGCR intron variant) has also been proposed in the literature.
SLC52A2 Leu123Pro — Riboflavin Transporter Deficiency and BVVLS2
Riboflavin (vitamin B2) is the molecular precursor to
FAD and FMN11 FAD and FMN
Flavin adenine dinucleotide and flavin mononucleotide — the two
active coenzyme forms of riboflavin that serve as electron carriers in over 100
cellular oxidoreduction reactions, including the mitochondrial electron transport
chain, fatty acid oxidation, and amino acid metabolism.
Unlike many vitamins that diffuse freely across cell membranes, riboflavin is a
polar molecule that cannot enter most cells without help. The brain and peripheral
nervous system are especially dependent on dedicated transport proteins to
maintain adequate intracellular riboflavin. The SLC52A2 gene encodes one of
those proteins — RFVT2 (riboflavin transporter 2), the principal riboflavin
transporter expressed in neurons and intestinal epithelium.
When SLC52A2 is non-functional, riboflavin cannot enter nerve cells efficiently,
FAD and FMN are depleted, and neuronal energy metabolism collapses. The clinical
consequence is
Brown-Vialetto-Van Laere syndrome type 222 Brown-Vialetto-Van Laere syndrome type 2
BVVLS2 (OMIM #614707) — named for
the three clinicians who independently described the phenotype between 1894 and
1959. The molecular cause was not identified until 2010 when mutations in
SLC52A3 were found; SLC52A2 was implicated in 2012.
(BVVLS2), a rare progressive neuronopathy with sensorineural deafness,
bulbar palsy, and respiratory compromise that is fatal without treatment.
Crucially, it is one of very few inherited neurodegenerative disorders that
responds dramatically to supplementation.
The Mechanism
The c.368T>C variant substitutes a leucine for a proline at position 123 of the RFVT2 protein (p.Leu123Pro). Proline is structurally unique — its side chain loops back to form a ring with the backbone nitrogen, introducing rigidity and a kink into a polypeptide chain. Replacing a flexible leucine with proline at position 123 disrupts the local fold of the transmembrane domain.
Functional transport assays33 Functional transport assays
Haack TB et al. Impaired riboflavin transport due
to missense mutations in SLC52A2 causes Brown-Vialetto-Van Laere syndrome.
J Inherit Metab Dis, 2012
demonstrated that cells overexpressing the p.Leu123Pro mutant showed
significantly reduced [³H]riboflavin uptake compared to wild-type RFVT2, confirming
that the structural change directly impairs transport activity. The same study
also identified p.Leu339Pro in the same patient as a compound heterozygous partner,
illustrating the typical presentation: most BVVLS2 patients carry two different
pathogenic SLC52A2 alleles rather than two identical ones.
The Evidence
BVVLS2 is rare but the treatment response is among the most striking in all of
metabolic genetics.
Foley et al. 201444 Foley et al. 2014
Foley AR et al. Treatable childhood neuronopathy caused by
mutations in riboflavin transporter RFVT2. Brain, 2014
described 18 patients from 13 families with SLC52A2 mutations, demonstrating
"significant and sustained clinical and biochemical improvements" with high-dose
oral riboflavin, including improvements in motor function, respiratory capacity,
and acylcarnitine profiles in 10 of 13 patients with preliminary data.
The natural history data makes the treatment case starkly.
Bosch et al. 201255 Bosch et al. 2012
Bosch AM et al. The Brown-Vialetto-Van Laere and Fazio
Londe syndrome revisited: natural history, genetics, treatment and future
perspectives. Orphanet J Rare Dis, 2012
reviewed 74 BVVLS and Fazio-Londe patients across all published cases: all 13
riboflavin-treated patients survived, while 28 of 61 untreated patients died —
most before age 4 if onset was in infancy. Without treatment, children typically
progress to ventilator dependence.
The standard dosing is established by clinical practice:
Cali et al. GeneReviews 201566 Cali et al. GeneReviews 2015
Cali E et al. Riboflavin Transporter Deficiency.
GeneReviews, University of Washington, 2015 (updated 2021)
specifies 10–50 mg/kg/day of oral riboflavin for riboflavin transporter deficiency,
with early initiation producing substantially better outcomes than late treatment.
ClinVar lists this variant (VCV000039576) as pathogenic for Brown-Vialetto-Van Laere syndrome 2 with five independent submitters including Cambridge Genomics Laboratory, OMIM, PreventionGenetics, and the Solve-RD Consortium.
Practical Actions
Because BVVLS2 is autosomal recessive, a single heterozygous copy of the p.Leu123Pro variant does not cause disease. Heterozygous carriers have one functional RFVT2 allele, which is sufficient for normal riboflavin transport under typical conditions. The clinical significance of carrier status is primarily reproductive: if both parents carry pathogenic SLC52A2 variants, each child has a 25% chance of inheriting biallelic mutations and developing BVVLS2.
For homozygous or compound heterozygous individuals, high-dose riboflavin is the only established disease-modifying treatment. It must be started immediately and continued lifelong — riboflavin supplementation does not cure the underlying transporter defect but compensates for it by driving passive diffusion through mass action. Sensorineural hearing loss, once established, is typically not recovered with treatment even when motor and respiratory function improve.
Interactions
Compound heterozygosity is the rule in BVVLS2: most patients carry p.Leu123Pro on one chromosome and a different pathogenic SLC52A2 variant on the other. The related gene SLC52A3 (RFVT3) causes BVVLS type 3 with an overlapping phenotype; both conditions respond to riboflavin. SLC52A1 (RFVT1) causes Fazio-Londe syndrome. When a patient presents with bulbar palsy and hearing loss, all three SLC52A genes should be sequenced to determine the causative gene, as the treatment (riboflavin) is the same for all three.
MYH7 Arg663Cys — A Charge-Altering Mutation at the Heart's Molecular Engine
Every heartbeat begins at the sarcomere — the basic contractile unit of cardiac muscle,
where the protein beta-myosin heavy chain11 beta-myosin heavy chain
the primary motor protein of the heart,
encoded by MYH7, that generates the forceful contraction driving blood into the aorta
with each beat physically pulls actin
filaments to shorten the cell. The Arg663Cys variant introduces a single amino acid
change in the myosin motor domain — the globular head where the force-generating power
stroke occurs — substituting the positively charged arginine at position 663 with a
neutral cysteine that is also capable of forming disulfide bonds. ClinVar classifies
this as Pathogenic/Likely Pathogenic22 Pathogenic/Likely Pathogenic
VCV000042874, criteria provided, 16 of 22 submitting
laboratories concur, no conflicts,
and it is essentially absent from gnomAD's population database of over 700,000 sequenced
individuals, consistent with a rare, high-penetrance disease allele.
The Mechanism
Codon 663 is situated in the myosin S1 (motor) domain33 myosin S1 (motor) domain
the ATPase-active globular
head region that forms cross-bridges with actin and undergoes the conformational
power stroke during muscle contraction.
Arginine-663 is highly conserved across vertebrate species44 highly conserved across vertebrate species
evolutionary conservation
of a residue indicates structural or functional importance; its replacement in any
vertebrate species produces disease,
underscoring its critical role in myosin function. The substitution of arginine (positively
charged, guanidinium side chain) with cysteine (neutral, thiol side chain) is a
charge-altering change — the same category of mutation that Watkins et al. (1992)55 Watkins et al. (1992)
established as prognostically severe in familial HCM, associated with markedly reduced
survival compared to charge-neutral variants.
The pathogenic mechanism is dominant-negative: the mutant MYH7 protein is synthesised
from the one mutant allele and incorporated into sarcomeres alongside normal protein
from the intact allele. Because sarcomeres assemble from mixtures of normal and mutant
myosin, even one copy of the variant disrupts cross-bridge kinetics across the entire
myofibril. Direct measurement in human HCM myocardium confirms that
MYH7 mutations reduce maximum sarcomere force to 73 kN/m² versus 113 kN/m² in
donor controls66 MYH7 mutations reduce maximum sarcomere force to 73 kN/m² versus 113 kN/m² in
donor controls
a 35% reduction in force-generating capacity despite near-normal myofibril
density, indicating intrinsic sarcomeric dysfunction rather than structural remodelling
alone. Downstream, abnormal calcium handling
compounds the defect: iPSC-derived cardiomyocytes from carriers of the closely related
Arg663His variant77 Arg663His variant
a different amino acid substitution at the identical codon 663,
also classified as pathogenic for HCM, rs371898076
showed elevated intracellular calcium, cellular enlargement, and contractile arrhythmia
— phenotypes prevented by calcium channel inhibition. The combined effect of impaired
force generation and disturbed calcium signalling drives progressive pathological
hypertrophy, myocyte disarray, and interstitial fibrosis over years to decades.
The Evidence
The variant's pathogenicity rests on multiple independent lines of evidence evaluated by the ClinGen Inherited Cardiomyopathy Expert Panel88 ClinGen Inherited Cardiomyopathy Expert Panel, which adapted the ACMG/AMP classification framework specifically for MYH7. Key criteria include: PS4 (identified in more than 15 unrelated HCM-affected individuals), PM1 (location in the motor domain mutational hotspot), PM5 (other pathogenic changes at the same codon — Arg663His and Arg663Ser), and PM2 (absent from population databases). No population controls in gnomAD carry the A allele at the relevant frequency in any ancestry group (observed count: 3 out of 1,401,470 alleles, frequency 0.0000021).
Among all sarcomeric HCM mutations, MYH7 pathogenic variants are associated with particularly adverse outcomes99 MYH7 pathogenic variants are associated with particularly adverse outcomes. In a Chinese cohort of 52 MYH7 mutation carriers compared to 18 MYBPC3 carriers, MYH7 patients had seven sudden cardiac deaths versus zero in the MYBPC3 group, and mean survival was 45 years versus 73 years — a 28-year reduction. Van Driest et al. (2004)1010 Van Driest et al. (2004) found MYH7 mutation carriers present with HCM nearly a decade earlier than sarcomere-negative patients (mean 33 vs 43 years), have substantially greater left ventricular wall thickness, and undergo myectomy at twice the rate.
The 1992 founding observation by Watkins et al.1111 Watkins et al. remains instructive: charge-altering arginine substitutions in MYH7 — the precise category that includes Arg663Cys — were associated with mean age at death of 33 years, versus near-normal survival for charge-neutral substitutions. Arg663Cys replaces a charged arginine with a neutral cysteine, placing it squarely in this high-risk mutation class.
Practical Actions
For a carrier of the Arg663Cys variant, the clinical priority is structured cardiac surveillance starting at the time of genetic diagnosis, before symptoms appear. Transthoracic echocardiography establishes whether hypertrophy is already present and quantifies the resting and provoked left ventricular outflow tract (LVOT) gradient — the key determinant of obstructive physiology requiring intervention. Cardiac MRI adds late gadolinium enhancement (LGE) mapping of myocardial fibrosis, which independently predicts sudden death risk beyond conventional HCM risk factors.
Sudden cardiac death (SCD) risk stratification is the most consequential early decision. Current ESC guidelines apply the HCM Risk-SCD calculator to estimate 5-year SCD probability; an ICD (implantable cardioverter-defibrillator) is recommended for estimated risk ≥6%, and is considered for risk 4–6%. Carrying a pathogenic MYH7 variant — particularly one in the charge-altering category — is itself a clinical marker of higher severity that cardiologists incorporate into the risk assessment.
Obstructive HCM (gradient ≥30 mmHg) responds to medical therapy with beta-blockers or disopyramide. Refractory obstruction may require septal reduction (surgical myectomy or alcohol septal ablation). Mavacamten — a first-in-class cardiac myosin inhibitor approved by the FDA for obstructive HCM — directly addresses the hypercontractile mechanism of sarcomere mutations and is an important option for carriers who develop symptomatic obstruction.
Strenuous competitive sport is contraindicated until formal HCM evaluation is complete, given the risk of arrhythmia-triggered SCD during extreme exertion.
Interactions
Codon 663 is a mutational hotspot1212 mutational hotspot
three independent pathogenic substitutions at the
same codon — Arg663His (rs371898076), Arg663Cys (this variant), and Arg663Ser — have
all been classified as pathogenic or likely pathogenic for HCM in separate individuals
and families. A carrier of
Arg663Cys could theoretically also carry a second sarcomeric HCM mutation in MYBPC3,
TNNT2, TNNI3, or another MYH7 variant; double sarcomeric mutation carriers have been
shown in clinical cohorts to face markedly elevated risk of end-stage heart failure and
malignant arrhythmias compared to single-variant carriers. Comprehensive sarcomere gene
panel testing at the time of diagnosis will detect compound genotypes that substantially
alter prognosis and management. If a second sarcomeric variant is found, the management
threshold for ICD implantation and specialist referral should be reviewed with a
cardiology team experienced in high-risk HCM.
PLG Intronic Variant — When Plasminogen Expression Runs Low
Plasminogen is a liver-secreted zymogen that circulates in the blood and anchors to cell surfaces, fibrin clots, and invading pathogens. When activated by tissue plasminogen activator (tPA) or urokinase (uPA)11 tissue plasminogen activator (tPA) or urokinase (uPA)
serine proteases that convert inactive plasminogen into the active enzyme plasmin, it becomes plasmin — a broad-spectrum protease that dissolves fibrin clots, clears damaged extracellular matrix, recruits macrophages, and helps the immune system track down and destroy pathogens. Rs4252130 tags a haplotype in the PLG gene region that subtly lowers plasminogen output, compressing every downstream function that depends on adequate plasmin activity.
The Mechanism
Rs4252130 (c.1587+155A>C) sits 155 base pairs into an intron of PLG on chromosome 6q26. The PLG gene is on the plus strand, so the alleles reported by genome files match the coding strand directly. The C allele is the variant form, present in about 29% of Europeans but nearly absent in East Asians (under 0.2%). This striking population stratification pattern is typical of regulatory haplotypes under positive or balancing selection22 positive or balancing selection. Though the intronic position itself may not be the causal site, it tags a regional LD block that includes multiple PLG variants associated with reduced plasma plasminogen levels and altered gene expression in liver and osteoblasts. The precise molecular mechanism — whether altered enhancer activity, splicing regulation, or transcription factor binding — has not been characterised for this specific position.
The Evidence
A GWAS of 3,456 healthy individuals33 A GWAS of 3,456 healthy individuals identified the PLG locus on chr6q26 as the primary genetic determinant of plasma plasminogen levels, with 9 of 11 genome-wide significant SNPs clustering near PLG or the adjacent LPA gene. The lead missense variant (rs4252129, R523W) reduces levels by 13.4% per allele, and adjacent intronic variants in LD tag the same haplotype. Heritability of plasminogen levels was estimated at 48–60%, with identified variants collectively explaining 6.8%.
Schaefer et al. 201544 Schaefer et al. 2015 identified PLG intronic variant rs4252120 — a near neighbour of rs4252130 with identical population allele frequency profiles — as significantly associated with aggressive periodontitis (OR 1.27, p=5.9×10⁻⁵) and shared genetic risk with coronary artery disease, implicating the PLG haplotype in both mucosal immunity and vascular biology.
Munz et al. 201755 Munz et al. 2017 refined the genetic signal downstream of PLG, finding rs1247559 associated with aggressive periodontitis (OR 1.33) and chronic periodontitis (OR 1.15), while also serving as an eQTL for PLG expression in osteoblasts (p=6.9×10⁻⁵). A landmark Science paper (Silva et al. 2021)66 Science paper (Silva et al. 2021) demonstrated the causal mechanism: when plasmin-mediated fibrinolysis fails, fibrin accumulates at mucosal surfaces, aberrantly activates neutrophils through αMβ2 integrin, and triggers destructive NET formation and oxidative bursts. PLG polymorphisms from the D-ARIC cohort were associated with common periodontal disease through this pathway.
Beyond periodontitis, plasminogen-deficient mice show 60–90% reductions in macrophage phagocytosis of apoptotic cells Ploplis & Castellino 201477 Ploplis & Castellino 2014, and plasmin is required for macrophage polarization from pro-inflammatory (M1) to resolution-phase (M2) phenotypes Heissig et al. 202088 Heissig et al. 2020.
Practical Actions
Carriers of one or two C alleles have modestly reduced plasminogen-driven fibrinolysis and macrophage function. This does not produce disease on its own but shifts the immune baseline toward less efficient mucosal clearance and wound resolution. The most actionable implications are in periodontal health maintenance and monitoring for conditions where fibrinolytic clearance matters — respiratory, mucosal, and wound-healing contexts.
Interactions
Interaction with the pathogenic PLG variant rs7301596599 rs73015965 (K38E, p.Lys38Glu) is possible in principle: a compound heterozygote carrying one severely reduced-activity K38E allele and one expression-reducing haplotype (tagged by rs4252130) would have lower total plasmin activity than either variant alone. This would be expected to increase risk for ligneous periodontitis or mucous membrane complications beyond what either variant predicts independently. However, direct empirical data on this specific combination are lacking.
The PLG region also interacts with LPA (lipoprotein-a): the LPA gene, immediately adjacent on 6q26, affects both plasminogen activation and cardiovascular risk. Carriers of C alleles at rs4252130 who also carry high-Lp(a) alleles at LPA may have compounded vascular risk through impaired fibrinolysis alongside elevated Lp(a).
The Brain's Neurosteroid Thermostat — CYP7B1 and GABAergic Tone
Deep inside brain neurons and astrocytes, a cytochrome P450 enzyme quietly shapes
the chemical environment that governs anxiety, mood, and stress resilience. The
CYP7B1 gene11 CYP7B1 gene
Cytochrome P450 family 7 subfamily B member 1; an oxysterol and
neurosteroid 7α-hydroxylase expressed prominently in brain, testis, and ovary
encodes the enzyme responsible for breaking down two pivotal neurosteroids —
DHEA22 DHEA
Dehydroepiandrosterone; the most abundant circulating adrenal steroid, a
precursor to both androgens and estrogens and a potent neuromodulator in its own right
and pregnenolone — in the brain. By controlling how quickly these substrates are
catabolized, CYP7B1 activity sets the local concentration of neurosteroids available
for downstream conversion into
allopregnanolone33 allopregnanolone
A potent positive allosteric modulator of GABA-A receptors;
the active ingredient of brexanolone (Zulresso) and zuranolone (Zurzuvae), both
FDA-approved for postpartum depression,
the brain's own benzodiazepine-like molecule. The rs4395923 intronic variant in
CYP7B1 was identified as a signal in the anxiety GWAS literature, consistent with
the gene's role as a master regulator of GABAergic neurosteroid tone.
The Mechanism
CYP7B1 catalyzes 7α-hydroxylation44 CYP7B1 catalyzes 7α-hydroxylation
The addition of a hydroxyl group at the 7α
carbon position; this converts the neurosteroid into a less active trihydroxy
metabolite that cannot modulate GABA-A receptors
of DHEA (Km ~14 μM) and pregnenolone (Km ~4 μM) in the brain. Both substrates
are precursors to allopregnanolone55 allopregnanolone
Also called 3α,5α-THP (tetrahydroprogesterone);
enhances GABA-A receptor chloride conductance via both synaptic and extrasynaptic
receptor subtypes, producing tonic inhibition
through the steroidogenic cascade. When CYP7B1 activity is lower — whether due to
reduced expression or a regulatory variant like rs4395923 — more substrate is
available for conversion to these GABAergic neurosteroids. Conversely, higher
CYP7B1 activity depletes the neurosteroid pool, reducing GABAergic tone and
potentially tipping the excitatory/inhibitory balance toward anxiety-like states.
The enzyme is expressed most abundantly in the
hippocampus66 hippocampus
The brain region central to memory consolidation, spatial navigation,
and stress-response regulation; CYP7B1 mRNA is higher here than in cortex, cerebellum,
or other brain regions
across multiple primate species, including humans. This regional specificity is
clinically meaningful: the hippocampus is a key node in the anxiety circuit, and
CYP7B1 expression there is approximately 50% lower in Alzheimer's disease compared
to controls. Estrogen further modulates this system through estrogen receptor β,
which suppresses CYP7B1-mediated DHEA catabolism in astrocytes, effectively raising
local neurosteroid concentrations and contributing to estrogen's neuroprotective effects.
The intronic rs4395923 variant likely influences CYP7B1 expression levels through cis-regulatory effects — consistent with how most common psychiatric GWAS variants operate. The precise molecular mechanism (altered splicing, intronic regulatory element, expression quantitative trait locus) has not been characterized in published literature.
The Evidence
The strongest human-genetic evidence linking rs4395923 to anxiety comes from large-scale GWAS of anxiety-related traits, where this CYP7B1 intronic variant reached association. The neurobiological rationale is strongly supported by mechanistic studies:
The founding Cyp7b neurosteroid study77 Cyp7b neurosteroid study
Rose et al. "Cyp7b, a novel brain cytochrome
P450, catalyzes the synthesis of neurosteroids 7alpha-hydroxy dehydroepiandrosterone
and 7alpha-hydroxy pregnenolone." Proc Natl Acad Sci USA, 1997
established that CYP7B1 is the dominant 7α-hydroxylase in brain tissue, converting
DHEA and pregnenolone into inactive triols. The enzyme's broad expression in
hippocampus, amygdala, and cortex — regions critical to emotional regulation —
positioned it as a neurosteroid gate.
The hippocampal expression study88 hippocampal expression study
Yau et al. "Dehydroepiandrosterone 7-hydroxylase
CYP7B: predominant expression in primate hippocampus and reduced expression in
Alzheimer's disease." Neuroscience, 2003
demonstrated that CYP7B1 is the primary neurosteroid-catabolizing enzyme in primate
hippocampus. Its 50% reduction in Alzheimer's disease suggests the enzyme's activity
directly shapes local neurosteroid concentrations, with downstream effects on GABA-A
receptor tone.
On the GABAergic side, allopregnanolone mediates anxiolysis99 allopregnanolone mediates anxiolysis
Antonoudiou et al.
"Allopregnanolone Mediates Affective Switching Through Modulation of Oscillatory States
in the Basolateral Amygdala." Biol Psychiatry, 2022
through δ-subunit-containing GABA-A receptors in the basolateral amygdala, the brain's
threat-detection hub. This tonic inhibitory pathway is precisely what CYP7B1 regulates
by controlling substrate availability upstream.
The clinical validation of this pathway comes from FDA-approved neurosteroid drugs:
brexanolone and zuranolone1010 brexanolone and zuranolone
Both approved by the FDA for postpartum depression;
zuranolone is oral, brexanolone is IV-administered
are synthetic allopregnanolone analogs that activate GABA-A receptors. Their efficacy
confirms that the DHEA → allopregnanolone → GABA-A axis is a modifiable target for
mood and anxiety disorders.
The evidence level is emerging: the GWAS signal links the variant to anxiety, and the mechanistic biology is well-established, but no functional studies have directly characterized the effect of rs4395923 on CYP7B1 expression or enzyme activity.
Practical Actions
Because CYP7B1 activity governs local neurosteroid concentrations, and these neurosteroids tonically modulate GABAergic inhibition, the actionable implications differ based on whether CYP7B1 activity is likely increased or decreased by the variant. Carriers of the A risk allele appear to have altered CYP7B1 function that shifts the neurosteroid equilibrium, which can be partially modulated through factors that influence DHEA levels and conversion rates — including stress management through specific non-generic means (e.g., sauna exposure has documented effects on DHEA), supplemental DHEA or pregnenolone (with medical oversight), and avoidance of compounds that suppress GABA-A signaling.
Interactions
CYP7B1 sits upstream of the progesterone → pregnenolone → allopregnanolone conversion chain, which also involves AKR1C enzymes (AKR1C1-4) and 5α-reductase (SRD5A1/SRD5A2). Variants in these downstream enzymes would interact with rs4395923 to determine final allopregnanolone concentrations. Additionally, CYP19A1 (aromatase) and CYP17A1 variants in the adrenal androgen synthesis pathway influence baseline DHEA availability, the substrate CYP7B1 acts upon. Estrogen status (estrogen receptor variants, menopause) strongly modulates CYP7B1 expression via ERβ — making gene-environment interactions with hormonal transitions particularly relevant.
IL6R Intron Variant — A Genetic Tag for Cardiovascular Inflammation Risk
Interleukin-6 (IL-6) is one of the body's most powerful inflammatory messengers.
When tissue is damaged, infected, or metabolically stressed, IL-6 floods the bloodstream,
triggering C-reactive protein (CRP) production in the liver, activating immune cells,
and — at chronically elevated levels — accelerating atherosclerosis and contributing
to cardiovascular disease. The IL6R gene encodes the
interleukin-6 receptor (IL-6R)11 interleukin-6 receptor (IL-6R)
the cell-surface protein that binds IL-6 and
initiates downstream JAK-STAT3 signaling cascades.
A soluble form of this receptor (sIL-6R), shed from cell surfaces, enables
"trans-signaling" — extending IL-6 activity to cells that do not themselves express
membrane-bound IL-6R. rs4537545 is an intronic variant in IL6R whose C allele tags
a haplotype associated with elevated inflammatory and lipid markers, and reduced
protection from coronary heart disease.
The Mechanism
rs4537545 sits within intron 9 of IL6R on chromosome 1q21 and does not itself
change the receptor's amino acid sequence. Its cardiovascular significance comes
from
linkage disequilibrium (LD)22 linkage disequilibrium (LD)
a statistical association between nearby genetic
variants, meaning they are inherited together more often than chance would predict
with the functional coding variant rs2228145 (Asp358Ala, p.Asp358Ala). The Asp358Ala
substitution in the membrane-proximal domain of IL-6R alters the cleavage efficiency
of the receptor's ectodomain by the metalloprotease ADAM17 (TACE), increasing release
of sIL-6R into circulation. Higher circulating sIL-6R amplifies IL-6 trans-signaling
in tissues that lack membrane-bound IL-6R — including vascular smooth muscle and
endothelial cells — while paradoxically reducing classic IL-6 signaling in hepatocytes,
which lowers CRP and fibrinogen production.
The rs4845625*T / rs4537545*C haplotype represents the pro-inflammatory configuration: reduced Asp358Ala activity, less sIL-6R shedding, stronger hepatic classic IL-6 signaling, and consequently higher circulating CRP and LDL cholesterol.
The Evidence
The most powerful evidence linking IL6R variants to cardiovascular disease comes from
a Mendelian randomization
meta-analysis coordinated by the IL6R MR Consortium33 meta-analysis coordinated by the IL6R MR Consortium
Swerdlow DI et al., Lancet 2012,
which combined data from 40 studies (up to 133,449 individuals for biomarker endpoints)
and 25,458 coronary heart disease (CHD) cases alongside 100,740 controls across
25 studies. The Asp358Ala allele (tagged by rs4537545 through LD) was associated
with a 9.45% increase in circulating IL-6 per allele (95% CI 8.34–10.57), an 8.35%
decrease in CRP per allele (95% CI 7.31–9.38), and a 0.85% decrease in fibrinogen per
allele — alongside a per-allele CHD odds ratio of 0.95 (95% CI 0.93–0.97, p=1.53×10⁻⁵).
This inverse relationship — higher IL-6, lower CRP — reflects the sIL-6R shedding
mechanism shifting IL-6 activity away from hepatic acute-phase protein synthesis.
The genetic evidence parallels clinical trial findings for IL-6R-blocking biologics
(tocilizumab, sarilumab), supporting IL-6R signaling as a causal cardiovascular target.
The specific haplotype containing rs4537545*C was directly linked to cardiovascular
biomarker risk in a
study by Arguinano et al.44 study by Arguinano et al.
Arguinano AA et al., Genes Immun 2017,
which examined two IL6R SNPs in French and Dutch cohorts. The rs4845625*T/rs4537545*C
haplotype was simultaneously associated with elevated CRP (P=0.009), higher LDL-C
(P=0.007), and elevated ApoB (P=0.009) — the full atherogenic triad. The rs4537545
T allele alone showed inverse associations with CRP (P=0.009), while the combination
of risk alleles at both loci amplified the biomarker effect beyond either SNP alone.
In a prospective cohort of
559 Danish patients with severe chronic heart failure55 559 Danish patients with severe chronic heart failure
Hansen PR et al., Pharmacogenomics J 2019,
IL6R pathway SNPs — including rs4537545 and the functionally linked rs2228145 —
were among independent predictors of cardiovascular death and all-cause mortality
over a mean 5-year follow-up (14-year observation window). Heterozygous carriers of
related IL6R variants showed hazard ratios of 1.37–1.39 for cardiovascular death
(p < 0.008), pointing to the prognostic importance of IL-6 signaling genetics in
established heart disease.
Practical Actions
For individuals carrying the CC genotype, the actionable priorities are monitoring inflammatory cardiovascular biomarkers (hs-CRP and LDL-C), since this genotype is associated with higher baseline levels of both. Omega-3 fatty acids at pharmacological doses reduce CRP, and high-sensitivity statin trials show reductions in both CRP and cardiovascular events even at baseline LDL levels that clinical guidelines would not normally treat.
For individuals on IL-6R-targeting biologics (tocilizumab or sarilumab) for rheumatoid arthritis or other indications, IL6R genotype may influence receptor sensitivity and treatment response — a clinically relevant pharmacogenomic consideration to discuss with a prescriber.
Interactions
rs4537545 is in LD with rs2228145 (the Asp358Ala coding variant) and rs8192284, which have been the primary variants studied in GWAS and Mendelian randomization analyses. The rs4845625 variant shows strong haplotypic interaction with rs4537545: the combination of rs4845625*T and rs4537545*C produces a larger simultaneous effect on CRP, LDL-C, and ApoB than either variant alone, suggesting that IL6R haplotype-level analysis captures more cardiovascular risk than individual SNPs. Individuals who carry rs4537545*C alongside rs4845625*T represent the highest-risk configuration for these inflammatory and lipid biomarkers. Interaction with the CRP gene variants (rs1205, rs1800947) may further modulate baseline CRP levels beyond the IL6R pathway contribution.