IRF5 rs4728142 — The Promoter Enhancer Switch: Causal Driver of Interferon Overactivation
Interferon Regulatory Factor 5 (IRF5) is the molecular ignition switch for the type I interferon
response — the branch of innate immunity responsible for fighting viral infections by triggering
widespread pro-inflammatory signaling. When IRF5 is overactive, the same cascade that defends
against viruses begins attacking the body's own tissues, driving systemic lupus erythematosus
(SLE), Sjögren's syndrome, systemic sclerosis, and other autoimmune diseases. rs4728142 sits
approximately 5 kilobases upstream of IRF5's transcription start sites, in a region now established
as an active enhancer11 active enhancer
An enhancer is a regulatory DNA element that boosts transcription of a
nearby gene; it doesn't have to be immediately adjacent — enhancers can act from tens of thousands
of base pairs away via DNA looping for the gene.
The A allele of rs4728142 has recently been confirmed as the leading causal variant for the 5'
IRF5 risk signal — not merely a tag for some other functional change, but the variant that directly
rewires how the IRF5 gene is regulated.
The Mechanism
rs4728142 operates through a newly described chromatin-looping mechanism. The A allele alters
the binding affinity of ZBTB3 (Zinc Finger and BTB Domain-Containing Protein 3)22 ZBTB3 (Zinc Finger and BTB Domain-Containing Protein 3)
A transcription
factor that recognizes specific DNA sequences and can recruit chromatin-remodeling complexes;
ZBTB3 belongs to the BTB-POZ family involved in gene regulation and cell fate decisions.
When ZBTB3 binds the rs4728142-A allele, it orchestrates a short-range chromatin loop that
physically connects the upstream enhancer region to the alternative (short-transcript) promoter
of IRF5. The consequence: the IRF5 gene preferentially produces its shorter isoforms at the expense
of the full-length transcript, and total IRF5 expression is markedly elevated. In lymphoblastoid
cells from European ancestry individuals, the A allele is associated with a 1.5-fold increase in
IRF5 mRNA33 1.5-fold increase in
IRF5 mRNA
Bonferroni-corrected P=0.0004 in CEU cell lines; smaller but significant effects also
seen in Asian-ancestry cells (1.2-fold, P=0.006).
In SLE patient monocytes, CRISPR-based interference with the enhancer function at this locus
attenuated production of lupus-associated cytokines, confirming the causal chain from variant
to expression to inflammatory output.
The rs4728142-A allele is in strong linkage disequilibrium44 linkage disequilibrium
LD measures how often two nearby
variants are inherited together; r²>0.6 means the two variants co-occur on the same chromosome
much more often than chance would predict (r²>0.6)
with the CGGGG insertion-deletion polymorphism in the IRF5 promoter. The CGGGG indel sits
64 base pairs upstream of the exon 1A transcription start site; the 4-copy insertion allele (4R)
creates an additional binding site for the transcription factor Sp1, which was confirmed by
electrophoretic mobility shift assays. The 4R allele increases IRF5 mRNA expression in peripheral
blood mononuclear cells and amplifies interferon-inducible gene expression (MX1, IFITM1) in
Sjögren's patients. rs4728142 and the CGGGG indel together provide complementary molecular
explanations for why the 5' region of IRF5 elevates interferon tone: the CGGGG indel creates
an extra Sp1 site at the promoter; rs4728142 directs an enhancer loop to that promoter.
The Evidence
The breadth of disease associations for rs4728142 is exceptional even among major autoimmune risk loci. The variant has been independently associated with six autoimmune conditions across multiple populations.
In systemic lupus erythematosus, a large meta-analysis of IRF5 variants across nine
populations found rs4728142 among the strongest independent signals P=1.34×10⁻⁸,
OR=1.22 (95% CI 1.14–1.30)55 P=1.34×10⁻⁸,
OR=1.22 (95% CI 1.14–1.30)
Pooled analysis across European, Asian, and admixed populations
from multiple case-control studies, placing it
among the most replicated non-HLA genetic risk factors for lupus.
In primary Sjögren's syndrome, the CGGGG indel (in LD with rs4728142-A) was found to
fully explain the IRF5-Sjögren's association in two independent Norwegian and British cohorts,
yielding OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶)66 OR=2.00 (95% CI 1.5–2.7, P=6.6×10⁻⁶)
Norheim et al. 2012, Annals of the Rheumatic
Diseases; OR of 2.00 for carrying the 4R CGGGG allele vs the 3R allele.
Carriers of the 4R allele had measurably higher IRF5 mRNA in both peripheral blood and salivary
gland epithelial cells — the tissue most directly relevant to Sjögren's pathology.
In inflammatory bowel disease, the CGGGG indel was the lead signal in an independent IBD
GWAS analysis, with OR=1.81 for overall IBD (P=1.9×10⁻⁵) and OR=2.42 specifically for
ulcerative colitis (P=5.3×10⁻⁸)77 OR=1.81 for overall IBD (P=1.9×10⁻⁵) and OR=2.42 specifically for
ulcerative colitis (P=5.3×10⁻⁸)
Eames et al. 2007, Human Molecular Genetics; replication
in a second cohort OR=1.59. The UC association
is among the strongest single-gene effects in IBD genetics.
In multiple sclerosis, rs4728142 showed independent association across Spanish, Swedish, and Finnish cohorts (combined P<0.001, OR≈1.14), with allele-specific protein binding on EMSA suggesting direct transcription factor recruitment at the variant site rather than passive LD.
In systemic sclerosis, the three-SNP IRF5 haplotype incorporating rs4728142 reached
OR=1.75 (P=9.04×10⁻²²)88 OR=1.75 (P=9.04×10⁻²²)
Dieudé et al. 2013, PLOS ONE; five-country European study including
Spain, Germany, Netherlands, Italy, and UK, exceeding
the effect of rs10488631 alone and confirming additive contributions from the 5' and 3' haplotype
blocks.
Practical Implications
The A allele at rs4728142 acts as an immune-system volume dial stuck at a higher setting. The overexpressed IRF5 protein amplifies both the initial interferon alarm response and subsequent pro-inflammatory cytokine cascades (TNF-α, IL-6, IL-12). For most carriers this translates to modest increases in autoimmune disease susceptibility across multiple organ systems — not a predetermined path to disease, but a meaningful shift in the immune threshold.
Because rs4728142 affects the IRF5 enhancer rather than the coding sequence, its effects are partly modifiable by context: the immune system's activation state, the presence of viral triggers (particularly Epstein-Barr virus, strongly implicated in SLE pathogenesis), and environmental modulators of interferon tone including vitamin D. The discovery that CRISPR interference with this enhancer can attenuate lupus-associated cytokine production in patient monocytes indicates that this regulatory region is being studied as a potential therapeutic target.
Population stratification is striking at this locus: European and Latino populations carry the A allele at approximately 44% frequency — nearly half the population — while East Asian populations carry it at only 13%. This mirrors the population-level differences in autoimmune disease prevalence and suggests that variation at IRF5 contributes to the well-documented ethnic disparities in lupus and related conditions.
Interactions
rs4728142 functions within the IRF5 three-block haplotype architecture alongside rs2004640 (exon 1B splice site) and rs10488631 (3' haplotype tag). The 5' block tagged by rs4728142 and the 3' block tagged by rs10488631 operate independently and additively: individuals carrying both the rs4728142-A risk allele and the rs10488631-C risk allele carry contributions from both haplotype blocks, with combined effects substantially exceeding those of either alone. The three-block SSc haplotype data illustrating OR=1.75 for the combined haplotype versus OR=1.63 for rs10488631 alone quantifies this additive contribution.
Downstream of IRF5, STAT4 (rs7574865) amplifies cellular responsiveness to the interferons that IRF5 drives. IRF5 variants increase interferon production; STAT4 variants increase cellular sensitivity to that interferon. Studies in primary Sjögren's syndrome and SLE demonstrate striking additive effects: with all five IRF5 + STAT4 risk alleles, OR for Sjögren's reaches 6.78. rs4728142 risk carriers with concurrent STAT4 rs7574865 risk alleles thus face a compounded immune activation burden — elevated production and elevated responsiveness to what is produced.
The autoimmune pleiotropic nature of rs4728142 — associated with lupus, Sjögren's, IBD, MS, SSc, and RA — means the variant likely exerts a shared molecular mechanism (IRF5 overactivation) that manifests as different diseases depending on which other genetic and environmental factors are present. Individuals carrying this risk allele who develop symptoms in one autoimmune domain should be aware of overlapping risk across organ systems.
The Angiotensinogen Variant — Blood Pressure Genetics in the Renin-Angiotensin System
Angiotensinogen (AGT) is the essential precursor protein of the renin-angiotensin system (RAS), the body's
primary long-term regulator of blood pressure and fluid balance. The kidney enzyme renin cleaves AGT to
produce angiotensin I, which is then converted by ACE to angiotensin II11 angiotensin II
the vasoactive peptide that
raises blood pressure by constricting blood vessels and signaling the adrenal glands to release
aldosterone. The T174M variant (rs4762) is one of two
well-studied missense changes in AGT — alongside [M235T (rs699) | the more frequently studied AGT
variant, which is in strong linkage disequilibrium with T174M in many populations] — and both have been
investigated for decades as candidates for genetically elevated blood pressure.
The Mechanism
AGT sits on the minus strand of chromosome 1 at position 230,710,231 (GRCh38). The rs4762 G>A change
on the plus strand produces a Threonine-to-Methionine substitution at position 198 of the angiotensinogen
precursor22 Threonine-to-Methionine substitution at position 198 of the angiotensinogen
precursor
also described as T174M in older literature counting from the mature protein after signal
peptide cleavage. This amino acid change in the AGT protein is
thought to affect its rate of cleavage by renin and its plasma concentration33 plasma concentration
carriers of the A allele
tend to have higher circulating angiotensinogen levels,
shifting the equilibrium of the entire RAS toward higher angiotensin II output. Higher angiotensin II
drives sustained increases in vascular tone, sodium retention, and — over years — contributes to
hypertension and end-organ damage.
The Evidence
The AGT gene's role in hypertension was first established in 199244 first established in 1992
Jeunemaitre et al., "Molecular
basis of human hypertension: role of angiotensinogen." Cell 71:169-80,
when genetic linkage between AGT variants and hypertension was demonstrated in two large independent
family panels. This landmark paper showed that AGT molecular variants were inherited predispositions to
essential hypertension and that plasma AGT concentrations differed by genotype.
The cardiovascular consequences of rs4762 were quantified in a
2021 meta-analysis of 7,657 subjects across 11 studies55 2021 meta-analysis of 7,657 subjects across 11 studies
Li et al., "Myocardial Infarction and AGT
p.Thr174Met Polymorphism: A Meta-Analysis of 7657 Subjects." Cardiovascular Therapeutics
2021. The A (Met174) allele significantly increased
myocardial infarction risk: OR 2.26 (95% CI 1.35–3.77) under the recessive model, OR 1.13 (95% CI
1.02–1.26) under the dominant model, and OR 1.36 (95% CI 1.13–1.64) under the additive model. Effects
were strongest in Asian populations; Caucasian subgroup analyses did not reach statistical significance.
Park et al. 201366 Park et al. 2013
"Assessment of two missense polymorphisms (rs4762 and rs699) of the angiotensinogen
gene and stroke." Experimental and Therapeutic Medicine 5:343-349
examined 197 stroke patients and 301 controls, finding that the rs4762 A (T allele on coding strand)
allele was associated with [intracerebral hemorrhage | 16.2% allele frequency in ICH vs 9.6% in
controls, P=0.021] and correlated with worse neurological severity scores. Ischemic stroke was not
significantly associated.
In a Mexican cohort of 546 adults with diabetic nephropathy,
Vázquez-Moreno et al. 202177 Vázquez-Moreno et al. 2021
"AGT rs4762 is associated with diastolic blood pressure in Mexicans
with diabetic nephropathy." Journal of Diabetes and Its Complications 35(3)
found that the A allele predicted higher diastolic blood pressure specifically in those with established
kidney disease (β=2.84, P=0.026), an important finding because diastolic hypertension drives
progression of diabetic nephropathy.
A 2018 study in Han Chinese women88 2018 study in Han Chinese women
Zhou et al., Biomed Res Int 2018, n=156 preeclampsia / 286
controls identified rs4762 as one of seven alleles
significantly associated with preeclampsia susceptibility. Notably, a large
2016 meta-analysis of 95 case-control studies99 2016 meta-analysis of 95 case-control studies
Zhang et al., Medical Science Monitor,
n=16,646 PE patients / 28,901 controls found no overall
significant association of rs4762 with preeclampsia across all ancestries combined, suggesting
population-specific effects — consistent with the pattern seen for MI risk.
Practical Implications
The A allele raises circulating angiotensinogen and tips the RAS toward sustained vasoconstriction. For carriers, this is not destiny — the RAS is highly modifiable by both lifestyle and medication. RAS-targeting medications (ACE inhibitors, ARBs) act downstream of AGT and are among the most effective antihypertensives available. Dietary sodium directly amplifies RAS activation: each gram of excess sodium per day substantially raises angiotensin II-driven pressure in those with genetically elevated AGT. Conversely, [low-sodium dietary patterns | below 2,300 mg/day, or ideally 1,500 mg/day for those at elevated cardiovascular risk] are among the most potent non-pharmacological strategies for blunting AGT-mediated pressure elevation.
Interactions
rs4762 is in linkage disequilibrium with rs699 (M235T)1010 linkage disequilibrium with rs699 (M235T)
the two variants are often co-inherited and
studied as a haplotype block. Haplotype analyses
consistently show stronger associations with blood pressure and cardiovascular disease than either
SNP alone. The AGT gene also interacts with [rs5186 (AGTR1 A1166C) | the angiotensin II type 1
receptor variant] — when both the AGT signal peptide and the AT1 receptor are variant, the
downstream pressor response is amplified. In women, AGT haplotypes interact significantly with
hypertension status to modulate coronary artery disease risk, a gene-environment interaction that
may partly explain sex-specific cardiovascular risk patterns.
TMPRSS6 D512E — Iron Absorption's Hidden Brake
Iron deficiency is the world's most common nutritional deficiency, yet how much iron your gut
actually absorbs is controlled less by what you eat and more by a hormone called
hepcidin11 hepcidin
A 25-amino-acid peptide produced by the liver that acts as the master regulator
of systemic iron homeostasis; high hepcidin blocks iron absorption and recycling. The
TMPRSS6 gene encodes matriptase-222 matriptase-2
A type II transmembrane serine protease expressed
primarily in the liver that cleaves hemojuvelin, suppressing hepcidin production, whose
job is to keep hepcidin in check by cleaving a cell-surface protein called
hemojuvelin33 hemojuvelin
A co-receptor for bone morphogenetic proteins (BMPs) that, when intact on the
hepatocyte surface, drives hepcidin gene transcription. The rs4820268 variant — a single
nucleotide change in the protein-coding sequence of TMPRSS6 — alters one amino acid in
matriptase-2 and shifts the setpoint of this hormonal control system, with measurable
consequences for iron status that play out across the lifespan.
The Mechanism
The A allele of rs4820268 changes aspartic acid to glutamic acid at position 512 of the
matriptase-2 protein (p.Asp512Glu). This substitution falls within the functional region of
the enzyme and impairs its ability to suppress hepcidin production. With less matriptase-2
activity, hemojuvelin remains intact on hepatocyte surfaces, activating the
BMP/SMAD signaling pathway44 BMP/SMAD signaling pathway
Bone morphogenetic protein/SMAD — a signaling cascade that
drives transcription of the hepcidin gene HAMP in liver cells and sustaining elevated
hepcidin levels. Higher hepcidin causes ferroportin55 ferroportin
The only known mammalian iron exporter,
expressed on gut enterocytes, macrophages, and hepatocytes; hepcidin binds ferroportin, triggers
its internalization and degradation, blocking cellular iron export degradation in gut
enterocytes, reducing the amount of dietary iron that crosses from gut lining into blood.
The effect is additive: each additional A allele incrementally reduces matriptase-2 activity, raises hepcidin, and lowers net iron absorption. The GG genotype (no amino acid change, synonymous) represents wild-type enzyme function.
The Evidence
A 2025 systematic review66 2025 systematic review
Fauzan R et al. Impact of TMPRSS6 Genetic Variants on Maternal Iron
Status in Pregnancy: A Systematic Review. Birth Defects Research,
2025 of seven studies encompassing 1,094 pregnant
participants found that the A allele consistently correlated with lower serum iron, reduced
transferrin saturation, and elevated unsaturated iron-binding capacity. The review further
identified associations with increased risks of iron-deficiency anemia, gestational diabetes
mellitus, and preeclampsia — outcomes mediated through impaired iron absorption and downstream
metabolic disturbances.
A recall-by-genotype study77 recall-by-genotype study
Jallow MW et al. Common Variants in the TMPRSS6 Gene Alter
Hepcidin but not Plasma Iron in Response to Oral Iron in Healthy Gambian Adults. Current
Developments in Nutrition, 2021 in 251 adults
directly confirmed the hepcidin mechanism: AA homozygotes had baseline hepcidin of 9.50 ng/mL
compared to 3.27 ng/mL in GG homozygotes (P = 0.002) — nearly a three-fold difference. This
study also found that after a large oral iron dose (130 mg elemental iron), TMPRSS6 genotype
altered the hepcidin response but did not fully overcome the absorption disadvantage.
An Egyptian study of 160 children (Hamed et al. 202488 Hamed et al. 2024
Hamed HM et al. The association of
TMPRSS6 gene polymorphism with iron status in Egyptian children. BMC Pediatrics,
2024) found the GG genotype was linked to
the highest hepcidin gene expression, lowest serum ferroportin, and lowest iron stores,
confirming that the G allele (without the Asp512Glu change) confers the greatest risk in
this specific direction in some ancestry contexts — underscoring the importance of considering
both variants and population context together.
A Chinese Han study of 1,574 adults (Gan et al. 201299 Gan et al. 2012
Gan W et al. Association of TMPRSS6
polymorphisms with ferritin, hemoglobin, and type 2 diabetes risk in a Chinese Han population.
AJCN, 2012) found rs4820268 significantly
associated with plasma ferritin (P ≤ 0.006), hemoglobin (P ≤ 0.001), and iron overload risk
(P ≤ 0.007), also detecting a link to type 2 diabetes risk (P ≤ 0.031) — likely mediated
through iron's role in insulin signaling.
Practical Implications
For most people with adequate dietary iron and normal iron demand, even the AA genotype does not guarantee iron deficiency — the body's iron regulatory system has multiple compensatory mechanisms. The risk becomes clinically meaningful when demand increases (menstruation, pregnancy, adolescent growth) or intake is marginal (plant-based diets, food insecurity, malabsorption). In these contexts, impaired matriptase-2 function can tip the balance toward depleted iron stores faster than average.
Practical strategies to compensate: pair non-heme iron sources with vitamin C (which reduces ferric to ferrous iron and forms a soluble chelate resistant to hepcidin-mediated inhibition); prefer heme iron (meat, organ meats, shellfish) where possible — heme iron enters enterocytes via HCP1 rather than through ferroportin, making it less sensitive to elevated hepcidin; avoid tea, coffee, and calcium supplements within 60 minutes of iron-rich meals. For supplementation, iron bisglycinate is partially absorbed via peptide transporters, offering a partial bypass of the hepcidin-ferroportin bottleneck.
Serum ferritin is the most sensitive early marker of iron depletion; it falls before hemoglobin drops, allowing intervention before anemia develops.
Interactions
rs4820268 is in linkage disequilibrium with rs855791 — the other major TMPRSS6 coding variant (Ala736Val) — and the two together explain more variance in iron status than either alone. Carriers of risk alleles at both loci have compounded impairment of matriptase-2 function. In the context of hereditary hemochromatosis (HFE C282Y or H63D variants, rs1800562 or rs1799945), TMPRSS6 risk alleles may partially counteract the pathological iron overload driven by reduced hepcidin from HFE mutations — the same hepcidin-raising effect that impairs absorption in healthy people acts as a modest brake on iron loading in hemochromatosis carriers.
LDLR's Hidden Splice Switch — When Silent Mutations Aren't Silent
The low-density lipoprotein receptor (LDLR) pulls cholesterol-carrying LDL particles out of the bloodstream and into liver cells for clearance. Most people know that rare LDLR mutations cause familial hypercholesterolemia, a severe inherited condition of massively elevated cholesterol. But LDLR is also shaped by a network of common regulatory variants that subtly tune how much functional receptor your liver produces — variants that don't alter the protein sequence at all, yet still affect cholesterol levels measurably.
rs5925 is one such variant. Located in exon 13 of LDLR, it produces a synonymous change (c.1959T>C, p.Val653=): both alleles encode valine, so the protein sequence is identical regardless of which allele you carry. Yet [multiple population studies across Chilean, Egyptian, South Indian, and Chinese cohorts | Rojas 2019; Alsabbagh 2022; Jha 2019; Wang 2023] have associated the C allele with higher LDL cholesterol and cardiovascular risk. The molecular explanation involves mRNA splicing — a key step in gene regulation that occurs after transcription but before the protein is made.
The Mechanism
Synonymous variants can profoundly affect gene output by disrupting exonic splicing enhancers
(ESEs)11 exonic splicing enhancers
(ESEs)
Short RNA sequences within exons that signal to the spliceosome to include the exon in
the mature mRNA. If an ESE is disrupted, the
spliceosome may skip the exon, producing a truncated or non-functional protein.
The Lee et al. 2014 minigene study22 Lee et al. 2014 minigene study
Lee JD et al. Mutual effect of rs688 and rs5925 in
regulating low-density lipoprotein receptor splicing. DNA Cell Biol, 2014
demonstrated that rs5925 and the nearby synonymous variant rs688 (exon 12) jointly control LDLR
exon splicing efficiency. Using minigene constructs in human cell lines and confirmed in
leukocyte RNA from patients, they measured how efficiently exon 12 and exon 13 were retained in
mature LDLR mRNA across all four possible haplotype combinations:
- rs688-C / rs5925-C: 79.6% splicing efficiency (highest)
- rs688-C / rs5925-T: 76.7%
- rs688-T / rs5925-C: 69.0%
- rs688-T / rs5925-T: 68.5% (lowest)
rs5925 accounts for approximately 5.4% of splicing efficiency difference when rs688 carries the C allele. Critically, rs688-T (itself associated with reduced exon 12 inclusion) and rs5925-C rarely co-occur in high-splicing haplotypes in the populations where C-allele LDL elevation is observed — the predominant clinical haplotype combines rs688-T with rs5925-C, creating the TC combination with 69% efficiency versus the protective CC combination at 79.6%. This haplotype context explains why the population-level C allele effect on LDL is adverse despite rs5925-C appearing more efficient when paired with rs688-C.
The Evidence
Clinical evidence from multiple independent cohorts consistently links the C allele with higher
LDL cholesterol. In a study of 178 healthy subjects from northern Chile33 study of 178 healthy subjects from northern Chile
Rojas C et al.
JCLA, 2019, individuals carrying at least one T
allele had significantly lower total cholesterol, triglycerides, and LDL-C than CC homozygotes
(p<0.05). The distribution in this Latin American population (CC 19%, CT 53%, TT 28%) closely
matches Hardy-Weinberg predictions for a C allele frequency of ~45-53%, consistent with global
gnomAD data.
In a case-control study of 400 South Indians44 case-control study of 400 South Indians
Jha CK et al. Med Sci, 2019
(200 coronary artery disease patients, 200 matched controls), the heterozygous genotype at
rs5925 was significantly protective against CAD (OR=0.45, 95% CI 0.27-0.75, p=0.002),
demonstrating that the T allele confers cardiovascular protection beyond lipid levels alone.
An Egyptian study of 100 lupus nephritis patients55 Egyptian study of 100 lupus nephritis patients
Alsabbagh YA et al. Arch Rheumatol, 2022
found the C allele significantly more prevalent in patients than healthy controls (60% vs 45%,
p=0.003), with CC genotype carriers showing higher atherogenic index and LDL/HDL ratios. TT
and CT genotypes showed significantly lower TC, TG, and LDL-C than CC homozygotes.
A 2023 Chinese adolescent study66 2023 Chinese adolescent study
Wang J et al. Int J Mol Sci, 2023
involving 709 students confirmed that male C allele carriers had elevated total cholesterol and
LDL-C compared to TT homozygotes in the control group.
Notably, one Chilean pharmacogenomics study77 Chilean pharmacogenomics study
Lagos J et al. Int J Mol Sci, 2015
examining atorvastatin response in 139 hypercholesterolemic subjects found no significant
association between rs5925 and statin-induced LDL reduction (p=0.576). This suggests the
variant's effect on LDLR expression levels does not meaningfully alter the pharmacodynamic
response to statins, which primarily works through a separate mechanism (inhibiting cholesterol
synthesis and upregulating LDLR protein quantity rather than splicing efficiency).
Practical Actions
For CC homozygotes, the elevated LDL risk is modest in absolute terms — this is not a pathogenic variant, and the LDL difference between genotypes is in the range of a few mg/dL to perhaps 10-15 mg/dL in studies with the largest effects. The practical implication is attentiveness: more frequent LDL monitoring, prompt action when cholesterol trends upward, and consideration of dietary and pharmaceutical interventions earlier rather than later.
Dietary strategies with the strongest LDL-lowering evidence independent of genotype — soluble fiber, plant sterols, reduced saturated fat — are particularly relevant for CC carriers since their genetic baseline LDLR efficiency is lower. These interventions enhance cholesterol clearance through mechanisms that can partially compensate for reduced splicing efficiency.
Because statin response does not appear to track with rs5925 genotype, standard lipid management guidelines apply to CC homozygotes who require pharmacotherapy.
Interactions
rs5925 functions within a two-SNP regulatory module with rs688. The splicing efficiency of LDLR exon transcripts is jointly determined by both variants, with rs688 being the dominant partner (accounting for ~9.4% vs rs5925's ~5.4% of the variation in splicing efficiency). Together, the rs688-T / rs5925-C haplotype produces the lowest splicing output. Individuals carrying both risk alleles (rs688-T and rs5925-C) have the most reduced LDLR mRNA splicing efficiency and may face compounded LDL-raising effects.
This variant also operates in the context of the broader LDLR regulatory landscape. rs6511720 (intron 1) modulates LDLR transcription through a sterol response element — it affects how much mRNA is produced before splicing even occurs. A CC homozygote at rs5925 who also carries the rs6511720 GG genotype (common, lower-expression) would experience a compounded reduction in functional LDLR output: reduced transcription AND reduced splicing efficiency. Conversely, rs6511720-T carriers (enhanced transcription) may partially offset the splicing inefficiency of the rs5925-C allele by producing more total transcript.
Factor IX Malmö — The X-Linked DVT Modifier with an Unknown Mechanism
Coagulation factor IX sits at the centre of the intrinsic pathway of blood clotting. When
activated by factor XIa, it forms the tenase complex with factor VIIIa, which in turn
activates factor X to trigger fibrin clot formation. Severe loss-of-function mutations in F9
cause haemophilia B (Christmas disease). Factor IX Malmö — rs6048, the common p.Thr194Ala
missense variant11 p.Thr194Ala
missense variant
An amino acid change from threonine to alanine at position 194 of the
canonical pre-pro-protein; also reported as Ala148Thr in older literature using mature-protein
numbering that excludes the signal peptide and propeptide —
is a far subtler story: a common polymorphism that modestly shifts clotting risk in the
general population without causing haemophilia.
Because F9 is located on the X chromosome, males carry only one copy (hemizygous), while females can carry zero, one, or two copies of the protective G allele. Genotyping chips typically report hemizygous males as homozygous in the raw data, so a male reported as GG carries exactly one G allele, and a male reported as AA carries one A allele.
The Mechanism
At position 194 in the pre-pro-factor IX sequence, the common A allele encodes threonine;
the protective G allele encodes alanine. This substitution is located in the EGF-1 domain
of the factor IX protein, a region involved in calcium binding and factor VIIIa interaction.
Despite this structurally interesting location, a key finding from the primary association
study is that factor IX antigen levels and factor IX activation peptide levels — markers
of factor IX production and activation — did not differ significantly between AA and GG
carriers22 factor IX antigen levels and factor IX activation peptide levels — markers
of factor IX production and activation — did not differ significantly between AA and GG
carriers
Bezemer et al. 2009 measured both factor IX antigen and FIXa activation peptide
in controls from the LETS (n=191) and MEGA (n=823+484) studies; no significant genotype
effect. This means the mechanism of the G
allele's protective effect is genuinely unknown — it is not simply producing less factor IX
or generating less activated factor IX. Whether the alanine substitution subtly alters
factor IX's interaction with factor VIIIa, phospholipid membranes, or inhibitory proteins
such as antithrombin remains to be determined.
An adjacent intronic variant (rs422187) showed similar association to DVT in the Bezemer study, raising the possibility that rs6048 may partly or wholly tag a functional non-coding variant in linkage disequilibrium, rather than being the causal change itself.
The Evidence
The key association study is the 2009 analysis by Bezemer and colleagues33 2009 analysis by Bezemer and colleagues
Irene D Bezemer
et al., "F9 Malmö, factor IX and deep vein thrombosis," Haematologica 2009;94(5):693–9.
Combined LETS and MEGA case-control studies with 380+1,469 male cases and factor IX
measurements in 191+823+484 controls, which
combined two Dutch case-control studies (LETS and MEGA) to demonstrate that the G allele
of rs6048 was associated with a 20% reduction in DVT odds: OR 0.80 (95% CI 0.69–0.93).
This finding was subsequently replicated in large GWAS datasets. The Klarin 2019
genome-wide association study44 genome-wide association study
Derek Klarin et al., "Genome-wide association analysis of
venous thromboembolism identifies new risk loci and genetic overlap with arterial vascular
disease," Nature Genetics 2019;51:1574–1579. Over 650,000 participants across the Million
Veteran Program and UK Biobank identified the
rs6048-A allele among 33 genomic loci associated with VTE risk (OR ~1.07). The cross-ancestry
replication by Thibord et al. 202255 Thibord et al. 2022
Florian Thibord et al., "Cross-Ancestry Investigation
of Venous Thromboembolism Genomic Predictors," Circulation 2022;146:1225–1242. 135 VTE loci
identified across European, African, and Hispanic cohorts
confirmed rs6048 across European, African, and Hispanic populations.
Importantly, a study of VTE recurrence in 2,185 patients66 study of VTE recurrence in 2,185 patients
Roach et al. 2015, J Thromb
Haemost 2015;13(10):1815–22. Four European cohorts: MEGA, Leiden Thrombophilia Study,
Scottish Thrombophilia Study, Vienna Thrombosis Research Group
found that Factor IX Malmö did not explain the observed sex difference in VTE recurrence risk.
This suggests the variant's modest protective effect applies to first VTE events but does
not substantially alter recurrence dynamics.
The effect size is modest — OR ~0.80 for DVT first event — and the clinical significance classification by the ClinGen Coagulation Factor Deficiency Variant Curation Expert Panel (4-star review, February 2024) is benign for haemophilia B and protective for DVT. This is not a variant that eliminates thrombosis risk; it shifts it modestly on a population scale.
Practical Implications
For carriers of the GG genotype (females with two protective alleles) or males with the G allele, the practical implication is limited: a modest downward adjustment in baseline DVT risk. This does not substitute for standard DVT prevention measures during high-risk situations (surgery, prolonged immobility, hormonal contraceptives), particularly if other prothrombotic variants are present (Factor V Leiden, prothrombin G20210A).
For AA homozygotes and AG heterozygotes, no specific intervention is warranted based on this variant alone, since the common A allele simply represents population-average coagulation factor IX function.
The G allele frequency varies dramatically by ancestry — approximately 30% in Europeans and Ashkenazi Jews, 18% in South Asians, 13% in Africans, and less than 0.2% in East Asians. This means the protective variant is essentially absent in East Asian populations, and population-level DVT risk attributable to this locus differs substantially by ancestry.
Interactions
The most clinically relevant interactions are with other coagulation pathway variants. Factor V Leiden (rs6025) and prothrombin G20210A (rs1799963) are the two most common inherited thrombophilias and act at different points in the coagulation cascade. If a carrier of rs6048-G also carries prothrombotic variants, the modest protective effect of the G allele would be expected to partially, but incompletely, offset the elevated risk. The magnitude of this offset has not been directly quantified in published studies.
The adjacent intronic SNP rs422187 is in strong linkage disequilibrium with rs6048 and showed comparable DVT association in the Bezemer cohort. These variants cannot be distinguished functionally with current evidence.
HSD17B13 rs6834314 — A Tag SNP Telling the Same Story as rs72613567
In 2011, a landmark genome-wide association study scanning 61,089 individuals of Caucasian
and Asian Indian descent identified an intergenic variant near the HSD17B13 gene on
chromosome 4 as one of the strongest signals for plasma ALT concentrations — a key marker
of liver injury:
Chambers JC et al. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma. Nature Genetics, 201111 Chambers JC et al. Genome-wide association study identifies loci influencing concentrations of liver enzymes in plasma. Nature Genetics, 2011.
That variant was rs6834314, and its protective G allele was associated with lower liver
enzyme levels in the general population. However, subsequent functional work established
that rs6834314 itself has no direct effect on HSD17B13 gene expression or enzyme activity.
Its associations are entirely a consequence of
linkage disequilibrium22 linkage disequilibrium
Two variants are in LD when they co-occur on the same chromosome more often than expected by chance; r² measures how predictively one tags the other
with the causal splice variant rs72613567 (D'=0.995, r²=0.93). Knowing your
rs6834314 genotype tells you almost exactly what your rs72613567 genotype is —
these two variants are nearly perfect proxies for each other.
The Mechanism
rs6834314 is located approximately 11 kb downstream of the HSD17B13 gene in an intergenic
region and has no predicted regulatory or coding consequence. The variant's associations
with liver disease risk, liver inflammation, and liver enzyme levels are entirely attributable
to its co-inheritance with rs72613567, the
adenine insertion adjacent to the splice donor site of intron 633 adenine insertion adjacent to the splice donor site of intron 6
rs72613567 disrupts the splice donor site, generating an aberrant truncated transcript (isoform D) that encodes a non-functional HSD17B13 protein with reduced enzymatic activity
in HSD17B13. This was confirmed by Ma et al. (2019), who showed that rs6834314 genotype
was not associated with hepatic HSD17B13 expression levels after accounting for
the rs72613567 splice variant.
The Evidence
The clinical signal captured by rs6834314 mirrors the rs72613567 findings precisely, because the two variants tag the same underlying haplotype. In 768 Caucasian patients with biopsy-proven NAFLD, Ma et al. found the rs6834314 G allele (tagging the HSD17B13 loss-of-function haplotype) was associated with lower risk of liver inflammation (OR 0.77), reduced hepatocyte ballooning (OR 0.67), fewer Mallory-Denk bodies (OR 0.68), and lower serum transaminases and GGT. In the general population cohort (Michigan Genomics Initiative), the G allele was associated with a 21% reduction in cirrhosis risk (OR 0.79, p=7.5×10⁻⁴): Ma Y et al. 17-Beta Hydroxysteroid Dehydrogenase 13 Is a Hepatic Retinol Dehydrogenase Associated With Histological Features of Nonalcoholic Fatty Liver Disease. Hepatology, 201944 Ma Y et al. 17-Beta Hydroxysteroid Dehydrogenase 13 Is a Hepatic Retinol Dehydrogenase Associated With Histological Features of Nonalcoholic Fatty Liver Disease. Hepatology, 2019.
In a multi-ethnic Asian cohort, Seto et al. followed 165 biopsy-proven NAFLD patients for a mean of 89 months and found that each rs6834314 G allele was associated with lower odds of NAFLD (adjusted OR 0.59, 95% CI 0.40–0.87) and NASH (adjusted OR 0.48, 95% CI 0.31–0.75). Homozygous GG individuals showed markedly lower liver-related complication rates during follow-up. However, the protective effect was observed only in ethnic Chinese, not in Malays or Indians within the same cohort — likely reflecting population-specific LD patterns between rs6834314 and the causal rs72613567 insertion: Seto WK et al. Loss-of-function HSD17B13 variants, non-alcoholic steatohepatitis and adverse liver outcomes: results from a multi-ethnic Asian cohort. Clinical and Molecular Hepatology, 202155 Seto WK et al. Loss-of-function HSD17B13 variants, non-alcoholic steatohepatitis and adverse liver outcomes: results from a multi-ethnic Asian cohort. Clinical and Molecular Hepatology, 2021.
In Japanese patients with biopsy-proven NAFLD (n=290), rs6834314 G allele carriage abolished the fibrosis-promoting effect of PNPLA3 I148M: among HSD17B13 AA carriers, PNPLA3 GG conferred significantly higher advanced fibrosis risk (OR 2.4, p=0.041); among HSD17B13 AG/GG carriers, no such PNPLA3 effect was detected. The G allele was also associated with lower inflammation and ballooning prevalence: Seko Y et al. Attenuated effect of PNPLA3 on hepatic fibrosis by HSD17B13 in Japanese patients with non-alcoholic fatty liver disease. Liver International, 202066 Seko Y et al. Attenuated effect of PNPLA3 on hepatic fibrosis by HSD17B13 in Japanese patients with non-alcoholic fatty liver disease. Liver International, 2020.
Practical Implications
The clinical interpretation of rs6834314 is structurally identical to that of rs72613567, because they tag the same biological state. If your genome file includes rs6834314 but not rs72613567 (a scenario most likely on older Illumina arrays that predated specific addition of the HSD17B13 locus), your rs6834314 genotype tells you the same information: whether you carry the HSD17B13 loss-of-function haplotype. The G allele protective association with liver inflammation and fibrosis is real — it just originates from a functionally upstream cause.
One important caveat specific to rs6834314: because protection depends on LD, and LD patterns can differ between ancestral populations, the G allele is a reliable proxy for rs72613567 in European and East Asian populations (r²=0.93) but may be a less accurate proxy in other ancestries, particularly in populations not well-represented in the original LD reference panels.
Interactions
The most clinically important interaction is between the HSD17B13 loss-of-function haplotype (tagged by rs6834314 G) and PNPLA3 rs738409 (I148M). The high-risk PNPLA3 GG genotype strongly predicts advanced fibrosis in HSD17B13 AA carriers but not in G allele carriers — meaning the HSD17B13 protective haplotype functionally overrides much of the PNPLA3 fibrosis risk. This same interaction was originally described for rs72613567 in the Abul-Husn NEJM 2018 cohort and replicated here using rs6834314 as the proxy.
See also rs58542926 (TM6SF2 E167K), which operates in the same hepatic lipid droplet biology axis and further stratifies liver disease risk when combined with PNPLA3 and HSD17B13 genotypes.
DHCR7 Vitamin D Synthesis — Array Coverage Proxy Variant
Your skin makes vitamin D through a two-step process: ultraviolet B
light converts 7-dehydrocholesterol (7-DHC)11 7-dehydrocholesterol (7-DHC)
A cholesterol precursor
concentrated in the outer skin layers. It is the substrate for both
vitamin D synthesis and DHCR7-mediated cholesterol production
in the outer skin into previtamin D3, which spontaneously rearranges
into vitamin D3 (cholecalciferol). But the same 7-DHC molecule is
also the substrate for DHCR7 (7-dehydrocholesterol reductase), the enzyme
that converts it into cholesterol — making DHCR7 a molecular switch
governing how much sunlight exposure translates into vitamin D.
rs7944926 is an intronic variant in the NADSYN1 gene on chromosome
11, situated within the same large haplotype block22 haplotype block
A segment of DNA
inherited as a unit, typically because recombination within the block
is rare. SNPs within a haplotype block track together through generations
as the canonical DHCR7 vitamin D variant rs12785878, spanning 63–102 kb
depending on the population. The two variants are in near-perfect
linkage disequilibrium33 linkage disequilibrium
LD measures how strongly two variants travel
together in a population. r² ≈ 1.0 means the two SNPs are essentially
interchangeable as genetic proxies for each other (r² ≈ 1.0 in
Europeans), meaning rs7944926 tags exactly the same biological signal as
rs12785878. The A allele at rs7944926 co-segregates with the G allele
at rs12785878 — both associated with lower circulating 25-hydroxyvitamin D.
The Mechanism
The functional effect at this locus is attributed to regulatory changes
that alter DHCR7 transcription or activity. Higher DHCR7 activity
channels more 7-DHC toward cholesterol synthesis via the
Kandutsch-Russell pathway44 Kandutsch-Russell pathway
One of two routes for cellular cholesterol
synthesis. DHCR7 reduces the C7-8 double bond in 7-DHC using NADPH,
producing cholesterol on the smooth endoplasmic reticulum,
leaving less 7-DHC available for UV-driven vitamin D synthesis in the
skin. Cholesterol itself accelerates the proteasomal degradation of
DHCR7 protein, creating a feedback loop: when cholesterol is plentiful,
DHCR7 degrades, 7-DHC accumulates, and vitamin D production is favored.
Risk-allele carriers appear to disrupt this balance, sustaining higher
DHCR7 activity and reducing vitamin D synthesis capacity.
The Evidence
The
landmark 2010 Lancet GWAS55 landmark 2010 Lancet GWAS
Wang TJ et al. Common genetic determinants
of vitamin D insufficiency: a genome-wide association study. Lancet,
2010 in 33,996 Europeans
identified the DHCR7/NADSYN1 locus as one of three genome-wide significant
loci for 25(OH)D concentration (P = 2.1 × 10⁻²⁷). Carriers of two
risk alleles had mean 25(OH)D approximately 8 nmol/L lower than
non-carriers under the same conditions. Each additional risk allele
increased the odds of vitamin D insufficiency (below 75 nmol/L) by
about 21%.
An
evolutionary genetics study66 evolutionary genetics study
Kuan V et al. DHCR7 mutations linked to
higher vitamin D status allowed early human migration to northern latitudes.
BMC Evol Biol, 2013
demonstrated that rs7944926 specifically showed signatures of strong
positive selection in European and East Asian populations, with
FST values77 FST values
FST measures allele frequency differentiation between
populations. Values above the 95th percentile relative to genome-wide SNPs
indicate selection pressures, not just genetic drift
above the 95th percentile on chromosome 11. The G allele (protective,
higher vitamin D) rose dramatically in frequency at northern latitudes
— from ~15–16% in sub-Saharan Africa to ~72% in Europeans — consistent
with selection pressure to maintain vitamin D synthesis in low-UVB
environments. This population differentiation is among the strongest seen
anywhere in the human genome for this region.
The findings have been confirmed in massive replication. A
UK Biobank GWAS88 UK Biobank GWAS
Manousaki D et al. Genome-wide association study for
vitamin D levels reveals 69 independent loci. Am J Hum Genet,
2020 in 401,460 participants
and a concurrent
143-loci study99 143-loci study
Revez JA et al. Genome-wide association study identifies
143 loci associated with 25 hydroxyvitamin D concentration. Nat Commun,
2020 in 417,580 Europeans
both confirmed DHCR7/NADSYN1 among the strongest vitamin D loci.
rs7944926 has been used as a direct instrument in Mendelian randomization
studies of vitamin D-disease relationships, with each A allele associated
with approximately 2–3 nmol/L lower serum 25(OH)D.
Practical Implications
Because rs7944926 is an LD proxy for rs12785878, its clinical meaning is identical to that of its canonical partner: a genetic tendency toward reduced vitamin D3 synthesis from sunlight. The per-allele effect is modest in isolation, but it compounds with other vitamin D pathway variants (CYP2R1 rs10741657 for hepatic hydroxylation, GC rs2282679 for D-binding protein transport, CYP24A1 rs6013897 for active D degradation), limited sun exposure, higher latitude, and darker skin pigmentation.
The A allele does not impair absorption of dietary or supplemental vitamin D, making supplementation an effective countermeasure.
Array Coverage Note
This SNP is included in the GeneOps database as an array coverage proxy. If your genome file includes rs12785878 directly, that entry provides equivalent information. If rs12785878 was not genotyped on your array, rs7944926 captures the same biological signal. Both cannot give discordant results in a person with a high-quality genome file — if both are present, results should be consistent.
Evolutionary Context
The stark frequency gradient of the G (protective) allele — from ~12% in West African populations to ~72% in Europeans and ~42% in East Asians — is one of the clearest signatures of latitude-driven natural selection in the human genome. As populations migrated from equatorial Africa to regions with weaker UV radiation, variants that redirected more 7-DHC toward vitamin D synthesis conferred survival advantages against rickets, immune dysfunction, and reproductive impairment. rs7944926 marks this same adaptive sweep.
Interactions
rs7944926 is functionally equivalent to rs12785878 and participates in the same four-locus vitamin D pathway network. Wang et al. (2010) found that individuals in the highest quartile of a combined genetic risk score across DHCR7, CYP2R1 (rs10741657), GC (rs2282679), and CYP24A1 (rs6013897) had 2.47 times the odds of vitamin D insufficiency compared to the lowest quartile. If you carry risk alleles at multiple loci, compound vitamin D supplementation guidance is warranted.
MAPRE1/EB1 — The Spindle Tracking Protein at the Heart of a PCOS Locus
At the growing tip of every microtubule in a dividing cell sits a small protein called EB1, encoded by MAPRE1. Its job is deceptively simple: ride the plus ends of microtubules and recruit the molecular machinery that keeps the spindle — the apparatus responsible for separating chromosomes — working properly. When EB1 fails to do this job correctly during oocyte meiosis, chromosomes can missegregate, producing aneuploid eggs that either fail to fertilize or result in miscarriage. The rs853854 variant in MAPRE1 marks a locus that a major European GWAS meta-analysis linked to polycystic ovary syndrome susceptibility, connecting a cell-biology workhorse to one of the most common causes of female reproductive difficulty.
The Mechanism
MAPRE1 encodes EB111 EB1
end-binding protein 1; a plus-end tracking (+TIP)
protein that localizes to the growing tips of microtubules throughout the
cell cycle. EB1 acts as a master regulator of the +TIP network, recruiting
downstream effectors to microtubule ends and promoting spindle
assembly. rs853854 is an
intronic variant at the MAPRE1 locus on chromosome 20q11.21 — it does not
change the EB1 protein directly, but tags regulatory variation that likely
influences MAPRE1 expression levels or splicing in relevant tissues.
Single-cell eQTL analysis in immune cells has identified MAPRE1 as a causal PCOS gene with regulatory effects in NK cells; the risk-associated haplotype correlates with reduced MAPRE1 expression in these cells, suggesting broader immune and ovarian effects beyond the classical androgen excess pathway.
The link to oocyte biology is direct. Zhou et al. 202122 Zhou et al. 2021
EB1 Is Essential for Spindle Formation and Chromosome Alignment During
Oocyte Meiotic Maturation in Mice. Microsc Microanal 27:385–391
showed that EB1 protein localizes along the meiotic spindle, mirroring
α-tubulin distribution. When EB1 was depleted using a Trim-Away approach,
the consequences were severe: spindle disorganization, chromosome
misalignment and missegregation, reduced conversion to mature MII oocytes,
and decreased β-catenin at the cortical adherens junctional complex. The
impairment of EB1 function promotes chromosomal loss — fuelling aneuploidy
and potential fertilization failure.
The Evidence
Day et al. 201833 Day et al. 2018
Large-scale genome-wide meta-analysis of polycystic
ovary syndrome suggests shared genetic architecture for different diagnosis
criteria. PLoS Genet 14:e1007813
is the primary source of evidence for this variant. The study combined
10,074 PCOS cases and 103,164 controls of European ancestry in a
fixed-effect inverse-variance-weighted meta-analysis, identifying 14
genome-wide significant PCOS susceptibility loci. The rs853854 locus near
MAPRE1 was among three novel associations discovered, the others being
near PLGRKT and ZBTB16. The authors characterized MAPRE1 as a metabolic
candidate gene that interacts with LRP1 — a receptor that controls
adipogenesis — and may additionally mediate ovarian angiogenesis and
follicle development.
This variant has not yet been replicated at genome-wide significance in independent cohorts of comparable power, and no specific functional variant within the locus has been causally validated. The evidence level is therefore moderate: replicated within the European GWAS framework, biologically plausible via EB1's established spindle role, but lacking the cross-ethnic replication and molecular fine-mapping of more established PCOS loci like DENND1A or FSHR.
The allele frequency differs strikingly by ancestry: the T risk allele reaches 50% in Europeans but only 9% in East Asians and 16% in Africans. This suggests the variant, or a closely linked causal variant, may have experienced different selection pressures across populations.
Practical Actions
Because the MAPRE1 locus influences PCOS risk through a pathway that includes ovarian angiogenesis and potentially oocyte spindle stability, women with the T risk allele who are planning to conceive — or who are undergoing fertility treatment — have actionable considerations. Spindle stability during oocyte maturation is sensitive to oxidative stress and mitochondrial function; coenzyme Q10 supplementation has evidence for improving spindle integrity in eggs under oxidative stress. Anti-Müllerian hormone (AMH) testing combined with antral follicle count assesses the ovarian reserve and follicular health most directly affected by PCOS-related follicular arrest.
The MAPRE1 locus does not operate through the androgen excess pathway that dominates DENND1A-associated PCOS. Women carrying the rs853854 T allele alongside a normal androgen profile should not assume they are protected from PCOS-related reproductive difficulty — the MAPRE1 mechanism likely acts independently through follicular development and potential oocyte quality.
Interactions
The three novel loci identified in Day 2018 — MAPRE1, PLGRKT, and ZBTB16 — likely operate through mechanisms partly distinct from the earlier Chinese GWAS loci (DENND1A, THADA, FSHR/LHCGR). Carrying risk alleles at both MAPRE1 (rs853854) and DENND1A (rs7852296) would represent additive burden from two different PCOS pathways: spindle/follicular development (MAPRE1) and androgen excess/FSH resistance (DENND1A). No published compound effect size exists for this combination, but the biological mechanisms are sufficiently distinct that the combined phenotype may be more complex than either alone.
DNMT3A rs11683424 — Epigenetic Writer and Stress Modulator
Every cell in your body carries the same DNA sequence, yet a liver cell and a neuron behave completely differently. That feat of biological differentiation depends on DNA methylation — the chemical tagging of cytosines 11 Cytosines are one of the four DNA bases; methylation adds a methyl group (–CH₃) to position 5, creating 5-methylcytosine without changing the underlying sequence that silences or activates genes without altering the underlying code. DNMT3A (DNA methyltransferase 3 alpha) is one of the primary enzymes that writes these methyl tags during development and throughout adult life. Variants in DNMT3A alter how efficiently — and where — this enzyme deposits methylation marks, with downstream effects on gene expression, stress-response circuits, and immune cell balance.
The Mechanism
rs11683424 is an intronic variant in DNMT3A, located on chromosome 2 (GRCh38: 25,266,262). Intronic variants can alter 22 Introns are non-coding stretches within genes; variants here can change mRNA splicing, alter regulatory elements, or affect transcript levels without changing the protein sequence pre-mRNA splicing, disrupt internal regulatory elements such as intronic enhancers, or influence transcript stability — all without changing the amino acid sequence of the DNMT3A protein. The exact molecular mechanism for rs11683424 has not been characterized in published literature, but the functional associations documented — altered stress-response gene regulation and shifts in immune cell ratios — are consistent with subtle, tissue-specific changes in DNMT3A expression or isoform balance.
DNMT3A works by de novo methylation: it places methyl marks on previously unmethylated cytosines, primarily in the context of early development, neuronal differentiation, and immune cell programming. In the brain, DNMT3A is expressed in neurons and glia, where it helps establish methylation patterns at genes governing synaptic plasticity and stress-response pathways. In the bone marrow, DNMT3A is essential for normal hematopoietic stem cell differentiation — which explains why somatic DNMT3A mutations are among the most common drivers of clonal hematopoiesis and acute myeloid leukemia, though those are distinct from the germline variant rs11683424.
The Evidence
The stress-modulation evidence for rs11683424 centers on a multi-sample gene-environment
interaction study33 multi-sample gene-environment
interaction study
Pishva E et al. Epigenetic genes and emotional reactivity to daily life events. PLoS One, 2014 across 867 participants
spanning healthy volunteers, population controls, and psychiatric patients. The T allele
consistently buffered the effect of daily stressors on negative affect — carriers reported
reduced emotional reactivity to adverse events across three of five independent cohorts.
This buffering effect is biologically plausible: DNMT3A is expressed in stress-relevant
brain regions, and altered methylation at glucocorticoid-response genes could attenuate
the stress axis.
A second cross-sectional study44 cross-sectional study
Barliana MI et al. DNA methyltransferase 3A gene polymorphism contributes to daily life stress susceptibility. Psychol Res Behav Manag, 2017
in 129 healthy Indonesian adults (genotype distribution: CC 14%, CT 81%, TT 5%)
found that DNMT3A rs11683424 genotype significantly associated with daily life stress
susceptibility (p=0.04). Subjects with the CT genotype were the most frequently
classified under stress conditions. This finding is somewhat at odds with the buffering
narrative from Pishva 2014 — the two studies differ in population, stress measurement
(daily stress scales vs ecological momentary assessment), and direction of effect
framing, highlighting the need for caution in interpreting early-stage evidence.
From a different angle, a large GWAS in childhood leukemia55 GWAS in childhood leukemia
Kachuri L et al. Genetic determinants of blood-cell traits influence susceptibility to childhood acute lymphoblastic leukemia. Am J Hum Genet, 2021
identified rs11683424 as a genome-wide significant locus for neutrophil-to-lymphocyte ratio
(NLR; p=1×10⁻¹¹), with the C allele associated with higher NLR. NLR is an epigenetically
regulated immune metric, and the association underscores that DNMT3A variants influence
immune cell differentiation in vivo. Higher NLR reflects a relative neutrophil predominance
and is often used as an inflammatory marker.
A large case-control study66 large case-control study
Berking AC et al. DNMT3A and 3B variability and panic disorder. J Neural Transm, 2020 in 3,618
individuals found no major association between DNMT3A variants and panic disorder,
though minor protective signals emerged for anxiety dimensions in healthy controls —
consistent with the possibility of small, context-dependent effects.
Practical Actions
The actionable implications of rs11683424 center on supporting DNMT3A enzyme function through adequate methyl-donor supply. DNMT3A requires SAM (S-adenosylmethionine) as its methyl-group donor. SAM is synthesized from methionine and is directly sustained by the one-carbon / methylation cycle — the same pathway fed by folate, B12, B6, and betaine. If this pathway is constrained (e.g., by co-occurring MTHFR variants), DNMT3A activity can be limited by substrate availability, potentially amplifying whatever expression shift rs11683424 imparts.
For T allele carriers interested in stress modulation, the evidence from Pishva 2014 suggests the T allele itself may already buffer emotional stress reactivity — which is not necessarily a disadvantage. The CT and TT genotypes should focus on supporting the methylation substrate pool and monitoring stress-response biomarkers over time.
Interactions
The most relevant interaction is with MTHFR C677T (rs1801133). MTHFR reduces the supply of 5-methylTHF, which is upstream of homocysteine remethylation to methionine, the precursor of SAM. Individuals carrying both reduced-function MTHFR and the DNMT3A rs11683424 T allele face a potential dual constraint: reduced SAM availability plus altered DNMT3A expression. MTRR rs1801394 also participates in methionine regeneration and would compound any SAM limitation.
The DNMT3A rs1465764 variant (also in the same gene) was examined alongside rs11683424 in the Pishva 2014 study, and both moderated stress-response negative affect, suggesting that multiple regulatory variants in DNMT3A may act in concert.
Secondary TCF7L2 Variant
This is the second most-studied variant in the TCF7L2 gene, located in intron 4 approximately 50 kb from the primary variant rs7903146. While rs7903146 is the primary diabetes risk variant, rs12255372 provides additional information about your TCF7L2 haplotype. The two variants are in moderate linkage disequilibrium11 Linkage disequilibrium means these variants tend to be inherited together because they sit close on the same chromosome, within a 92-kb LD block, meaning they are often co-inherited but not always.
The Mechanism
Like rs7903146, this variant sits in a non-coding region and is thought to influence TCF7L2 expression levels, though rs7903146 appears to be the stronger functional driver. The T allele at this position is associated with decreased insulin secretion and impaired incretin response.
The Evidence
A meta-analysis of 28 studies22 meta-analysis of 28 studies
Wang et al. Association of rs12255372 in the TCF7L2 gene with type 2 diabetes mellitus: a meta-analysis. Braz J Med Biol Res, 2013 confirmed the association with
type 2 diabetes with an odds ratio of 1.39 (95% CI: 1.35-1.42). The effect is
consistent across European, African, and South Asian populations but weaker in
East Asian populations where the T allele is rare (~2% frequency).
The Pounds Lost trial33 Pounds Lost trial
Mattei et al. Am J Clin Nutr, 2012 also examined
rs12255372 and found that T allele carriers who consumed a lower-fat diet had
greater reductions in body adiposity, which could improve glycemic control.
Practical Implications
Having risk alleles at both rs7903146 and rs12255372 compounds your overall TCF7L2-related diabetes risk. The dietary recommendations are the same: moderate fat intake and a Mediterranean-style eating pattern.
Interactions
This variant is in moderate linkage disequilibrium with rs7903146. If you carry risk alleles at both positions, your overall TCF7L2-related risk is higher.