CRP Promoter Variant — Genetically Elevated Inflammation at the -286 Position
C-reactive protein is the liver's primary sentinel of systemic inflammation — a pentameric protein synthesized within hours of an IL-6 signal, capable of rising 1,000-fold during acute infection. But the baseline CRP level a person carries when healthy is substantially inherited: genetic factors explain 30–40% of inter-individual variation in resting CRP11 genetic factors explain 30–40% of inter-individual variation in resting CRP
Heritability estimates from twin studies; reviewed in Kathiresan et al. 2006. The rs3091244 variant sits at position -286 in the CRP gene promoter — one of the most functionally important regulatory positions identified for this gene — and is one of the most replicated genetic determinants of circulating CRP concentrations.
This is a triallelic variant, unusual among well-characterized SNPs: the -286 position can carry a C, T, or A allele on the coding strand (equivalent to G, A, or T on the genomic plus strand). Both the T and A coding alleles increase CRP transcription relative to C, with the A allele having the strongest effect. This creates a dosage gradient: CC < CT < TT and CA < AA at this position.
The Mechanism
The -286 position lies within a region of the CRP promoter that binds transcription factors involved in acute-phase gene regulation. Luciferase reporter assays in HepG2 hepatocytes22 Luciferase reporter assays in HepG2 hepatocytes
Wang et al. Frontiers in Immunology 2020 demonstrated that the A and T coding alleles confer significantly higher promoter activity than the C allele, confirming that the variant directly affects transcription rather than mRNA stability or translation. The CRP gene resides on chromosome 1q23.2 and is transcribed from the minus strand; alleles reported in genome files (plus-strand) are the complements of the conventionally cited coding alleles.
The Evidence
In a landmark study of 1,640 Framingham Heart Study participants, Kathiresan et al. (2006)33 Kathiresan et al. (2006) genotyped 13 CRP polymorphisms and found rs3091244 was the single strongest genetic predictor of serum CRP, with haplotypes carrying the T or A alleles associated with significantly higher levels (stepwise P<0.0001). The SNP explained 1.4% of total CRP variability — modest in isolation but clinically meaningful given that the 12 clinical covariates (including BMI) combined explained 26%.
Crawford et al. (Circulation, 2006)44 Crawford et al. (Circulation, 2006) confirmed the association in the diverse NHANES III cohort of 7,159 individuals spanning three ethnic groups. The AA genotype at rs3091244 was associated with elevated serum CRP and, importantly, with prevalent coronary heart disease in non-Hispanic white participants55 prevalent coronary heart disease in non-Hispanic white participants
After covariate adjustment, AA genotype associated with CHD; Crawford et al. 2006, establishing the first direct genetic link between this promoter variant and clinical cardiovascular disease endpoints.
The functional impact is also visible in inflammatory disease contexts. In 244 patients with ankylosing spondylitis66 244 patients with ankylosing spondylitis, those carrying the CA genotype (coding) had CRP levels of 18.6 mg/L versus 8.3 mg/L in CC homozygotes (p=0.02), a difference that persisted independent of disease activity scores — demonstrating that this genetic effect is constitutional, not simply a reflection of inflammatory load.
Saratzis et al. (J Vasc Surg, 2014)77 Saratzis et al. (J Vasc Surg, 2014) showed that carriers of the rare T (coding A) allele had an odds ratio of 4.88 for abdominal aortic aneurysm in the primary cohort, with larger aneurysm diameters and higher circulating inflammatory markers. This suggests the constitutively elevated CRP from this genotype may directly participate in vessel wall inflammation and structural remodeling.
Mendelian randomization analyses using rs3091244 as a genetic instrument88 Mendelian randomization analyses using rs3091244 as a genetic instrument have clarified the causal direction of the BMI–CRP relationship: elevated BMI causally raises CRP, but genetically elevated CRP does not cause weight gain — suggesting CRP is a downstream consequence of metabolic dysfunction rather than a driver of adiposity.
Practical Actions
Knowing your rs3091244 genotype adds resolution to hs-CRP testing. Carriers of the T or A coding allele who show elevated hs-CRP face a compounded risk: genetic predisposition to higher transcription on top of any lifestyle or metabolic contribution. For these individuals, reducing modifiable CRP drivers (visceral adiposity, smoking, physical inactivity) matters more, not less — because the genetic floor is already elevated. Statin therapy lowers CRP by ~37% independent of LDL effects (the JUPITER trial enrolled participants specifically on CRP ≥2 mg/L) and is particularly relevant when genetically elevated CRP co-occurs with borderline LDL.
For GG homozygotes (coding CC), genetic protection does not guarantee low CRP — obesity, smoking, and metabolic syndrome can push any genotype into the elevated range.
Interactions
rs3091244 does not act in isolation. It commonly exists in haplotypes with rs1205, rs1130864, rs2794521, and rs3093059. The combined haplotype architecture — not individual SNPs — best predicts CRP variability in population studies. The rs1205 (3' UTR) variant is the most clinically established companion, and compound haplotype effects have been documented in both cardiovascular and autoimmune disease contexts. If you carry risk alleles at both rs3091244 and rs1205, the additive effect on baseline CRP is larger than either alone.
ACAD9 R532W — Riboflavin-Responsive Mitochondrial Complex I Failure
Inside every cell, the mitochondrial electron transport chain converts the
chemical energy of nutrients into ATP — the universal cellular fuel.
Complex I11 Complex I
NADH:ubiquinone oxidoreductase, the largest of the five
respiratory chain complexes, built from 44 protein subunits and assembled
in a strictly ordered sequence
is the entry point for electrons from NADH. Without functional complex I,
tissues with the highest energy demands — heart muscle, skeletal muscle,
and brain — fail first. ACAD9 is not itself a component of complex I, but
it is one of the critical assembly factors required to build it: the protein
physically interacts with NDUFAF1 and ECSIT to scaffold the assembly of
complex I's proximal module, and its absence leaves the entire complex
unassembled.
The R532W variant — a single cytosine-to-thymine substitution at nucleotide
position 1,594 of the ACAD9 coding sequence — replaces the highly conserved
arginine at protein position 532 with a bulky tryptophan residue.
Protein structural modelling22 Protein structural modelling
Gerards M et al. Riboflavin-responsive
oxidative phosphorylation complex I deficiency caused by defective ACAD9:
new function for an old gene. Brain, 2011
shows that this substitution disrupts a stabilizing hydrogen bond within an
α-helix of the ACAD9 protein, impairing the protein's ability to participate
in complex I scaffolding. In the original family identified by Gerards et al.,
two siblings carried R532W in the homozygous state; a second unrelated
patient carried R532W alongside a different ACAD9 mutation (compound
heterozygous), confirming the allele is pathogenic in both configurations.
The Mechanism
ACAD9 belongs to the acyl-CoA dehydrogenase family and retains weak fatty
acid oxidation enzymatic activity toward long-chain substrates, but its
disease-causing role is firmly established as a complex I assembly factor.
Nouws et al.33 Nouws et al.
Nouws J et al. Acyl-CoA dehydrogenase 9 is required for
the biogenesis of oxidative phosphorylation complex I. Cell Metab,
2010 demonstrated that ACAD9
depletion by RNAi abolished complex I activity while leaving other respiratory
chain complexes intact. ACAD9 assembles into an early intermediate complex
during complex I biogenesis and is not a subunit of the final assembled
complex — it acts transiently during construction, then is released.
The R532W substitution destabilizes the ACAD9 protein's tertiary structure,
reducing both its FAD cofactor binding and its capacity to associate with
NDUFAF1. Schiff et al.44 Schiff et al.
Schiff M et al. Complex I assembly function and
fatty acid oxidation enzyme activity of ACAD9 both contribute to disease
severity in ACAD9 deficiency. Hum Mol Genet,
2015 showed that both the assembly
and the oxidation activities of ACAD9 contribute to disease severity, with
the assembly function being the dominant determinant of phenotype. Importantly,
ACAD9 requires FAD (derived from riboflavin, vitamin B2) as a cofactor — and
this FAD dependence provides the mechanistic basis for riboflavin
responsiveness: supplementing with high-dose riboflavin increases intracellular
FAD, partially stabilizing the mutant ACAD9 protein and rescuing complex I
assembly capacity.
The Evidence
The riboflavin responsiveness of ACAD9 deficiency is the most clinically
actionable finding in this disorder.
Gerards et al. 201155 Gerards et al. 2011
Gerards M et al. Brain, 2011
first demonstrated that high-dose riboflavin supplementation improved both
complex I enzyme activity and clinical symptoms (exercise intolerance,
lactic acidosis) in ACAD9-deficient patients. The largest subsequent cohort study,
Repp et al. 201866 Repp et al. 2018
Repp BM et al. Clinical, biochemical and genetic spectrum
of 70 patients with ACAD9 deficiency: is riboflavin supplementation effective?
Orphanet J Rare Dis, 2018,
recruited 70 ACAD9-deficient patients from European metabolic centers and found:
- Complex I activity improved in 9 of 15 patient-derived fibroblast lines (60%) following riboflavin supplementation in vitro
- Physicians reported a beneficial clinical effect in 20 of 31 treated patients (65%) for whom treatment data were available
- In patients with disease onset before age 1, riboflavin treatment was associated with statistically significant better survival (p = 5.34 × 10⁻⁵)
The same cohort documented the cardinal clinical features: cardiomyopathy in 85% (44/56 patients), muscular weakness in 75%, exercise intolerance in 72%, and elevated lactate as a near-universal biochemical marker. Severe intellectual disability was rare (1 patient), and over 70% of surviving patients maintained normal activities of daily living — underscoring that riboflavin response, when present, can dramatically improve prognosis.
An ACAD9 cardiac-specific knockout mouse model
Sinsheimer et al. 202177 Sinsheimer et al. 2021
Sinsheimer A et al. Development and characterization
of a mouse model for Acad9 deficiency. Mol Genet Metab,
2021
confirmed the primacy of complex I failure: knockout hearts lacked all supercomplexes
and isolated complex I activity, with only complex II retained, reproducing the
fatal neonatal cardiomyopathy seen in the most severely affected human patients.
Practical Actions
For individuals carrying two copies of the R532W allele (homozygous) or one copy paired with another ACAD9 pathogenic variant (compound heterozygous), the critical intervention is establishing care with a metabolic specialist and initiating high-dose riboflavin supplementation if not already underway. Cardiac monitoring is essential given the 85% prevalence of cardiomyopathy. Exercise-induced symptoms (muscle pain, weakness, dyspnea) should prompt immediate medical evaluation; lactic acid levels are a useful biomarker for disease monitoring and treatment response.
For heterozygous carriers, no symptoms are expected — ACAD9 deficiency is autosomal recessive and one functional copy is sufficient. The significance of carrier status is primarily reproductive.
Interactions
ACAD9 deficiency caused by R532W requires compound heterozygosity (R532W paired with a second pathogenic ACAD9 allele on the other chromosome) or homozygosity. The most commonly reported second alleles in the literature include R127Q (c.380G>A) and R469W (c.1405C>T), both identified in the original Gerards 2011 cohort. The Repp 2018 cohort identified 34 known and 18 novel variants — no biallelic loss-of-function (two null alleles) was identified in any patient, suggesting complete ACAD9 loss is incompatible with survival. The presence and nature of the second allele influences both phenotype severity and riboflavin responsiveness. Compound heterozygotes should have both alleles identified to guide prognosis.
MYL2 G162E — A Rare but Serious Cause of Familial Hypertrophic Cardiomyopathy
The heart's pumping force is generated by the sarcomere11 sarcomere
the basic contractile unit
of cardiac muscle, made of interdigitating thick and thin filaments,
a precisely organized molecular motor. The MYL2 gene encodes the ventricular
regulatory myosin light chain (RLC), a small protein that wraps around the neck of
beta-myosin heavy chain and modulates the force and speed of cardiac contraction.
The rs397516406 variant substitutes glutamic acid (a charged amino acid) for glycine
(the smallest, most flexible amino acid) at position 162 of the protein — a region
critical for calcium-dependent regulation of myosin motor activity. This change was
classified as likely pathogenic for hypertrophic cardiomyopathy22 classified as likely pathogenic for hypertrophic cardiomyopathy
ClinVar variation
ID 43479, reviewed by expert panel
and is absent from the gnomAD database of over 187,000 population controls, consistent
with a rare disease-causing variant rather than a benign polymorphism.
The Mechanism
The regulatory myosin light chain controls cardiac muscle contraction through two key
mechanisms: it stabilizes the myosin lever arm (the structural domain that converts
ATP hydrolysis into mechanical force), and it responds to calcium-dependent
phosphorylation33 calcium-dependent
phosphorylation
phosphorylation of the RLC at Ser15 by myosin light chain kinase
increases cross-bridge cycling rate and force production
that tunes contractile output to cardiac demand. Position 162 sits within the
C-terminal EF-hand domain44 C-terminal EF-hand domain
a calcium-binding structural motif common to many
regulatory proteins of the RLC. Replacing
the tiny, conformationally flexible glycine with a bulky charged glutamic acid at this
position is predicted to disrupt the local protein architecture and alter how the RLC
interacts with the myosin neck. The downstream consequence — as with most sarcomeric
HCM mutations — is hypercontractility and disordered calcium handling55 hypercontractility and disordered calcium handling
sarcomere
mutations increase myosin-actin cross-bridge stability, slowing relaxation and raising
resting force, which over years drives
the hypertrophic remodeling and fibrosis characteristic of HCM.
The Evidence
The pathogenicity of the G162E variant was established by Renaudin et al. (2018)66 Renaudin et al. (2018)
Mol Diagn Ther 2018 Apr;22(2):219-223
through family segregation analysis. Among 27 family members of the index patient —
a 51-year-old woman diagnosed with HCM — 16 carried the MYL2 p.Gly162Glu variant.
Twelve of these 16 carriers (75%) had clinical evidence of cardiomyopathy, while none
of the 11 non-carriers were affected. This cosegregation pattern is strong evidence
for dominant pathogenicity and documents that penetrance is high but incomplete,
with some carriers remaining subclinical or developing disease later in life.
MYL2 mutations as a group are a rare but recognized cause of HCM77 a rare but recognized cause of HCM
accounting for
approximately 1% of all genotype-positive HCM cases.
The clinical consequences of sarcomere-positive HCM are serious. In Olivotto et al.'s
2008 cohort of 203 HCM patients88 Olivotto et al.'s
2008 cohort of 203 HCM patients
Mayo Clin Proc 2008 Jun;83(6):630-638,
mutation-positive patients experienced major adverse cardiovascular events (death,
stroke, progression to advanced heart failure) at a rate of 25% versus 7% in mutation-negative
patients over 4 years of follow-up (hazard ratio 4.27, p=.008). Critically, adverse
events in the sarcomere-positive group occurred across all ages from 14 to 86 years,
while mutation-negative patients were largely protected until after age 65. Other MYL2
variants illustrate the range of severity: Arg58Gln has been linked to premature
sudden cardiac death in affected families99 premature
sudden cardiac death in affected families
early-onset disease with fatal arrhythmias
in young adults, while Glu22Lys follows
a more benign course.
Practical Actions
For carriers of the G162E variant, cardiac surveillance is the cornerstone of
management1010 cardiac surveillance is the cornerstone of
management
regular echocardiography detects structural changes before symptoms
emerge, enabling earlier intervention.
Annual or biennial echocardiography with assessment of left ventricular wall thickness,
outflow tract gradient, and diastolic function allows longitudinal monitoring for
HCM progression. Holter monitoring or cardiac event recording can detect subclinical
arrhythmias — atrial fibrillation and non-sustained ventricular tachycardia are
common in HCM and alter management thresholds for anticoagulation and defibrillator
implantation. Patients who develop obstructive HCM (resting or provoked left ventricular
outflow tract gradient ≥30 mmHg) may benefit from medical therapy with beta-blockers
or disopyramide; septal reduction therapy (alcohol ablation or surgical myectomy)
addresses severe refractory obstruction.
Because G162E follows autosomal dominant inheritance, each first-degree relative (parent, sibling, child) has a 50% probability of carrying the variant. Cascade genetic testing of first-degree relatives is the most efficient way to identify at-risk family members before they develop the structural changes that trigger symptoms and sudden cardiac death risk.
Interactions
MYL2 G162E acts as an autosomal dominant variant — a single copy is sufficient to cause
disease. The risk does not compound with another copy of the same variant, though HCM
severity can be modulated by additional variants in other sarcomeric genes (MYH7, MYBPC3,
TNNT2, TNNI3). Individuals found to carry two or more sarcomeric gene mutations have
been shown in clinical cohorts to face markedly elevated risk of end-stage progression
and ventricular arrhythmias1111 markedly elevated risk of end-stage progression
and ventricular arrhythmias
triple mutation carriers in the Girolami 2010 JACC study
often required ICD or transplantation by their fourth decade.
Comprehensive panel testing is recommended at the time of diagnosis.
MCM6 G-13915T — The East African Lactase Persistence Allele
Human populations that domesticated dairy animals faced a powerful selective pressure: adults who could digest lactose gained a calorie-dense, nutrient-rich food source that also served as a safer alternative to potentially contaminated water. Rather than a single evolutionary solution, natural selection produced at least five independent mutations on separate continents — all targeting the same regulatory switch in the MCM6 gene, all maintaining lactase enzyme expression into adulthood. The G-13915T variant (rs41380347) is one of these parallel solutions, found primarily among pastoralist peoples of the Horn of Africa and the Sudan: Oromo, Somali, Beja, and Sudanese Arab populations.
The Mechanism
The MCM6 gene sits immediately upstream of the LCT gene11 LCT gene
LCT encodes lactase-phlorizin
hydrolase, the intestinal brush-border enzyme that cleaves lactose into glucose and
galactose for absorption on chromosome 2.
Although MCM6 itself encodes a DNA replication helicase subunit22 DNA replication helicase subunit
Part of the MCM2-7
complex that unwinds DNA ahead of the replication fork,
its intron 13 contains a long-range enhancer element that controls LCT expression
roughly 13.9 kb away. In most mammals — and in most adult humans — this enhancer is
progressively silenced after weaning, reducing lactase production to near-zero by
early adulthood. A cluster of mutations in this enhancer region can override this
silencing program.
The -13915 position sits within an approximately 300 bp region in MCM6 intron 13 that
encompasses several transcription factor binding sites. The derived G allele at position
-13915 (reported as C on the GRCh38 plus strand) alters interaction with the Oct-1
transcription factor33 transcription factor
A ubiquitous Octamer-binding protein that regulates tissue-specific
gene expression. Olds et al. (2011) used
electrophoretic mobility shift assays and cell transfection experiments to demonstrate
that Oct-1 binds the -13915*G sequence and enhances transcription of an LCT
promoter-reporter construct — providing a molecular basis for how this single nucleotide
change maintains lactase production through adulthood.
The Evidence
The variant was first identified by
Ingram et al. (2007)44 Ingram et al. (2007)
Ingram CJE et al. A novel polymorphism associated with lactose
tolerance in Africa: multiple causes for lactase persistence? Hum Genet. 2007;120:779-88.
through a cohort study of Sudanese pastoralists. The European diagnostic variant
(C/T-13910, rs4988235) was absent in this population's lactase-persistent individuals,
leading the authors to search for population-specific alleles. They identified -13915*G
as significantly associated with lactose tolerance among Sudanese volunteers and confirmed
its presence in other East African and Middle Eastern pastoral groups.
Tishkoff et al. (2007)55 Tishkoff et al. (2007)
Tishkoff SA et al. Convergent adaptation of human lactase
persistence in Africa and Europe. Nat Genet. 2007 Jan;39(1):31-40.
genotyped 470 individuals from Tanzania, Kenya, and Sudan and identified T/G-13915 as
one of three African persistence alleles, distinct from the European C/T-13910 and from
each other at the haplotype level. Extended haplotype homozygosity analysis showed
selection signatures consistent with a sweep beginning approximately 7,000 years ago —
coinciding with the spread of cattle pastoralism across East Africa.
Jones et al. (2013)66 Jones et al. (2013)
Jones BL et al. Diversity of lactase persistence alleles in
Ethiopia: signature of a soft selective sweep. Am J Hum Genet. 2013;93(3):538-44.
examined more than 350 Ethiopian individuals from multiple ethnic groups and confirmed
-13915*G (rs41380347) as one of at least five independent functional persistence alleles.
They documented greater genetic diversity among lactose digesters than non-digesters
in Ethiopia — a pattern consistent with multiple independent mutations spreading
simultaneously (a soft selective sweep), rather than the single dominant haplotype
seen in the European lineage.
Hassan et al. (2016)77 Hassan et al. (2016)
Hassan HY et al. Genetic diversity of lactase persistence in
East African populations. BMC Res Notes. 2016;9:16.
surveyed 12 East African populations and found that the -13915*G and -13907*G variants
dominate among nomadic Arab and Beja communities in Sudan, reaching notably high
frequencies in these pastoral groups.
Population Context
Unlike the European rs4988235, which spread from a single founder variant to high frequency across Northern Europe (70–90% in Scandinavia), the African persistence alleles each remain concentrated in the pastoral populations among whom they originated. The -13915*G allele is most common in the Beja of northeastern Sudan and Eritrea, and among Afro-Asiatic-speaking pastoralists including Somali and Oromo communities. It is essentially absent in East Asian, South Asian, and European populations, and rare in global genetic databases (like gnomAD) that are dominated by European and African American samples not drawn from East African pastoral groups.
Practical Actions
The practical implications of this variant exactly parallel the European lactase persistence system. Individuals who carry no persistence allele (AA genotype) in any of the five known variants are likely to be lactose non-persistent and may experience bloating, gas, cramps, or diarrhea after consuming lactose-containing foods. Note that individuals of East African descent may carry any combination of these five alleles — a comprehensive lactase persistence assessment requires testing all known variants (rs4988235, rs145946881, rs41525747, rs41380347, and rs869051967).
Interactions
This variant is one of at least five independent lactase persistence mutations operating through the same MCM6 intron 13 enhancer region. The companion variants rs145946881 (-14010C, East African), rs41525747 (-13907G, East African/Afro-Asiatic), and rs4988235 (-13910T, European) all act through the same Oct-1/HNF1α enhancer mechanism to maintain LCT expression. Since each allele is independently sufficient to confer lactase persistence, compound carriage of two persistence alleles does not add incremental benefit — one copy of any persistence allele on one chromosome is sufficient to maintain lactase production. For individuals of East African ancestry, the non-persistent call is only meaningful when all relevant alleles have been tested and found absent.
ABCA1 — The Cholesterol Efflux Gatekeeper
ABCA111 ATP-Binding Cassette Transporter A1 — a cell-membrane protein that
pumps cholesterol and phospholipids out of cells onto lipid-poor apolipoprotein A-I,
creating nascent HDL particles is the rate-limiting step in reverse cholesterol
transport22 reverse cholesterol
transport
the process by which cholesterol is removed from peripheral tissues and
delivered to the liver for excretion. Without ABCA1, cells cannot shed excess
cholesterol and HDL formation collapses. Loss-of-function mutations in both copies of
ABCA1 cause Tangier disease33 Tangier disease
a rare autosomal recessive disorder characterized by
near-absent HDL, orange-tinged tonsils, and accelerated atherosclerosis; heterozygous
carriers have roughly half-normal HDL levels. rs4149268 operates at a far milder scale —
it is a common intronic variant that fine-tunes how much ABCA1 the liver makes.
The Mechanism
rs4149268 sits in intron 2 of ABCA1 on chromosome 9 (GRCh38 position 104,884,939).
The gene is transcribed from the minus strand, so the C allele reported in genome files
(plus-strand) corresponds to a G in coding-strand notation as used in many papers — these
are the same allele. Functional studies identified a nearby variant, rs2575875, in
near-perfect linkage disequilibrium (r²=0.96) with rs4149268. Richardson et al.44 Richardson et al.
Richardson et al. Allele-specific enhancers mediate associations between LCAT and ABCA1
polymorphisms and HDL metabolism. PLOS One, 2019
showed that the A allele of rs2575875 (tagging the C allele of rs4149268) creates a
STAT355 Signal Transducer and Activator of Transcription 3 — a transcription factor
that activates gene expression in response to cytokines and growth factors binding
site within an intronic enhancer that loops physically to the ABCA1 promoter. Chromatin
immunoprecipitation showed 8-fold enrichment of this allele with anti-STAT3 antibody,
and the enhancer drove allele-specific transcription in liver cell lines. Higher ABCA1
expression means more efficient cholesterol efflux and more nascent HDL particles.
The Evidence
The landmark GWAS meta-analysis by Willer et al.66 landmark GWAS meta-analysis by Willer et al.
Willer et al. Newly identified loci
that influence lipid concentrations and risk of coronary artery disease. Nature Genetics,
2008 in 8,816 individuals (with replication
in 11,569) found that rs4149268-C carriers had approximately +0.82 mg/dL higher
HDL-cholesterol compared to T/T homozygotes (p=10⁻¹⁰). The effect is small in absolute
terms — HDL levels typically range from 40 to 80 mg/dL — but the association was
genome-wide significant and biologically coherent with the ABCA1 pathway.
Replication in diverse populations has been mixed: the PAGE study77 PAGE study
Dumitrescu et al.
Genetic determinants of lipid traits in diverse populations from the PAGE study. PLOS
Genetics, 2011 could not replicate the
association in European Americans despite 99% statistical power, suggesting the effect
may be smaller than the original estimate implied or population-specific. The T allele
reaches ~67% frequency in African populations (versus ~38% in Europeans), which affects
absolute HDL distribution in those ancestries.
A study in a Chinese cohort (n=1,654) found that TT homozygotes were 1.76 times more
likely to have mild cognitive impairment88 mild cognitive impairment
MCI — a clinically recognized transitional
state between normal aging cognition and dementia, with impairment in memory or other
domains that does not yet meet dementia criteria than CC homozygotes (OR 1.76,
95% CI 1.11–2.80). This likely reflects the known link between chronically low HDL and
neurodegeneration, as ABCA1 drives cholesterol efflux in neurons and astrocytes, not
just hepatocytes.
Practical Actions
The effect size of rs4149268 on HDL is modest — roughly 0.82 mg/dL per C allele. For TT homozygotes, the genotype itself is not a clinical diagnosis of low HDL; many TT individuals have perfectly normal HDL levels due to lifestyle and other genetic factors. The priority is knowing your measured HDL, then targeting the lifestyle drivers that have the largest HDL-raising effect in people with genetically modest ABCA1 activity: regular aerobic exercise (the strongest HDL lifter), reduction of trans fats, and niacin-equivalent dietary choices. If HDL is confirmed low, discussion with a clinician about whether pharmacological support is appropriate is warranted.
Interactions
ABCA1 does not act alone. APOE ε499 APOE ε4
rs429358 C allele — the APOE4 determinant that
impairs LDL clearance and amyloid removal and ABCA1 T-allele homozygosity converge
on the same pathway: both reduce brain cholesterol homeostasis and are independent risk
factors for cognitive decline. Carrying ABCA1 TT alongside APOE4 may produce additive
vulnerability to neurodegeneration, though direct compound studies at these two loci
specifically are limited. The R219K variant in ABCA1 (rs2230806), studied extensively
for HDL effects, acts through a different mechanism (missense, not enhancer regulation)
and can interact with rs4149268 within the same gene.
NF-κB1 Intronic Variant — A Rare Protective Signal in the Immune Master Switch
The NFKB1 gene encodes NF-κB1 (Nuclear Factor kappa-light-chain-enhancer of activated B cells subunit 1)11 NF-κB1 (Nuclear Factor kappa-light-chain-enhancer of activated B cells subunit 1)
A key transcription factor that controls hundreds of immune and inflammatory genes in response to
pathogen detection, the essential p105/p50 subunit of
the canonical NF-κB signaling complex. When Toll-like receptors and NOD-like receptors detect
pathogen components, they trigger signaling cascades that liberate p50-p65 dimers to enter the
nucleus and activate innate immune genes including cytokines, antimicrobial peptides, and
co-stimulatory molecules. rs4648127 is an intronic variant in NFKB1 at chromosome 4 position
102,614,748 (GRCh38). The T allele at this position is uncommon (~5.8% in Europeans) and was
identified in a systematic candidate-gene study as independently associated with reduced lung
cancer risk — making it one of the few common genetic variants at the NFKB1 locus where the
minor allele confers protection rather than susceptibility.
The Mechanism
rs4648127 lies within an intron of NFKB1 and does not alter the protein sequence of p105 or
p50. Intronic variants at this locus are thought to influence regulatory elements — splicing
factor binding sites, enhancers, or transcription factor binding sequences embedded in intronic
DNA — that tune NFKB1 expression levels or splicing efficiency. The precise molecular mechanism
for rs4648127 has not been characterized at the protein or transcript level. Its position within
the NFKB1 locus places it in a genomic region that harbors multiple linkage disequilibrium22 linkage disequilibrium
The non-random co-inheritance of alleles at nearby chromosomal positions; variants in LD tend
to be co-inherited as a block blocks associated with
immune function and infection susceptibility. The nearby functional variant rs28362491 (a 4-bp
ATTG insertion-deletion in the NFKB1 5' regulatory region) is one candidate for the causal
signal; rs4648127 may tag independent regulatory variation or reflect the same underlying
functional effect through partial LD.
The direction of association for rs4648127 is notable: whereas rs230523 (another intronic NFKB1 variant) shows a susceptibility signal (C allele → increased infection risk), rs4648127 shows the opposite — the T allele appears protective against lung cancer. This divergent directionality across two variants in the same gene suggests the NFKB1 intronic region contains multiple partially independent regulatory elements that differentially affect immune gene expression in different tissue contexts or along different pathogen-response axes.
The Evidence
Shiels et al. (2012)33 Shiels et al. (2012)
Genetic variation in innate immunity and inflammation pathways associated
with lung cancer risk. Cancer. conducted a systematic
two-stage study: first screening 1,429 SNPs across innate immunity and inflammation genes in 378
lung cancer cases and 450 controls from the Prostate, Lung, Colorectal, and Ovarian Cancer
Screening Trial (PLCO), then replicating top signals in an independent GWAS dataset. Of 81 SNPs
initially significant, only rs4648127 in NFKB1 survived multiple testing correction. The CT/TT
genotype was associated with an odds ratio of 0.56 (95% CI: 0.37–0.86) in the PLCO discovery
cohort and 0.79 (95% CI: 0.69–0.90) in the GWAS replication. The protective direction was
consistent across both datasets.
Curtin et al. (2010)44 Curtin et al. (2010)
Exploring multilocus associations of inflammation genes and colorectal
cancer risk using hapConstructor. BMC Medical Genetics.
included rs4648127 among NFKB1 haplotype variants in a multilocus analysis of colorectal cancer
susceptibility, examining how combinations of NFKB1, IKBKB, and IL6 variants interact to affect
colon cancer risk.
Slattery et al. (2011)55 Slattery et al. (2011)
Interferon-signaling pathway associations with colon and rectal cancer
risk. Carcinogenesis. examined NFKB1 variants as
pathway interaction partners in 1,555 colon cancer cases and 754 rectal cancer cases, finding
that NF-κB1 variants significantly interact with interferon-pathway components to modulate
colorectal cancer susceptibility.
The broader NFKB1 locus context is established by two large studies: Tian et al. (2017)66 Tian et al. (2017) — a 23andMe GWAS of infection susceptibility in >200,000 Europeans identifying the NFKB1 locus as genome-wide significant for overall infection risk — and Chong et al. (2024)77 Chong et al. (2024) — a UK Biobank serology-based GWAS in 487,297 participants finding NFKB1 as the top locus influencing antibody responses to herpesviruses and polyomaviruses.
Practical Actions
The rs4648127 T allele is protective against lung cancer with an OR of ~0.79 in the GWAS replication — a reduction in relative risk of approximately 21% per protective allele copy. For CT heterozygotes, this represents meaningful but partial protection; for the rare TT homozygotes, the full protective effect applies. Importantly, this genetic protection operates through immune surveillance mechanisms, not through reduction in carcinogen exposure. The protective allele does not eliminate lung cancer risk from environmental exposures such as tobacco smoke or occupational carcinogens — it modulates how the immune system recognizes and responds to transformed cells. Carriers should avoid over-relying on this genetic signal as license to ignore environmental risk factors for lung cancer.
The variant does not alter standard innate immunity recommendations for infection prevention, which are covered by the nearby NFKB1 variant rs230523.
Interactions
rs4648127 is physically close to rs230523 on chromosome 4 within the same NFKB1 locus. These two variants show different association directions (rs230523 C allele → susceptibility; rs4648127 T allele → protection), suggesting they may tag distinct functional elements within the NFKB1 regulatory landscape. Users who carry the rs230523 C susceptibility allele and the rs4648127 T protective allele face partially offsetting effects: one variant subtly reducing NF-κB1 immune activation, the other associated with improved immune surveillance of malignant cells.
The most studied functional variant at this locus is rs28362491 (the -94ATTG regulatory insertion-deletion). Its LD relationship with rs4648127 has not been fully characterized in published literature, so the two should be interpreted as potentially independent signals until formal LD mapping is available.
The Hidden Switch in Your Brain's Nerve Growth Factory
Every neuron in your brain depends on a steady supply of mature
brain-derived neurotrophic factor11 brain-derived neurotrophic factor
BDNF is a protein that acts like
fertilizer for neurons — it promotes their survival, growth, and
connection-forming (synaptogenesis). Low BDNF is consistently found
in depression, anxiety, and neurodegenerative diseases (BDNF) to
survive, form memories, and regulate mood. But BDNF doesn't start out
active — it's made as a larger, inactive precursor called proBDNF, which
must be cleaved by the enzyme FURIN before it becomes the mature,
beneficial form. A variant in the FURIN gene's regulatory region,
rs4702, controls how much of this essential cleavage enzyme is produced —
and the G allele reduces FURIN levels enough to shift the BDNF system
toward its pro-apoptotic, rather than pro-survival, mode.
The Mechanism
The FURIN gene's 3' untranslated region (3'UTR) is a stretch of RNA
that doesn't code for protein but instead controls how much FURIN
protein is made. The rs4702 variant sits within this regulatory sequence.
When the G allele is present, it creates a binding site for a small
RNA molecule called
miR-338-3p22 miR-338-3p
microRNAs are short RNA sequences that bind to messenger
RNA and suppress translation. miR-338-3p is expressed in the brain and
is involved in neuronal differentiation and axon growth.
Hou and colleagues33 Hou and colleagues
Hou Y, Liang W, Zhang J et al. Schizophrenia-associated
rs4702 G allele-specific downregulation of FURIN expression by miR-338-3p
reduces BDNF production. Schizophr Res, 2018
demonstrated this mechanism in 2018 using luciferase reporter assays:
the G allele allows miR-338-3p to dock on FURIN's 3'UTR and suppress
its expression, while the A allele disrupts the binding site and preserves
FURIN levels. This allele-specific FURIN suppression was then shown to
reduce production of mature BDNF in human cells.
The consequences are significant. When FURIN levels are reduced, more
proBDNF accumulates instead of being converted to mature BDNF. These
two forms of BDNF have opposite effects:
mature BDNF binds TrkB receptors44 mature BDNF binds TrkB receptors
TrkB (tropomyosin receptor kinase B)
is the high-affinity receptor for mature BDNF. Activation promotes
neuronal survival, LTP, and memory consolidation and promotes neuronal
survival and synaptic plasticity, while
proBDNF binds p75NTR receptors55 proBDNF binds p75NTR receptors
p75NTR is a low-affinity neurotrophin
receptor that, when activated by proBDNF, triggers apoptosis
(programmed cell death) and long-term depression of synaptic strength
and can trigger apoptosis and synaptic weakening.
A 2021 review66 2021 review
Wang M, Xie Y, Qin D. Proteolytic cleavage of proBDNF
to mBDNF in neuropsychiatric and neurodegenerative diseases.
Brain Res Bull, 2021
concluded that "insufficient proBDNF transformation into mature BDNF
is potentially critical to disease pathogenesis" across depression,
anxiety, schizophrenia, Parkinson's disease, and Alzheimer's disease.
The Evidence
Schizophrenia association. The rs4702 G allele emerged as a
genome-wide significant locus from large schizophrenia GWAS studies.
The Hou et al. 2018 paper was specifically motivated by the 108 confirmed
schizophrenia loci from the Psychiatric Genomics Consortium GWAS and
identified rs4702 as a functional 3'UTR variant among these loci.
A 2019 Nature Genetics study77 2019 Nature Genetics study
Schrode N, Ho SM, Yamamuro K et al.
Synergistic effects of common schizophrenia risk variants. Nat Genet,
2019 using CRISPR-edited
iPSCs demonstrated that the rs4702 G allele affects neuronal morphology
and electrophysiological properties in excitatory neurons, with synergistic
effects when combined with other schizophrenia eQTL variants.
FURIN as a cis-eQTL. An eQTL study88 eQTL study
Turpeinen H, Seppälä I, Lyytikäinen
LP et al. A genome-wide expression quantitative trait loci analysis of
proprotein convertase subtilisin/kexin enzymes identifies a novel regulatory
gene variant for FURIN expression and blood pressure. Hum Genet, 2015
of over 1,400 human blood samples confirmed rs4702 as a significant cis-eQTL
for FURIN expression and found nominal associations with blood pressure
phenotypes (diastolic p=0.012, peripheral vascular resistance p=0.003),
suggesting the variant's regulatory effects extend beyond the brain.
Causal evidence from iPSCs. The
Dobrindt et al. 202199 Dobrindt et al. 2021
Dobrindt K, Hoagland DA, Seah C et al. Common
genetic variation in humans impacts in vitro susceptibility to SARS-CoV-2
infection. Stem Cell Reports, 2021
study used CRISPR-based allelic conversion to establish causality: GG
iPSC-derived neurons expressed ~0.74-fold less FURIN than AA counterparts,
confirming that the genotype drives FURIN expression differences, not just
associations.
PTSD and trauma. A
7-year prospective cohort study1010 7-year prospective cohort study
Tamman AJF, Wendt FR, Pathak GA et al.
Attachment style moderates polygenic risk for incident PTSD in US military
veterans. Biol Psychiatry, 2022
of 1,083 trauma-exposed veterans found that rs4702 showed the strongest
gene×environment interaction with cumulative lifetime trauma burden for
incident PTSD, suggesting the FURIN-BDNF axis is particularly relevant
to stress-related psychiatric outcomes.
Neuroprotection after brain injury. In a clinical study of 106 glioma
patients receiving radiotherapy,
Yang and colleagues1111 Yang and colleagues
Yang S, Fu ZZ, Zhang YQ et al. The G to A
transformation of rs4702 polymorphism in 3'UTR of FURIN reduced the risk
of radiotherapy-induced cognitive impairment in glioma patients.
J Cell Mol Med, 2022
found that rs4702-A carriers maintained higher FURIN and BDNF expression
post-treatment and experienced less cognitive impairment, providing
clinical evidence that the A allele's preservation of FURIN activity
matters under neurotoxic stress.
Practical Actions
For GG carriers, the priority is supporting the pathways that compensate for reduced FURIN-mediated proBDNF cleavage. Aerobic exercise is the most robustly evidence-backed intervention for raising mature BDNF levels directly, bypassing the FURIN bottleneck by stimulating BDNF gene transcription via FNDC5/irisin and PGC-1α pathways. Omega-3 fatty acids (EPA/DHA) support neuronal membrane health and have been shown to raise BDNF expression in depressed and healthy populations. Ensuring adequate zinc and magnesium intake matters because both cofactors support metalloprotease activity, including enzymes in the proBDNF processing cascade.
Stress management deserves particular attention for G allele carriers given the documented PTSD interaction: chronic stress suppresses BDNF expression independently of genotype, compounding the FURIN-related reduction. The rs4702 × trauma interaction data suggest that carriers under high cumulative stress load face disproportionately elevated psychiatric risk.
Interactions
The most clinically meaningful interaction is with rs6265 (BDNF Val66Met) in the BDNF gene itself. Both variants converge on reduced mature BDNF availability through different mechanisms: rs4702 G reduces FURIN expression, limiting proBDNF cleavage upstream; rs6265 T (the Met allele) impairs activity-dependent secretion of BDNF downstream. Individuals carrying both the rs4702 G allele and the rs6265 T allele face compound reduction in mature BDNF signaling — reduced production from proBDNF and reduced regulated release of whatever mature BDNF is produced. This combination is a candidate for a compound action targeting both upstream (FURIN support) and downstream (BDNF secretion) components of the neurotrophin pathway.
IRGM rs4958847 — Autophagy Failure at the Gut Frontier
The human gut is in a constant negotiation with bacteria. Most organisms are kept out by
the epithelial barrier, but some — including adherent-invasive Escherichia coli (AIEC)11 adherent-invasive Escherichia coli (AIEC)
A pathobiont consistently enriched in the ileal mucosa of Crohn's disease patients, where
it penetrates epithelial cells and replicates inside macrophages, evading conventional immune
clearance — breach the lining and invade cells
directly. The body's main weapon against these intracellular invaders is
[autophagy | A cellular recycling process that wraps damaged organelles and intracellular
pathogens in double-membrane vesicles called autophagosomes and delivers them to lysosomes
for destruction], a process in which IRGM (Immunity-Related GTPase Family M)22 IRGM (Immunity-Related GTPase Family M)
Encodes a GTPase that controls mitochondrial dynamics and serves as a master regulator
of selective autophagy, specifically xenophagy (pathogen clearance) and mitophagy
(damaged mitochondria elimination) plays a
central coordinating role. rs4958847 is a common intronic variant in IRGM that alters
the gene's expression level and disrupts this bacterial clearance program, raising the
risk for Crohn's disease — particularly the ileal subtype and its most severe complications.
The Mechanism
rs4958847 lies in intron 6 of IRGM (GRCh38 chr5:150860025, G>A) and acts through a regulatory rather than a coding mechanism. The A risk allele is associated with reduced IRGM expression in intestinal epithelial cells and macrophages. When IRGM function is diminished, the autophagy program falters in two distinct ways: intracellular pathogens such as Salmonella, Mycobacterium, and AIEC escape xenophagic destruction and persist in the mucosal compartment; and damaged mitochondria accumulate, releasing reactive oxygen species and pro-inflammatory signals that amplify the innate immune response.
The resulting inflammatory cycle — persistent intracellular bacteria → innate immune activation → epithelial damage → deeper bacterial invasion → more activation — is thought to underlie the transmural, granulomatous inflammation characteristic of ileal Crohn's disease. IRGM also interacts functionally with [ATG16L1 | Autophagy-related protein 16-like 1; the other major autophagy Crohn's gene whose T300A variant (rs2241880) is independently associated with CD susceptibility], and reduced activity of both proteins in the same individual compounds the xenophagy defect.
The Evidence
The IRGM locus was first replicated as a Crohn's disease susceptibility gene33 first replicated as a Crohn's disease susceptibility gene
Parkes et al. 2007 — Nature Genetics 39(7):830-2; genome-wide replication study, combined
P=2.1×10⁻¹⁰ in a multi-locus replication
analysis. rs4958847 specifically was established as a risk variant through a body of
subsequent replication and meta-analysis work.
The most definitive quantification comes from a 2013 meta-analysis of 25 studies44 2013 meta-analysis of 25 studies
Lu et al. 2013 — PLoS One; 20,590 IBD cases and 27,670 controls
which found rs4958847 A allele OR=1.18 (95% CI 1.08–1.29, P=0.0002) for CD, surviving
Bonferroni correction. Importantly, the association was specific to CD and was not
observed for ulcerative colitis, consistent with a mechanism centered on the ileum.
A separate pooled analysis by Palomino-Morales et al. 200955 Palomino-Morales et al. 2009
Genes Immun 10(4):356-64;
557 CD, 425 UC patients and 672 controls
found rs4958847 A allele OR=1.31 (P=2.78×10⁻¹⁷) for CD.
The effect size is larger in phenotype-restricted analyses. In a New Zealand Caucasian
cohort66 New Zealand Caucasian
cohort
Roberts et al. 2008 — Genes Immun; 507 CD patients, 475 UC, 576 controls specifically analysing ileal disease,
rs4958847 A allele OR reached 1.77 (95% CI 1.22–2.56, P=0.0022), confirming that the
risk is concentrated in the ileal subtype. An Italian study by Latiano et al. 200977 Latiano et al. 2009
Am J Gastroenterol; 823 CD patients including 265 paediatric cases and 578 controls found the A allele was associated with
fistulizing behavior (OR=1.54, P=0.037) and perianal fistulas (OR=1.55, P=0.045),
extending the signal to severe disease complications.
A 2012 surgical outcomes study88 2012 surgical outcomes study
Sehgal et al. — Dis Colon Rectum; 66 ileocolonic CD
patients who underwent ileocolectomy found
rs4958847 to be the most significant predictor of surgical recurrence: A-allele carriers
required reoperation on average every 6.87 years versus 11.43 years for GG individuals
(P=0.007), suggesting the variant marks not just susceptibility but ongoing disease
severity post-surgery.
Practical Implications
For A-allele carriers with established or suspected Crohn's disease, the IRGM autophagy defect points toward specific management strategies. Monitoring for ileal disease patterns and fistulizing complications is more relevant than for non-carriers. The autophagy connection also opens a specific pharmacological angle: [rapamycin and its analogues | mTOR inhibitors that upregulate autophagy; not currently standard CD therapy but relevant to the mechanistic rationale for existing medications] and several approved immunosuppressants used in IBD (azathioprine, 6-mercaptopurine) stimulate autophagy as part of their mechanism of action, which may partly explain their relevance in IRGM-variant carriers. Anti-TNF biologics are an established treatment for fistulizing CD — the complication subtype most strongly linked to rs4958847.
Interactions
The strongest documented interaction is with [ATG16L1 rs2241880 | T300A missense variant in ATG16L1, the other major autophagy Crohn's gene; A allele is the CD-risk allele at ~45% European frequency]. Both IRGM and ATG16L1 proteins function in the same autophagosome assembly pathway. Studies examining the two variants simultaneously have found additive — and in some analyses synergistic — effects on CD risk. An individual carrying A risk alleles at both rs4958847 and rs2241880 has substantially higher CD risk than either variant alone would predict. The combination is of particular relevance for ileal CD phenotype and post-surgical recurrence risk.
ANRIL's Second Voice — The 9p21.3 Independent CAD Signal at rs4977574
The 9p21.3 locus on chromosome 9 is the most robustly replicated genetic risk region for coronary artery disease ever identified, and rs4977574 is one of its key sentinels. Located within an intron of CDKN2B-AS1 — the gene encoding ANRIL11 ANRIL
Antisense Non-coding RNA in the INK4 Locus, a long non-coding RNA that epigenetically regulates the p16/p15 cell-cycle-inhibitor cluster — this variant captures a partially independent signal from the well-known rs1333049, with the two SNPs sharing an r² of approximately 0.89 in European populations. Together they provide better CAD risk stratification than either alone.
The G allele at rs4977574 is carried by approximately 47% of people of European and East Asian ancestry, but only 18% of those of African descent. This frequency means that roughly 72% of Europeans carry at least one G allele — making this one of the most common high-impact cardiovascular risk variants in the genome.
The Mechanism
ANRIL is a 3,834 bp long non-coding RNA that spans the 9p21.3 region and functions as a master regulator of the CDKN2A/CDKN2B gene cluster. It recruits Polycomb repressive complexes PRC1 and PRC222 Polycomb repressive complexes PRC1 and PRC2
Large chromatin-modifying protein complexes that silence genes by adding repressive histone marks (H3K27me3) to their promoters to silence p16-INK4a and p15-INK4b in vascular smooth muscle cells (VSMCs) and macrophages, keeping these cells in a proliferative, repair-competent state rather than entering cellular senescence.
The 9p21.3 risk haplotype tagged by rs4977574 disrupts enhancer elements within ANRIL's regulatory architecture. This impairs the ANRIL–Polycomb axis, allowing premature de-repression of p16 and p15 in vascular tissue — accelerating VSMC senescence, impairing vascular repair capacity, and promoting the pro-inflammatory senescence-associated secretory phenotype (SASP) that drives atherosclerotic plaque vulnerability.
A secondary mechanism operates through an interferon-γ–STAT1 pathway33 interferon-γ–STAT1 pathway
Risk SNPs at 9p21 disrupt a STAT1 binding site at an IFN-γ-responsive enhancer, altering expression of ANRIL and nearby genes in response to immune stimulation, contributing to vascular inflammation independent of the senescence pathway. The G allele's direct effect on lipid metabolism is also documented: GG carriers show higher total cholesterol, elevated LDL-C, and reduced HDL-C independently of other risk factors.
The Evidence
The variant's CAD association is established across multiple independent datasets. The Malmö Diet and Cancer Study44 Malmö Diet and Cancer Study
Prospective cohort, 23,949 individuals, 15 years of follow-up, 3,164 incident CVD events genotyped rs4977574 as the primary 9p21.3 SNP and found each G allele was associated with a 16% increased incidence of cardiovascular disease (HR 1.16; 95% CI 1.10–1.22). This prospective data, independent of retrospective studies, is particularly strong evidence.
A 2021 meta-analysis of 17 studies covering 40,979 subjects55 2021 meta-analysis of 17 studies covering 40,979 subjects
Li & Wang, Journal of Cellular and Molecular Medicine confirmed the G allele as a CHD risk factor under multiple genetic models (allelic OR 1.18; homozygous OR 1.39; recessive OR 1.36). An earlier Asian-specific meta-analysis of 12,005 subjects across 6 studies66 Asian-specific meta-analysis of 12,005 subjects across 6 studies
Xu et al., Medicine 2018 found allelic OR 1.18 (p = 0.010) and GG vs AA OR 1.46 (p = 0.002).
What makes rs4977574 especially actionable is the documented gene-diet interaction. In a Hispanic case-control study of 3,311 individuals77 Hispanic case-control study of 3,311 individuals
1,560 MI cases and 1,751 controls from Costa Rica — which used rs4977574 as its primary genotyped 9p21 SNP — a striking interaction with sugar-sweetened beverage (SSB) intake was identified: the per-allele OR for MI was 1.44 in those consuming more than 2 SSB servings per day, 1.21 at 1–2 servings/day, and a non-significant 0.97 in those consuming fewer than 1 serving per day (P-interaction = 0.005). The Malmö cohort separately found that vegetable intake interacts with rs4977574 to modify both CVD incidence and metabolic markers including HbA1C.
Practical Actions
For G-allele carriers, the most specific and actionable interventions are:
Eliminate sugar-sweetened beverages. The SSB–rs4977574 interaction is particularly strong and was identified with rs4977574 as the primary SNP. This is a genotype-directed intervention: the same SSB consumption that mildly elevates risk in the general population can amplify risk by 44% per G allele in high consumers.
Maximize vegetable intake. The Malmö cohort found that the G allele's association with elevated HbA1C was restricted to those in the lowest tertile of vegetable intake; higher vegetable intake was associated with lower HbA1C specifically among AG and GG carriers. The INTERHEART/FINRISK data (using proxy SNPs) showed raw vegetable and fruit intake dose-dependently attenuated MI risk from the 9p21 haplotype.
Monitor lipids from an early age. The G allele independently raises LDL-C and total cholesterol while reducing HDL-C. This lipid-altering effect stacks with the plaque-forming mechanism, making annual fasting lipid panels a genotype-specific monitoring priority from age 35.
Consider CAC scoring. Coronary artery calcium scoring detects subclinical atherosclerosis — the primary tissue-level consequence of 9p21.3 risk — and is an appropriate screening tool for G-allele carriers to guide statin therapy decisions.
Interactions
rs4977574 shares its 9p21.3 risk haplotype with rs1333049 (r² ≈ 0.89 in Europeans), rs10757278, and rs10757274. These SNPs largely capture the same biological signal through ANRIL dysregulation. However, rs4977574 is the SNP most consistently used in gene-diet interaction studies of this locus — the SSB interaction (PMID 26961926) and the Malmö vegetable/wine interaction (PMID 25551366) were both conducted with rs4977574 as the primary genotyped variant. Together with rs1333049, the two SNPs refine risk stratification beyond what either captures individually; individuals with risk alleles at both loci carry greater burden than either prediction alone.
The 9p21.3 locus shows a documented interaction with telomere length88 telomere length
Short telomeres combined with 9p21 risk alleles are associated with compounded adverse cardiovascular outcomes beyond either factor alone on cardiovascular prognosis. Telomere maintenance variants (e.g., rs12696304 in TERC) may compound with 9p21.3 for longevity risk when both results are available.
VWF c.7437+1G>T — A Splice Signal Silenced in the Bleeding Brake
Von Willebrand factor (VWF) is a large multimeric glycoprotein that performs two essential jobs in hemostasis: it tethers activated platelets to sites of vascular injury, and it escorts coagulation factor VIII through the bloodstream, protecting it from premature degradation. Without adequate VWF, even minor wounds fail to seal promptly and surgical procedures carry elevated bleeding risk. The rs61751290 variant disrupts the VWF gene at a particularly vulnerable position — the first nucleotide of intron 43 of the canonical VWF transcript (NM_000552.511 NM_000552.5).
The reference allele at plus-strand position 12:5,976,110 is C, which corresponds to
a G on the coding (minus) strand — precisely the G that forms the canonical GT splice
donor dinucleotide22 GT splice
donor dinucleotide
The GT at the start of virtually every intron in human genes is
the mandatory signal recognized by the spliceosome; mutation here disrupts mRNA
processing. The alternate T allele on
the plus strand changes this coding-strand G to an A, converting the intact GT to AT —
a sequence the spliceosome cannot recognize.
The Mechanism
Pre-mRNA splicing is carried out by the spliceosome, a large ribonucleoprotein complex
that excises introns and joins exons to form mature messenger RNA. The process depends
on highly conserved splice donor (5') and splice acceptor (3') sequences at each
intron boundary. The canonical splice donor is GU (GT in DNA)33 GU (GT in DNA)
Almost 99% of human
introns begin with GT; even single-nucleotide changes at this position abolish splicing
in the vast majority of tested transcripts.
Destroying the +1G of the splice donor at intron 43 is predicted by Ensembl's Variant
Effect Predictor to be a high-confidence loss-of-function (HC LoF) variant44 high-confidence loss-of-function (HC LoF) variant
Ensembl VEP flags splice_donor_variant at the +1 position as HIGH impact; LOFTEE
classifies it as high-confidence LoF because +1 mutations essentially always destroy
splicing.
When the splice donor is ablated, one of two outcomes typically follows: exon 43 is skipped entirely (joining exon 42 directly to exon 44), or the intron is retained in the mature mRNA. Both outcomes disrupt the VWF reading frame downstream of exon 43, generating either a premature stop codon or a transcript that is degraded by nonsense-mediated decay (NMD). The net result is that one copy of the VWF gene produces little or no functional protein — reducing circulating VWF levels and impairing primary hemostasis.
This mechanism is not speculative: analogous VWF splice donor +1 variants —
c.1156+1G>A, c.1729+1G>A, c.5455+1G>A, c.6798+1G>T, and c.7081+1G>A55 c.1156+1G>A, c.1729+1G>A, c.5455+1G>A, c.6798+1G>T, and c.7081+1G>A
All
classified Likely Pathogenic or Pathogenic in ClinVar for hereditary von Willebrand
disease; retrieved April 2026
— are all classified as pathogenic or likely pathogenic in ClinVar for hereditary
von Willebrand disease.
The Evidence
The rs61751290 variant itself has one ClinVar submission from the Academic Unit of
Haematology, University of Sheffield (accession RCV000086885.1), with clinical
significance listed as "not provided" — reflecting an early database entry without
formal classification. No PubMed publications currently cite this specific rsid.
The variant is extremely rare66 extremely rare
ExAC reports one carrier in 120,180 chromosomes;
GO-ESP reports one in 13,006; ALFA finds zero in 6,396; all ancestry-specific
frequencies are essentially zero,
appearing primarily in European populations when detected at all.
The classification of this variant is therefore based on mechanism rather than
direct epidemiological evidence — a well-established approach for rare null alleles
where functional consequences are unambiguous. The ACMG/AMP variant interpretation
framework77 ACMG/AMP variant interpretation
framework
Richards et al., Genetics in Medicine 2015
classifies +1 splice donor mutations in genes where haploinsufficiency is a known
disease mechanism as Likely Pathogenic (PVS1 + PM2), irrespective of allele
frequency. VWF haploinsufficiency (reduced protein from one allele) is the molecular
basis of type 1 VWD, which affects approximately 1% of the population.
The ISTH guideline-endorsed thresholds88 ISTH guideline-endorsed thresholds
James et al. 2017; Nichols et al. 2008
NHLBI guidelines for type 1 VWD are:
VWF antigen (VWF:Ag) or VWF ristocetin cofactor activity (VWF:RCo) below
30 IU/dL (30% of normal) is diagnostic for VWD; 30–50 IU/dL with bleeding symptoms
is also classified as VWD; 50–200 IU/dL is normal. Type 1 VWD typically presents
with levels in the 20–50 IU/dL range. Heterozygous carriers of null alleles often
have VWF:Ag around 50% of normal, with wide individual variation influenced by
ABO blood group, thyroid status, estrogen levels, and physical stress.
Practical Actions
Most heterozygous carriers will have mild symptoms at most — easy bruising, prolonged nosebleeds, heavier menstrual periods, or longer bleeding after cuts. The key step is establishing whether this variant actually produces VWF levels below the diagnostic threshold (30 IU/dL) or the symptomatic range (30–50 IU/dL). A VWF panel — antigen, ristocetin cofactor activity, and factor VIII — provides that answer. Blood group O independently lowers VWF by approximately 25%, so group O carriers of this variant face a compounded deficit and are more likely to meet the diagnostic threshold.
Desmopressin (DDAVP) is highly effective in many type 1 VWD patients: it stimulates the release of VWF stored in endothelial Weibel-Palade bodies, transiently raising VWF 3- to 5-fold. Whether this response is adequate depends on individual VWF stores and must be established via a formal DDAVP trial before the drug is used in an emergency.
Interactions
ABO blood group is the strongest modifier of VWF levels: blood group O reduces VWF antigen by approximately 25% through enhanced clearance of von Willebrand factor. A carrier of rs61751290 who is also blood group O (associated with rs8176719 and rs1053878 in the ABO gene) faces a compounded deficit and is substantially more likely to have VWF levels in the diagnostic range for type 1 VWD. A second VWF null allele — for example rs61750630 (VWF C2362F) in trans — would produce type 3 VWD-like severe deficiency. Thyroid disorders, estrogen (whether endogenous or from hormonal contraception), and acute illness all transiently raise VWF, potentially masking the underlying deficit if laboratory testing is done during these states.