PMM2 R141H — The Most Common CDG Mutation and Its Implications for Carriers
Every protein destined for export, membrane display, or passage through the
secretory pathway must be decorated with sugar chains — a process called
N-linked glycosylation11 N-linked glycosylation
The attachment of oligosaccharide chains to the
nitrogen of asparagine residues in proteins destined for the endoplasmic
reticulum. "N-linked" refers to the nitrogen atom of asparagine that anchors
the glycan.. This process depends on a supply of activated sugar building
blocks, and the enzyme phosphomannomutase 2 (PMM2) is the gatekeeper for one of
them: it converts mannose-6-phosphate to mannose-1-phosphate, the precursor of
GDP-mannose, which is incorporated into the oligosaccharide core. When PMM2 fails,
cells cannot build complete N-glycan chains, and hundreds of glycoproteins are
produced in aberrant forms — affecting brain development, liver function, blood
coagulation, hormones, and more.
The c.422G>A variant (rs28936415) changes arginine to histidine at position 141
of the PMM2 protein (p.Arg141His, commonly written R141H). It is the most
frequent pathogenic PMM2 allele worldwide,
particularly in European populations22 particularly in European populations
European carrier frequency approximately
1/79 in the Netherlands and 1/60 in Denmark per Schollen et al. 2000,
and accounts for roughly half of all disease-causing PMM2 alleles identified
in CDG patients.
The Mechanism
PMM2 functions as a homodimer — two identical PMM2 subunits join to form the active enzyme. The Arg141 residue sits in a conserved region critical for protein folding and stability. The R141H substitution introduces a smaller, less basic histidine in place of the bulky positively charged arginine, destabilizing the protein structure.
Protein folding studies33 Protein folding studies
Yuste-Checa P et al. The Effects of PMM2-CDG-Causing
Mutations on the Folding, Activity, and Stability of the PMM2 Protein. Hum Mutat,
2015 confirmed that R141H is a
destabilizing variant that substantially reduces enzyme activity, and that the
protein may be partially rescuable with pharmacological chaperones — a finding
relevant to emerging treatment strategies.
A critical fact distinguishes R141H from other PMM2 variants: homozygous R141H
(two copies of this exact mutation) has never been observed in any living person.
Population genetics analysis44 Population genetics analysis
Schollen E et al. Lack of Hardy-Weinberg equilibrium
for the most prevalent PMM2 mutation in CDG-Ia. Eur J Hum Genet, 2000
confirmed that given the observed carrier frequency (1/79 in Dutch neonates),
homozygous infants should appear in the population if they survived — but they
don't. R141H homozygosity leaves too little PMM2 activity for embryogenesis to
complete. All living PMM2-CDG patients who carry R141H are
compound heterozygotes55 compound heterozygotes
Carry two different pathogenic PMM2 alleles — R141H on
one chromosome and a different mutation (e.g. F119L, V44A, I132T) on the other.
Neither allele alone causes disease; both are needed.: they carry R141H on
one chromosome and a different, less severe PMM2 mutation on the other that
preserves just enough residual enzyme activity to allow survival.
The Evidence
PMM2-CDG (formerly CDG type Ia, or Jaeken syndrome) was the first CDG identified
and remains by far the most common, with over 1,000 cases worldwide. The PMM2
gene was discovered in
199766 1997
Matthijs G et al. Mutations in PMM2, a phosphomannomutase gene on
chromosome 16p13, in carbohydrate-deficient glycoprotein type I syndrome.
Nat Genet, 1997, with R141H among the
original mutations identified.
A landmark French cohort study
Schiff et al. 201777 Schiff et al. 2017
Clinical, laboratory and molecular findings and long-term
follow-up data in 96 French patients with PMM2-CDG. J Med Genet, 2017
of 96 PMM2-CDG patients documented the full clinical spectrum: cerebellar ataxia
in virtually all patients, intellectual disability, neonatal hypotonia, liver
involvement, cardiac abnormalities in 20%, and a mortality rate of ~12.5% (12
deaths at mean age 3.8 years, primarily from cardiac complications). R141H was
present in approximately 50% of all disease-causing alleles in this cohort.
Biochemical markers consistently show elevated liver enzymes, abnormal coagulation
parameters (low protein C, antithrombin), elevated TSH, and hypocholesterolemia.
ClinVar (VCV000007706) lists R141H as Pathogenic with criteria provided by over 50 independent submitters including Mayo Clinic Laboratories, ARUP, GeneDx, and Invitae — one of the most extensively classified pathogenic variants in the CDG field.
Practical Actions
For heterozygous carriers: A single R141H allele does not cause PMM2-CDG. Carriers have one fully functional PMM2 gene and maintain normal glycosylation. The primary significance is reproductive: if both partners carry a pathogenic PMM2 allele (any combination), each pregnancy has a 25% chance of inheriting compound heterozygous PMM2 variants and developing CDG.
For compound heterozygous individuals (disease): PMM2-CDG requires multidisciplinary
management by metabolic physicians, neurologists, cardiologists, and hepatologists.
Biochemical monitoring of transferrin glycoform analysis, liver enzymes, coagulation
factors, and thyroid function is essential. While no curative therapy exists,
emerging approaches include dietary mannose supplementation and repurposed drugs.
Epalrestat, an aldose reductase inhibitor approved for diabetic neuropathy in Japan,
showed 30–400% increases in PMM2 enzyme activity88 showed 30–400% increases in PMM2 enzyme activity
Iyer S et al. Repurposing the
aldose reductase inhibitor and diabetic neuropathy drug epalrestat for PMM2-CDG.
Dis Model Mech, 2019 in patient
fibroblasts across multiple PMM2 genotypes, and is the only small-molecule activator
of PMM2 identified to date.
Interactions
PMM2-CDG is defined by compound heterozygosity: one R141H allele paired with a second, distinct PMM2 pathogenic variant on the other chromosome. The most common compound heterozygous combinations include R141H + F119L and R141H + V44A, with phenotypic severity correlating broadly with residual enzyme activity — milder second alleles yield milder disease. Any individual carrying R141H who is diagnosed with PMM2-CDG should have full sequencing of both PMM2 alleles to characterize the specific compound genotype, as this informs prognosis and eligibility for emerging genotype-specific treatments.
LDLR Pro685Leu — A Pathogenic Familial Hypercholesterolemia Mutation
The LDL receptor (LDLR) is the liver's primary mechanism for clearing
low-density lipoprotein11 low-density lipoprotein
LDL cholesterol — the primary carrier of cholesterol in blood, often called "bad cholesterol" because accumulation in arteries drives atherosclerosis
from the bloodstream. Each hepatocyte displays roughly 50,000 LDL receptors
that continuously cycle between the cell surface and endosomes, capturing
LDL particles and internalizing them for degradation. When LDLR function
is impaired by even one pathogenic variant, LDL-C accumulates in the blood
from birth, silently damaging arteries for decades.
The Pro685Leu variant (c.2054C>T) is a well-characterized pathogenic mutation found in over 200 individuals with familial hypercholesterolemia across multiple ethnic groups — including populations in Japan, China, India, Zambia, Italy, and European cohorts — earning alternative names FH Zambia, FH Gujerat, FH Frosinone-1, and FH Kanazawa-2. Its worldwide distribution and consistent pathogenicity make it one of the more instructive examples of the global burden of monogenic FH.
The Mechanism
Pro685 sits in the EGF-precursor homology domain of the LDLR, a
calcium-dependent structural region22 calcium-dependent structural region
The EGF precursor homology domain (also called the β-propeller domain) coordinates calcium ions that are essential for receptor recycling after LDL release in the acidic endosome
essential for receptor recycling after each endocytic cycle. Proline at
this position is highly conserved across vertebrate species, and its
replacement with leucine disrupts the local protein conformation.
The functional consequence is a combined Class II/III defect:
the precursor form of the mutant receptor33 the precursor form of the mutant receptor
Normal LDLR is synthesized in the endoplasmic reticulum as a ~120 kDa precursor, processed to a mature ~160 kDa glycoprotein, and trafficked to the cell surface in approximately 45 minutes. The Pro685Leu variant slows this maturation and reduces surface expression.
is converted to the mature form more slowly than normal, and the
receptor that does reach the cell surface binds LDL with reduced
affinity. Functional assays consistently demonstrate 10–30% of normal
LDLR activity. Five of five in silico prediction tools (SIFT, PolyPhen-2,
REVEL score 0.883, etc.) independently rate the substitution as
deleterious.
Heterozygous carriers have approximately half the normal hepatic LDL receptor capacity, producing the classic FH phenotype. Rare homozygotes (both LDLR alleles affected) face near-complete loss of LDL clearance, causing extremely severe hypercholesterolemia with LDL-C often exceeding 500 mg/dL.
The Evidence
The landmark Rotterdam cohort study44 landmark Rotterdam cohort study
Versmissen et al. Efficacy of statins in familial hypercholesterolaemia: a long term cohort study. BMJ, 2008
of 1,950 genetically confirmed FH patients (mean 8.5 years of follow-up)
found that statin therapy reduced the cardiovascular event rate by 76%
(HR 0.24, 95% CI 0.18–0.30). Treated patients achieved LDL-C reductions
of 44–49% with standard statin doses, bringing absolute cardiovascular
event rates (11/1,000 person-years) close to those of the general population.
The Pro685Leu variant specifically was identified in
two Chinese FH pedigrees55 two Chinese FH pedigrees
Yao et al. Identification of LDLR mutations in two Chinese pedigrees with familial hypercholesterolemia. J Pediatr Endocrinol Metab, 2012
including a compound heterozygous child with severe FH. Across 39
ClinVar submissions from major diagnostic laboratories (Invitae, GeneDx,
Mayo Clinic Laboratories, Quest Diagnostics, ARUP Laboratories, Color
Diagnostics), zero submissions conflict with a Pathogenic classification.
ACMG/AMP criteria applied include PS4 (prevalence in affected
individuals), PP1_Strong (co-segregation with disease), PM2 (absent
from controls), and PS3_Supporting (functional studies).
Practical Actions
Single-copy (heterozygous) carriers should expect untreated LDL-C in the 190–400 mg/dL range. High-intensity statin therapy (atorvastatin 40–80 mg or rosuvastatin 20–40 mg) is the first-line intervention and can reduce LDL-C by 50–60%. Ezetimibe (10 mg daily) added to a statin provides an additional 20–25% LDL-C reduction through complementary mechanism (intestinal cholesterol absorption inhibition). If LDL-C targets are not met on statin + ezetimibe, PCSK9 inhibitors (alirocumab, evolocumab) add another 50–60% on top of background therapy — though their efficacy depends on residual LDL receptor function, so response monitoring is important.
Cascade screening of all first-degree relatives is recommended immediately on diagnosis: each first-degree relative has a 50% probability of carrying the same variant and will have been accumulating LDL-driven atherosclerosis since birth without knowing it. Children of carriers should be tested by age 8–10.
Interactions
The cardiovascular burden of LDLR Pro685Leu compounds with APOE genotype. APOE E4 carriers (rs429358 CT or CC) already have impaired LDL clearance through a separate mechanism; those who also carry an LDLR pathogenic variant face additive LDL elevation and may require more aggressive lipid-lowering therapy. Similarly, PCSK9 gain-of-function variants (which increase LDL receptor degradation) can worsen the FH phenotype.
The Prefrontal Phosphatase — When Stress Rewires the Brain
Most people associate depression with serotonin. LHPP tells a different story — one about a small enzyme in the prefrontal cortex that quietly prevents stress from reshaping your brain. When LHPP function is reduced, chronic stress finds an open door into the neural circuitry that governs mood, motivation, and resilience.
LHPP encodes phospholysine phosphohistidine inorganic pyrophosphate phosphatase11 phospholysine phosphohistidine inorganic pyrophosphate phosphatase
A HAD-family phosphatase that removes phosphate groups from histidine residues on
target proteins. Histidine phosphorylation is an understudied but functionally
important post-translational modification in mammalian cells, regulating kinase
signaling cascades. It was essentially unknown in psychiatry until the
CONVERGE consortium22 CONVERGE consortium
Collaborative Research on the Genetics of Depression in
East Asians — a large-scale genetics consortium focused on recurrent major
depressive disorder in Han Chinese women identified it in
a landmark 2015 Nature study33 a landmark 2015 Nature study
CONVERGE consortium. Sparse whole-genome
sequencing identifies two loci for major depressive disorder. Nature,
2015, sequencing 5,303 cases and
5,337 controls to find just two genome-wide significant loci — one near SIRT1 and
one in LHPP (P = 6.45 × 10⁻¹²). The rs35936514 variant sits in the 3' UTR and
flanking intronic region of LHPP on chromosome 10, likely affecting expression
levels rather than protein structure.
The Mechanism
LHPP acts as a brake on stress-induced neuronal dysfunction. In the
medial prefrontal cortex44 medial prefrontal cortex
A region of the prefrontal cortex critical for
executive function, emotional regulation, and the top-down suppression of the
stress response. It communicates extensively with the amygdala and hippocampus
to modulate fear and mood (mPFC), LHPP dephosphorylates a network of
histidine kinase substrates55 histidine kinase substrates
Proteins — including NME1/2 — that carry
phosphate groups on histidine amino acids. When these kinases are
over-active, downstream signaling cascades including MAPK and PI3K/AKT are
altered, reducing glutamatergic synaptic transmission including NME1/2.
When LHPP levels fall — whether from reduced expression due to the T allele or
from chronic stress itself suppressing LHPP — histidine phosphorylation rises
unchecked, weakening glutamatergic synaptic transmission in exactly the circuits
that should be most resilient.
Lin et al. (2023)66 Lin et al. (2023)
Lin D et al. LHPP, a risk factor for major depressive
disorder, regulates stress-induced depression-like behaviors through its
histidine phosphatase activity. Molecular Psychiatry,
2023 showed in mice that mPFC LHPP
deletion alone produced no spontaneous depression-like behavior — but on exposure
to chronic social defeat stress, LHPP-knockout mice showed dramatically augmented
behavioral deficits. Crucially, re-introducing wild-type LHPP reversed the
deficits, while a phosphatase-dead LHPP mutant did not, confirming the enzymatic
activity is required. A parallel mechanism runs through astrocytes:
Sha et al. (2023)77 Sha et al. (2023)
Sha L et al. LHPP-mediated inorganic pyrophosphate
hydrolysis-driven lysosomal acidification in astrocytes regulates adult
neurogenesis. Cell Reports,
2023 showed that LHPP normally drives
lysosomal acidification88 lysosomal acidification
Lysosomes require an acidic environment (pH ~5) to
function as the cell's recycling centers. In astrocytes, LHPP hydrolyzes
inorganic pyrophosphate to fuel this acidification. Without it, lysosomal
function is impaired, the C/EBPβ transcription factor accumulates, and
downstream chemokine signaling promoting neurogenesis is triggered in
astrocytes; in its absence, the C/EBPβ pathway triggers compensatory adult
neurogenesis that confers stress resilience. This counterintuitive finding
(less LHPP → more resilience in knockout mice) may reflect that the
full-knockout model removes a gene globally, while the rs35936514 T allele
reduces rather than abolishes LHPP expression in specific circuits.
In the ventral hippocampus, a complementary mechanism operates:
Zhuang et al. (2024)99 Zhuang et al. (2024)
Zhuang L et al. LHPP in glutamatergic neurons of
the ventral hippocampus mediates depression-like behavior by dephosphorylating
CaMKIIα and ERK. Biological Psychiatry,
2024 found that LHPP levels
increase in glutamatergic neurons during stress-induced depression, and
that knocking out LHPP in these neurons enhanced spontaneous activity and
stress resilience via the CaMKIIα/ERK pathway1010 CaMKIIα/ERK pathway
CaMKII (calcium/calmodulin-
dependent protein kinase II) and ERK (extracellular signal-regulated kinase)
regulate synaptic plasticity and BDNF expression. LHPP dephosphorylates both,
reducing their activity and impairing the BDNF/PSD95-mediated synaptic
strengthening that underlies mood resilience. Esketamine — but not
fluoxetine — reversed LHPP-induced depression-like behaviors, pointing
toward glutamatergic rather than serotonergic modulation as the relevant
therapeutic axis.
The Evidence
The CONVERGE study represents one of the cleanest GWAS findings in depression genetics. By restricting the sample to Han Chinese women with recurrent, severe MDD — reducing phenotypic and genetic heterogeneity — the consortium achieved genome-wide significance with just 10,640 individuals, far fewer than most psychiatric GWAS require. The LHPP signal (P = 6.45 × 10⁻¹²) was validated in an independent replication cohort. rs35936514 sits in intron 2 of LHPP and the flanking 3' UTR, and the T allele is associated with reduced LHPP expression in the prefrontal cortex.
At the neural circuit level, Cui et al. (2020)1111 Cui et al. (2020)
Cui L et al. Association of
LHPP genetic variation (rs35936514) with structural and functional connectivity
of hippocampal-corticolimbic neural circuitry. Brain Imaging and Behavior,
2020 examined 122 healthy
participants and found that T-allele carriers showed increased hippocampal
connectivity to the rostral anterior cingulate cortex and higher fractional
anisotropy in the fornix — structural and functional signatures consistent
with altered top-down emotional regulation. These brain connectivity differences
were present without clinical depression, suggesting the risk phenotype is
a continuous trait rather than a categorical diagnosis.
Earlier family-based work by
Neff et al. (2009)1212 Neff et al. (2009)
Neff CD et al. Evidence for HTR1A and LHPP as interacting
genetic risk factors in major depression. Molecular Psychiatry,
2009 found a chromosome 10 linkage
peak (HLOD = 4.4) in Utah pedigrees with familial depression and identified
disease-segregating LHPP SNPs in the linkage region. The LHPP effects in that
study were contingent on HTR1A genotype, hinting at an interaction between
serotonin receptor sensitivity and LHPP-mediated stress signaling.
Practical Implications
The LHPP mechanism converges on a clear biological theme: glutamatergic synaptic function under stress. T-allele carriers do not appear to have impaired mood at baseline — the risk emerges under chronic stress. This has direct implications for how carriers should think about their mental health strategy: minimizing the duration and intensity of chronic stressors matters more than managing acute stress reactions.
The esketamine finding from Zhuang et al. is clinically relevant: if depression does develop in T-allele carriers, the evidence points toward glutamate-targeting treatments (esketamine/ketamine) having particular mechanistic relevance versus standard SSRIs, which do not directly address the LHPP-CaMKII/ERK pathway.
Interactions
The Neff et al. (2009) data suggest a potential interaction between LHPP (rs35936514) and the HTR1A serotonin receptor gene. In that family-based study, the LHPP depression association was modulated by HTR1A genotype, suggesting that individuals carrying risk variants in both genes may have compounded vulnerability. The mechanistic intersection — serotonergic regulation of prefrontal excitatory tone — offers a plausible biological substrate.
The BDNF Val66Met variant (rs6265) is also relevant: since LHPP deficiency reduces CaMKII/ERK activity and thereby impairs BDNF/PSD95-mediated synaptic strengthening, T-allele carriers who also carry the BDNF Met allele (reduced activity-dependent BDNF secretion) may experience compounded vulnerability to stress-induced synaptic weakening in prefrontal and hippocampal circuits.
OX40 Ligand and the Immune Gate of Atherosclerosis
TNFSF4 encodes OX40 ligand (OX40L, also called CD252), a co-stimulatory molecule of the TNF superfamily that serves as the key amplifier of T-cell activation in inflammatory settings. OX40L is expressed on antigen-presenting cells — particularly macrophages, dendritic cells, and vascular endothelial cells — and engages OX40 on activated CD4+ T cells to extend their survival, promote cytokine secretion, and drive clonal expansion. Within atherosclerotic plaques, OX40L expression is markedly elevated compared to healthy vascular tissue, driven by oxidized LDL and other inflammatory mediators. The rs3850641 variant sits in the first intron of TNFSF4, in strong linkage disequilibrium with a promoter polymorphism (rs45454293) that regulates OX40L gene expression.
The Mechanism
The functional variant linked to rs3850641 is rs4545429311 rs45454293
A promoter SNP in LD with
rs3850641 at r²=0.70; the T-allele reduces TNFSF4 transcription by ~50% compared to
the C-allele in luciferase reporter assays, attributed to binding of a transcriptional
repressor. The rs3850641 SNP
itself has no direct effect on promoter activity, but because both SNPs travel together
on the same haplotype block — the TG haplotype combining rs45454293T and rs3850641G —
the G allele at rs3850641 tags a state of reduced OX40L expression in vascular tissue.
Paradoxically, lower OX40L expression appears to increase cardiovascular risk. This may
reflect an immunoregulatory role: OX40L signaling is also required for the proper
differentiation and persistence of regulatory T cells (Tregs). Disruption of OX40L in
the plaque microenvironment could impair Treg-mediated suppression of inflammatory
effector T cells, allowing uncontrolled T-cell–driven inflammation to accelerate plaque
growth and destabilization. Consistent with this, anti-OX40L antibody blockade in
LDL receptor-deficient mice22 anti-OX40L antibody blockade in
LDL receptor-deficient mice
Guo et al., Arterioscler Thromb Vasc Biol 2007 — anti-OX40L
antibody reduced atherosclerotic lesion area by approximately 53% in Ldlr−/− mice by
attenuating Th2-mediated responses and preserving anti-oxLDL IgM antibodies
attenuated atherogenesis, pointing to the pathway as a therapeutic target rather than a
simple risk amplifier.
Elevated soluble OX40L (sOX40L) in circulation — released primarily from activated platelets — correlates with acute coronary syndrome severity, plaque instability, and elevated matrix metalloproteinase (MMP-9, MMP-3) levels, reinforcing OX40L as a biomarker of active plaque inflammation.
The Evidence
The variant was first implicated by Wang et al. in Nature Genetics 200533 Wang et al. in Nature Genetics 2005
Positional
identification of TNFSF4, encoding OX40 ligand, as a gene that influences atherosclerosis
susceptibility, which used positional cloning
from the murine atherosclerosis susceptibility locus Ath1 to identify TNFSF4 as a
human candidate gene. The rs3850641 G allele was significantly more frequent in MI cases
than controls in two independent Swedish cohorts. Subsequent work by Cunningham et al.
PLoS ONE 201144 Cunningham et al.
PLoS ONE 2011
A Common Polymorphism in the Promoter Region of the TNFSF4 Gene Is
Associated with Lower Allele-Specific Expression and Risk of MI
identified rs45454293 as the functional promoter variant in LD with rs3850641 (r²=0.70),
with the TG haplotype significantly enriched in female MI patients vs controls in the
SCARF cohort (P=0.01), and demonstrated 50% reduced OX40L promoter activity for the
T-allele.
A Chinese Han case-control study (Li et al. Oncotarget 2018, PMID 29921578 — 454 cases, 512 controls)55 (Li et al. Oncotarget 2018, PMID 29921578 — 454 cases, 512 controls) found that homozygous GG carriers had significantly elevated MI risk (crude OR 2.00, 95% CI 1.04–3.86, P=0.039; adjusted OR 2.29, 95% CI 1.20–4.69, P=0.023), while the AG heterozygous genotype conferred no significant independent risk under additive or dominant models.
However, larger meta-analyses show overall null results in aggregate populations. A meta-analysis of 9 studies (Fu et al. 2016, PMID 27008001)66 meta-analysis of 9 studies (Fu et al. 2016, PMID 27008001) found allelic OR 1.10 (95% CI 0.96–1.27, P=0.174), and a systematic review of 11 studies covering 3,865 cases and 6,344 controls (Lu et al. 2018, PMID 29424751)77 systematic review of 11 studies covering 3,865 cases and 6,344 controls (Lu et al. 2018, PMID 29424751) found OR 1.02 (95% CI 0.89–1.17) for the G allele — no overall association. These null findings likely reflect population stratification (original signal primarily in European women), heterogeneity across disease definitions (CHD vs. MI specifically), and statistical power limitations for the rare GG genotype (only ~2% of individuals).
The GG-specific risk signal (recessive model OR ~2) remains coherent across the positive studies and is mechanistically plausible; the overall null meta-analysis findings reflect the rarity of GG homozygotes diluting allele-level effects. Evidence is classified as moderate — replicated in multiple independent studies under the recessive model, with a plausible regulatory mechanism, but no established clinical guidelines and significant heterogeneity across populations.
Practical Actions
For GG homozygotes, the elevated MI risk under the recessive model (OR ~2) warrants proactive cardiovascular monitoring. The risk appears particularly pronounced in women and in populations of East Asian and South Asian descent where OX40L pathway activity in the context of this haplotype may interact differently with other risk factors. Tracking inflammatory biomarkers such as hs-CRP and sOX40L alongside standard lipid panels provides the most relevant picture of vascular inflammatory burden for this genotype.
Interactions
rs3850641 is in linkage disequilibrium with the functional promoter SNP rs45454293 (r²=0.70); the TG haplotype combining both minor alleles carries the highest risk signal in women. rs2205960 and other upstream TNFSF4 promoter region SNPs (rs10912580, rs12039904, rs1234317) form a distinct haplotype block associated with systemic lupus erythematosus susceptibility, illustrating that TNFSF4 haplotypes have different disease consequences depending on which part of the regulatory region they tag.
Individuals carrying both rs3850641 GG and cardiovascular-risk variants in classical inflammatory pathways (e.g., IL-6 signaling genes, CRP variants) may have compounded atherosclerotic risk through synergistic immune activation in the vascular wall — an interaction that has not been formally studied but is mechanistically plausible given the T-cell co-stimulatory nature of the OX40L pathway.
MMEL1-TNFRSF14 — A Regulatory Checkpoint Between Peptide Processing and Immune Costimulation
On the short arm of chromosome 1, at position 1p36, two genes sit in close proximity with
distinct but complementary roles in immune homeostasis: MMEL1, encoding a neprilysin-like
metalloendopeptidase, and TNFRSF14, encoding HVEM11 HVEM
Herpes Virus Entry Mediator, also known
as CD270 — a TNF receptor superfamily member that regulates both activating and inhibitory
T-cell signals. The rs3890745 intronic
SNP22 SNP
Single nucleotide polymorphism — a single-letter DNA difference that varies between
people; this one lies within MMEL1 but is in linkage disequilibrium with regulatory elements
affecting both neighboring genes has been
consistently associated with rheumatoid arthritis susceptibility in multiple genome-wide
association studies. The C allele, carried at approximately 32% frequency in Europeans but
up to 53% in African populations, tags a risk haplotype at this locus.
The Mechanism
rs3890745 falls within an intron of MMEL1 (membrane metalloendopeptidase like 1), also known as NEP2 or neprilysin II. MMEL1 is a zinc-dependent neutral endopeptidase that cleaves bioactive peptides including neuropeptides and inflammatory mediators — a function that positions it as a regulator of local peptide signaling in immune and neural contexts. MMEL1 is expressed broadly, with highest levels in testis and small intestine, and weaker expression in brain, kidney, and immune tissues.
The more immunologically prominent neighbor is TNFRSF14 (HVEM/CD270). HVEM operates as a
dual-function immune checkpoint: it can deliver activating signals to T cells via binding
LIGHT33 LIGHT
TNFSF14, a ligand expressed on activated T cells that delivers costimulatory signals
through HVEM, promoting T-cell proliferation and cytokine production,
but it also transmits inhibitory signals through BTLA44 inhibitory signals through BTLA
B- and T-lymphocyte attenuator — an
inhibitory receptor expressed on lymphocytes that signals "stop" when bound to HVEM,
analogous to CTLA-4 and PD-1. This bidirectional
control makes HVEM a critical rheostat in peripheral immune tolerance. Disruption of the
HVEM/BTLA inhibitory axis has been implicated in multiple autoimmune conditions, including
rheumatoid arthritis, celiac disease, and inflammatory bowel disease.
As an intronic variant, rs3890745 does not change the MMEL1 protein sequence. It is presumed to act as a regulatory tag SNP in linkage disequilibrium with functional variants that alter expression levels of MMEL1, TNFRSF14, or both in immune cell populations. Expression quantitative trait locus (eQTL) data from immune cells would be required to identify the precise causal variant; the rs3890745 association most likely reflects a haplotype-level regulatory effect spanning both genes.
The Evidence
The MMEL1-TNFRSF14 locus was first established as an RA susceptibility region in a 2008
genome-wide association meta-analysis55 genome-wide association meta-analysis
Study design combining multiple GWAS cohorts to
increase statistical power — more samples yield more reliable effect estimates
by Raychaudhuri and colleagues (Nature Genetics), which analyzed 3,393 cases and 12,462
controls in the discovery phase and 3,929 anti-CCP-positive RA cases and 5,807 controls
in replication. The rs3890745 signal at this locus reached an overall p-value of 1.1×10⁻⁷,
placing it among confirmed RA susceptibility loci alongside CD40, KIF5A, and CDK6.
The association was replicated in the large trans-ethnic RA GWAS meta-analysis66 large trans-ethnic RA GWAS meta-analysis
Trans-ethnic
meta-analysis pooling cohorts of European, East Asian, and mixed ancestry — improves
fine-mapping precision by narrowing LD blocks
by Okada et al. (Nature, 2014), which confirmed an odds ratio of 1.18 (95% CI 1.10–1.27)
for the risk allele at this locus. An independent GWAS meta-analysis by Stahl et al.
(Nature Genetics, 2010)77 Stahl et al.
(Nature Genetics, 2010)
Discovery phase: 5,539 RA cases and 20,169 controls from European
populations; replication phase: 6,768 cases and 8,806 controls
confirmed the MMEL1 locus at p=4×10⁻⁶. A 2019 cross-trait GWAS analysis88 2019 cross-trait GWAS analysis
Examining
shared genetic architecture between autoimmune diseases and non-Hodgkin lymphomas using
regional overlap and polygenic risk scores
identified rs3890745 with OR 1.14 at genome-wide significance (p=2×10⁻⁸) in a combined
RA/DLBCL analysis, supporting shared immune-regulatory genetic architecture across
B-cell-driven conditions.
Beyond RA, variants at the MMEL1-TNFRSF14 locus have been associated with multiple sclerosis and primary biliary cirrhosis in separate GWAS, consistent with the broad role of HVEM-mediated immune costimulation in peripheral tolerance across different autoimmune phenotypes.
The per-allele odds ratio of approximately 1.12–1.18 classifies this as a moderate-effect susceptibility locus. This is consistent with the polygenic architecture of RA, where dozens of loci each contribute small incremental risk — the clinical impact of rs3890745 alone is modest, but it gains significance when combined with high-risk HLA-DRB1 shared epitope alleles or PTPN22 R620W.
Practical Actions
For C allele carriers, particularly CC homozygotes, the priority is early recognition of seropositive RA — the anti-CCP (ACPA)-positive, erosive subtype with the strongest genetic component. Because rs3890745 tags shared autoimmune genetic architecture, individuals with joint symptoms, unexplained morning stiffness, or a strong family history of RA should prioritize early autoantibody testing. Vitamin D3 supplementation has demonstrated a 22% reduction in incident autoimmune disease in the VITAL RCT and acts as a direct NF-kB and T-cell regulatory modulator.
Interactions
rs3890745 sits in the same chromosomal neighbourhood as a broader 1p36 autoimmune risk region. TNFRSF14/HVEM interacts directly with BTLA on T cells and B cells — variants in the BTLA locus (rs9288952) have been associated with RA in some GWAS, suggesting a potential compound effect through the HVEM-BTLA inhibitory axis. MMEL1 has homology with neprilysin (NEP/CD10), which processes inflammatory neuropeptides; the folate-methylation pathway (MTHFR, rs1801133) intersects with DNA methylation patterns at immune loci that regulate TNFRSF14 expression.
The strongest RA risk loci that may compound with rs3890745 include PTPN22 rs2476601 (T-cell signaling threshold), TRAF1-C5 rs10818488 (NF-kB regulation), and HLA-DRB1 shared epitope alleles (antigen presentation). These operate through distinct pathways and their combined carriage incrementally raises absolute RA risk.
FCGR3A V158F — The NK Cell Affinity Switch
Your body's natural killer (NK) cells carry a surface receptor called FcgammaRIIIa (CD16a)11 FcgammaRIIIa (CD16a)
The primary Fc receptor on NK cells that binds the constant region of IgG
antibodies, triggering antibody-dependent cellular cytotoxicity that connects innate immunity to antibody-mediated defense. This
receptor grabs the tail end of IgG antibodies already bound to target cells — infected cells, cancer cells, or cells flagged for destruction — and activates the NK cell to
kill. The V158F variant (rs396991) changes a single amino acid in the IgG-binding domain of this receptor, creating two versions with dramatically different binding
affinities22 dramatically different binding
affinities
The 158V isoform binds IgG1 and IgG3 with approximately 2-fold higher affinity than the 158F isoform. This
difference matters both for natural immune surveillance and, critically, for how well monoclonal antibody therapies work.
Genotyping accuracy warning: FCGR3A shares over 98% sequence homology33 over 98% sequence homology
Only four nucleotides differ between FCGR3A and FCGR3B in the genotyped
region with its neighboring gene FCGR3B. This extreme similarity can cause consumer genotyping chips and some research assays
to inadvertently read FCGR3B sequence instead of FCGR3A, producing incorrect genotype calls. Validated TaqMan assays show 100% accuracy in European and Asian populations
but 7.8% error rate in African populations and 1.1% in admixed American populations44 7.8% error rate in African populations and 1.1% in admixed American populations. If your result seems inconsistent with
clinical observations, consider confirmatory testing with a gene-specific assay.
The Mechanism
The FCGR3A gene encodes a transmembrane glycoprotein expressed primarily on NK cells, macrophages, and some T-cell subsets. The V158F polymorphism occurs in the
second extracellular immunoglobulin-like domain55 second extracellular immunoglobulin-like domain
This domain directly contacts the CH2 region of IgG, and the amino acid at position 158 sits at the binding
interface, precisely where IgG makes contact. Valine at position 158 (encoded by the C allele on the plus strand) creates a
receptor that binds IgG1 and IgG3 with approximately two-fold higher affinity than phenylalanine at the same position (A allele). This translates directly into
enhanced antibody-dependent cellular cytotoxicity (ADCC)66 enhanced antibody-dependent cellular cytotoxicity (ADCC)
ADCC is the process by which NK cells kill antibody-coated target cells; higher receptor affinity means more
efficient target recognition and killing.
The functional hierarchy is clear: V/V homozygotes show the strongest ADCC activity, V/F heterozygotes are intermediate, and F/F homozygotes have the weakest response. This gradient affects both natural immune surveillance against infected or abnormal cells and the therapeutic efficacy of monoclonal antibodies that depend on ADCC as their mechanism of action.
The Evidence
Monoclonal Antibody Therapy
The pharmacogenomic significance of V158F was first demonstrated in follicular lymphoma patients treated with rituximab77 follicular lymphoma patients treated with rituximab
Cartron et al. showed V/V patients achieved
significantly higher molecular response rates to rituximab monotherapy than V/F or F/F patients. For trastuzumab in
HER2-positive breast cancer, a study of Egyptian patients88 study of Egyptian patients
V/V genotype present in 29.6% of responders vs 8.4% of non-responders; median
progression-free survival 22 months for V/V vs 6 months for F/F (p=0.003) found V/V carriers had significantly
better overall survival and response rates. In follicular lymphoma treated with antibody-chemotherapy combinations99 follicular lymphoma treated with antibody-chemotherapy combinations
SWOG trials showed patients with at least one V allele had
better overall survival than F/F patients when treated with antibody-chemotherapy combinations, V allele carriers
showed improved outcomes specifically in the antibody-containing treatment arms.
However, results are not universally consistent. A large randomized study in follicular lymphoma1010 A large randomized study in follicular lymphoma
Analysis of 321 patients found no FCGR genotype predicted
initial response to rituximab or rituximab-chemotherapy combinations found no predictive value, highlighting that
tumor biology, immune microenvironment, and combination chemotherapy may modulate the receptor's influence. The evidence is strongest for rituximab monotherapy
and weakens when combined with intensive chemotherapy regimens.
Anti-TNF Therapy in Inflammatory Bowel Disease
In Crohn's disease, V/V carriers showed 100% biological response to infliximab compared to 69.8% of F carriers1111 V/V carriers showed 100% biological response to infliximab compared to 69.8% of F carriers,
consistent with enhanced ADCC against TNF-expressing cells. Paradoxically, V/V carriers also show faster infliximab clearance and higher anti-drug antibody
rates1212 faster infliximab clearance and higher anti-drug antibody
rates
37.5% of V/V patients developed anti-drug antibodies vs 10.6% for V/F and 5% for F/F (OR 6.08) — the
same high-affinity receptor that makes the drug work better also accelerates its elimination. This creates a clinical dilemma: V/V patients may respond better
initially but need closer therapeutic drug monitoring and potentially dose optimization to maintain response.
Autoimmune Disease Susceptibility
Meta-analyses link V158F to systemic lupus erythematosus susceptibility1313 Meta-analyses link V158F to systemic lupus erythematosus susceptibility
FCGR3A rs396991 shows association with SLE in recessive model (OR 1.26, p = 9.62
x 10-5) and lupus nephritis risk, particularly in non-European populations. The enhanced IgG binding by the V allele
may increase immune complex-mediated tissue damage in autoimmune conditions. In rheumatoid arthritis, the association is weaker and may interact with
HLA shared epitope status1414 HLA shared epitope status
The V allele may predispose shared epitope-positive individuals to RA.
Practical Implications
The clinical significance of V158F depends heavily on context. For individuals who may receive monoclonal antibody therapy (rituximab for lymphoma, trastuzumab for breast cancer, cetuximab for colorectal cancer), knowing your genotype could inform treatment expectations and potentially guide therapeutic decisions. F/F carriers receiving these therapies may benefit from higher doses, more frequent administration, or combination approaches that don't rely solely on ADCC.
For individuals on anti-TNF therapy (infliximab, adalimumab) for inflammatory bowel disease or rheumatoid arthritis, V/V carriers should be aware of higher anti-drug antibody risk and may benefit from proactive therapeutic drug monitoring rather than reactive testing only when treatment appears to be failing.
For all carriers of the F allele, strategies that support NK cell function take on added importance since the lower-affinity receptor means each NK cell-target interaction is less efficient. While the receptor affinity is genetically fixed, NK cell number and activation state are modifiable.
Interactions
FCGR3A V158F interacts with FCGR2A rs1801274 (H131R)1515 FCGR2A rs1801274 (H131R)
Another Fc gamma receptor polymorphism affecting IgG binding, located on the same chromosome; combined
low-affinity genotypes at both loci may compound reduced ADCC. In DLBCL, FCGR2A was the primary driver of survival
differences — FCGR3A was not independently associated with DLBCL survival — while FCGR3A showed predictive value in follicular lymphoma settings. In autoimmune disease, the
combined effect of low-affinity alleles at both FCGR3A and FCGR2A may influence susceptibility and treatment response to antibody-based therapies. Enhancer
SNPs rs4656317 and rs12071048 within FCGR3A are in strong linkage disequilibrium with rs396991 and influence NK cell ADCC through transcriptional regulation
of CD16a expression levels, potentially modifying the functional impact of V158F.
BTD Pro167Ser — A Rare Pathogenic Allele in the Biotin Recycling Enzyme
Biotin (vitamin B7) is indispensable for four
carboxylase enzymes11 carboxylase enzymes
pyruvate carboxylase, propionyl-CoA carboxylase,
3-methylcrotonyl-CoA carboxylase, and acetyl-CoA carboxylase — all require
biotin as a covalently attached cofactor to function that power fat
synthesis, amino acid metabolism, and gluconeogenesis. Unlike most vitamins,
biotin is largely recovered from food proteins rather than absorbed free —
the digestive breakdown of biotin-dependent enzymes in the diet releases
biocytin22 biocytin
biotin-ε-lysine, the product of proteolytic digestion of
holocarboxylases and other biotinylated proteins, which must then be
cleaved back to free biotin before the body can reuse it. The enzyme
responsible for this cleavage is biotinidase, encoded by the BTD gene on
chromosome 3p25.1.
The rs397507173 variant introduces a c.499C>T nucleotide change that substitutes
serine for proline at position 167 of the biotinidase protein (p.Pro167Ser).
ClinVar records two "Likely pathogenic" submissions for this allele in the
context of biotinidase deficiency, with a single "Uncertain significance"
submission also on record (VCV003339734; conflicting interpretations status).
The variant is absent from more than 250,000 control chromosomes in population
databases, consistent with a rare disease allele. It was first documented in
patients with biochemically confirmed biotinidase deficiency by
Iqbal et al.33 Iqbal et al.
Iqbal F et al. The identification of novel mutations in the
biotinidase gene using denaturing high pressure liquid chromatography (dHPLC).
Mol Genet Metab, 2010.
The Mechanism
Biotinidase is a member of the nitrilase superfamily. It cleaves the amide bond between biotin and lysine in biocytin, regenerating free biotin for re-attachment to newly synthesized apo-carboxylases. Without functional biotinidase, biocytin accumulates in urine (biotinuria), free biotin concentrations fall, and all four biotin-dependent carboxylases progressively lose activity. The proline at position 167 lies within the enzyme's catalytic domain; replacement with serine is predicted to disrupt local protein folding and reduce or abolish catalytic activity, consistent with ClinVar's pathogenicity assessment and in-silico tools predicting a damaging effect.
Biotinidase deficiency is classified by residual serum enzyme activity: profound deficiency (<10% of mean normal activity) causes severe neurological disease if untreated; partial deficiency (10–30% of mean normal) produces milder or stress-triggered symptoms. Heterozygous carriers typically retain ~50% of normal activity — sufficient for health under ordinary conditions but relevant for compound heterozygosity risk assessment.
The Evidence
The clinical significance of biotinidase deficiency is
well established44 well established
Wolf B. Biotinidase Deficiency. GeneReviews, 2000
(updated 2026): untreated
profound deficiency causes seizures, hypotonia, developmental delay, optic
atrophy, hearing loss, and alopecia, typically presenting in the first
weeks to years of life. Biotin supplementation is curative if started
before irreversible neurological damage occurs; some deficits (optic atrophy,
sensorineural hearing loss) may not fully reverse even with treatment.
Partial deficiency can remain asymptomatic for years but cause symptoms
under physiological stress — intercurrent illness, pregnancy, or periods
of increased biotin turnover.
The rs397507173 Pro167Ser allele itself has been reported in individuals with confirmed biotinidase deficiency, including at least one homozygous case (ClinVar record). Its rarity — absent in 251,442 control chromosomes across gnomAD datasets — and the in-silico evidence of functional disruption support the "likely pathogenic" classification. The conflicting "uncertain significance" submission likely reflects the limited published functional data specific to this single variant, rather than any evidence of benignity.
Newborn screening programs that test biotinidase activity on dried blood
spots identify profound deficiency at approximately
1 in 137,000 births and combined (profound + partial) deficiency at
1 in 61,000 births55 1 in 137,000 births and combined (profound + partial) deficiency at
1 in 61,000 births
Wolf B, GeneReviews 2026. BTD is a required
target on expanded newborn screening panels in the United States and
many European countries precisely because early biotin supplementation
is inexpensive, safe, and prevents irreversible harm.
Practical Actions
Heterozygous carriers (CT genotype) have sufficient biotinidase activity for normal health and do not require supplementation. The primary clinical relevance is reproductive: if both parents carry a pathogenic BTD allele, each pregnancy has a 25% probability of inheriting biallelic pathogenic variants and presenting with biotinidase deficiency. Carrier couples should receive genetic counseling before or during pregnancy.
Individuals who carry Pro167Ser in compound heterozygosity with a second pathogenic BTD variant (particularly the common profound-deficiency alleles) should have biotinidase enzyme activity measured to determine their actual activity level and need for supplementation.
For homozygous Pro167Ser (TT) individuals — an exceptionally rare genotype not yet documented in population databases — the expected clinical scenario is profound biotinidase deficiency requiring immediate and lifelong biotin supplementation, as for all confirmed profound cases.
Interactions
The key interactions for BTD pathogenic alleles involve compound heterozygosity: carrying one Pro167Ser allele together with a second BTD pathogenic variant on the other chromosome results in biallelic loss of function, producing partial or profound deficiency depending on the combined residual activity. The most common co-occurring allele in many populations is p.Asp444His (associated with partial deficiency), and the most common severe allele is the c.98_104delinsTCC frameshift. When Pro167Ser is paired with a severe loss-of-function allele, the phenotype is expected to be profound deficiency; paired with the milder p.Asp444His, the combined activity typically falls in the partial deficiency range (approximately 10–25% of normal). Any individual found to carry Pro167Ser should have their partner screened for BTD pathogenic variants before or during pregnancy, and any child of two carriers should receive biotinidase activity testing as part of newborn screening.
MYBPC3 Gly490Val — A Recessive Sarcomere Variant with Malignant Homozygous Phenotype
Cardiac myosin-binding protein C (cMyBP-C), encoded by MYBPC3 on chromosome 11, is a thick-filament
accessory protein that accounts for roughly 2% of myofibrillar protein mass11 roughly 2% of myofibrillar protein mass
cMyBP-C spans the
C-zone of each sarcomere half and contacts both myosin and actin.
Its C-terminal domains anchor it to the thick filament while its N-terminal domains regulate
cross-bridge cycling — slowing myosin-actin interaction at rest and permitting contraction when
phosphorylated by protein kinase A during adrenergic stimulation. MYBPC3 is the single most
commonly mutated gene in hypertrophic cardiomyopathy (HCM), a condition marked by asymmetric
left ventricular hypertrophy, diastolic dysfunction, and in severe cases, sudden cardiac death.
Most pathogenic MYBPC3 variants cause HCM through autosomal dominant haploinsufficiency — one defective copy is sufficient to disrupt sarcomere stoichiometry. The Gly490Val variant (rs397514752, c.1469G>T on the coding strand) is an important exception: it behaves recessively, causing disease only when both copies of MYBPC3 are affected.
The Mechanism
Glycine at position 490 falls within the C3 immunoglobulin-like domain22 C3 immunoglobulin-like domain
MYBPC3 contains 11
domains (C0-C10); C3 is an Ig-like domain in the middle segment of the protein
of cMyBP-C. Glycine residues are structurally unique — their lack of a side chain allows tight
turns and compact beta-strand arrangements that larger amino acids cannot adopt. Substituting the
bulky valine disrupts this local fold. The conserved glycine at position 490 is present across
vertebrate species, underscoring its structural importance.
In heterozygotes, one functional copy of MYBPC3 appears sufficient to maintain normal sarcomere stoichiometry, explaining why carriers remain phenotypically normal. In homozygotes, with no wild-type cMyBP-C produced, the sarcomere cannot regulate cross-bridge cycling properly, leading to the disorganized myocyte architecture (myofibre disarray) characteristic of severe HCM.
The Evidence
The variant was first reported in 2013 by Wang Y and colleagues33 first reported in 2013 by Wang Y and colleagues
PLoS One, n=1 pedigree, 2 affected
homozygous siblings, 5 unaffected heterozygous carriers, 376 controls
in a consanguineous Chinese family. Both homozygous brothers presented with classic HCM: maximum
wall thickness 17-18 mm, asymmetric ventricular hypertrophy on cardiac MRI, and diffuse
repolarization changes with large negative T waves. None of the five adult heterozygous family
members — including one aged 71 years — showed any clinical evidence of HCM by echocardiography
or ECG. The variant was absent from 376 Chinese controls and public variant databases at the time
of publication.
The recessive inheritance pattern at this site contrasts sharply with an adjacent variant, Gly490Arg, which causes dominant HCM. This domain-specific behavior illustrates how even nearby amino acid substitutions can produce fundamentally different inheritance modes depending on their effect on protein function versus stability.
Recessive MYBPC3 variants as a class are most dramatically illustrated by the Amish splice
mutation, where homozygous infants developed lethal HCM requiring transplantation in the first
year of life44 homozygous infants developed lethal HCM requiring transplantation in the first
year of life
Zahka K et al. Heart 2008; all surviving homozygous infants required cardiac
transplantation, while heterozygous parents were
unaffected. The Gly490Val variant appears to follow the same recessive pattern but with later
onset and less catastrophic severity.
Evidence note: rs397514752 is reported in a single family (n=2 homozygous affected individuals). The evidence level is emerging — the recessive behavior is internally consistent and mechanistically plausible, but independent replication in additional families is needed before clinical management protocols can be established with confidence.
Practical Actions
For the overwhelming majority of people who carry this variant at all, the relevant question is whether they are heterozygous or homozygous. Heterozygous carriers have no current evidence of increased personal cardiac risk. Their clinical relevance lies in reproductive planning: if both members of a couple carry this variant, each pregnancy carries a 25% chance of producing a homozygous child with HCM.
Homozygous individuals warrant full cardiological evaluation given the documented severe phenotype.
Interactions
Because the pathogenic effect of rs397514752 is recessive, the key interaction is between the two copies of this variant in homozygotes — compound heterozygosity with a second MYBPC3 pathogenic variant on the other allele would likely produce a similar biallelic loss-of-function phenotype. Heterozygous carriers of rs397514752 who also carry a dominant MYBPC3 pathogenic variant on the opposite allele could theoretically have a modified phenotype, but no published evidence addresses this specific combination.
MUC1 Variant — Gastric Mucus Protection and H. Pylori Susceptibility
The MUC1 gene encodes mucin-1, a membrane-bound glycoprotein11 membrane-bound glycoprotein
MUC1 is a large transmembrane mucin that forms a protective barrier on the surface of gastric epithelial cells that plays a crucial role in protecting the gastric lining from environmental insults, particularly the bacterium Helicobacter pylori (H. pylori)22 Helicobacter pylori (H. pylori)
H. pylori is the primary bacterial cause of gastric ulcers and gastric cancer, infecting about half the world's population. The rs4072037 variant, though synonymous33 synonymous
A synonymous variant doesn't change the amino acid sequence but can affect mRNA splicing and gene expression, significantly affects how effectively this protective barrier functions by influencing alternative splicing of the MUC1 gene.
The Mechanism
The rs4072037 variant (G>A) is located in exon 2 of the MUC1 gene at chromosome 1q22. Though it doesn't change the encoded amino acid (making it synonymous), the A variant disrupts normal splicing patterns, leading to production of a 27-nucleotide shorter transcript44 27-nucleotide shorter transcript
The T allele introduces an alternative splice site that removes 27 nucleotides from the mature mRNA. This altered MUC1 protein has reduced ability to block H. pylori adhesion to gastric mucosa. The protective C allele maintains normal MUC1 structure, which more effectively blocks H. pylori adhesin binding (BabA and SabA), limiting bacterial colonization.
The Evidence
Multiple meta-analyses55 Multiple meta-analyses
Liu et al. Meta-analysis of 9 studies with 10,410 cases and 11,437 controls have established that the C allele provides significant protection against gastric cancer, with an odds ratio of 0.70 (95% CI: 0.64-0.76). This protective effect is particularly strong in Asian populations, where the association reaches genome-wide significance. A larger meta-analysis66 larger meta-analysis
Gu et al. 17 studies with 12,551 cases and 13,436 controls confirmed that rs4072037 is associated with decreased cancer risk, especially gastric cancer in Asian populations. An Iranian case-control study77 Iranian case-control study
Shekarriz et al. 99 gastric cancer patients and 98 controls demonstrated a gene-environment interaction between rs4072037 genotype and H. pylori infection status in modulating gastric cancer risk.
The TT genotype is associated with approximately 2-fold increased gastric cancer risk compared to GG, and the effect is most pronounced for diffuse-type gastric cancer88 diffuse-type gastric cancer
Diffuse gastric cancer is a more aggressive subtype with worse prognosis. Studies across multiple ethnic groups consistently show this pattern, though the effect size is larger in Asian populations (where gastric cancer and H. pylori rates are higher) than in Caucasians.
Practical Implications
If you carry one or two copies of the T allele, your gastric mucus barrier may be less effective at preventing H. pylori colonization. This doesn't mean you'll definitely develop problems, but it suggests increased vigilance around gastric health, particularly if you're in a region with high H. pylori prevalence. Consider testing for H. pylori infection if you experience persistent digestive symptoms, as early treatment can prevent progression to more serious conditions.
The protective C allele has a global frequency of approximately 45-46%, meaning that a substantial portion of the population benefits from enhanced natural defense against H. pylori. Those with CC genotype have the strongest protective effect, but even GA carriers show intermediate protection compared to AA individuals.
Interactions
The rs4072037 variant's effect is most pronounced in the presence of H. pylori infection, demonstrating a critical gene-environment interaction. The variant also exists in linkage disequilibrium99 linkage disequilibrium
Linkage disequilibrium means these variants are often inherited together as a block with other MUC1 variants including rs2070803 and rs2075570, which also affect gastric cancer risk. The combined effect of multiple MUC1 variants may further modulate gastric mucosal protection, though individual risk from rs4072037 is well-established independent of other variants.
The Ghrelin Receptor Variant That Shapes Lean Body Mass and IGF-1
Ghrelin does far more than signal hunger. When it docks onto
GHSR-1a (growth hormone secretagogue receptor)11 GHSR-1a (growth hormone secretagogue receptor)
The canonical ghrelin receptor,
expressed in the pituitary gland, hypothalamus, and across peripheral tissues.
Named for its original discovery as the receptor mediating growth hormone
secretion in response to synthetic GH secretagogues.,
it triggers a powerful pulse of growth hormone release from the pituitary. That GH
pulse drives the liver to secrete
IGF-1 (insulin-like growth factor 1)22 IGF-1 (insulin-like growth factor 1)
The primary mediator of GH's anabolic
effects: stimulates muscle protein synthesis, promotes linear bone growth, and
regulates fat metabolism. Circulating IGF-1 levels closely track GH secretion
patterns over hours to days.,
and IGF-1 is the hormone that actually builds lean tissue — muscle, bone, connective
tissue. rs509035 is an intronic variant within the GHSR gene itself, sitting at
chromosome 3:172,445,659 (GRCh38). It does not change the receptor's amino acid
sequence, but GWAS data from populations totalling hundreds of thousands of individuals
shows that the A allele is associated with higher fat-free mass, greater adult stature,
and elevated serum IGF-1 — the three expected downstream consequences of enhanced
GHSR-mediated GH pulsatility.
The Mechanism
rs509035 sits within an intron of GHSR, likely in a regulatory element that influences how efficiently the gene is transcribed or the mRNA is processed. The exact molecular function has not been characterized by gel-shift or reporter assays (unlike the functionally annotated nearby promoter variant rs490683), so the mechanism is inferred from the downstream phenotype pattern rather than direct biochemical evidence. The convergence of three independent large GWAS signals — lean body mass, height, and IGF-1 — pointing to the same A allele strongly implies that A tracks with modestly elevated GHSR expression or activity, producing more frequent or larger GH pulses, more IGF-1, and consequently more lean tissue accretion over a lifetime.
Animal model support comes from studies of
GHSR knockout mice33 GHSR knockout mice
Labarthe et al. 2022 (PMID 33774644): GHS-R1a deletion
reduced pulsatile GH secretion, decreased linear growth, and reduced hypothalamic
GHRH mRNA expression in males — demonstrating that GHSR is causally required for
normal GH pulsatility and growth.,
which show reduced GH pulsatility and impaired linear growth when the receptor is
absent. If loss of GHSR reduces lean mass and growth, it follows that variants that
enhance GHSR signaling would move these phenotypes in the opposite direction — and
rs509035-A appears to do exactly that at the population level.
The Evidence
The strongest evidence comes from a sex-stratified GWAS in the UK Biobank by
Hübel et al. 201944 Hübel et al. 2019
Genomics of body fat percentage may contribute to sex bias
in anorexia nervosa. Am J Med Genet B Neuropsychiatr Genet. 2019;180(3):222-235.
N=155,961 healthy UK Biobank participants.,
which identified rs509035 as a genome-wide significant hit for fat-free mass (overall
beta = 0.188, p = 2×10⁻¹⁵; males beta = 0.214, p = 1×10⁻⁸; females beta = 0.151,
p = 5×10⁻⁹). The effect is present in both sexes but is modestly larger in men,
consistent with the testosterone–GH axis interaction in lean mass regulation.
Height GWAS results from
Wood et al. 201455 Wood et al. 2014
Nature Genetics, N=253,288 adults; the A allele (frequency 0.316)
shows beta = 0.031 SD per allele, p = 3×10⁻²³.
confirm that the lean mass signal has a growth axis explanation — the same allele
that promotes lean tissue accretion also produces slightly greater adult height.
The CHARGE Consortium IGF-1 meta-analysis by
Teumer et al. 201666 Teumer et al. 2016
Aging Cell, N=30,884 European adults; rs509035 A allele
associated with higher serum IGF-1, p = 2×10⁻⁸.
provides the mechanistic link: the A allele is associated with higher circulating
IGF-1, which is the molecular effector translating GH pulsatility into lean tissue
construction. Together, these three GWAS signals (fat-free mass, height, IGF-1)
form a coherent picture of enhanced GHSR-to-GH-to-IGF-1 anabolic signaling in A
allele carriers.
Glucose and insulin metabolism may also be modified. In the Finnish Diabetes
Prevention Study (Mager et al. 2008, N=507),
rs509035 was associated with several glucose and insulin measures during 3-year
follow-up77 rs509035 was associated with several glucose and insulin measures during 3-year
follow-up
Mager U et al. PLoS One. 2008;3(8):e2941. The paper studied 7 GHSR
SNPs in overweight adults with impaired glucose tolerance; rs509035 and rs490683
were the two variants showing significant metabolic associations.,
though the specific direction and magnitude of the effect for rs509035 were not
separately published in the abstract. IGF-1 itself has insulin-sensitizing effects
at the receptor level, so the higher IGF-1 in A allele carriers may partially
explain glucose metabolism differences.
Practical Actions
For GG individuals (the majority), the GHSR intronic variant confers typical lean mass accrual potential without the enhanced GH-axis signaling that A allele carriers appear to have. For those pursuing body composition goals, the GG genotype means that lean mass gains depend primarily on training stimulus and protein adequacy rather than a genetically elevated IGF-1 floor. Monitoring IGF-1 levels is more informative for AA individuals than GG individuals, as the GG baseline is uncharacterized relative to the A allele effect.
For A allele carriers (AG and AA), enhanced GHSR-mediated GH pulsatility confers a measurable lean mass and IGF-1 advantage. This is actionable in resistance training contexts: a higher IGF-1 floor generally means faster muscle protein synthesis recovery and potentially better hypertrophic response to resistance training. Adequate protein intake (1.6–2.2 g/kg body weight daily) is necessary to fully leverage the elevated anabolic signaling environment.
Interactions
rs509035 is an intronic variant in the same GHSR gene as the promoter variants rs490683 and rs9819506, which have independent associations with appetite drive and body weight change during dietary intervention. The functional NF-1 binding site variant rs490683 (promoter) affects ghrelin receptor expression level; rs509035 (intronic) appears to operate through a downstream mechanism influencing the GH/IGF-1 anabolic axis more than appetite per se. Individuals carrying rs490683-GG (elevated appetite drive) combined with rs509035-AA (elevated IGF-1) may have a phenotype characterized by both high appetite and efficient lean tissue accretion — the genetic profile of a mesomorphic body type that gains muscle readily but also experiences strong hunger during caloric restriction.