PMM2 R141H — The Most Common CDG Mutation and Its Implications for Carriers

Every protein destined for export, membrane display, or passage through the secretory pathway must be decorated with sugar chains — a process called N-linked glycosylation11 N-linked glycosylation
The attachment of oligosaccharide chains to the nitrogen of asparagine residues in proteins destined for the endoplasmic reticulum. "N-linked" refers to the nitrogen atom of asparagine that anchors the glycan.
. This process depends on a supply of activated sugar building blocks, and the enzyme phosphomannomutase 2 (PMM2) is the gatekeeper for one of them: it converts mannose-6-phosphate to mannose-1-phosphate, the precursor of GDP-mannose, which is incorporated into the oligosaccharide core. When PMM2 fails, cells cannot build complete N-glycan chains, and hundreds of glycoproteins are produced in aberrant forms — affecting brain development, liver function, blood coagulation, hormones, and more.

The c.422G>A variant (rs28936415) changes arginine to histidine at position 141 of the PMM2 protein (p.Arg141His, commonly written R141H). It is the most frequent pathogenic PMM2 allele worldwide, particularly in European populations22 particularly in European populations
European carrier frequency approximately 1/79 in the Netherlands and 1/60 in Denmark per Schollen et al. 2000
, and accounts for roughly half of all disease-causing PMM2 alleles identified in CDG patients.

The Mechanism

PMM2 functions as a homodimer — two identical PMM2 subunits join to form the active enzyme. The Arg141 residue sits in a conserved region critical for protein folding and stability. The R141H substitution introduces a smaller, less basic histidine in place of the bulky positively charged arginine, destabilizing the protein structure.

Protein folding studies33 Protein folding studies
Yuste-Checa P et al. The Effects of PMM2-CDG-Causing Mutations on the Folding, Activity, and Stability of the PMM2 Protein. Hum Mutat, 2015
confirmed that R141H is a destabilizing variant that substantially reduces enzyme activity, and that the protein may be partially rescuable with pharmacological chaperones — a finding relevant to emerging treatment strategies.

A critical fact distinguishes R141H from other PMM2 variants: homozygous R141H (two copies of this exact mutation) has never been observed in any living person. Population genetics analysis44 Population genetics analysis
Schollen E et al. Lack of Hardy-Weinberg equilibrium for the most prevalent PMM2 mutation in CDG-Ia. Eur J Hum Genet, 2000
confirmed that given the observed carrier frequency (1/79 in Dutch neonates), homozygous infants should appear in the population if they survived — but they don't. R141H homozygosity leaves too little PMM2 activity for embryogenesis to complete. All living PMM2-CDG patients who carry R141H are compound heterozygotes55 compound heterozygotes
Carry two different pathogenic PMM2 alleles — R141H on one chromosome and a different mutation (e.g. F119L, V44A, I132T) on the other. Neither allele alone causes disease; both are needed.
: they carry R141H on one chromosome and a different, less severe PMM2 mutation on the other that preserves just enough residual enzyme activity to allow survival.

The Evidence

PMM2-CDG (formerly CDG type Ia, or Jaeken syndrome) was the first CDG identified and remains by far the most common, with over 1,000 cases worldwide. The PMM2 gene was discovered in 199766 1997
Matthijs G et al. Mutations in PMM2, a phosphomannomutase gene on chromosome 16p13, in carbohydrate-deficient glycoprotein type I syndrome. Nat Genet, 1997
, with R141H among the original mutations identified.

A landmark French cohort study Schiff et al. 201777 Schiff et al. 2017
Clinical, laboratory and molecular findings and long-term follow-up data in 96 French patients with PMM2-CDG. J Med Genet, 2017
of 96 PMM2-CDG patients documented the full clinical spectrum: cerebellar ataxia in virtually all patients, intellectual disability, neonatal hypotonia, liver involvement, cardiac abnormalities in 20%, and a mortality rate of ~12.5% (12 deaths at mean age 3.8 years, primarily from cardiac complications). R141H was present in approximately 50% of all disease-causing alleles in this cohort. Biochemical markers consistently show elevated liver enzymes, abnormal coagulation parameters (low protein C, antithrombin), elevated TSH, and hypocholesterolemia.

ClinVar (VCV000007706) lists R141H as Pathogenic with criteria provided by over 50 independent submitters including Mayo Clinic Laboratories, ARUP, GeneDx, and Invitae — one of the most extensively classified pathogenic variants in the CDG field.

Practical Actions

For heterozygous carriers: A single R141H allele does not cause PMM2-CDG. Carriers have one fully functional PMM2 gene and maintain normal glycosylation. The primary significance is reproductive: if both partners carry a pathogenic PMM2 allele (any combination), each pregnancy has a 25% chance of inheriting compound heterozygous PMM2 variants and developing CDG.

For compound heterozygous individuals (disease): PMM2-CDG requires multidisciplinary management by metabolic physicians, neurologists, cardiologists, and hepatologists. Biochemical monitoring of transferrin glycoform analysis, liver enzymes, coagulation factors, and thyroid function is essential. While no curative therapy exists, emerging approaches include dietary mannose supplementation and repurposed drugs. Epalrestat, an aldose reductase inhibitor approved for diabetic neuropathy in Japan, showed 30–400% increases in PMM2 enzyme activity88 showed 30–400% increases in PMM2 enzyme activity
Iyer S et al. Repurposing the aldose reductase inhibitor and diabetic neuropathy drug epalrestat for PMM2-CDG. Dis Model Mech, 2019
in patient fibroblasts across multiple PMM2 genotypes, and is the only small-molecule activator of PMM2 identified to date.

Interactions

PMM2-CDG is defined by compound heterozygosity: one R141H allele paired with a second, distinct PMM2 pathogenic variant on the other chromosome. The most common compound heterozygous combinations include R141H + F119L and R141H + V44A, with phenotypic severity correlating broadly with residual enzyme activity — milder second alleles yield milder disease. Any individual carrying R141H who is diagnosed with PMM2-CDG should have full sequencing of both PMM2 alleles to characterize the specific compound genotype, as this informs prognosis and eligibility for emerging genotype-specific treatments.

rs28942084

LDLR LDLR Pro685Leu

Established Pathogenic

LDLR Pro685Leu — A Pathogenic Familial Hypercholesterolemia Mutation

The LDL receptor (LDLR) is the liver's primary mechanism for clearing low-density lipoprotein11 low-density lipoprotein
LDL cholesterol — the primary carrier of cholesterol in blood, often called "bad cholesterol" because accumulation in arteries drives atherosclerosis
from the bloodstream. Each hepatocyte displays roughly 50,000 LDL receptors that continuously cycle between the cell surface and endosomes, capturing LDL particles and internalizing them for degradation. When LDLR function is impaired by even one pathogenic variant, LDL-C accumulates in the blood from birth, silently damaging arteries for decades.

The Pro685Leu variant (c.2054C>T) is a well-characterized pathogenic mutation found in over 200 individuals with familial hypercholesterolemia across multiple ethnic groups — including populations in Japan, China, India, Zambia, Italy, and European cohorts — earning alternative names FH Zambia, FH Gujerat, FH Frosinone-1, and FH Kanazawa-2. Its worldwide distribution and consistent pathogenicity make it one of the more instructive examples of the global burden of monogenic FH.

The Mechanism

Pro685 sits in the EGF-precursor homology domain of the LDLR, a calcium-dependent structural region22 calcium-dependent structural region
The EGF precursor homology domain (also called the β-propeller domain) coordinates calcium ions that are essential for receptor recycling after LDL release in the acidic endosome
essential for receptor recycling after each endocytic cycle. Proline at this position is highly conserved across vertebrate species, and its replacement with leucine disrupts the local protein conformation.

The functional consequence is a combined Class II/III defect: the precursor form of the mutant receptor33 the precursor form of the mutant receptor
Normal LDLR is synthesized in the endoplasmic reticulum as a ~120 kDa precursor, processed to a mature ~160 kDa glycoprotein, and trafficked to the cell surface in approximately 45 minutes. The Pro685Leu variant slows this maturation and reduces surface expression.
is converted to the mature form more slowly than normal, and the receptor that does reach the cell surface binds LDL with reduced affinity. Functional assays consistently demonstrate 10–30% of normal LDLR activity. Five of five in silico prediction tools (SIFT, PolyPhen-2, REVEL score 0.883, etc.) independently rate the substitution as deleterious.

Heterozygous carriers have approximately half the normal hepatic LDL receptor capacity, producing the classic FH phenotype. Rare homozygotes (both LDLR alleles affected) face near-complete loss of LDL clearance, causing extremely severe hypercholesterolemia with LDL-C often exceeding 500 mg/dL.

The Evidence

The landmark Rotterdam cohort study44 landmark Rotterdam cohort study
Versmissen et al. Efficacy of statins in familial hypercholesterolaemia: a long term cohort study. BMJ, 2008
of 1,950 genetically confirmed FH patients (mean 8.5 years of follow-up) found that statin therapy reduced the cardiovascular event rate by 76% (HR 0.24, 95% CI 0.18–0.30). Treated patients achieved LDL-C reductions of 44–49% with standard statin doses, bringing absolute cardiovascular event rates (11/1,000 person-years) close to those of the general population.

The Pro685Leu variant specifically was identified in two Chinese FH pedigrees55 two Chinese FH pedigrees
Yao et al. Identification of LDLR mutations in two Chinese pedigrees with familial hypercholesterolemia. J Pediatr Endocrinol Metab, 2012
including a compound heterozygous child with severe FH. Across 39 ClinVar submissions from major diagnostic laboratories (Invitae, GeneDx, Mayo Clinic Laboratories, Quest Diagnostics, ARUP Laboratories, Color Diagnostics), zero submissions conflict with a Pathogenic classification. ACMG/AMP criteria applied include PS4 (prevalence in affected individuals), PP1_Strong (co-segregation with disease), PM2 (absent from controls), and PS3_Supporting (functional studies).

Practical Actions

Single-copy (heterozygous) carriers should expect untreated LDL-C in the 190–400 mg/dL range. High-intensity statin therapy (atorvastatin 40–80 mg or rosuvastatin 20–40 mg) is the first-line intervention and can reduce LDL-C by 50–60%. Ezetimibe (10 mg daily) added to a statin provides an additional 20–25% LDL-C reduction through complementary mechanism (intestinal cholesterol absorption inhibition). If LDL-C targets are not met on statin + ezetimibe, PCSK9 inhibitors (alirocumab, evolocumab) add another 50–60% on top of background therapy — though their efficacy depends on residual LDL receptor function, so response monitoring is important.

Cascade screening of all first-degree relatives is recommended immediately on diagnosis: each first-degree relative has a 50% probability of carrying the same variant and will have been accumulating LDL-driven atherosclerosis since birth without knowing it. Children of carriers should be tested by age 8–10.

Interactions

The cardiovascular burden of LDLR Pro685Leu compounds with APOE genotype. APOE E4 carriers (rs429358 CT or CC) already have impaired LDL clearance through a separate mechanism; those who also carry an LDLR pathogenic variant face additive LDL elevation and may require more aggressive lipid-lowering therapy. Similarly, PCSK9 gain-of-function variants (which increase LDL receptor degradation) can worsen the FH phenotype.

rs35936514

LHPP LHPP depression risk variant

Strong Risk Factor

The Prefrontal Phosphatase — When Stress Rewires the Brain

Most people associate depression with serotonin. LHPP tells a different story — one about a small enzyme in the prefrontal cortex that quietly prevents stress from reshaping your brain. When LHPP function is reduced, chronic stress finds an open door into the neural circuitry that governs mood, motivation, and resilience.

LHPP encodes phospholysine phosphohistidine inorganic pyrophosphate phosphatase11 phospholysine phosphohistidine inorganic pyrophosphate phosphatase
A HAD-family phosphatase that removes phosphate groups from histidine residues on target proteins. Histidine phosphorylation is an understudied but functionally important post-translational modification in mammalian cells, regulating kinase signaling cascades
. It was essentially unknown in psychiatry until the CONVERGE consortium22 CONVERGE consortium
Collaborative Research on the Genetics of Depression in East Asians — a large-scale genetics consortium focused on recurrent major depressive disorder in Han Chinese women
identified it in a landmark 2015 Nature study33 a landmark 2015 Nature study
CONVERGE consortium. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature, 2015
, sequencing 5,303 cases and 5,337 controls to find just two genome-wide significant loci — one near SIRT1 and one in LHPP (P = 6.45 × 10⁻¹²). The rs35936514 variant sits in the 3' UTR and flanking intronic region of LHPP on chromosome 10, likely affecting expression levels rather than protein structure.

The Mechanism

LHPP acts as a brake on stress-induced neuronal dysfunction. In the medial prefrontal cortex44 medial prefrontal cortex
A region of the prefrontal cortex critical for executive function, emotional regulation, and the top-down suppression of the stress response. It communicates extensively with the amygdala and hippocampus to modulate fear and mood
(mPFC), LHPP dephosphorylates a network of histidine kinase substrates55 histidine kinase substrates
Proteins — including NME1/2 — that carry phosphate groups on histidine amino acids. When these kinases are over-active, downstream signaling cascades including MAPK and PI3K/AKT are altered, reducing glutamatergic synaptic transmission
including NME1/2. When LHPP levels fall — whether from reduced expression due to the T allele or from chronic stress itself suppressing LHPP — histidine phosphorylation rises unchecked, weakening glutamatergic synaptic transmission in exactly the circuits that should be most resilient.

Lin et al. (2023)66 Lin et al. (2023)
Lin D et al. LHPP, a risk factor for major depressive disorder, regulates stress-induced depression-like behaviors through its histidine phosphatase activity. Molecular Psychiatry, 2023
showed in mice that mPFC LHPP deletion alone produced no spontaneous depression-like behavior — but on exposure to chronic social defeat stress, LHPP-knockout mice showed dramatically augmented behavioral deficits. Crucially, re-introducing wild-type LHPP reversed the deficits, while a phosphatase-dead LHPP mutant did not, confirming the enzymatic activity is required. A parallel mechanism runs through astrocytes: Sha et al. (2023)77 Sha et al. (2023)
Sha L et al. LHPP-mediated inorganic pyrophosphate hydrolysis-driven lysosomal acidification in astrocytes regulates adult neurogenesis. Cell Reports, 2023
showed that LHPP normally drives lysosomal acidification88 lysosomal acidification
Lysosomes require an acidic environment (pH ~5) to function as the cell's recycling centers. In astrocytes, LHPP hydrolyzes inorganic pyrophosphate to fuel this acidification. Without it, lysosomal function is impaired, the C/EBPβ transcription factor accumulates, and downstream chemokine signaling promoting neurogenesis is triggered
in astrocytes; in its absence, the C/EBPβ pathway triggers compensatory adult neurogenesis that confers stress resilience. This counterintuitive finding (less LHPP → more resilience in knockout mice) may reflect that the full-knockout model removes a gene globally, while the rs35936514 T allele reduces rather than abolishes LHPP expression in specific circuits.

In the ventral hippocampus, a complementary mechanism operates: Zhuang et al. (2024)99 Zhuang et al. (2024)
Zhuang L et al. LHPP in glutamatergic neurons of the ventral hippocampus mediates depression-like behavior by dephosphorylating CaMKIIα and ERK. Biological Psychiatry, 2024
found that LHPP levels increase in glutamatergic neurons during stress-induced depression, and that knocking out LHPP in these neurons enhanced spontaneous activity and stress resilience via the CaMKIIα/ERK pathway1010 CaMKIIα/ERK pathway
CaMKII (calcium/calmodulin- dependent protein kinase II) and ERK (extracellular signal-regulated kinase) regulate synaptic plasticity and BDNF expression. LHPP dephosphorylates both, reducing their activity and impairing the BDNF/PSD95-mediated synaptic strengthening that underlies mood resilience
. Esketamine — but not fluoxetine — reversed LHPP-induced depression-like behaviors, pointing toward glutamatergic rather than serotonergic modulation as the relevant therapeutic axis.

The Evidence

The CONVERGE study represents one of the cleanest GWAS findings in depression genetics. By restricting the sample to Han Chinese women with recurrent, severe MDD — reducing phenotypic and genetic heterogeneity — the consortium achieved genome-wide significance with just 10,640 individuals, far fewer than most psychiatric GWAS require. The LHPP signal (P = 6.45 × 10⁻¹²) was validated in an independent replication cohort. rs35936514 sits in intron 2 of LHPP and the flanking 3' UTR, and the T allele is associated with reduced LHPP expression in the prefrontal cortex.

At the neural circuit level, Cui et al. (2020)1111 Cui et al. (2020)
Cui L et al. Association of LHPP genetic variation (rs35936514) with structural and functional connectivity of hippocampal-corticolimbic neural circuitry. Brain Imaging and Behavior, 2020
examined 122 healthy participants and found that T-allele carriers showed increased hippocampal connectivity to the rostral anterior cingulate cortex and higher fractional anisotropy in the fornix — structural and functional signatures consistent with altered top-down emotional regulation. These brain connectivity differences were present without clinical depression, suggesting the risk phenotype is a continuous trait rather than a categorical diagnosis.

Earlier family-based work by Neff et al. (2009)1212 Neff et al. (2009)
Neff CD et al. Evidence for HTR1A and LHPP as interacting genetic risk factors in major depression. Molecular Psychiatry, 2009
found a chromosome 10 linkage peak (HLOD = 4.4) in Utah pedigrees with familial depression and identified disease-segregating LHPP SNPs in the linkage region. The LHPP effects in that study were contingent on HTR1A genotype, hinting at an interaction between serotonin receptor sensitivity and LHPP-mediated stress signaling.

Practical Implications

The LHPP mechanism converges on a clear biological theme: glutamatergic synaptic function under stress. T-allele carriers do not appear to have impaired mood at baseline — the risk emerges under chronic stress. This has direct implications for how carriers should think about their mental health strategy: minimizing the duration and intensity of chronic stressors matters more than managing acute stress reactions.

The esketamine finding from Zhuang et al. is clinically relevant: if depression does develop in T-allele carriers, the evidence points toward glutamate-targeting treatments (esketamine/ketamine) having particular mechanistic relevance versus standard SSRIs, which do not directly address the LHPP-CaMKII/ERK pathway.

Interactions

The Neff et al. (2009) data suggest a potential interaction between LHPP (rs35936514) and the HTR1A serotonin receptor gene. In that family-based study, the LHPP depression association was modulated by HTR1A genotype, suggesting that individuals carrying risk variants in both genes may have compounded vulnerability. The mechanistic intersection — serotonergic regulation of prefrontal excitatory tone — offers a plausible biological substrate.

The BDNF Val66Met variant (rs6265) is also relevant: since LHPP deficiency reduces CaMKII/ERK activity and thereby impairs BDNF/PSD95-mediated synaptic strengthening, T-allele carriers who also carry the BDNF Met allele (reduced activity-dependent BDNF secretion) may experience compounded vulnerability to stress-induced synaptic weakening in prefrontal and hippocampal circuits.

rs3850641

TNFSF4 TNFSF4 (OX40L) Intron 1 Variant

Moderate Risk Factor

OX40 Ligand and the Immune Gate of Atherosclerosis

TNFSF4 encodes OX40 ligand (OX40L, also called CD252), a co-stimulatory molecule of the TNF superfamily that serves as the key amplifier of T-cell activation in inflammatory settings. OX40L is expressed on antigen-presenting cells — particularly macrophages, dendritic cells, and vascular endothelial cells — and engages OX40 on activated CD4+ T cells to extend their survival, promote cytokine secretion, and drive clonal expansion. Within atherosclerotic plaques, OX40L expression is markedly elevated compared to healthy vascular tissue, driven by oxidized LDL and other inflammatory mediators. The rs3850641 variant sits in the first intron of TNFSF4, in strong linkage disequilibrium with a promoter polymorphism (rs45454293) that regulates OX40L gene expression.

The Mechanism

The functional variant linked to rs3850641 is rs4545429311 rs45454293
A promoter SNP in LD with rs3850641 at r²=0.70; the T-allele reduces TNFSF4 transcription by ~50% compared to the C-allele in luciferase reporter assays, attributed to binding of a transcriptional repressor
. The rs3850641 SNP itself has no direct effect on promoter activity, but because both SNPs travel together on the same haplotype block — the TG haplotype combining rs45454293T and rs3850641G — the G allele at rs3850641 tags a state of reduced OX40L expression in vascular tissue.

Paradoxically, lower OX40L expression appears to increase cardiovascular risk. This may reflect an immunoregulatory role: OX40L signaling is also required for the proper differentiation and persistence of regulatory T cells (Tregs). Disruption of OX40L in the plaque microenvironment could impair Treg-mediated suppression of inflammatory effector T cells, allowing uncontrolled T-cell–driven inflammation to accelerate plaque growth and destabilization. Consistent with this, anti-OX40L antibody blockade in LDL receptor-deficient mice22 anti-OX40L antibody blockade in LDL receptor-deficient mice
Guo et al., Arterioscler Thromb Vasc Biol 2007 — anti-OX40L antibody reduced atherosclerotic lesion area by approximately 53% in Ldlr−/− mice by attenuating Th2-mediated responses and preserving anti-oxLDL IgM antibodies
attenuated atherogenesis, pointing to the pathway as a therapeutic target rather than a simple risk amplifier.

Elevated soluble OX40L (sOX40L) in circulation — released primarily from activated platelets — correlates with acute coronary syndrome severity, plaque instability, and elevated matrix metalloproteinase (MMP-9, MMP-3) levels, reinforcing OX40L as a biomarker of active plaque inflammation.

The Evidence

The variant was first implicated by Wang et al. in Nature Genetics 200533 Wang et al. in Nature Genetics 2005
Positional identification of TNFSF4, encoding OX40 ligand, as a gene that influences atherosclerosis susceptibility
, which used positional cloning from the murine atherosclerosis susceptibility locus Ath1 to identify TNFSF4 as a human candidate gene. The rs3850641 G allele was significantly more frequent in MI cases than controls in two independent Swedish cohorts. Subsequent work by Cunningham et al. PLoS ONE 201144 Cunningham et al. PLoS ONE 2011
A Common Polymorphism in the Promoter Region of the TNFSF4 Gene Is Associated with Lower Allele-Specific Expression and Risk of MI
identified rs45454293 as the functional promoter variant in LD with rs3850641 (r²=0.70), with the TG haplotype significantly enriched in female MI patients vs controls in the SCARF cohort (P=0.01), and demonstrated 50% reduced OX40L promoter activity for the T-allele.

A Chinese Han case-control study (Li et al. Oncotarget 2018, PMID 29921578 — 454 cases, 512 controls)55 (Li et al. Oncotarget 2018, PMID 29921578 — 454 cases, 512 controls) found that homozygous GG carriers had significantly elevated MI risk (crude OR 2.00, 95% CI 1.04–3.86, P=0.039; adjusted OR 2.29, 95% CI 1.20–4.69, P=0.023), while the AG heterozygous genotype conferred no significant independent risk under additive or dominant models.

However, larger meta-analyses show overall null results in aggregate populations. A meta-analysis of 9 studies (Fu et al. 2016, PMID 27008001)66 meta-analysis of 9 studies (Fu et al. 2016, PMID 27008001) found allelic OR 1.10 (95% CI 0.96–1.27, P=0.174), and a systematic review of 11 studies covering 3,865 cases and 6,344 controls (Lu et al. 2018, PMID 29424751)77 systematic review of 11 studies covering 3,865 cases and 6,344 controls (Lu et al. 2018, PMID 29424751) found OR 1.02 (95% CI 0.89–1.17) for the G allele — no overall association. These null findings likely reflect population stratification (original signal primarily in European women), heterogeneity across disease definitions (CHD vs. MI specifically), and statistical power limitations for the rare GG genotype (only ~2% of individuals).

The GG-specific risk signal (recessive model OR ~2) remains coherent across the positive studies and is mechanistically plausible; the overall null meta-analysis findings reflect the rarity of GG homozygotes diluting allele-level effects. Evidence is classified as moderate — replicated in multiple independent studies under the recessive model, with a plausible regulatory mechanism, but no established clinical guidelines and significant heterogeneity across populations.

Practical Actions

For GG homozygotes, the elevated MI risk under the recessive model (OR ~2) warrants proactive cardiovascular monitoring. The risk appears particularly pronounced in women and in populations of East Asian and South Asian descent where OX40L pathway activity in the context of this haplotype may interact differently with other risk factors. Tracking inflammatory biomarkers such as hs-CRP and sOX40L alongside standard lipid panels provides the most relevant picture of vascular inflammatory burden for this genotype.

Interactions

rs3850641 is in linkage disequilibrium with the functional promoter SNP rs45454293 (r²=0.70); the TG haplotype combining both minor alleles carries the highest risk signal in women. rs2205960 and other upstream TNFSF4 promoter region SNPs (rs10912580, rs12039904, rs1234317) form a distinct haplotype block associated with systemic lupus erythematosus susceptibility, illustrating that TNFSF4 haplotypes have different disease consequences depending on which part of the regulatory region they tag.

Individuals carrying both rs3850641 GG and cardiovascular-risk variants in classical inflammatory pathways (e.g., IL-6 signaling genes, CRP variants) may have compounded atherosclerotic risk through synergistic immune activation in the vascular wall — an interaction that has not been formally studied but is mechanistically plausible given the T-cell co-stimulatory nature of the OX40L pathway.

MMEL1-TNFRSF14 — A Regulatory Checkpoint Between Peptide Processing and Immune Costimulation

On the short arm of chromosome 1, at position 1p36, two genes sit in close proximity with distinct but complementary roles in immune homeostasis: MMEL1, encoding a neprilysin-like metalloendopeptidase, and TNFRSF14, encoding HVEM11 HVEM
Herpes Virus Entry Mediator, also known as CD270 — a TNF receptor superfamily member that regulates both activating and inhibitory T-cell signals
. The rs3890745 intronic SNP22 SNP
Single nucleotide polymorphism — a single-letter DNA difference that varies between people; this one lies within MMEL1 but is in linkage disequilibrium with regulatory elements affecting both neighboring genes
has been consistently associated with rheumatoid arthritis susceptibility in multiple genome-wide association studies. The C allele, carried at approximately 32% frequency in Europeans but up to 53% in African populations, tags a risk haplotype at this locus.

The Mechanism

rs3890745 falls within an intron of MMEL1 (membrane metalloendopeptidase like 1), also known as NEP2 or neprilysin II. MMEL1 is a zinc-dependent neutral endopeptidase that cleaves bioactive peptides including neuropeptides and inflammatory mediators — a function that positions it as a regulator of local peptide signaling in immune and neural contexts. MMEL1 is expressed broadly, with highest levels in testis and small intestine, and weaker expression in brain, kidney, and immune tissues.

The more immunologically prominent neighbor is TNFRSF14 (HVEM/CD270). HVEM operates as a dual-function immune checkpoint: it can deliver activating signals to T cells via binding LIGHT33 LIGHT
TNFSF14, a ligand expressed on activated T cells that delivers costimulatory signals through HVEM, promoting T-cell proliferation and cytokine production
, but it also transmits inhibitory signals through BTLA44 inhibitory signals through BTLA
B- and T-lymphocyte attenuator — an inhibitory receptor expressed on lymphocytes that signals "stop" when bound to HVEM, analogous to CTLA-4 and PD-1
. This bidirectional control makes HVEM a critical rheostat in peripheral immune tolerance. Disruption of the HVEM/BTLA inhibitory axis has been implicated in multiple autoimmune conditions, including rheumatoid arthritis, celiac disease, and inflammatory bowel disease.

As an intronic variant, rs3890745 does not change the MMEL1 protein sequence. It is presumed to act as a regulatory tag SNP in linkage disequilibrium with functional variants that alter expression levels of MMEL1, TNFRSF14, or both in immune cell populations. Expression quantitative trait locus (eQTL) data from immune cells would be required to identify the precise causal variant; the rs3890745 association most likely reflects a haplotype-level regulatory effect spanning both genes.

The Evidence

The MMEL1-TNFRSF14 locus was first established as an RA susceptibility region in a 2008 genome-wide association meta-analysis55 genome-wide association meta-analysis
Study design combining multiple GWAS cohorts to increase statistical power — more samples yield more reliable effect estimates
by Raychaudhuri and colleagues (Nature Genetics), which analyzed 3,393 cases and 12,462 controls in the discovery phase and 3,929 anti-CCP-positive RA cases and 5,807 controls in replication. The rs3890745 signal at this locus reached an overall p-value of 1.1×10⁻⁷, placing it among confirmed RA susceptibility loci alongside CD40, KIF5A, and CDK6.

The association was replicated in the large trans-ethnic RA GWAS meta-analysis66 large trans-ethnic RA GWAS meta-analysis
Trans-ethnic meta-analysis pooling cohorts of European, East Asian, and mixed ancestry — improves fine-mapping precision by narrowing LD blocks
by Okada et al. (Nature, 2014), which confirmed an odds ratio of 1.18 (95% CI 1.10–1.27) for the risk allele at this locus. An independent GWAS meta-analysis by Stahl et al. (Nature Genetics, 2010)77 Stahl et al. (Nature Genetics, 2010)
Discovery phase: 5,539 RA cases and 20,169 controls from European populations; replication phase: 6,768 cases and 8,806 controls
confirmed the MMEL1 locus at p=4×10⁻⁶. A 2019 cross-trait GWAS analysis88 2019 cross-trait GWAS analysis
Examining shared genetic architecture between autoimmune diseases and non-Hodgkin lymphomas using regional overlap and polygenic risk scores
identified rs3890745 with OR 1.14 at genome-wide significance (p=2×10⁻⁸) in a combined RA/DLBCL analysis, supporting shared immune-regulatory genetic architecture across B-cell-driven conditions.

Beyond RA, variants at the MMEL1-TNFRSF14 locus have been associated with multiple sclerosis and primary biliary cirrhosis in separate GWAS, consistent with the broad role of HVEM-mediated immune costimulation in peripheral tolerance across different autoimmune phenotypes.

The per-allele odds ratio of approximately 1.12–1.18 classifies this as a moderate-effect susceptibility locus. This is consistent with the polygenic architecture of RA, where dozens of loci each contribute small incremental risk — the clinical impact of rs3890745 alone is modest, but it gains significance when combined with high-risk HLA-DRB1 shared epitope alleles or PTPN22 R620W.

Practical Actions

For C allele carriers, particularly CC homozygotes, the priority is early recognition of seropositive RA — the anti-CCP (ACPA)-positive, erosive subtype with the strongest genetic component. Because rs3890745 tags shared autoimmune genetic architecture, individuals with joint symptoms, unexplained morning stiffness, or a strong family history of RA should prioritize early autoantibody testing. Vitamin D3 supplementation has demonstrated a 22% reduction in incident autoimmune disease in the VITAL RCT and acts as a direct NF-kB and T-cell regulatory modulator.

Interactions

rs3890745 sits in the same chromosomal neighbourhood as a broader 1p36 autoimmune risk region. TNFRSF14/HVEM interacts directly with BTLA on T cells and B cells — variants in the BTLA locus (rs9288952) have been associated with RA in some GWAS, suggesting a potential compound effect through the HVEM-BTLA inhibitory axis. MMEL1 has homology with neprilysin (NEP/CD10), which processes inflammatory neuropeptides; the folate-methylation pathway (MTHFR, rs1801133) intersects with DNA methylation patterns at immune loci that regulate TNFRSF14 expression.

The strongest RA risk loci that may compound with rs3890745 include PTPN22 rs2476601 (T-cell signaling threshold), TRAF1-C5 rs10818488 (NF-kB regulation), and HLA-DRB1 shared epitope alleles (antigen presentation). These operate through distinct pathways and their combined carriage incrementally raises absolute RA risk.

FCGR3A V158F — The NK Cell Affinity Switch

Your body's natural killer (NK) cells carry a surface receptor called FcgammaRIIIa (CD16a)11 FcgammaRIIIa (CD16a)
The primary Fc receptor on NK cells that binds the constant region of IgG antibodies, triggering antibody-dependent cellular cytotoxicity
that connects innate immunity to antibody-mediated defense. This receptor grabs the tail end of IgG antibodies already bound to target cells — infected cells, cancer cells, or cells flagged for destruction — and activates the NK cell to kill. The V158F variant (rs396991) changes a single amino acid in the IgG-binding domain of this receptor, creating two versions with dramatically different binding affinities22 dramatically different binding affinities
The 158V isoform binds IgG1 and IgG3 with approximately 2-fold higher affinity than the 158F isoform
. This difference matters both for natural immune surveillance and, critically, for how well monoclonal antibody therapies work.

Genotyping accuracy warning: FCGR3A shares over 98% sequence homology33 over 98% sequence homology
Only four nucleotides differ between FCGR3A and FCGR3B in the genotyped region
with its neighboring gene FCGR3B. This extreme similarity can cause consumer genotyping chips and some research assays to inadvertently read FCGR3B sequence instead of FCGR3A, producing incorrect genotype calls. Validated TaqMan assays show 100% accuracy in European and Asian populations but 7.8% error rate in African populations and 1.1% in admixed American populations44 7.8% error rate in African populations and 1.1% in admixed American populations. If your result seems inconsistent with clinical observations, consider confirmatory testing with a gene-specific assay.

The Mechanism

The FCGR3A gene encodes a transmembrane glycoprotein expressed primarily on NK cells, macrophages, and some T-cell subsets. The V158F polymorphism occurs in the second extracellular immunoglobulin-like domain55 second extracellular immunoglobulin-like domain
This domain directly contacts the CH2 region of IgG, and the amino acid at position 158 sits at the binding interface
, precisely where IgG makes contact. Valine at position 158 (encoded by the C allele on the plus strand) creates a receptor that binds IgG1 and IgG3 with approximately two-fold higher affinity than phenylalanine at the same position (A allele). This translates directly into enhanced antibody-dependent cellular cytotoxicity (ADCC)66 enhanced antibody-dependent cellular cytotoxicity (ADCC)
ADCC is the process by which NK cells kill antibody-coated target cells; higher receptor affinity means more efficient target recognition and killing
.

The functional hierarchy is clear: V/V homozygotes show the strongest ADCC activity, V/F heterozygotes are intermediate, and F/F homozygotes have the weakest response. This gradient affects both natural immune surveillance against infected or abnormal cells and the therapeutic efficacy of monoclonal antibodies that depend on ADCC as their mechanism of action.

The Evidence

Monoclonal Antibody Therapy

The pharmacogenomic significance of V158F was first demonstrated in follicular lymphoma patients treated with rituximab77 follicular lymphoma patients treated with rituximab
Cartron et al. showed V/V patients achieved significantly higher molecular response rates to rituximab monotherapy than V/F or F/F patients
. For trastuzumab in HER2-positive breast cancer, a study of Egyptian patients88 study of Egyptian patients
V/V genotype present in 29.6% of responders vs 8.4% of non-responders; median progression-free survival 22 months for V/V vs 6 months for F/F (p=0.003)
found V/V carriers had significantly better overall survival and response rates. In follicular lymphoma treated with antibody-chemotherapy combinations99 follicular lymphoma treated with antibody-chemotherapy combinations
SWOG trials showed patients with at least one V allele had better overall survival than F/F patients when treated with antibody-chemotherapy combinations
, V allele carriers showed improved outcomes specifically in the antibody-containing treatment arms.

However, results are not universally consistent. A large randomized study in follicular lymphoma1010 A large randomized study in follicular lymphoma
Analysis of 321 patients found no FCGR genotype predicted initial response to rituximab or rituximab-chemotherapy combinations
found no predictive value, highlighting that tumor biology, immune microenvironment, and combination chemotherapy may modulate the receptor's influence. The evidence is strongest for rituximab monotherapy and weakens when combined with intensive chemotherapy regimens.

Anti-TNF Therapy in Inflammatory Bowel Disease

In Crohn's disease, V/V carriers showed 100% biological response to infliximab compared to 69.8% of F carriers1111 V/V carriers showed 100% biological response to infliximab compared to 69.8% of F carriers, consistent with enhanced ADCC against TNF-expressing cells. Paradoxically, V/V carriers also show faster infliximab clearance and higher anti-drug antibody rates1212 faster infliximab clearance and higher anti-drug antibody rates
37.5% of V/V patients developed anti-drug antibodies vs 10.6% for V/F and 5% for F/F (OR 6.08)
— the same high-affinity receptor that makes the drug work better also accelerates its elimination. This creates a clinical dilemma: V/V patients may respond better initially but need closer therapeutic drug monitoring and potentially dose optimization to maintain response.

Autoimmune Disease Susceptibility

Meta-analyses link V158F to systemic lupus erythematosus susceptibility1313 Meta-analyses link V158F to systemic lupus erythematosus susceptibility
FCGR3A rs396991 shows association with SLE in recessive model (OR 1.26, p = 9.62 x 10-5)
and lupus nephritis risk, particularly in non-European populations. The enhanced IgG binding by the V allele may increase immune complex-mediated tissue damage in autoimmune conditions. In rheumatoid arthritis, the association is weaker and may interact with HLA shared epitope status1414 HLA shared epitope status
The V allele may predispose shared epitope-positive individuals to RA
.

Practical Implications

The clinical significance of V158F depends heavily on context. For individuals who may receive monoclonal antibody therapy (rituximab for lymphoma, trastuzumab for breast cancer, cetuximab for colorectal cancer), knowing your genotype could inform treatment expectations and potentially guide therapeutic decisions. F/F carriers receiving these therapies may benefit from higher doses, more frequent administration, or combination approaches that don't rely solely on ADCC.

For individuals on anti-TNF therapy (infliximab, adalimumab) for inflammatory bowel disease or rheumatoid arthritis, V/V carriers should be aware of higher anti-drug antibody risk and may benefit from proactive therapeutic drug monitoring rather than reactive testing only when treatment appears to be failing.

For all carriers of the F allele, strategies that support NK cell function take on added importance since the lower-affinity receptor means each NK cell-target interaction is less efficient. While the receptor affinity is genetically fixed, NK cell number and activation state are modifiable.

Interactions

FCGR3A V158F interacts with FCGR2A rs1801274 (H131R)1515 FCGR2A rs1801274 (H131R)
Another Fc gamma receptor polymorphism affecting IgG binding, located on the same chromosome; combined low-affinity genotypes at both loci may compound reduced ADCC
. In DLBCL, FCGR2A was the primary driver of survival differences — FCGR3A was not independently associated with DLBCL survival — while FCGR3A showed predictive value in follicular lymphoma settings. In autoimmune disease, the combined effect of low-affinity alleles at both FCGR3A and FCGR2A may influence susceptibility and treatment response to antibody-based therapies. Enhancer SNPs rs4656317 and rs12071048 within FCGR3A are in strong linkage disequilibrium with rs396991 and influence NK cell ADCC through transcriptional regulation of CD16a expression levels, potentially modifying the functional impact of V158F.

BTD Pro167Ser — A Rare Pathogenic Allele in the Biotin Recycling Enzyme

Biotin (vitamin B7) is indispensable for four carboxylase enzymes11 carboxylase enzymes
pyruvate carboxylase, propionyl-CoA carboxylase, 3-methylcrotonyl-CoA carboxylase, and acetyl-CoA carboxylase — all require biotin as a covalently attached cofactor to function
that power fat synthesis, amino acid metabolism, and gluconeogenesis. Unlike most vitamins, biotin is largely recovered from food proteins rather than absorbed free — the digestive breakdown of biotin-dependent enzymes in the diet releases biocytin22 biocytin
biotin-ε-lysine, the product of proteolytic digestion of holocarboxylases and other biotinylated proteins
, which must then be cleaved back to free biotin before the body can reuse it. The enzyme responsible for this cleavage is biotinidase, encoded by the BTD gene on chromosome 3p25.1.

The rs397507173 variant introduces a c.499C>T nucleotide change that substitutes serine for proline at position 167 of the biotinidase protein (p.Pro167Ser). ClinVar records two "Likely pathogenic" submissions for this allele in the context of biotinidase deficiency, with a single "Uncertain significance" submission also on record (VCV003339734; conflicting interpretations status). The variant is absent from more than 250,000 control chromosomes in population databases, consistent with a rare disease allele. It was first documented in patients with biochemically confirmed biotinidase deficiency by Iqbal et al.33 Iqbal et al.
Iqbal F et al. The identification of novel mutations in the biotinidase gene using denaturing high pressure liquid chromatography (dHPLC). Mol Genet Metab, 2010
.

The Mechanism

Biotinidase is a member of the nitrilase superfamily. It cleaves the amide bond between biotin and lysine in biocytin, regenerating free biotin for re-attachment to newly synthesized apo-carboxylases. Without functional biotinidase, biocytin accumulates in urine (biotinuria), free biotin concentrations fall, and all four biotin-dependent carboxylases progressively lose activity. The proline at position 167 lies within the enzyme's catalytic domain; replacement with serine is predicted to disrupt local protein folding and reduce or abolish catalytic activity, consistent with ClinVar's pathogenicity assessment and in-silico tools predicting a damaging effect.

Biotinidase deficiency is classified by residual serum enzyme activity: profound deficiency (<10% of mean normal activity) causes severe neurological disease if untreated; partial deficiency (10–30% of mean normal) produces milder or stress-triggered symptoms. Heterozygous carriers typically retain ~50% of normal activity — sufficient for health under ordinary conditions but relevant for compound heterozygosity risk assessment.

The Evidence

The clinical significance of biotinidase deficiency is well established44 well established
Wolf B. Biotinidase Deficiency. GeneReviews, 2000 (updated 2026)
: untreated profound deficiency causes seizures, hypotonia, developmental delay, optic atrophy, hearing loss, and alopecia, typically presenting in the first weeks to years of life. Biotin supplementation is curative if started before irreversible neurological damage occurs; some deficits (optic atrophy, sensorineural hearing loss) may not fully reverse even with treatment. Partial deficiency can remain asymptomatic for years but cause symptoms under physiological stress — intercurrent illness, pregnancy, or periods of increased biotin turnover.

The rs397507173 Pro167Ser allele itself has been reported in individuals with confirmed biotinidase deficiency, including at least one homozygous case (ClinVar record). Its rarity — absent in 251,442 control chromosomes across gnomAD datasets — and the in-silico evidence of functional disruption support the "likely pathogenic" classification. The conflicting "uncertain significance" submission likely reflects the limited published functional data specific to this single variant, rather than any evidence of benignity.

Newborn screening programs that test biotinidase activity on dried blood spots identify profound deficiency at approximately 1 in 137,000 births and combined (profound + partial) deficiency at 1 in 61,000 births55 1 in 137,000 births and combined (profound + partial) deficiency at 1 in 61,000 births
Wolf B, GeneReviews 2026
. BTD is a required target on expanded newborn screening panels in the United States and many European countries precisely because early biotin supplementation is inexpensive, safe, and prevents irreversible harm.

Practical Actions

Heterozygous carriers (CT genotype) have sufficient biotinidase activity for normal health and do not require supplementation. The primary clinical relevance is reproductive: if both parents carry a pathogenic BTD allele, each pregnancy has a 25% probability of inheriting biallelic pathogenic variants and presenting with biotinidase deficiency. Carrier couples should receive genetic counseling before or during pregnancy.

Individuals who carry Pro167Ser in compound heterozygosity with a second pathogenic BTD variant (particularly the common profound-deficiency alleles) should have biotinidase enzyme activity measured to determine their actual activity level and need for supplementation.

For homozygous Pro167Ser (TT) individuals — an exceptionally rare genotype not yet documented in population databases — the expected clinical scenario is profound biotinidase deficiency requiring immediate and lifelong biotin supplementation, as for all confirmed profound cases.

Interactions

The key interactions for BTD pathogenic alleles involve compound heterozygosity: carrying one Pro167Ser allele together with a second BTD pathogenic variant on the other chromosome results in biallelic loss of function, producing partial or profound deficiency depending on the combined residual activity. The most common co-occurring allele in many populations is p.Asp444His (associated with partial deficiency), and the most common severe allele is the c.98_104delinsTCC frameshift. When Pro167Ser is paired with a severe loss-of-function allele, the phenotype is expected to be profound deficiency; paired with the milder p.Asp444His, the combined activity typically falls in the partial deficiency range (approximately 10–25% of normal). Any individual found to carry Pro167Ser should have their partner screened for BTD pathogenic variants before or during pregnancy, and any child of two carriers should receive biotinidase activity testing as part of newborn screening.

MYBPC3 Gly490Val — A Recessive Sarcomere Variant with Malignant Homozygous Phenotype

Cardiac myosin-binding protein C (cMyBP-C), encoded by MYBPC3 on chromosome 11, is a thick-filament accessory protein that accounts for roughly 2% of myofibrillar protein mass11 roughly 2% of myofibrillar protein mass
cMyBP-C spans the C-zone of each sarcomere half and contacts both myosin and actin
. Its C-terminal domains anchor it to the thick filament while its N-terminal domains regulate cross-bridge cycling — slowing myosin-actin interaction at rest and permitting contraction when phosphorylated by protein kinase A during adrenergic stimulation. MYBPC3 is the single most commonly mutated gene in hypertrophic cardiomyopathy (HCM), a condition marked by asymmetric left ventricular hypertrophy, diastolic dysfunction, and in severe cases, sudden cardiac death.

Most pathogenic MYBPC3 variants cause HCM through autosomal dominant haploinsufficiency — one defective copy is sufficient to disrupt sarcomere stoichiometry. The Gly490Val variant (rs397514752, c.1469G>T on the coding strand) is an important exception: it behaves recessively, causing disease only when both copies of MYBPC3 are affected.

The Mechanism

Glycine at position 490 falls within the C3 immunoglobulin-like domain22 C3 immunoglobulin-like domain
MYBPC3 contains 11 domains (C0-C10); C3 is an Ig-like domain in the middle segment of the protein
of cMyBP-C. Glycine residues are structurally unique — their lack of a side chain allows tight turns and compact beta-strand arrangements that larger amino acids cannot adopt. Substituting the bulky valine disrupts this local fold. The conserved glycine at position 490 is present across vertebrate species, underscoring its structural importance.

In heterozygotes, one functional copy of MYBPC3 appears sufficient to maintain normal sarcomere stoichiometry, explaining why carriers remain phenotypically normal. In homozygotes, with no wild-type cMyBP-C produced, the sarcomere cannot regulate cross-bridge cycling properly, leading to the disorganized myocyte architecture (myofibre disarray) characteristic of severe HCM.

The Evidence

The variant was first reported in 2013 by Wang Y and colleagues33 first reported in 2013 by Wang Y and colleagues
PLoS One, n=1 pedigree, 2 affected homozygous siblings, 5 unaffected heterozygous carriers, 376 controls
in a consanguineous Chinese family. Both homozygous brothers presented with classic HCM: maximum wall thickness 17-18 mm, asymmetric ventricular hypertrophy on cardiac MRI, and diffuse repolarization changes with large negative T waves. None of the five adult heterozygous family members — including one aged 71 years — showed any clinical evidence of HCM by echocardiography or ECG. The variant was absent from 376 Chinese controls and public variant databases at the time of publication.

The recessive inheritance pattern at this site contrasts sharply with an adjacent variant, Gly490Arg, which causes dominant HCM. This domain-specific behavior illustrates how even nearby amino acid substitutions can produce fundamentally different inheritance modes depending on their effect on protein function versus stability.

Recessive MYBPC3 variants as a class are most dramatically illustrated by the Amish splice mutation, where homozygous infants developed lethal HCM requiring transplantation in the first year of life44 homozygous infants developed lethal HCM requiring transplantation in the first year of life
Zahka K et al. Heart 2008; all surviving homozygous infants required cardiac transplantation
, while heterozygous parents were unaffected. The Gly490Val variant appears to follow the same recessive pattern but with later onset and less catastrophic severity.

Evidence note: rs397514752 is reported in a single family (n=2 homozygous affected individuals). The evidence level is emerging — the recessive behavior is internally consistent and mechanistically plausible, but independent replication in additional families is needed before clinical management protocols can be established with confidence.

Practical Actions

For the overwhelming majority of people who carry this variant at all, the relevant question is whether they are heterozygous or homozygous. Heterozygous carriers have no current evidence of increased personal cardiac risk. Their clinical relevance lies in reproductive planning: if both members of a couple carry this variant, each pregnancy carries a 25% chance of producing a homozygous child with HCM.

Homozygous individuals warrant full cardiological evaluation given the documented severe phenotype.

Interactions

Because the pathogenic effect of rs397514752 is recessive, the key interaction is between the two copies of this variant in homozygotes — compound heterozygosity with a second MYBPC3 pathogenic variant on the other allele would likely produce a similar biallelic loss-of-function phenotype. Heterozygous carriers of rs397514752 who also carry a dominant MYBPC3 pathogenic variant on the opposite allele could theoretically have a modified phenotype, but no published evidence addresses this specific combination.

MUC1 Variant — Gastric Mucus Protection and H. Pylori Susceptibility

The MUC1 gene encodes mucin-1, a membrane-bound glycoprotein11 membrane-bound glycoprotein
MUC1 is a large transmembrane mucin that forms a protective barrier on the surface of gastric epithelial cells
that plays a crucial role in protecting the gastric lining from environmental insults, particularly the bacterium Helicobacter pylori (H. pylori)22 Helicobacter pylori (H. pylori)
H. pylori is the primary bacterial cause of gastric ulcers and gastric cancer, infecting about half the world's population
. The rs4072037 variant, though synonymous33 synonymous
A synonymous variant doesn't change the amino acid sequence but can affect mRNA splicing and gene expression
, significantly affects how effectively this protective barrier functions by influencing alternative splicing of the MUC1 gene.

The Mechanism

The rs4072037 variant (G>A) is located in exon 2 of the MUC1 gene at chromosome 1q22. Though it doesn't change the encoded amino acid (making it synonymous), the A variant disrupts normal splicing patterns, leading to production of a 27-nucleotide shorter transcript44 27-nucleotide shorter transcript
The T allele introduces an alternative splice site that removes 27 nucleotides from the mature mRNA
. This altered MUC1 protein has reduced ability to block H. pylori adhesion to gastric mucosa. The protective C allele maintains normal MUC1 structure, which more effectively blocks H. pylori adhesin binding (BabA and SabA), limiting bacterial colonization.

The Evidence

Multiple meta-analyses55 Multiple meta-analyses
Liu et al. Meta-analysis of 9 studies with 10,410 cases and 11,437 controls
have established that the C allele provides significant protection against gastric cancer, with an odds ratio of 0.70 (95% CI: 0.64-0.76). This protective effect is particularly strong in Asian populations, where the association reaches genome-wide significance. A larger meta-analysis66 larger meta-analysis
Gu et al. 17 studies with 12,551 cases and 13,436 controls
confirmed that rs4072037 is associated with decreased cancer risk, especially gastric cancer in Asian populations. An Iranian case-control study77 Iranian case-control study
Shekarriz et al. 99 gastric cancer patients and 98 controls
demonstrated a gene-environment interaction between rs4072037 genotype and H. pylori infection status in modulating gastric cancer risk.

The TT genotype is associated with approximately 2-fold increased gastric cancer risk compared to GG, and the effect is most pronounced for diffuse-type gastric cancer88 diffuse-type gastric cancer
Diffuse gastric cancer is a more aggressive subtype with worse prognosis
. Studies across multiple ethnic groups consistently show this pattern, though the effect size is larger in Asian populations (where gastric cancer and H. pylori rates are higher) than in Caucasians.

Practical Implications

If you carry one or two copies of the T allele, your gastric mucus barrier may be less effective at preventing H. pylori colonization. This doesn't mean you'll definitely develop problems, but it suggests increased vigilance around gastric health, particularly if you're in a region with high H. pylori prevalence. Consider testing for H. pylori infection if you experience persistent digestive symptoms, as early treatment can prevent progression to more serious conditions.

The protective C allele has a global frequency of approximately 45-46%, meaning that a substantial portion of the population benefits from enhanced natural defense against H. pylori. Those with CC genotype have the strongest protective effect, but even GA carriers show intermediate protection compared to AA individuals.

Interactions

The rs4072037 variant's effect is most pronounced in the presence of H. pylori infection, demonstrating a critical gene-environment interaction. The variant also exists in linkage disequilibrium99 linkage disequilibrium
Linkage disequilibrium means these variants are often inherited together as a block
with other MUC1 variants including rs2070803 and rs2075570, which also affect gastric cancer risk. The combined effect of multiple MUC1 variants may further modulate gastric mucosal protection, though individual risk from rs4072037 is well-established independent of other variants.

rs509035

GHSR GHSR Metabolic Syndrome Variant

Strong Risk Factor

The Ghrelin Receptor Variant That Shapes Lean Body Mass and IGF-1

Ghrelin does far more than signal hunger. When it docks onto GHSR-1a (growth hormone secretagogue receptor)11 GHSR-1a (growth hormone secretagogue receptor)
The canonical ghrelin receptor, expressed in the pituitary gland, hypothalamus, and across peripheral tissues. Named for its original discovery as the receptor mediating growth hormone secretion in response to synthetic GH secretagogues.
, it triggers a powerful pulse of growth hormone release from the pituitary. That GH pulse drives the liver to secrete IGF-1 (insulin-like growth factor 1)22 IGF-1 (insulin-like growth factor 1)
The primary mediator of GH's anabolic effects: stimulates muscle protein synthesis, promotes linear bone growth, and regulates fat metabolism. Circulating IGF-1 levels closely track GH secretion patterns over hours to days.
, and IGF-1 is the hormone that actually builds lean tissue — muscle, bone, connective tissue. rs509035 is an intronic variant within the GHSR gene itself, sitting at chromosome 3:172,445,659 (GRCh38). It does not change the receptor's amino acid sequence, but GWAS data from populations totalling hundreds of thousands of individuals shows that the A allele is associated with higher fat-free mass, greater adult stature, and elevated serum IGF-1 — the three expected downstream consequences of enhanced GHSR-mediated GH pulsatility.

The Mechanism

rs509035 sits within an intron of GHSR, likely in a regulatory element that influences how efficiently the gene is transcribed or the mRNA is processed. The exact molecular function has not been characterized by gel-shift or reporter assays (unlike the functionally annotated nearby promoter variant rs490683), so the mechanism is inferred from the downstream phenotype pattern rather than direct biochemical evidence. The convergence of three independent large GWAS signals — lean body mass, height, and IGF-1 — pointing to the same A allele strongly implies that A tracks with modestly elevated GHSR expression or activity, producing more frequent or larger GH pulses, more IGF-1, and consequently more lean tissue accretion over a lifetime.

Animal model support comes from studies of GHSR knockout mice33 GHSR knockout mice
Labarthe et al. 2022 (PMID 33774644): GHS-R1a deletion reduced pulsatile GH secretion, decreased linear growth, and reduced hypothalamic GHRH mRNA expression in males — demonstrating that GHSR is causally required for normal GH pulsatility and growth.
, which show reduced GH pulsatility and impaired linear growth when the receptor is absent. If loss of GHSR reduces lean mass and growth, it follows that variants that enhance GHSR signaling would move these phenotypes in the opposite direction — and rs509035-A appears to do exactly that at the population level.

The Evidence

The strongest evidence comes from a sex-stratified GWAS in the UK Biobank by Hübel et al. 201944 Hübel et al. 2019
Genomics of body fat percentage may contribute to sex bias in anorexia nervosa. Am J Med Genet B Neuropsychiatr Genet. 2019;180(3):222-235. N=155,961 healthy UK Biobank participants.
, which identified rs509035 as a genome-wide significant hit for fat-free mass (overall beta = 0.188, p = 2×10⁻¹⁵; males beta = 0.214, p = 1×10⁻⁸; females beta = 0.151, p = 5×10⁻⁹). The effect is present in both sexes but is modestly larger in men, consistent with the testosterone–GH axis interaction in lean mass regulation.

Height GWAS results from Wood et al. 201455 Wood et al. 2014
Nature Genetics, N=253,288 adults; the A allele (frequency 0.316) shows beta = 0.031 SD per allele, p = 3×10⁻²³.
confirm that the lean mass signal has a growth axis explanation — the same allele that promotes lean tissue accretion also produces slightly greater adult height.

The CHARGE Consortium IGF-1 meta-analysis by Teumer et al. 201666 Teumer et al. 2016
Aging Cell, N=30,884 European adults; rs509035 A allele associated with higher serum IGF-1, p = 2×10⁻⁸.
provides the mechanistic link: the A allele is associated with higher circulating IGF-1, which is the molecular effector translating GH pulsatility into lean tissue construction. Together, these three GWAS signals (fat-free mass, height, IGF-1) form a coherent picture of enhanced GHSR-to-GH-to-IGF-1 anabolic signaling in A allele carriers.

Glucose and insulin metabolism may also be modified. In the Finnish Diabetes Prevention Study (Mager et al. 2008, N=507), rs509035 was associated with several glucose and insulin measures during 3-year follow-up77 rs509035 was associated with several glucose and insulin measures during 3-year follow-up
Mager U et al. PLoS One. 2008;3(8):e2941. The paper studied 7 GHSR SNPs in overweight adults with impaired glucose tolerance; rs509035 and rs490683 were the two variants showing significant metabolic associations.
, though the specific direction and magnitude of the effect for rs509035 were not separately published in the abstract. IGF-1 itself has insulin-sensitizing effects at the receptor level, so the higher IGF-1 in A allele carriers may partially explain glucose metabolism differences.

Practical Actions

For GG individuals (the majority), the GHSR intronic variant confers typical lean mass accrual potential without the enhanced GH-axis signaling that A allele carriers appear to have. For those pursuing body composition goals, the GG genotype means that lean mass gains depend primarily on training stimulus and protein adequacy rather than a genetically elevated IGF-1 floor. Monitoring IGF-1 levels is more informative for AA individuals than GG individuals, as the GG baseline is uncharacterized relative to the A allele effect.

For A allele carriers (AG and AA), enhanced GHSR-mediated GH pulsatility confers a measurable lean mass and IGF-1 advantage. This is actionable in resistance training contexts: a higher IGF-1 floor generally means faster muscle protein synthesis recovery and potentially better hypertrophic response to resistance training. Adequate protein intake (1.6–2.2 g/kg body weight daily) is necessary to fully leverage the elevated anabolic signaling environment.

Interactions

rs509035 is an intronic variant in the same GHSR gene as the promoter variants rs490683 and rs9819506, which have independent associations with appetite drive and body weight change during dietary intervention. The functional NF-1 binding site variant rs490683 (promoter) affects ghrelin receptor expression level; rs509035 (intronic) appears to operate through a downstream mechanism influencing the GH/IGF-1 anabolic axis more than appetite per se. Individuals carrying rs490683-GG (elevated appetite drive) combined with rs509035-AA (elevated IGF-1) may have a phenotype characterized by both high appetite and efficient lean tissue accretion — the genetic profile of a mesomorphic body type that gains muscle readily but also experiences strong hunger during caloric restriction.