TGF-β1 R25P — When the Body's Scar-Maker Works Overtime

Transforming growth factor beta 1 (TGF-β1) is one of the most versatile signaling molecules in the cardiovascular system — it can stabilize arterial plaques, suppress excessive inflammation, and regulate the repair of damaged heart muscle. But when TGF-β1 production tips too high or its signaling becomes dysregulated, the same molecule drives fibrosis, atherosclerosis progression, and myocardial remodeling after injury. The rs1800471 variant (R25P, Arg25Pro) sits in the signal peptide of the TGFB1 gene on chromosome 19q1311 chromosome 19q13
NC_000019.10:g.41352971C>G; TGFB1 encodes the inactive pre-pro-protein, cleaved to the active dimer
, altering how efficiently the protein is threaded into the endoplasmic reticulum and ultimately secreted. The minor G allele (Pro25 on plus strand, coding C at position 915) influences TGF-β1 levels and has been linked to cardiovascular outcomes in multiple cohorts — though the direction and magnitude of risk are modest and population-dependent.

The Mechanism

The signal peptide is the N-terminal zip code that directs newly synthesized proteins into the secretory pathway. The Arg25Pro substitution changes a positively charged arginine to a neutral proline22 changes a positively charged arginine to a neutral proline
Proline introduces a rigid kink in the peptide backbone, potentially disrupting signal-peptide secondary structure and slowing translocation
in this critical region. Functional studies show that different haplotypes formed by the codon 10 (rs1800470, L10P) and codon 25 (rs1800471, R25P) variants stratify individuals into high, intermediate, and low TGF-β1 producers33 high, intermediate, and low TGF-β1 producers
Leukocyte in-vitro assays measuring TGF-β1 in cell supernatants after stimulation classify genotype combinations into production categories
. The GG coding genotype (homozygous Arg25, corresponding to plus-strand CC) is most commonly associated with highest TGF-β1 secretion capacity; the heterozygous state produces intermediate levels.

TGF-β1 has a dual and context-dependent role in the vasculature. At physiological levels it stabilizes atherosclerotic plaques44 stabilizes atherosclerotic plaques
TGF-β1 promotes smooth muscle cell contractility and fibrous cap formation, reducing plaque vulnerability to rupture
and suppresses macrophage-mediated inflammation. At chronically elevated levels, it drives myocardial and vascular fibrosis55 drives myocardial and vascular fibrosis
Activates cardiac fibroblasts, increases collagen deposition, promotes atrial and ventricular remodeling
— contributing to arrhythmia susceptibility, reduced cardiac compliance, and post-MI scarring.

The Evidence

The most comprehensive genetic evidence comes from a 2012 meta-analysis of 5 studies66 2012 meta-analysis of 5 studies
Qu et al., BMC Medical Genetics, 14,452 participants total
that found minor allele (coding C, plus-strand G) carriers had a pooled OR of 1.16 (95% CI 1.02–1.32) for coronary heart disease. The heterozygous GC (coding, plus-strand CG) genotype conferred OR 1.15 (1.01–1.31) versus homozygous major allele carriers. However, statistical significance disappeared after Egger's test correction for publication bias, highlighting the modest and population-heterogeneous nature of the association.

Orthogonal evidence from fibrotic and electrophysiological studies is more consistent. A study of essential hypertensive patients77 study of essential hypertensive patients
Chen et al., Journal of Biomedical Science 2010, n=259
found the GG coding genotype was significantly overrepresented in patients with both hypertension and atrial fibrillation versus hypertension alone (P=0.009). GG coding carriers had the highest serum TGF-β1 levels among AF-positive subjects, with a 3.09-fold elevated AF risk in this context. Since atrial fibrosis is the primary substrate for atrial fibrillation, this provides a mechanistic link from the genotype through TGF-β1 to cardiac remodeling.

A Russian cohort study of MI patients88 Russian cohort study of MI patients
Contribution of TGFB1 to MI susceptibility, PMC3408705
found the GG coding genotype (high producer) significantly enriched in MI cases versus controls (OR 1.76, 95% CI 1.05–2.97, P=0.048). A study of TGFB1 and coronary heart disease with angiographic confirmation99 TGFB1 and coronary heart disease with angiographic confirmation
Lamblin et al., PMID 16543493
similarly linked codon 25 variants to MI in patients with confirmed CHD. The Rotterdam Study (n=5,972)1010 Rotterdam Study (n=5,972)
Verhoeven et al., Thrombosis and Haemostasis 2006
found no association with MI, illustrating the population-specific heterogeneity that characterizes this SNP.

Additional context: rs1800471 has been associated with hypertension and CKD progression1111 hypertension and CKD progression
Polish cohort, Archives of Medical Science 2013
, consistent with the broader role of TGF-β1 in cardiovascular-renal fibrotic pathways.

Practical Actions

The cardiovascular implications of rs1800471 are primarily mediated through TGF-β1's effect on vascular inflammation and fibrosis. Carrying the minor G allele (plus strand) is associated with lower TGF-β1 production, and the heterozygous state shows modest CHD risk in aggregate analyses. Monitoring for early signs of cardiovascular disease and specific management of fibrosis-related risk factors (blood pressure, myocardial stiffness) are most relevant to this variant's biology.

Interactions

rs1800471 (codon 25) and rs1800470 (codon 10, L10P) are the two primary coding variants in the TGFB1 signal peptide. Together they form haplotypes that stratify TGF-β1 production capacity. The codon 10 high-producer T allele (rs1800470) and codon 25 G coding allele (rs1800471) combine into distinct HIGH, INTERMEDIATE, and LOW producer haplotypes. Carriers of both high-producer alleles across both codons show the strongest elevation in TGF-β1 secretion and likely the highest cumulative cardiovascular risk through fibrotic pathways. The rs1800469 promoter variant (-509 C/T) provides additional regulation of TGF-β1 transcription and should be interpreted alongside the coding variants.

FOXO3's Second Longevity Signal — A Distinct Regulatory Haplotype

FOXO3 is one of only two human genes — along with APOE — whose longevity associations have been consistently replicated across multiple independent populations and ancestries. The first FOXO3 signal, rs280229211 rs2802292
the well-characterized HSF1-binding enhancer variant in intron 2
, has been studied intensively since 2008. But FOXO3's longevity architecture is more complex: rs1935949 tags a second, independent haplotype block spanning 121 kilobases of FOXO3 intron 3 that independently contributes to centenarian enrichment through a distinct regulatory mechanism.

Pawlikowska et al. 200922 Pawlikowska et al. 2009
Association of common genetic variation in the insulin/IGF1 signaling pathway with human longevity. Aging Cell. 2009
identified rs1935949 among longevity-associated variants across three Caucasian cohorts — the Study of Osteoporotic Fractures, the Cardiovascular Health Study, and Ashkenazi Jewish Centenarians — finding a significant longevity association in women (OR = 1.35, 95% CI 1.15–1.57, adjusted p = 0.0093).

The Mechanism

rs1935949 sits in intron 3 of FOXO3, approximately 1.7 kilobase-pairs from an intron-exon boundary, and is in strong linkage disequilibrium (r² = 0.96) with rs4946935 — the functionally characterized variant in the same haplotype block. Flachsbart et al. 201733 Flachsbart et al. 2017
Identification and characterization of two functional variants in the human longevity gene FOXO3. Nat Commun. 2017
demonstrated that the longevity allele of rs4946935 creates an allele-dependent binding site for serum response factor (SRF) — a transcription factor distinct from the heat shock factor 1 (HSF1) that binds the rs2802292 site. In luciferase reporter assays, the longevity alleles at rs4946935 (and by proxy, rs1935949) show substantial enhancer activity that is specifically reversed by IGF-1 treatment.

This IGF-1 reversibility is mechanistically significant. The insulin/IGF-1 signaling (IIS) pathway44 insulin/IGF-1 signaling (IIS) pathway
the PI3K-AKT cascade that phosphorylates and excludes FOXO3 from the nucleus
is the primary evolutionary brake on FOXO3 activity: high IGF-1 levels drive AKT activation, which phosphorylates FOXO3 and traps it in the cytoplasm, preventing it from activating stress-response and longevity genes. The longevity allele at rs1935949's haplotype creates an enhancer that is most active when IGF-1 is low — exactly the dietary and metabolic condition under which FOXO3's protective functions matter most. This explains why the variant was first identified in a study of IGF-1 signaling pathway genetics and why its effects are magnified by caloric restriction and fasting contexts.

An eQTL database search confirms that the longevity alleles associated with the rs1935949/rs4946935 haplotype block are associated with higher FOXO3 mRNA expression across multiple human tissues, consistent with the enhancer model. The haplotype containing rs1935949 (haplotype block 1, alongside rs13217795, rs2764264, rs9400239, and rs9486902, D'>0.86 spanning 121 kb) is structurally distinct55 structurally distinct
the haplotype blocks were identified by LD analysis across 12 FOXO3 SNPs
from the block containing rs2802292, allowing independent additive effects on longevity.

The Evidence

Bao et al. 201466 Bao et al. 2014
Association between FOXO3A gene polymorphisms and human longevity: a meta-analysis. Asian J Androl. 2014
pooled five of its eleven studies that genotyped rs1935949, encompassing 1,435 cases and 2,098 controls, finding significant overall longevity association (OR = 1.14, 95% CI 1.01–1.27). Gender-stratified results suggested the overall association was not driven primarily by females in this cohort, though the sex-specific picture is complex and differs across studies.

The largest replication effort comes from Bae et al. 201877 Bae et al. 2018
Effects of FOXO3 polymorphisms on survival to extreme longevity in four centenarian studies. J Gerontol A Biol Sci Med Sci. 2018
, which analyzed 2,072 cases and 6,194 controls across the Long Life Family Study, New England Centenarian Study, Southern Italian Centenarian Study, and Longevity Genes Project. rs1935949 showed a meta-analysis beta of 0.18 (SE 0.044, p = 6.60×10⁻⁵), corresponding to approximately OR = 1.20 for extreme longevity (defined as surviving to the oldest 1 percentile of the 1900 U.S. birth cohort — ≥96 years for males, ≥100 for females). Centenarian enrichment of the protective A allele was observed in all four cohorts, though effect sizes varied.

Evidence is strongest in Caucasian populations. Studies in Chinese Han populations did not replicate the rs1935949 association, consistent with population-specific LD patterns and potentially different regulatory architectures across ancestries.

Practical Implications

The IGF-1-reversible nature of this haplotype's regulatory effect suggests a dietary angle that is more explicit than for rs2802292. Dietary interventions that lower circulating IGF-1 — particularly plant-protein-predominant diets and periodic fasting — reduce IGF-1-mediated suppression of FOXO3 at this enhancer. Caloric restriction extending maximum lifespan in model organisms is mechanistically connected to reduced IIS signaling and consequent FOXO3 activation; the rs1935949 haplotype may amplify this response.

The additive architecture of FOXO3 longevity signals is worth appreciating. Individuals who carry protective alleles at both rs1935949 and rs2802292 benefit from independent contributions to FOXO3 expression through distinct transcription factor binding sites (SRF at the rs4946935/rs1935949 haplotype; HSF1 at rs2802292). The two mechanisms respond to different cellular stresses — nutrient status for the IGF-1-sensitive haplotype, heat shock and oxidative stress for the HSF1 site — making them genuinely complementary.

Interactions

rs1935949 and rs2802292 are in different haplotype blocks across FOXO3 and have independent longevity effects. Individuals carrying protective alleles at both loci likely experience additive longevity benefit through distinct molecular mechanisms (SRF-mediated nutrient-sensing regulation vs. HSF1-mediated stress response regulation). A compound action is worth considering for individuals carrying beneficial alleles at both rsids, as the combined message — targeting both IGF-1 reduction and stress-response activation — is more specific than either alone.

rs4946935 (r² = 0.96 with rs1935949) is the functional proxy variant for the enhancer mechanism; individuals who have both in their genome data will likely have concordant results. rs13217795, also in haplotype block 1, has been separately shown to regulate an alternative FOXO3 promoter (FOXO3-TR isoform expression), adding another regulatory dimension to the same haplotype block.

MYL3 Arg94His — A High-Penetrance Sarcomeric Variant for Hypertrophic Cardiomyopathy

Your heart's pumping power comes from millions of sarcomeres11 sarcomeres
The sarcomere is the basic contractile unit of cardiac muscle. Each sarcomere contains an interdigitating lattice of thick (myosin) and thin (actin) filaments that slide past each other to produce contraction
working in concert. The MYL3 gene encodes the essential myosin light chain (ELC), a structural component of the myosin thick filament that stabilizes the lever arm region and regulates the force and speed of cardiac muscle contraction. The Arg94His substitution — a single amino acid change replacing arginine with histidine at position 94 of the ELC — disrupts this regulation in a way that causes the heart muscle to thicken abnormally, a condition called hypertrophic cardiomyopathy (HCM).

HCM is the most common inherited cardiac disease, affecting approximately 1 in 500 people overall. However, MYL3 mutations are a rare cause of HCM — responsible for less than 1% of all sarcomeric HCM cases. The Arg94His variant in particular has been documented in a small number of families worldwide and is classified as pathogenic by eight independent clinical laboratories22 pathogenic by eight independent clinical laboratories
ClinVar VCV000031777: submissions from LabCorp Genetics, Women's Health and Genetics/LabCorp, Ambry Genetics, GeneDx, Mass General Brigham, NIH All of Us, Color Diagnostics, and CHEO Genetics
.

The Mechanism

The MYL3 ELC wraps around the myosin heavy chain lever arm33 lever arm
The lever arm is the portion of myosin that amplifies the small conformational change at the catalytic domain into a large stroke movement that pulls actin filaments. Light chains stabilize this lever arm against mechanical stress
. Arginine-94 sits in a functionally critical region of the ELC. Replacing the positively charged arginine with histidine (which carries a smaller, uncharged side chain at physiological pH) is predicted to disrupt local protein folding and the ELC's interaction with the myosin heavy chain, altering the super relaxed state of myosin — a low-energy resting state that is disrupted in HCM-associated ELC mutations. The net result is hypercontractility of the sarcomere, leading over time to pathological left ventricular hypertrophy, diastolic dysfunction, and in some cases left ventricular outflow tract obstruction.

The Evidence

The primary evidence for MYL3 Arg94His comes from a Japanese family study44 Japanese family study
Nomura et al., J Cardiol, 2016 (N=7 family members, 5 affected; followed by registry screening of 600 HCM patients)
that used whole-exome sequencing combined with bioinformatic filtering to identify the variant. Among carriers in that family, disease penetrance was 88% — very high for an HCM gene. All clinically affected carriers showed asymmetric septal hypertrophy with a maximum left ventricular wall thickness of 18±3mm without outflow obstruction. Two additional carriers were found in a registry of 600 HCM patients, supporting that this is a genuine pathogenic variant rather than a private family mutation.

By contrast, other MYL3 variants (such as the p.Val79Ile variant55 p.Val79Ile variant
Andersen et al., Biochem Res Int, 2012 — nine heterozygous carriers in a Danish family
) show much lower penetrance (~40%) and late onset. This variability is consistent with the broader GeneReviews HCM data66 GeneReviews HCM data
Cirino et al., updated 2025 — MYL3 overall penetrance ~32%, the lowest among sarcomeric HCM genes when all variants are pooled
. The Arg94His variant appears to be at the higher end of MYL3 penetrance, based on the Nomura family data, though the evidence is limited to a small number of families. Eight independent clinical laboratories have independently classified it as pathogenic or likely pathogenic, providing strong multi-submitter validation.

In terms of population frequency, Arg94His is extremely rare: gnomAD v4 exomes identified only 7 T alleles among over 730,000 alleles examined globally, exclusively in European and Latino ancestry groups.

Practical Actions

For carriers of MYL3 Arg94His, the central priorities are: (1) confirm the diagnosis with a specialist, (2) establish regular cardiac surveillance, and (3) arrange cascade testing for first-degree relatives.

Unlike many common HCM genes (MYBPC3, MYH7) where genotype-specific prognosis is well-characterized, MYL3 Arg94His has been observed in relatively few families. Standard HCM management — echocardiography every 1–2 years, ambulatory ECG monitoring, exercise testing, and SCD risk stratification — applies. Drug therapy (beta-blockers, calcium channel blockers) addresses symptoms; septal reduction therapy (myectomy or alcohol ablation) is reserved for refractory obstructive cases. The newer selective cardiac myosin inhibitor mavacamten is approved for obstructive HCM and may be relevant if outflow obstruction develops.

Interactions

MYL3 variants have been observed to interact with other sarcomeric HCM genes. Individuals carrying two sarcomeric pathogenic variants (compound or digenic) show substantially worse outcomes — a hazard ratio of 7.5 for adverse events such as cardiac transplantation or ICD placement77 hazard ratio of 7.5 for adverse events such as cardiac transplantation or ICD placement
Cirino et al., GeneReviews, 2025 — based on registry data across sarcomeric HCM genes
compared to single-variant carriers. This means that if a carrier of MYL3 Arg94His also carries a pathogenic variant in MYBPC3, MYH7, TNNT2, or another sarcomeric gene, clinical severity is likely to be substantially greater.

MYL3 is an essential light chain (encoded by chromosome 3), distinct from the regulatory light chain encoded by MYL2. Variants in ACTC1 (cardiac actin, rs193922680) affect the thin filament partner of myosin and can produce overlapping HCM phenotypes through a different molecular mechanism.

ACADVL p.Gly289Arg — A Pathogenic Variant in the Long-Chain Fat Burning Engine

Every time your body burns fat for energy during fasting, sleep, or prolonged exercise, it relies on a chain of enzymes inside the mitochondria. The first and rate-limiting step for very long-chain fatty acids (14–20 carbons) is carried out by very-long-chain acyl-CoA dehydrogenase11 very-long-chain acyl-CoA dehydrogenase
VLCAD — encoded by ACADVL on chromosome 17p13. The enzyme sits on the inner mitochondrial membrane and initiates beta-oxidation of long-chain fatty acids, producing the acetyl-CoA and reduced electron carriers (FADH₂) that feed the Krebs cycle and electron transport chain
. This SNP, rs200788251, represents a c.865G>A transition that replaces the glycine at position 289 with the much bulkier, positively charged arginine (p.Gly289Arg). Glycine-289 sits in a structurally conserved region of the enzyme and is under strong evolutionary constraint; the substitution reduces VLCAD activity to approximately 15% of normal in vitro.

The Mechanism

VLCAD22 VLCAD
very-long-chain acyl-CoA dehydrogenase, a homodimeric flavoprotein that removes two hydrogen atoms from the acyl-CoA thioester, generating a trans-2-enoyl-CoA and transferring electrons to electron transfer flavoprotein (ETF) for entry into the respiratory chain
depends on precise folding of its active site for catalysis. The p.Gly289Arg substitution introduces a bulky, charged side chain where glycine's compact structure is required for proper protein folding. Functional studies demonstrate that fibroblasts from a compound heterozygous individual carrying the p.Gly289Arg allele alongside a second pathogenic variant showed no detectable enzyme activity and absent protein expression on western blot — indicating the substitution causes protein misfolding and accelerated degradation.

Because ACADVL operates as a homodimer, a single non-functional allele (the heterozygous carrier state) still produces enough functional enzyme dimers — with approximately 50% enzyme activity — to support normal fatty acid oxidation under ordinary dietary conditions. Disease emerges only when both alleles are compromised.

The Evidence

The variant is classified likely pathogenic by the ClinGen ACADVL Variant Curation Expert Panel33 likely pathogenic by the ClinGen ACADVL Variant Curation Expert Panel
ClinVar VCV000370981, 4-star expert panel review, last evaluated June 2023, based on ACMG/AMP classification criteria
. All twelve diagnostic laboratory submitters in ClinVar classify the variant as pathogenic or likely pathogenic. The glycine at codon 289 is conserved across vertebrates, computational tools (SIFT, PolyPhen-2) predict a deleterious effect, and in vitro functional studies confirm the consequence.

A US newborn screening study of 693 individuals44 US newborn screening study of 693 individuals
Pena et al. Recurrent ACADVL molecular findings in individuals with a positive newborn screen for very long chain acyl-coA dehydrogenase (VLCAD) deficiency in the United States. Mol Genet Metab, 2016
identified p.Gly289Arg on six alleles from affected individuals, establishing it among the recurrent pathogenic alleles seen in clinical practice in the United States.

VLCAD deficiency occurs in approximately 1 in 30,000–100,000 births. Expanded newborn screening with acylcarnitine profiling — measuring C14:1 acylcarnitine55 C14:1 acylcarnitine
tetradecadienoylcarnitine, the characteristic accumulation product of impaired VLCAD-mediated beta-oxidation; a C14:1 level above 1 µmol/L on dried blood spot strongly suggests VLCAD deficiency
— now identifies most affected individuals presymptomatically.

Three phenotypes are documented in homozygous or compound heterozygous individuals: - Severe neonatal form: cardiomyopathy (hypertrophic or dilated), pericardial effusion, arrhythmia, and metabolic crisis in the first weeks of life — historically associated with significant mortality. - Infantile hepatic form: hypoketotic hypoglycemia, hepatomegaly, and liver dysfunction, typically presenting during intercurrent illness. - Late-onset myopathic form: the most common presentation in the current NBS era — episodic rhabdomyolysis triggered by prolonged exercise, fasting, or illness; muscle cramps, pain, dark urine (myoglobinuria), and markedly elevated creatine kinase (CK).

Practical Actions

Heterozygous carriers (AG genotype) are clinically asymptomatic. The primary practical significance of carrier status is reproductive: if both parents carry ACADVL pathogenic variants, each child has a 25% risk of VLCAD deficiency. Partner carrier testing before or during pregnancy is the key action.

For individuals with biallelic pathogenic variants (AA genotype, or compound heterozygous for two different ACADVL pathogenic alleles), the consensus-based nutrition management guidelines66 consensus-based nutrition management guidelines
Vockley et al. 2021, Mol Genet Metab. PMID 33093005
recommend: avoidance of fasting (using age-appropriate maximum fasting intervals), restriction of long-chain fatty acid intake, MCT supplementation as an alternative fat source bypassing the enzymatic block, and carnitine supplementation to support acylcarnitine clearance. During exercise, consuming easily metabolized carbohydrates before and after activity significantly reduces rhabdomyolysis risk.

Interactions

This variant causes disease in the autosomal recessive setting: a second loss-of-function ACADVL variant on the other chromosome must be present. Most affected patients identified through newborn screening are compound heterozygous (two different pathogenic variants) rather than homozygous for p.Gly289Arg. Genotyping at this single SNP position will detect the AG carrier state but cannot determine whether a second pathogenic variant exists elsewhere in ACADVL — full gene sequencing is required for a complete clinical picture in symptomatic individuals or those with positive NBS results.

rs2066702

ADH1B Arg370Cys

Strong Protective

The African Flush Gene — ADH1B*3 and Alcohol Protection in African-Ancestry Populations

Alcohol dehydrogenase 1B (ADH1B) catalyzes the first step of alcohol metabolism, converting ethanol into acetaldehyde in the liver. Most people carry the common Arg370 form (ADH1B*1). The ADH1B*2 allele (His48Arg, rs1229984) is common in East Asian populations and encodes an enzyme roughly 100-fold more active than the common form. But there is a third functional variant — ADH1B*3 (Arg370Cys, rs2066702) — that is found almost exclusively in populations of African descent.

ADH1B*3 encodes a superactive enzyme with substantially higher ethanol oxidation kinetics than the common ADH1B*1 form. Like ADH1B*2, it accelerates the conversion of ethanol to acetaldehyde, producing an aversive physiological response that acts as a natural deterrent against heavy drinking. In African American populations, where this allele reaches frequencies of 15–25%, it is one of the strongest genetic predictors of alcohol consumption patterns ever identified.

This variant is functionally and epidemiologically distinct from rs1229984. The two alleles reside at different positions in the ADH1B protein (residue 48 versus residue 370), arise from independent mutational events, are distributed across different populations, and contribute independently to alcohol use disorder risk. A person of African descent may carry ADH1B*3 with no ADH1B*2 allele, and vice versa for East Asians.

The Mechanism

The rs2066702 A allele, on the plus (forward) strand of chromosome 4, corresponds to the Cys370 substitution in the ADH1B protein — the ADH1B*3 allele. The gene is located on the minus strand, so the A allele in genome files is the complement of the T allele in the coding sequence notation used in older literature.

At the protein level, replacing Arginine with Cysteine at position 370 alters the active-site geometry of the enzyme in a way that increases its catalytic efficiency for ethanol oxidation. The result is accelerated production of [acetaldehyde | A reactive aldehyde intermediate; classified as a Group 1 human carcinogen by the IARC; causes flushing, nausea, and tachycardia] after alcohol ingestion — similar in kind to what ADH1B*2 produces in East Asian populations, though the mechanistic basis (different residue, different kinetic parameters) is independent.

Because the downstream enzyme ALDH2, which clears acetaldehyde, operates at a fixed rate, any increase in acetaldehyde production from faster ADH1B activity creates a transient acetaldehyde surplus. Carriers of ADH1B*3 experience faster and more aversive responses to alcohol, which behaviorally reduces both the amount consumed and the likelihood of developing alcohol use disorder.

The Evidence

Genome-wide significance in African Americans:

The first genome-wide association study for maximum number of alcoholic drinks consumed in a 24-hour period in African Americans11 The first genome-wide association study for maximum number of alcoholic drinks consumed in a 24-hour period in African Americans
Xu K et al. Genomewide Association Study for Maximum Number of Alcoholic Drinks in European Americans and African Americans. Alcohol and Alcoholism, 2015
identified rs2066702 as the peak genome-wide significant signal in African Americans (p = 2.50×10⁻¹⁰). Eight SNPs in the region on chromosome 4 reached significance, all tagging the ADH1B*3 haplotype.

A subsequent GWAS of maximum habitual alcohol intake in 17,029 African American veterans from the VA Million Veteran Program22 A subsequent GWAS of maximum habitual alcohol intake in 17,029 African American veterans from the VA Million Veteran Program
Gelernter J et al. Genome-wide Association Study of Maximum Habitual Alcohol Intake in >140,000 U.S. European and African American Veterans. Biological Psychiatry, 2019
confirmed rs2066702 as the lead locus for African Americans (p = 2.3×10⁻¹²), a far stronger signal than any other variant identified in that ancestry group.

Clinical alcohol use metrics:

In a large electronic health record study of 57,677 African American veterans33 In a large electronic health record study of 57,677 African American veterans
Justice AC et al. Validating Harmful Alcohol Use as a Phenotype for Genetic Discovery Using Phosphatidylethanol and a Polymorphism in ADH1B. Alcohol and Alcoholism, 2018
, 34.3% of participants carried at least one A allele (minor allele frequency 19.0%). Carrying the protective A allele was associated with approximately half the odds of high AUDIT-C scores (aOR = 0.54) and an aOR of 0.51 when combining AUDIT-C with ICD-coded alcohol use disorder — demonstrating that the protective effect is clinically meaningful and detectable across multiple measurement approaches.

AUD diagnostic criteria:

Among African American participants in a detailed AUD phenotyping study44 Among African American participants in a detailed AUD phenotyping study
Hart AB et al. Which alcohol use disorder criteria contribute to the association of ADH1B with alcohol dependence? Addiction Biology, 2016
, individuals homozygous for the major (non-protective) allele GG endorsed significantly more DSM-IV and DSM-5 AUD criteria (p = 1.9×10⁻⁹). The criterion most strongly linked to this variant was tolerance — the need for increasing amounts of alcohol to achieve the desired effect — suggesting that ADH1B*3 carriers develop tolerance more slowly, consistent with the variant's faster acetaldehyde production making high-dose alcohol less pleasant.

Prenatal exposure and developmental protection:

A longitudinal study in African American families55 A longitudinal study in African American families
Dodge NC et al. Protective effects of the alcohol dehydrogenase-ADH1B*3 allele on attention and behavior problems in adolescents exposed to alcohol during pregnancy. Alcoholism: Clinical and Experimental Research, 2014
found that maternal ADH1B*3 carrier status shielded adolescent offspring from behavioral and attention consequences of prenatal alcohol exposure, with allele frequencies of 17.6% in mothers and 21.0% in adolescents — consistent with the expected 15–20% frequency in African American populations. The proposed mechanism is that faster maternal alcohol metabolism reduces peak blood alcohol concentration reaching the fetus.

Practical Actions

For GG carriers (most common in non-African populations): Without the ADH1B*3 protective allele, you lack one of the biological deterrents against heavy drinking. If you are of African descent, your GG genotype means you metabolize alcohol at the common rate — higher-tolerance drinking is biologically more accessible to you, which removes a natural brake on consumption. This is relevant when considering whether your drinking patterns are influenced by behavioral versus genetic factors.

For AG and AA carriers (primarily found in people of African descent): Your ADH1B*3 allele accelerates alcohol-to-acetaldehyde conversion, producing faster aversive responses to alcohol. This is one of the strongest inherited protective factors against alcohol use disorder in populations of African ancestry. The biological deterrent is real — but, as with ADH1B*2 in East Asian populations, it can be overridden socially. Environmental factors (peer norms, social offers) can diminish the genetic protection when drinking pressures are high.

Interactions

ADH1B*3 (rs2066702) and ADH1B*2 (rs1229984, His48Arg) reside in the same gene but at different protein positions. They arise independently and distribute across different populations. In African American populations where both alleles may occasionally co-occur, combined diplotype analysis is informative. However, the primary co-variant to consider alongside rs2066702 in African-ancestry individuals is ALDH2 (rs671), which controls the downstream clearance of the acetaldehyde that ADH1B*3 produces more rapidly.

ADH1C variants (rs1693482, rs698) are pathway partners that also affect the rate of alcohol oxidation. Combined diplotype analyses have documented that ADH1B and ADH1C allele combinations affect liver disease risk and alcohol metabolism outcomes beyond either variant alone.

MUC1 Near-Gene Variant — The Stomach's Mucus Shield and Gastric Cancer Risk

The gastric epithelium is under constant assault — acid, pepsin, ingested pathogens, and the chronic coloniser Helicobacter pylori11 Helicobacter pylori
A gram-negative bacterium that infects the stomach lining of roughly half the global population; the primary causal agent of peptic ulcers and a major risk factor for gastric cancer
. The first line of defence is a thick mucus layer anchored by mucin glycoproteins, the most important of which is MUC122 MUC1
Mucin 1, a high-molecular-weight, heavily O-glycosylated transmembrane protein expressed on the apical surface of gastric epithelial cells; it provides both steric hindrance against pathogens and acts as a releasable decoy
. rs2070803 is a regulatory variant near the MUC1 gene on chromosome 1q22 that influences how much of this protective protein the stomach produces. Carrying one or two copies of the A allele is associated with lower MUC1 surface expression — and, in some of the largest genetic studies ever conducted on gastric cancer, a meaningfully elevated risk of developing diffuse-type gastric cancer33 diffuse-type gastric cancer
One of two major histological subtypes of gastric cancer (alongside intestinal-type); diffuse-type spreads through the stomach wall without forming a distinct mass and carries a worse prognosis
.

The Mechanism

rs2070803 maps approximately 585 base pairs upstream of the MUC1 transcription start site on the reverse strand, placing it in the gene's regulatory region. It is in strong linkage disequilibrium44 linkage disequilibrium
Non-random co-inheritance of nearby variants; alleles at linked sites are more often inherited together than expected by chance
with the functionally characterised variant rs4072037, which directly alters MUC1 promoter activity and controls the ratio of major MUC1 splice variants produced in the gastric epithelium.

Carriers of the A allele at rs2070803 produce less MUC1 protein at the gastric mucosal surface. This reduction compromises the epithelium in two ways. First, MUC1's extracellular domain normally acts as a steric barrier against bacterial adhesion — its dense glycan coat is physically too large to allow most bacteria to reach the underlying cell membrane. Second, MUC1 functions as a releasable decoy: shed extracellular domain fragments bind pathogen adhesins and carry them away from the epithelium. With less surface MUC1, both mechanisms are weakened.

H. pylori further compounds genetically reduced expression. The virulence factor CagA55 CagA
Cytotoxin-associated gene A protein, injected directly into gastric epithelial cells by H. pylori; once inside, it is tyrosine-phosphorylated and disrupts multiple host signalling pathways
is injected into host cells, where it undergoes phosphorylation and promotes binding of MUC1's cytoplasmic tail to β-catenin. The resulting MUC1–β-catenin complex translocates to the nucleus, activating proliferative gene programmes including cyclin-D1. Carriers who already have lower baseline MUC1 expression due to the A allele are therefore more vulnerable to this CagA-driven oncogenic cascade.

The Evidence

The primary evidence comes from a genome-wide association study66 genome-wide association study
Saeki N et al. A functional single nucleotide polymorphism in mucin 1, at chromosome 1q22, determines susceptibility to diffuse-type gastric cancer. Gastroenterology, 2011
conducted across three panels of Japanese and Korean individuals (discovery: 606 cases/1,264 controls; validation 1: 304 cases/1,465 controls; validation 2: 452 cases/372 controls). rs2070803 reached genome-wide significance with p = 4.33 × 10⁻¹³ and a meta-analytic odds ratio of 1.71 for diffuse-type gastric cancer. Notably, the association was specific to diffuse-type and not intestinal-type gastric cancer.

A complementary case-control study77 case-control study
Xu L et al. Risk of gastric cancer is associated with the MUC1 568 A/G polymorphism. Int J Oncol, 2009
directly measured MUC1 protein in gastric tissue: AA genotype carriers showed significantly lower MUC1 immunostaining (r = −0.179, p = 0.004) and a 1.81-fold increased gastric cancer risk, providing the functional link between genotype and reduced mucosal protection.

A systematic review and meta-analysis88 systematic review and meta-analysis
Giraldi L et al. MUC1, MUC5AC, and MUC6 polymorphisms, Helicobacter pylori infection, and gastric cancer. Eur J Cancer Prev, 2018
of 21 studies confirmed the protective effect of the G allele (OR 0.66, 95% CI 0.57–0.78 for dominant model AG/GG vs. AA), with consistent findings across Asian (OR 0.73) and White European (OR 0.48) populations.

The risk is further amplified when rs2070803 co-occurs with the PSCA risk variant rs2294008. Carriers of risk alleles at both loci face a dramatically higher combined gastric cancer risk, as these two genes act in partially overlapping mucosal defence pathways.

Practical Implications

This variant is a genuine gastric cancer susceptibility signal — not a certainty, but a meaningful elevation in lifetime risk that is actionable. The most direct intervention is targeted H. pylori screening, since the genetic risk is substantially mediated through the bacteria's ability to exploit weakened mucosal defences. Current evidence suggests that eradicating H. pylori reduces gastric cancer incidence by roughly 35–45% in infected individuals, and for A allele carriers the absolute benefit may be larger because the residual mucosal vulnerability is genetically amplified.

Dietary and supplement strategies that support mucosal integrity — specifically those with evidence in the gastric context — complement surveillance without replacing it. Antral gastroscopy99 gastroscopy
Direct visual inspection of the stomach lining using an endoscope
surveillance intervals recommended by gastroenterologists should be followed by anyone who also has chronic gastritis, intestinal metaplasia, or a family history of gastric cancer.

Interactions

rs2070803 is in strong linkage disequilibrium with rs4072037, the functionally characterised MUC1 promoter variant. These two SNPs tag the same haplotype in most studies and their associations are not independent — the risk captured by rs2070803 is largely the same risk captured by rs4072037.

The PSCA variant rs2294008 represents a separate pathway (prostate stem cell antigen, expressed in gastric epithelium) that interacts epistatically with MUC1 variants. Carriers of both MUC1 and PSCA risk alleles face substantially higher combined risk than either alone, as documented in the original Saeki et al. GWAS.

ABCG2 V12M — When a Transporter Loses Its Place at the Membrane

ABCG2 — also called Breast Cancer Resistance Protein (BCRP) — is one of the most versatile efflux pumps in the body. It sits at the apical surface of intestinal cells, renal tubules, and the blood–brain barrier, where it exports a wide range of substances including uric acid11 uric acid
a metabolic waste product of purine breakdown
, porphyrins, anticancer drugs, antibiotics, and antiretroviral medications. The rs2231137 variant encodes a valine-to-methionine substitution at position 12 of the protein — a residue in the first transmembrane segment that influences how ABCG2 folds and reaches the cell membrane.

The Mechanism

The V12M change (T allele on the plus strand; the protein letter change is coded on the minus strand as G>A at nucleotide 34 of the coding sequence) sits at the very beginning of the ABCG2 protein's first predicted transmembrane helix. One cell model study found 22 Kamo et al., Intl J Cancer, 2004 that V12M disrupts apical plasma membrane localization — the protein is synthesized normally but fails to traffic to the correct compartment. Cells expressing V12M showed drug efflux less than one-tenth that of wild-type. However, a second study using membrane vesicles from HEK293 cells 33 Kondo et al., Pharm Res, 2004 found transport activity for estrone sulfate, DHEAS, and methotrexate was comparable to wild-type. The discrepancy likely reflects cell-type and assay differences, making V12M's intrinsic functional impact harder to pin down than the better-characterised Q141K variant (rs2231142), which consistently reduces urate transport by ~53%.

The Evidence

Population genetics paints an interesting picture. The T allele (Met12) occurs at about 4% in Europeans but reaches 27% in East Asians and 23% in Latino/Admixed American populations — a striking frequency difference suggesting different selective pressures across ancestries.

A meta-analysis of 52 studies44 meta-analysis of 52 studies
Shi et al., BMC Med Genet, 2020
found that carrying one or two T alleles at rs2231137 was associated with 36–57% lower odds of gout compared to CC homozygotes. This protective direction for gout is somewhat counterintuitive if V12M reduces urate transport, and may reflect incomplete LD with nearby protective haplotypes, population stratification, or context-dependent functional effects.

In drug-handling contexts, the picture is different. A study of 149 HIV-infected patients 55 Rodrigues et al., Mol Genet Genomic Med, 2024 found that the T allele and the heterozygous CT genotype were associated with increased risk of antiretroviral-induced hepatotoxicity (OR 1.50 for the allele; haplotype OR up to 2.49 in combination with rs2231142). In a small study of 93 Iranian children, 66 Barakat et al., Seizure, 2022 each T allele was associated with approximately 2.4-fold higher odds of drug-resistant epilepsy, suggesting altered antiepileptic drug transport.

An epistatic interaction study in 4,914 Chinese participants 77 Tang et al., Hereditas, 2020 found that rs2231137 interacts with the PKD2 variant rs2728121 to influence urate levels and gout risk, with stronger effects in males (Pint = 0.004).

Practical Actions

The main clinical implications of this variant are in the context of specific drugs whose clearance depends on ABCG2 — particularly antiretrovirals used in HIV treatment and certain antiepileptic drugs. Carriers of the T allele, especially those also carrying risk alleles at the companion ABCG2 variant rs2231142 (Q141K), may have a compounded reduction in ABCG2 transport capacity that warrants monitoring of drug levels and liver enzymes when starting hepatically-cleared medications.

Interactions

The most clinically important interaction is with rs2231142 (Q141K), the dominant functional variant in ABCG2. Q141K alone reduces urate transport by ~53% and drives most of the ABCG2-gout association. Carrying T alleles at both rs2231137 (V12M) and rs2231142 (Q141K) may compound transporter impairment beyond either variant alone — a haplotype combination studied for drug resistance in cancer cells (PMID 40806557). The 12M/141K haplotype showed 10.7-fold resistance to the anticancer drug mitoxantrone, substantially higher than either variant alone.

There is also a documented epistatic interaction with rs2728121 in the PKD2 gene, which co-regulates urate levels and gout susceptibility through separate renal mechanisms.

SLC28A2 rs2413775 — A Promoter Switch on the CNT2 Nucleoside Gate

Every cell in your body depends on a continuous supply of nucleosides — the molecular building blocks of DNA and RNA — and specialized transport proteins ferry these molecules across cell membranes. One such transporter, CNT211 CNT2
Concentrative Nucleoside Transporter 2, encoded by SLC28A2 on chromosome 15; the "concentrative" label reflects that it moves nucleosides against their concentration gradient, powered by co-transported sodium ions
, sits at a critical junction in the intestinal wall and kidney tubules, governing how efficiently purines and certain nucleoside-based drugs enter and leave the body. The rs2413775 variant sits 146 base pairs upstream of the SLC28A2 coding sequence, inside the gene's proximal promoter, and alters how much CNT2 protein the liver and kidney produce.

The Mechanism

The variant creates a T-to-A change at position −146 relative to the SLC28A2 transcription start site, directly within a binding site recognized by hepatic nuclear factor 122 hepatic nuclear factor 1
HNF1α and HNF1β are transcription factors that control gene expression in liver, kidney, and intestine; they bind specific DNA sequences and recruit the RNA polymerase machinery
(HNF1). The A allele enhances HNF1α and HNF1β binding affinity at this site — demonstrated by electrophoretic mobility shift assays showing superior binding of -146A oligonucleotides at all competitor concentrations tested. The result is a gain-of-function33 gain-of-function
The A allele increases, not decreases, CNT2 expression; this is the globally predominant allele (~62% globally), so "higher expression" is the population norm
effect: Yee et al. (2009)44 Yee et al. (2009)
Yee SW et al. Identification and characterization of proximal promoter polymorphisms in the human concentrative nucleoside transporter 2 (SLC28A2). J Pharmacol Exp Ther. 2009;328(3):699-707
showed that constructs carrying the A allele drove 1.8-fold greater luciferase activity than T-allele constructs in HepG2 hepatoblastoma cells (p < 0.001), with the effect confirmed in renal and colorectal cell lines and in mouse liver in vivo. Individuals carrying the T allele — the GRCh38 reference base but the global minor allele — produce less CNT2 protein per cell.

The Evidence

The foundational functional work was published in 2009 by the Pharmacogenomics of Membrane Transporters consortium55 Pharmacogenomics of Membrane Transporters consortium
PMT — a UCSF-led consortium dedicated to characterizing how genetic variation in drug transporters affects pharmacokinetics in diverse populations
. Yee et al. (2009)66 Yee et al. (2009) sequenced the SLC28A2 promoter in 272 individuals across five ethnic groups and identified rs2413775 as the only promoter variant with an allele frequency exceeding 20% in all populations studied. In a parallel analysis, Hesselson et al. (2009)77 Hesselson et al. (2009)
Hesselson SE et al. Genetic variation in the proximal promoter of ABC and SLC superfamilies: liver and kidney specific expression and promoter activity predict variation. PLoS ONE. 2009;4(9):e6942
confirmed that the T allele was associated with lower promoter activity in hepatic tissue across a broader survey of SLC family promoter variants.

The clinical pharmacogenomic relevance of SLC28A2 expression level has been demonstrated in studies of ribavirin, a purine nucleoside analog used to treat hepatitis C. CNT2 mediates intestinal absorption and renal tubular reabsorption of ribavirin; higher CNT2 expression translates to greater intracellular drug accumulation. A multicenter study by Ampuero et al. (2015)88 Ampuero et al. (2015)
Ampuero J et al. Role of ITPA and SLC28A2 genes in the prediction of anaemia associated with protease inhibitor plus ribavirin and peginterferon in hepatitis C treatment. J Clin Virol. 2015;68:8-13
found that the related SLC28A2 variant rs11854484 TT genotype independently predicted clinically significant ribavirin-induced anemia (OR 2.33, 95% CI 1.10–4.95, p = 0.027) — demonstrating that SLC28A2 expression differences have measurable hematological consequences during nucleoside analog therapy.

Beyond drug transport, SLC28A2 plays a role in purine nucleoside homeostasis. Zhou et al. (2019)99 Zhou et al. (2019)
Zhou Z et al. Common variants in the SLC28A2 gene are associated with serum uric acid level and hyperuricemia and gout in Han Chinese. Hereditas. 2019;156:7
identified multiple SLC28A2 variants significantly associated with serum uric acid levels and gout risk in 4,015 Han Chinese participants, confirming CNT2's role in purine clearance and urate homeostasis.

Clinical translation of rs2413775 specifically awaits prospective pharmacokinetic/pharmacodynamic studies — the existing evidence is mechanistically compelling but direct outcome data for this exact promoter variant remain limited, warranting an emerging evidence classification.

Practical Actions

For individuals receiving nucleoside analog drugs that are CNT2 substrates — primarily ribavirin and didanosine — the T allele may signal reduced transporter expression and potentially lower intracellular drug accumulation. This could affect both therapeutic efficacy (less drug reaching target cells) and tolerability (potentially less drug accumulation in red blood cells, which drives ribavirin- induced hemolytic anemia). The clinical significance for any individual patient depends on the specific drug, dose, and co-administered agents.

For general health, CNT2 also transports dietary purine nucleosides across the intestinal epithelium. Reduced CNT2 expression in T-allele carriers may modestly affect purine absorption from high-purine foods (red meat, organ meat, shellfish, beer), though the magnitude of this effect on serum uric acid has not been characterized for rs2413775 specifically.

Interactions

CNT2 works alongside the equilibrative nucleoside transporters ENT1 (SLC29A1) and ENT2 (SLC29A2) and the concentrative transporter CNT3 (SLC28A3) to govern nucleoside flux. Studies of ribavirin pharmacogenomics consistently include SLC29A1 variants alongside SLC28A2, as the two transporter families serve partly overlapping substrates in different tissue compartments.

The coding-region variant rs11854484 in SLC28A2 has been more extensively studied in clinical cohorts for ribavirin-induced anemia and treatment outcomes than rs2413775 itself, and these variants are not in strong linkage disequilibrium — they tag independent aspects of CNT2 biology (promoter expression level vs. protein function).

CD226 Intronic Variant — A Regulatory Signal at the DNAM-1 Autoimmune Locus

CD226 (also known as DNAM-1, DNAX Accessory Molecule-1)11 CD226 (also known as DNAM-1, DNAX Accessory Molecule-1)
A co-stimulatory receptor expressed on T cells, natural killer cells, and myeloid cells that delivers activating signals when it binds PVR (CD155) or Nectin-2 (CD112) on antigen-presenting cells and stressed tissues
is one of the few genes in the human genome where both coding and non-coding genetic variation independently contribute to autoimmune risk. rs2469434 sits deep within an intron of CD226 on chromosome 18q22, roughly 12,000 base pairs from the functionally well-characterised coding variant rs763361 (Gly307Ser). The two variants capture distinct signals at the same locus: rs763361 changes the protein's cytoplasmic signalling domain, while rs2469434 is thought to act through regulatory mechanisms — altering CD226 expression levels in immune cells rather than protein structure.

The Mechanism

rs2469434 maps to the notation c.728-3564 in the canonical CD226 transcript (NM_006566.4), placing it 3,564 nucleotides upstream of a major exon boundary — far from splice sites and beyond the reach of conventional splicing effects. The most plausible functional interpretation is that this variant lies within an intronic enhancer or silencer element22 enhancer or silencer element
Regulatory DNA sequences within introns that bind transcription factors and modulate gene expression levels without altering the protein sequence
, modulating CD226 transcription in immune cell types. The GWAS signal for altered neutrophil count and total white blood cell count — two haematopoietic phenotypes with high expression in circulating immune cells — is consistent with a cis-regulatory effect on CD226 expression during immune cell maturation and differentiation.

CD226 itself delivers co-stimulatory signals that lower the activation threshold of both innate NK cells and adaptive T cells. Higher expression of the receptor (or dysregulated expression timing) translates into lower thresholds for immune cell activation — the cellular substrate for autoimmune tissue damage. The C risk allele at rs2469434 is associated with increased neutrophil count and elevated WBC in the largest multi-ethnic blood cell GWAS to date (Chen MH et al. 2020, 746,667 individuals)33 (Chen MH et al. 2020, 746,667 individuals), consistent with a CD226 regulatory effect on myeloid cell development or survival.

The Evidence

The rheumatoid arthritis association at rs2469434 was established by the landmark Okada et al. 2014 RA GWAS meta-analysis44 Okada et al. 2014 RA GWAS meta-analysis
Okada Y et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature, 2014; >100,000 subjects, 29,880 RA cases, 73,758 controls
, which mapped 42 novel RA risk loci including the CD226 18q22 region. The C allele shows an odds ratio of approximately 1.07 (95% CI 1.05–1.10) in European populations and 1.11 (95% CI 1.07–1.15) in East Asian populations, with genome-wide significance (P=9×10⁻¹⁰ and 1×10⁻⁸ respectively). These effect sizes are modest — consistent with the polygenic architecture of RA, where each individual variant contributes a small increment to cumulative risk.

The multiple sclerosis association comes from the International MS Genetics Consortium 2019 study55 International MS Genetics Consortium 2019 study
Published in Science, 47,429 MS cases and 68,374 controls; established 200 autosomal susceptibility variants outside the MHC
, which found rs2469434-C associated with MS susceptibility at OR 1.055 (P=3×10⁻⁸).

Importantly, the haematopoietic phenotype associations — neutrophil count (P=10⁻¹⁶) and total WBC count (P=5×10⁻¹³) from Chen MH et al. 2020 — provide mechanistic support for the autoimmune risk associations. Both phenotypes are cellular traits with well-known links to immune activation and autoimmune disease activity: elevated neutrophil counts are observed in active RA and MS flares, and circulating neutrophil levels are an intermediate phenotype between CD226 regulatory variants and clinical autoimmune disease expression.

Practical Implications

The per-allele ORs from this intronic variant (1.07–1.11 for RA, 1.055 for MS) are modest but real, and they act independently of the better-characterised rs763361 (Gly307Ser) coding variant. For CC homozygotes carrying two C alleles, the cumulative effect is approximately the square of the per-allele OR — roughly 1.14–1.23 × the risk of TT homozygotes, depending on the disease. In the context of RA and MS, which are both T-cell-mediated diseases with well-defined early biomarkers, monitoring for early signs is the primary actionable response.

Interactions

rs2469434 and rs763361 both map to the CD226 locus and independently associate with overlapping autoimmune diseases. Their independence as signals means a person can carry risk alleles at both variants, producing a compound effect at the CD226 locus beyond what either variant predicts alone. More broadly, the CD226 18q22 locus interacts pathogenically with other T-cell checkpoint genes: PTPN22 rs2476601 (LYP phosphatase), CTLA4 rs3087243 and rs231775 (inhibitory co-receptor), and IL2RA rs2104286 (IL-2 receptor alpha). All four genes converge on the T-cell activation/tolerance decision, and elevated CD226 expression from rs2469434 would amplify the effect of any co-existing checkpoint gene risk alleles.

CNR2 rs2501401: Immune Tuning Through the Endocannabinoid System

The CNR2 gene encodes the cannabinoid receptor 211 cannabinoid receptor 2
CB2, a G-protein coupled receptor expressed predominantly on immune cells including macrophages, B cells, T cells, and natural killer cells
. Unlike its brain-dominant counterpart CB1, CB2 is the immune system's endocannabinoid tuning knob — when activated by endogenous ligands such as 2-arachidonoylglycerol (2-AG) and anandamide, it suppresses pro-inflammatory cytokine release, reduces immune cell migration into tissues, and dampens excessive immune activation. rs2501401 is an intronic variant in CNR2 that functions as a cis-eQTL22 cis-eQTL
expression quantitative trait locus — a variant that regulates how much of the nearby gene's protein gets made
, with the minor A allele associated with lower CB2 expression in whole blood (G allele NES = +0.128, p = 3.02 × 10⁻⁷ in GTEx v10, meaning each G copy raises expression).

The Mechanism

Although rs2501401 does not change any amino acid in the CB2 protein, its intronic location likely affects regulatory elements — enhancers, splice regulatory sequences, or chromatin accessibility signals — that control transcriptional output of CNR2 in immune cells. The G allele (carried by ~80% of Europeans and Africans) is associated with higher CB2 expression. The A allele (minor, ~20% in Europeans but ~65% in East Asians) correlates with lower CB2 levels in blood, which may reduce the brake on inflammatory signalling. CB2 activation normally suppresses NF-κB and MAPK pathways33 suppresses NF-κB and MAPK pathways
the molecular switches that drive cytokine production in macrophages and T cells
, so lower expression could leave immune cells in a more pro-inflammatory baseline state.

The Evidence

Direct clinical association studies for rs2501401 specifically have not been published. The evidence connecting CNR2 variation to autoimmune risk comes primarily from studies of the functional coding variant Q63R (rs35761398).

A 2025 Iranian cohort study (n=240) found that CNR2 Q63R RR carriers showed >2.5-fold increased rheumatoid arthritis risk44 CNR2 Q63R RR carriers showed >2.5-fold increased rheumatoid arthritis risk
Nateghi et al. International Journal of Genomics, 2025
under multiple inheritance models. A 2020 case-control study (n=200)55 2020 case-control study (n=200) in Iran reported OR 2.70 (95% CI 1.47–4.97, p=0.001) for multiple sclerosis under the dominant model for the same Q63R variant. Studies in pediatric immune thrombocytopenic purpura similarly found OR ~2.35 for the minor allele, replicating across Egyptian and Italian cohorts66 replicating across Egyptian and Italian cohorts. In chronic HCV infection, the variant was the only independent predictor of immune-mediated disorders77 only independent predictor of immune-mediated disorders
cryoglobulinemia, autoimmune thyroiditis, ANA positivity, p=0.005
(49.4% vs 24.1% in controls).

The eQTL relationship for rs2501401 is confirmed in GTEx whole-blood data (p = 3.02 × 10⁻⁷), establishing that this intronic variant meaningfully influences CNR2 transcript levels. Mechanistically, CB2 has been shown to directly suppress T-cell JAK/STAT signalling88 directly suppress T-cell JAK/STAT signalling
Xiong et al. Signal Transduction and Targeted Therapy, 2022
, and increased CB2 expression is seen in inflamed intestinal tissue of children with IBD, suggesting a compensatory anti-inflammatory role.

The weight of evidence places this variant's evidence level at emerging: the eQTL is statistically robust, but a direct clinical trial or large GWAS using rs2501401 as the index SNP has not been published.

Practical Actions

For people carrying the A allele (lower CB2 expression), the primary strategy is supporting endocannabinoid tone through precursor nutrients and direct CB2 agonism, combined with early monitoring of inflammatory markers. Omega-3 fatty acids (EPA and DHA) are substrates for 2-arachidonoylglycerol (2-AG) synthesis, the most abundant endogenous CB2 ligand, making omega-3 sufficiency especially relevant when receptor expression is reduced. Palmitoylethanolamide (PEA) — an endogenous CB2 agonist — has clinical trial evidence in inflammatory and neuropathic pain conditions and can supplement the reduced endocannabinoid tone directly. For those with AA genotype and a personal or family history of autoimmune disease, periodic monitoring of inflammatory markers (CRP, ESR) provides early warning before symptoms become established.

Interactions

The most relevant interaction is with the functional coding variant rs35761398 (Q63R) in the same CNR2 gene. rs35761398 changes glutamine to arginine at position 63, altering receptor signalling efficacy. Carrying both rs2501401 AA (lower expression) and rs35761398 RR (altered receptor function) could theoretically compound impaired CB2 pathway activity, though no published study has examined this specific combination. rs35761398 is the variant with direct clinical association data and should be interpreted alongside rs2501401 when available.