rs1801272

CYP2A6 *2 (L160H)

Established Risk Factor

CYP2A6*2 — The Nicotine Slow-Burn Variant

Your liver converts nicotine to cotinine in minutes. The enzyme doing this work — CYP2A6 — is encoded by one of the most polymorphic drug-metabolism genes in the human genome. The CYP2A6*2 allele11 CYP2A6*2 allele
rs1801272, also called L160H or Leu160His
swaps a single amino acid at position 160 of the protein, replacing leucine with histidine. The result: the enzyme is catalytically dead. People carrying this variant metabolize nicotine far more slowly than average — and the downstream consequences touch everything from smoking behaviour to cancer risk to dosing of several unrelated drugs.

The Mechanism

CYP2A6 is responsible for roughly 70–80% of hepatic nicotine clearance, converting nicotine to its primary metabolite cotinine22 cotinine
half-life ~16 hours; used as a biomarker for tobacco exposure
. The L160H substitution disrupts the haem-binding domain of the enzyme, abolishing catalytic activity entirely. Heterozygous carriers (*1/*2) have approximately 50% reduced activity; homozygous carriers (*2/*2) have essentially zero CYP2A6 function via this pathway. Because CYP2A6 also handles ~20% of CYP2B6-independent nicotine metabolism and the bulk of cotinine further oxidation to 3-hydroxycotinine33 ~20% of CYP2B6-independent nicotine metabolism and the bulk of cotinine further oxidation to 3-hydroxycotinine, the impact cascades across the entire nicotine-clearance pathway.

Beyond nicotine, CYP2A6 is the primary metaboliser of coumarin (the fragrance compound), the aromatase inhibitor letrozole, the prodrug tegafur (activated to 5-fluorouracil), and contributes meaningfully to efavirenz and valproic acid clearance. Poor metabolisers of this enzyme are not just slow smokers — they are pharmacokinetically different across a clinically significant drug panel.

The Evidence

The smoking-cessation advantage of slow metabolisers is well-documented. A systematic review of 34 studies44 systematic review of 34 studies
Jones et al. Nicotine Tob Res, 2022
found that reduced-function CYP2A6 carriers had more than twice the odds of quitting unaided compared to normal metabolisers (OR 2.05, 95% CI 1.23–3.42) in European ancestry populations. The mechanism: when nicotine lingers longer in plasma, its aversive and satiating effects are amplified, and the urge to re-dose is suppressed.

Reduced carcinogen activation is a parallel benefit. CYP2A6 activates tobacco-specific nitrosamines (TSNA) — including NNK, a potent lung carcinogen — as well as polycyclic aromatic hydrocarbons. A nested case-control study of 325 lung cancer cases55 nested case-control study of 325 lung cancer cases
Yuan et al. Int J Cancer, 2016
in Chinese men found poor metabolisers had an odds ratio of 0.64 for lung cancer, largely mediated by lower carcinogen-equivalent intake. A similar pattern has been observed in other Asian cohorts with high frequencies of CYP2A6 null alleles.

The picture is inverted for nicotine replacement therapy (NRT). When nicotine is delivered externally and continuously (patch, gum, lozenge), the slow-clearance advantage disappears: nicotine accumulates more, side effects (nausea, palpitations, insomnia) become more prominent, and the cessation benefit narrows. The 2022 meta-analysis66 2022 meta-analysis reported that with NRT, the cessation benefit was attenuated; with bupropion, intermediate/slow metabolisers actually showed worse outcomes (OR 0.86, 95% CI 0.79–0.94).

Practical Actions

For smokers: the natural tendency in poor metabolisers is to smoke fewer cigarettes per day and to find quitting easier — align with this biology rather than against it. If NRT is used, lower-dose formulations are appropriate; standard doses may cause accumulation and side effects. Varenicline (Champix/Chantix) is not primarily metabolised by CYP2A6 and is not affected.

For non-smoking contexts, this variant has direct drug-dosing implications. Letrozole (used for breast cancer and fertility), tegafur (colorectal cancer prodrug), and efavirenz (HIV therapy) all require attention when CYP2A6 activity is absent. Standard doses may result in elevated exposure; discuss genotype-informed dosing with oncology or infectious-disease prescribers.

Interactions

CYP2A6 activity is also influenced by other variants in the same gene. The CYP2A6 deletion alleles (*4A, *4B, *4C, *4D) completely eliminate one gene copy and are most common in East Asian populations. Compound heterozygotes carrying *2 on one chromosome and a deletion on the other are effectively null metabolisers. Other partial-activity alleles (*7, *10, *12, *17) can compound the effect in trans. Full phenotype prediction requires haplotype-based analysis (e.g. PharmCAT or star-allele calling) rather than individual SNP calls. Related variants rs28399433 (*4 deletion tag) and rs5031016 (*7) are relevant for East Asian ancestry individuals in particular.

ACTN3 R577X — The Sprint Gene

The ACTN3 gene encodes alpha-actinin-311 alpha-actinin-3
A structural protein found exclusively in type II (fast-twitch) muscle fibers, where it anchors the contractile apparatus at the Z-disc
, a structural protein found exclusively in fast-twitch (type II) muscle fibers. It is arguably the most replicated finding in exercise genetics. A single C-to-T change at position 577 converts an arginine codon to a premature stop codon, completely abolishing protein production. About 1.5 billion people worldwide carry two copies of the T allele and produce no alpha-actinin-3 at all — yet they are perfectly healthy. This makes ACTN3 R577X one of the most common "loss of function" variants in the human genome.

The Mechanism

Alpha-actinin-3 is a sarcomeric22 sarcomeric
Sarcomere: the basic contractile unit of skeletal muscle, bounded by Z-discs
protein that crosslinks actin filaments at the Z-disc of fast-twitch muscle fibers. It plays a structural and signaling role in these fibers, contributing to their ability to generate rapid, forceful contractions. When the R577X stop codon (T allele) is present on both chromosomes, the protein is entirely absent. Its closely related paralog, alpha-actinin-233 alpha-actinin-2
ACTN2 is expressed in all muscle fibers and partially compensates for ACTN3 loss, explaining why XX individuals have no disease phenotype
, partially compensates for this loss, which is why deficiency causes no disease.

However, the compensation is imperfect. Fast-twitch fibers lacking alpha-actinin-3 undergo a subtle remodeling: they shift toward slower, more oxidative contractile properties44 contractile properties
Including changes in myosin heavy chain isoforms and sarcoplasmic reticulum calcium handling
, improved aerobic enzyme activity, and enhanced fatigue recovery. In essence, fast-twitch fibers in XX individuals behave a bit more like slow-twitch fibers.

The Evidence

The landmark 2003 study55 landmark 2003 study
Yang N et al. ACTN3 genotype is associated with human elite athletic performance. Am J Hum Genet, 2003
by Yang and colleagues at the Australian Institute of Sport found that the RR genotype was significantly overrepresented among elite sprint and power athletes, while no female power athlete or Olympic sprinter in their cohort had the XX genotype. This has since been replicated extensively.

A meta-analysis of 44 studies66 meta-analysis of 44 studies
Houweling PJ et al. Association of the ACTN3 R577X polymorphism with elite power sports: A meta-analysis. PLoS One, 2019
covering 20,753 participants found the R allele at OR 1.21 (95% CI 1.07-1.37) in power athletes versus controls. The most recent systematic review77 recent systematic review
El Ouali M et al. Systematic review and meta-analysis of ACTN3 R577X in power vs endurance athletes. Sports Med Open, 2024
of 25 studies (14,541 participants) confirmed RR overrepresentation in power athletes with OR 1.48 (95% CI 1.25-1.75, p < 0.00001) versus controls, while the XX genotype was significantly underrepresented (OR 0.63).

The biological mechanism was confirmed in ACTN3 knockout mice88 ACTN3 knockout mice
MacArthur DG et al. Loss of ACTN3 gene function alters mouse muscle metabolism. Nat Genet, 2007
, which showed a clear shift in fast-fiber metabolism toward aerobic pathways, reduced fast fiber diameter, and increased endurance capacity.

Beyond Athletics

ACTN3 R577X is more than a "speed gene." The XX genotype has been associated with superior cold tolerance99 superior cold tolerance
Wyckelsma VL et al. Loss of alpha-actinin-3 provides superior cold resilience and muscle heat generation. Am J Hum Genet, 2021
— XX individuals maintain core body temperature better during cold exposure through altered muscle thermogenesis (increased muscle tone rather than shivering). This may explain why the X allele increased in frequency as humans migrated to colder climates, reaching its highest prevalence in South Asian and East Asian populations.

The XX genotype has also been linked to increased injury susceptibility1010 increased injury susceptibility
Systematic review of ACTN3 R577X and non-contact injury risk in trained athletes
, particularly non-contact muscle injuries and ligament damage, as well as greater exercise-induced muscle damage after eccentric exercise. In older adults, alpha-actinin-3 deficiency is associated with reduced muscle strength, decreased bone mineral density, and potentially faster sarcopenic decline.

Practical Implications

For CC (RR) individuals: your fast-twitch fibers are optimized for explosive power. You may have a natural advantage in sprinting, jumping, and strength sports. High-intensity interval training and power-focused resistance training align well with your fiber type profile.

For TT (XX) individuals: your muscle fibers are shifted toward endurance and aerobic efficiency. You may excel in longer-duration activities and recover from aerobic exercise more effectively. Pay extra attention to gradual eccentric loading progression and injury prevention, since your connective tissues may be more vulnerable to high-force impacts.

For CT (RX) individuals: you have an intermediate profile with one functional copy, giving you a versatile mix of power and endurance capacity. Most elite athletes across disciplines carry this genotype.

Interactions

ACTN3 R577X has been studied alongside ACE I/D (angiotensin-converting enzyme insertion/deletion polymorphism) and PPARA variants in exercise genetics. The ACE DD genotype combined with ACTN3 RR appears to compound power/sprint advantages, while ACE II plus ACTN3 XX may compound endurance traits. However, these interactions are based on observational athlete cohort data and remain at the level of moderate evidence.

rs1884614

HNF4A HNF4A P2 Promoter rs1884614

Moderate Risk Factor

HNF4A P2 Promoter — The Beta-Cell Insulin Hormone Switch

Your pancreatic beta cells carry a dedicated gene control switch — the P2 promoter of HNF4A11 P2 promoter of HNF4A
HNF4A (Hepatocyte Nuclear Factor 4 Alpha) has two promoters: P1 drives expression in adult liver, while P2 drives a distinct set of HNF4A isoforms (exons 7–12) exclusively in pancreatic beta cells and fetal liver. These P2-driven isoforms control the insulin secretion gene network. Rare inactivating mutations at the P2 locus cause MODY1 (maturity-onset diabetes of the young type 1)
— that coordinates dozens of genes required for glucose-stimulated insulin secretion. rs1884614 is a common intronic variant located within the P2 haplotype block on chromosome 20. It does not change any amino acid, but the T allele tags a haplotype associated with subtly reduced P2 promoter activity — an attenuated version of the same biological axis disrupted in MODY1.

The Mechanism

The HNF4A P2 promoter region22 HNF4A P2 promoter region
Located approximately 46 kb upstream of the P1/liver promoter, the P2 promoter is active specifically in pancreatic beta cells. P2-driven HNF4A isoforms directly regulate glucokinase (the beta-cell glucose sensor), the Kir6.2 potassium channel subunit (gating insulin release), and the insulin gene itself.
governs a transcription factor network that scales the insulin secretion response to incoming glucose. rs1884614 falls within an intronic region that is annotated as a non-coding transcript variant in R3HDML-AS1 (a nearby lncRNA on the minus strand), but its biological significance arises from its tight co-inheritance with the functional P2 haplotype. The T allele is in near-perfect linkage disequilibrium (r²>0.95) with rs4810424, rs1884613, and rs2144908 — all of which tag the same P2 promoter risk haplotype associated with reduced beta-cell HNF4A expression.

The practical result: beta cells carrying the risk haplotype produce less HNF4A protein from the P2 transcripts, quieting the downstream insulin secretion machinery. This manifests as a blunted glucose-stimulated insulin secretion33 glucose-stimulated insulin secretion
Specifically the acute first-phase and sustained second-phase insulin response to a glucose challenge, not basal fasting insulin — the deficit is a dynamic secretory defect rather than a structural one. This is why an oral glucose tolerance test (OGTT) reveals it while fasting glucose can remain normal for years.
response — quantifiable as a smaller insulin area-under-curve during an oral glucose challenge.

In established type 2 diabetes, the P2 isoform undergoes paradoxical re-activation in the liver: chronically elevated glucagon activates TET3, which demethylates the P2 promoter44 TET3, which demethylates the P2 promoter
Li et al. Nature Communications 2020 (PMID 31953394) — TET3 is recruited by FOXA2 to demethylate the P2 promoter in hepatocytes, switching on the fetal HNF4A isoform and driving excess hepatic glucose output. This feed-forward loop worsens hyperglycemia in established T2D.
, driving excess hepatic glucose production. Carriers of the P2 risk haplotype may be more susceptible to this re-activation under metabolic stress.

The Evidence

The variant was identified as part of the HNF4A P2 haplotype signal in Silander et al. Diabetes 200455 Silander et al. Diabetes 2004
Silander K et al. Genetic variation near the hepatocyte nuclear factor-4 alpha gene predicts susceptibility to type 2 diabetes. Diabetes 2004. PMID:15047633
(495 Finnish families, rs2144908 OR 1.33, P=0.011 — rs2144908 is in r²≈0.99 LD with rs1884614). Direct association of rs1884614 was confirmed by Hansen et al. Diabetologia 200566 Hansen et al. Diabetologia 2005
Hansen SK et al. Variation near the hepatocyte nuclear factor (HNF)-4alpha gene associates with type 2 diabetes in the Danish population. Diabetologia 2005. PMID:15735891
in a large Danish case-control study (1,400 T2D cases, 4,700 glucose-tolerant controls): T allele OR 1.14 (P=0.02) for T2D and elevated 2-hour post-OGTT glucose (P=0.05). In Damcott et al. Diabetes 200477 Damcott et al. Diabetes 2004
Damcott CM et al. Polymorphisms in both promoters of hepatocyte nuclear factor 4-alpha are associated with type 2 diabetes in the Amish. Diabetes 2004. PMID:15561969
(n=698 non-diabetic Amish), the T allele was directly associated with higher glucose area-under-curve during an OGTT (P=0.022) — a direct measure of blunted beta-cell secretory capacity.

A key mechanistic study by Tokunaga et al. Endocrine Journal 200888 Tokunaga et al. Endocrine Journal 2008
Tokunaga A et al. A common P2 promoter polymorphism of the hepatocyte nuclear factor-4alpha gene is associated with insulin secretion in non-obese Japanese with type 2 diabetes. Endocr J 2008. PMID:18654034
(349 Japanese T2D patients, 203 controls) refined the phenotypic target: the TT genotype was associated with reduced insulin secretion AUC specifically in non-obese subjects (BMI <25 kg/m²; P=0.027), but not in obese subjects. This is clinically important — in lean individuals, obesity-related insulin resistance cannot mask the HNF4A secretory deficit, making the genotype's impact visible as impaired glucose-stimulated insulin secretion per se.

Under immunological stress — which places extraordinary demand on beta-cell secretory reserve — the variant's impact becomes more pronounced. Yang et al. Transplantation 201199 Yang et al. Transplantation 2011
Yang J et al. Genetic and clinical risk factors of new-onset diabetes after transplantation in Hispanic kidney transplant recipients. Transplantation 2011. PMID:21544032
found that the TT genotype carried OR 2.44 (95% CI 1.42–4.48, P=0.002) for new-onset diabetes after transplantation in 303 Hispanic kidney recipients — the strongest single genetic predictor of post-transplant diabetes in that study. Calcineurin inhibitors (tacrolimus, cyclosporine) further impair beta-cell function by reducing calcineurin-NFAT signaling, compounding the HNF4A P2 transcriptional deficit.

Practical Actions

Because the risk mechanism is reduced HNF4A-driven insulin secretory capacity rather than insulin resistance, the central strategy is limiting the acute insulin secretory demand placed on beta cells at each meal. This means favouring lower-glycemic-load carbohydrates — legumes, intact grains, non-starchy vegetables — that produce slower, lower glucose peaks requiring less peak insulin release. Annual fasting glucose and HbA1c screening allows detection of emerging secretory deficit before overt diabetes develops. An oral glucose tolerance test (OGTT) is more sensitive than fasting glucose for this variant's mechanism, since the deficit is specifically in glucose-stimulated (not basal) insulin secretion.

For non-obese T carriers, the evidence is most robust: lean individuals cannot compensate for the secretory deficit through insulin resistance-driven hyperinsulinemia, making the HNF4A P2 haplotype effect most visible and most actionable in this body-composition context.

For anyone anticipating immunosuppressive therapy (kidney, liver, or heart transplant), the TT genotype's 2.44-fold post-transplant diabetes risk is clinically relevant information to share with the transplant team before initiation of calcineurin inhibitors.

Interactions

rs1884614 is in near-perfect LD (r²>0.95) with the companion P2 haplotype tags rs1884613, rs4810424, and rs2144908 — all currently in the GeneOps database. These variants probe the same causal P2 haplotype signal and are not independent risk factors: carrying the T allele at rs1884614 and the G allele at rs1884613 conveys no additional risk beyond either alone. Their independent database entries exist for chip coverage breadth — different genotyping arrays tag the haplotype through different SNPs.

The gene-gene interactions most relevant to this locus are with WFS1 rs10010131 (beta-cell ER homeostasis; combined OR 3.0 in Ashkenazi subjects, Neuman 2010, PMID:20361036) and TCF7L2 rs7903146 (Wnt-driven incretin signaling; combined OR 2.4). Both represent independent beta-cell stress pathways that amplify the HNF4A P2 transcriptional deficit.

LIPG — The Endothelial Lipase That Shapes Your HDL

Your HDL cholesterol is not just a passive bystander in cardiovascular health — it is actively dismantled and rebuilt by a family of enzymes, and endothelial lipase (EL)11 endothelial lipase (EL)
Encoded by the LIPG gene on chromosome 18; the only lipase secreted specifically from vascular endothelial cells
is one of the primary drivers of HDL catabolism. Unlike lipoprotein lipase (which targets triglyceride-rich particles), EL preferentially hydrolyzes the phospholipid surface of HDL, accelerating its breakdown and removal from circulation. Higher EL activity means lower HDL; lower EL activity means higher HDL. The rs2000813 variant is a common missense change in LIPG that, through an indirect regulatory mechanism, is associated with modestly higher HDL cholesterol in carriers of the T allele.

The Mechanism

The rs2000813 variant produces a Thr111Ile substitution22 Thr111Ile substitution
A change from threonine to isoleucine at amino acid position 111 of the endothelial lipase protein
in the LIPG protein. Careful in vitro studies have shown that the Ile111 variant has essentially identical phospholipase activity, protein stability, and regulation by ANGPTL3 and ANGPTL433 ANGPTL3 and ANGPTL4
Angiopoietin-like proteins 3 and 4 are endogenous inhibitors of endothelial lipase; they bind and inactivate EL on the surface of vascular cells
compared to the wild-type enzyme. The amino acid change itself is therefore functionally silent.

The HDL association arises through a different route. The Ile111-encoding T allele is in high linkage disequilibrium44 high linkage disequilibrium
LD means the two alleles are inherited together on the same chromosomal segment so frequently that knowing one predicts the other with high accuracy
(R²=0.8) with rs34474737, a variant in the 5' UTR of LIPG that directly reduces promoter activity and lowers EL expression. Carriers of the T allele at rs2000813 therefore tend to have lower circulating EL levels — not because the enzyme works differently, but because less of it is produced. Lower EL expression → slower HDL phospholipid hydrolysis → higher HDL-C. The coding variant is a marker for the regulatory variant, not the causal agent.

The Evidence

The GLGC (Global Lipids Genetics Consortium)55 GLGC (Global Lipids Genetics Consortium)
One of the largest GWAS consortia for lipid traits, combining data from dozens of cohorts and hundreds of thousands of participants
meta-analysis found rs2000813 associated with HDL-C at genome-wide significance (β = −0.15 SD per T allele, P = 1.92×10⁻¹⁴), corresponding to approximately 2 mg/dL higher HDL-C per T allele carried. The T allele was also associated with higher HDL2, HDL3, and apoA-I.

Huang et al. (2019)66 Huang et al. (2019)
LIPG SNPs, their haplotypes and gene-environment interactions on serum lipid levels; 2,498 adults from Maonan and Han Chinese populations (Lipids in Health and Disease)
replicated the HDL and apoA-I association in two East Asian ethnic groups after Bonferroni correction, adding population diversity to the dataset.

Crucially, higher HDL from this variant does not appear to translate into cardiovascular protection. Vergeer et al. (2009)77 Vergeer et al. (2009)
T111I variant in the endothelial lipase gene and risk of coronary heart disease in three independent populations; combined n=4,107 CHD cases from DCH, Nurses' Health Study, and Health Professionals Follow-up Study (European Heart Journal)
found a pooled CHD odds ratio of 0.95 (95% CI 0.85–1.06) — statistically null. Mean HDL differences between T allele carriers and non-carriers were only 0–1 mg/dL within each cohort, consistent with the modest effect size seen in the GWAS. This finding aligns with the broader body of evidence showing that genetically elevated HDL does not always protect against atherosclerosis88 genetically elevated HDL does not always protect against atherosclerosis
The HDL hypothesis has faced repeated setbacks in Mendelian randomization studies, suggesting HDL quantity and HDL function are different traits
.

The 2024 biochemical study99 2024 biochemical study
Johansen et al., Endothelial lipase variant T111I does not alter inhibition by angiopoietin-like proteins, Scientific Reports
definitively confirmed the protein-level neutrality of T111I and reframed rs2000813 as a marker SNP for a regulatory haplotype — an important caution against inferring direct functional consequences from coding variant annotations alone.

Practical Actions

For CC homozygotes (the common genotype with highest EL expression), the actionable insight is awareness: lower baseline HDL from higher EL activity is a genetically normal pattern, not evidence of lifestyle failure. If HDL runs low on a standard lipid panel, diet quality — specifically the ratio of polyunsaturated to saturated fat and omega-3 intake — can modulate EL activity through inflammatory pathways. LIPG expression is upregulated by pro-inflammatory cytokines, so any strategy that reduces systemic inflammation may blunt EL-driven HDL catabolism.

For CT and TT carriers, the modestly elevated HDL reflects lower EL expression rather than superior reverse cholesterol transport. Standard lipid monitoring remains appropriate; the slight HDL advantage does not warrant different cholesterol management targets.

Interactions

The rs2000813 T allele is inherited as part of a broader LIPG haplotype that also includes rs3813082, rs3744843, and the causal regulatory variant rs34474737. Haplotype analysis consistently shows stronger lipid associations than any single SNP alone. A compound action combining rs2000813 (CT or TT) with rs2278236 in ANGPTL4 (a direct inhibitor of endothelial lipase) could be informative: carriers of HDL-lowering ANGPTL4 variants on top of the CC (high-EL) LIPG genotype would represent a double-hit for depressed HDL, potentially warranting targeted omega-3 monitoring and dietary fat quality review. Both variants are in the lipid-fat-metabolism pathway.

PON1's Hidden Regulator — How an Intronic Variant Shapes HDL's Antioxidant Power

Paraoxonase-1 (PON1) is one of the most important enzymes you've likely never heard of. It travels attached to HDL particles11 HDL particles
High-density lipoprotein, the "good cholesterol" that ferries cholesterol from tissues back to the liver
in the bloodstream, where its primary job is to hydrolyze oxidized phospholipids22 oxidized phospholipids
Lipid peroxides that accumulate on LDL and artery walls, triggering the inflammatory cascade that leads to atherosclerotic plaque
before they can damage artery walls. PON1 activity varies enormously between individuals — as much as 40-fold — and that variation is largely genetic. Most research focuses on the well-known coding variants Q192R (rs662) and L55M (rs854560), but a 2012 genome-wide analysis of the PON gene cluster found that intronic variants like rs2237583 contribute meaningfully to explaining that activity gap.

The Mechanism

rs2237583 sits in an intron of PON1 — a non-coding stretch between exons — and does not change any amino acid in the final protein. Instead, it appears to affect PON1 gene expression33 gene expression
The process by which DNA is transcribed into mRNA and then translated into protein; intronic variants can alter splicing efficiency, RNA stability, or regulatory element binding
through regulatory mechanisms. The PON1 gene is transcribed on the minus strand of chromosome 7, and the rs2237583 T allele (plus strand) corresponds to the A allele in coding-strand notation. Carriers of the T allele show measurably higher PON1 arylesterase activity, which is the functional readout most tightly linked to HDL's capacity to protect LDL from oxidative modification. The T allele is the minor allele globally (~24–27% in European populations, ~63% in East Asians), meaning the majority of people carry the activity-limiting C allele.

The Evidence

Two large-scale genetic studies establish this variant's functional relevance. A comprehensive tag-SNP analysis of the PON gene cluster44 comprehensive tag-SNP analysis of the PON gene cluster
Kim et al. 2012. Additional Common Polymorphisms in the PON Gene Cluster Predict PON1 Activity but Not Vascular Disease. J Lipids.
in 1,328 Caucasian males found that rs2237583 independently predicted arylesterase activity beyond the four established functional PON1 SNPs (β=+11.36 per T allele, p=2.82×10⁻⁴), explaining an additional 0.5% of total variance. All ten SNPs together explained 30.1% of activity variance. Critically, none of these activity-predicting SNPs — including rs2237583 — independently predicted carotid artery disease status in that cohort. A genome-wide association study of PON1 activity55 genome-wide association study of PON1 activity
Kim et al. 2013. Novel common and rare genetic determinants of paraoxonase activity. J Lipid Res.
confirmed rs2237583 at genome-wide significance (p=3.88×10⁻⁸), underscoring its robust biological effect on enzyme activity.

The gap between "predicts PON1 activity" and "predicts cardiovascular events" is important context. The EPIC-Norfolk prospective study66 EPIC-Norfolk prospective study
Birjmohun et al. 2009. Both paraoxonase-1 genotype and activity do not predict the risk of future coronary artery disease. PLoS One.
(1,138 CAD cases, 2,237 controls, 6-year follow-up) found that while PON1 activity inversely associated with CAD risk at the univariate level, this association disappeared after adjusting for HDL-C and HDL particle markers — suggesting PON1's protective effect is inseparable from the HDL that carries it, rather than being an independent causal factor. Yet prospective data from a high-risk cardiac catheterization cohort77 prospective data from a high-risk cardiac catheterization cohort
Bhattacharya et al. 2008. JAMA.
(1,339 patients, 44-month follow-up) found stark outcome differences: the lowest PON1 activity quartile had 3.4-fold higher risk of major adverse cardiac events compared to the highest quartile (adjusted HR), and the QQ192 genotype (lower activity) was associated with HR 2.05 for all-cause mortality. The discrepancy likely reflects population differences — in a high-risk cohort, enzyme activity variation matters more.

Practical Implications

The T allele of rs2237583 is associated with modestly higher PON1 arylesterase activity, which in turn corresponds to slightly better HDL antioxidant function. The practical implication is not about an elevated cardiovascular risk so much as an opportunity: PON1 activity is highly modifiable by diet. A review of dietary PON1 modulators88 A review of dietary PON1 modulators
Goldberg et al. 2017. The Search for Dietary Supplements to Elevate or Activate Circulating Paraoxonases. Nutrients.
found that pomegranate increased PON1 activity by 83% in patients with carotid artery stenosis over 12 months, quercetin by approximately 29% in serum, and catechins by up to 150% in hemodialyzed patients, though most human data comes from small or special-population studies. The mechanism involves polyphenols upregulating PON1 gene transcription through aryl hydrocarbon receptor signaling. Individuals carrying the common C allele (lower baseline activity) stand to gain the most from dietary optimization of PON1 function.

Interactions

rs2237583 acts within a multi-locus PON1 activity architecture. The four established functional SNPs — the Q192R (rs662) and L55M (rs854560) coding variants, plus two promoter polymorphisms (rs705379 and rs854571) — together explain ~25% of PON1 activity variance; rs2237583 adds an additional 0.5% independently. In the Kim et al. 2012 stepwise regression99 Kim et al. 2012 stepwise regression
Kim DS et al. Additional Common Polymorphisms in the PON Gene Cluster. J Lipids.
, rs2237583 and the other intronic SNPs were in low linkage disequilibrium (r²<0.60) with each other and with the coding variants, suggesting they capture distinct regulatory signals. The combined genotype across multiple PON1 loci determines an individual's overall PON1 activity phenotype — someone unfavorable at rs2237583, rs662 (QQ), and rs705379 simultaneously would have substantially lower activity than the sum of individual effects. PON1 activity also interacts with dietary polyphenol intake in a nutrigenetic relationship: a nutrigenetic observational study1010 nutrigenetic observational study
Rosales-Corral et al. 2016. Interaction between polyphenols intake and PON1 gene variants on markers of cardiovascular disease. J Transl Med.
found that the relationship between polyphenol consumption and cardiovascular disease markers differed significantly by PON1 genotype, with low-activity genotypes benefiting more from high polyphenol intakes.

CFH A473A — The Complement Haplotype Tag SNP for Macular Degeneration Risk

Age-related macular degeneration (AMD) is the leading cause of irreversible vision loss in people over 65, and dysregulation of the complement immune system is its best-understood molecular driver. rs2274700 is a synonymous variant in exon 10 of CFH (Complement Factor H) — it does not change the amino acid sequence at position 473 (still alanine). Its significance lies not in any direct protein effect but in what it tags: rs2274700 is in complete linkage disequilibrium11 complete linkage disequilibrium
LD measures how often two variants are inherited together; r²=1 means these two variants are perfectly correlated across populations
with rs1410996, a well-characterized AMD risk variant in intron 1 of CFH that appears consistently in the GWAS-defined complement-risk haplotype.

In European populations, the G allele of rs2274700 occurs at roughly 60% frequency — making it the common allele — and it is this common allele that confers AMD risk. The protective A allele, found in only about 40% of Europeans, is consistently associated with lower AMD risk across multiple ethnic groups and is associated with better response to complement-targeted and anti-VEGF treatments. The 2022 Huan et al. study22 2022 Huan et al. study
Identifying Novel Genes and Variants in Immune and Coagulation Pathways Associated with Macular Degeneration. Ophthalmology Science. 2022
confirmed the A allele protective OR of 0.64 (P=4.5×10⁻⁴) and established the perfect LD with rs1410996, explaining why this synonymous coding variant consistently appears in AMD association studies despite having no direct protein effect.

The Mechanism

CFH is a critical brake on the alternative complement pathway33 alternative complement pathway
The complement system is an arm of innate immunity that can destroy pathogens and damaged cells via protein cascades; the alternative pathway runs continuously at a low level and must be tightly regulated to prevent self-damage
. It acts primarily at mucosal and epithelial surfaces — including Bruch's membrane and the retinal pigment epithelium (RPE) — where it suppresses complement-mediated attack on healthy host tissue. Age-related accumulation of oxidized lipids, cellular debris, and advanced glycation end-products in the sub-retinal space provides an increasing stimulus for complement activation. Without adequate CFH suppression, this drives chronic inflammation that damages photoreceptors and the RPE, ultimately resulting in drusen (yellow lipid-protein deposits under the retina), geographic atrophy (dry AMD), or choroidal neovascularization (wet AMD).

The G-allele haplotype tagged by rs2274700 is associated with reduced CFH expression or function at retinal surfaces relative to the A-allele haplotype. Because rs2274700 is a synonymous variant in perfect LD with the functional intronic variant rs1410996, it is likely that the causal mechanism operates through altered splicing efficiency, mRNA stability, or regulatory element binding within the CFH genomic region, rather than through any amino acid change. In East Asian populations where the Y402H variant (rs1061170) shows weak association, rs2274700 and rs1410996 remain significant, suggesting they tag distinct functional variation within CFH independent of Y402H.

The Evidence

The AMD-rs2274700 association is well-replicated across ethnicities. Francis et al. 200744 Francis et al. 2007
Haplotypes in the complement factor H (CFH) gene: associations with drusen and advanced age-related macular degeneration. PLoS One. 2007
identified rs2274700 as the most strongly associated CFH SNP in their haplotype analysis across three independent AMD populations, reaching p<10⁻⁹ in combination with Y402H and rs1061147 — covering both early drusen formation and advanced AMD.

The large-scale Lu et al. 2018 meta-analysis55 Lu et al. 2018 meta-analysis
53 studies, 110,747 participants
of complement gene polymorphisms and AMD found a pooled heterozygote-model OR of 0.53 (95% CI 0.40–0.70) for rs2274700, consistent with a protective A-allele effect across Caucasian and Asian populations. Babanejad et al. 201666 Babanejad et al. 2016
Iranian case-control study, 100 AMD patients vs 100 controls
independently confirmed the G allele as the risk allele with a significant case-vs-control frequency difference (p<0.001).

A notable nuance is the age-dependent effect documented by Adams et al. 201277 Adams et al. 2012
Melbourne Collaborative Cohort Study, 2,294 cases and 2,294 controls ages 48–86. Human Molecular Genetics. 2012
: in participants under 55, the risk genotype showed a paradoxical inverse (protective) association with early AMD; in those over 75, the association reversed to a significant risk signal. This age-modulation likely reflects the progressive breakdown of complement regulation as the sub-retinal environment accumulates oxidative stress with aging, at which point the G-allele haplotype's reduced CFH function becomes clinically relevant.

The clinical relevance extends to treatment: Cui et al. 202588 Cui et al. 2025
BMJ Open Ophthalmology; 104 neovascular AMD patients treated with combercept anti-VEGF
found that the A allele of rs2274700 was associated with significantly better treatment response to the anti-VEGF agent combercept, suggesting that genetic profiling of CFH variants may help guide treatment selection in neovascular AMD.

Practical Implications

rs2274700 genotype information is most useful as part of a multi-variant CFH risk profile alongside the Y402H variant (rs1061170) and ARMS2 A69S (rs10490924). The G allele at rs2274700 acts in the same complement-dysregulation direction as the C allele at Y402H — both impair CFH's protective function at the retina. Carrying GG at rs2274700 without the Y402H risk allele still meaningfully elevates AMD risk, particularly in East Asian populations where Y402H is rare but rs2274700/rs1410996 remain strongly associated.

The age-dependency finding argues for beginning retinal monitoring earlier than population guidelines suggest for GG homozygotes, since complement dysregulation appears to become clinically relevant as other age-related retinal stressors accumulate. Supplementation with lutein, zeaxanthin, and omega-3 fatty acids has an evidence base for AMD prevention that is relevant across all CFH risk genotypes.

Interactions

rs2274700 is in complete LD with rs1410996 (r²=1), meaning they are interchangeable markers for the same underlying CFH haplotype risk. The Y402H variant (rs1061170) is partially correlated — both mark complement-risk haplotypes but measure partially independent variation, as evidenced by rs2274700's independent significance in East Asian populations where Y402H shows no association. Combined high-risk genotypes at rs2274700 (GG) and rs10490924/ARMS2 (TT) confer synergistically elevated AMD risk through complementary pathogenic pathways: complement dysregulation (CFH) and retinal oxidative stress (ARMS2).

ABCA1 rs2853579 — A Common Regulatory Tag in the HDL Biogenesis Gene

ABCA111 ATP-Binding Cassette Transporter A1 — a cell-membrane pump that loads cholesterol and phospholipids onto lipid-poor apolipoprotein A-I, forming nascent HDL particles is the primary gatekeeper of reverse cholesterol transport22 reverse cholesterol transport
the pathway by which peripheral tissues offload excess cholesterol back to the liver for processing and excretion
. Without ABCA1, cells cannot shed excess cholesterol and HDL formation collapses. Loss-of-function mutations in both copies cause Tangier disease33 Tangier disease
a rare autosomal recessive disorder with near-absent HDL, cholesterol-laden macrophage deposits, and accelerated atherosclerosis
. rs2853579 operates at a far subtler level — it is a common synonymous coding variant within ABCA1 exon 15 whose effect on HDL arises not from any protein change but from its co-inheritance with nearby regulatory variation that modulates how much ABCA1 the liver produces.

The Mechanism

rs2853579 sits at chr9:104,828,991 (GRCh38) within the coding sequence of ABCA1 at transcript position c.2040 on NM_005502.4. The ABCA1 gene is transcribed from the minus strand; genome files report the plus strand, where the reference allele is G and the minor allele is T. The G>T change corresponds to a C>A substitution on the coding strand — an ATC→ATA codon change that encodes isoleucine at position 680 in both cases (p.Ile680Ile). This synonymous44 synonymous
a variant that changes the DNA sequence but not the resulting amino acid — sometimes called a "silent" mutation
variant therefore cannot alter ABCA1 protein function directly. Any influence on HDL-C must arise through linkage disequilibrium55 linkage disequilibrium
LD — the tendency for nearby alleles to be inherited together; a tag SNP statistically "marks" a causal variant nearby without being functionally responsible itself
with a causal regulatory element.

The ABCA1 locus contains several intronic enhancers governing hepatic expression. Richardson et al.66 Richardson et al.
Richardson et al. Allele-specific enhancers mediate associations between LCAT and ABCA1 polymorphisms and HDL metabolism. PLOS One, 2019
identified that the intronic variant rs2575875 creates an allele-specific STAT377 Signal Transducer and Activator of Transcription 3 — a transcription factor that activates gene expression in response to cytokines and growth factors in the liver binding site that loops physically to the ABCA1 promoter and drives hepatic transporter expression. Multiple GWAS tag SNPs across the ABCA1 locus — including rs4149268, rs1883025, and rs2853579 — show overlapping HDL-C associations consistent with tagging the same or related regulatory haplotype. The G allele of rs2853579 marks the lower-expression haplotype; the T allele marks the higher-expression haplotype.

The Evidence

The association was identified in a large EHR-based GWAS88 EHR-based GWAS
Hoffmann TJ et al. A large electronic-health-record-based genome-wide study of serum lipids. Nature Genetics, 2018
of 94,674 ancestrally diverse Kaiser Permanente participants using longitudinal untreated lipid measurements. rs2853579-G was associated with decreased HDL-C (β = −0.046 mmol/L, p = 6×10⁻¹⁶) and decreased total cholesterol (β = −0.053 mmol/L, p = 1×10⁻¹⁹) at genome-wide significance. A second analysis from the same dataset found p = 2×10⁻²⁰ for HDL-C with β = −0.044 mmol/L per G allele — equivalent to roughly −1.7 mg/dL per allele. For context, typical HDL-C ranges from 40 to 80 mg/dL, so the per-allele effect is modest but statistically robust and biologically coherent.

Population frequencies from gnomAD show remarkable ancestry variation: the T allele (higher HDL) is rare in Europeans (~12%) and South Asians (~15%) but reaches 66% in East Asian populations and 45% in African populations. In East Asia, T is the major allele — meaning the lower-HDL GG genotype is actually the minor one in that ancestry. ClinVar classifies this variant as benign for Tangier disease and hypoalphalipoproteinemia, consistent with the GWAS data showing small common-variant effects rather than the large functional disruptions that characterize those disorders.

Practical Actions

For most carriers of the common GG genotype (~73% of the global population), this variant represents the population-typical baseline for ABCA1-driven HDL production. The most actionable information at this locus is your measured HDL-C. The per-allele effect of ~1.7 mg/dL for the T allele translates to approximately +3.4 mg/dL for TT homozygotes — a meaningful directional tendency but one that is easily matched or exceeded by lifestyle factors: regular aerobic exercise raises HDL by 3–6 mg/dL in most people; eliminating trans fats adds 2–4 mg/dL; replacement of saturated fat with monounsaturated fat (olive oil, avocado, nuts) provides additional benefit.

Carriers of the rare TT genotype (~2% in Europeans, ~44% in East Asians) have a genetically favorable signal at this locus. Confirmed low HDL despite the favorable TT genotype would warrant a broader workup of other lipid-modifying variants, dietary patterns, and metabolic factors.

Interactions

rs2853579 sits within the same ABCA1 gene as rs4149268 and rs1883025, two other intronic variants previously associated with HDL-C in GWAS. Whether they represent independent regulatory signals or partial correlates of the same haplotype has not been fully resolved at the individual-study level. The R219K missense variant (rs2230806, ABCA1) alters protein-level cholesterol efflux capacity through a distinct mechanism from regulatory tag SNPs and represents an independent functional layer. Carriers of other low-HDL variants — such as CETP (rs708272) or LIPC variants — can compound the rs2853579 G-allele effect through additive lipid pathway disruption.

The Z Allele — Alpha-1 Antitrypsin's Most Common Deficiency Variant

Alpha-1 antitrypsin (AAT) is the body's primary defense against neutrophil elastase, a powerful enzyme11 powerful enzyme
Neutrophil elastase is released by white blood cells during inflammation and can break down elastin, the protein that gives lung tissue its elasticity
that can destroy lung tissue if left unchecked. The Z allele (Glu342Lys) is the most common genetic variant causing severe AAT deficiency, affecting approximately 1 in 2,000 to 3,500 births22 1 in 2,000 to 3,500 births
The ZZ genotype occurs in about 1:2,000-3,500 newborns in populations of European descent, though most remain undiagnosed
. This single amino acid change — glutamic acid to lysine at position 342 — causes the protein to misfold and polymerize inside liver cells, leading to both lung disease (from lack of AAT in circulation) and liver disease (from toxic accumulation in the liver).

The Mechanism

The Z variant creates a structural instability33 structural instability
The substitution of acidic glutamic acid with basic lysine at position 342 disrupts protein folding, causing AAT molecules to link together (polymerize) in the endoplasmic reticulum of liver cells
that prevents normal secretion from liver cells. Instead of being released into the bloodstream, approximately 85% of Z variant AAT gets retained in hepatocytes as large protein polymers. ZZ homozygotes have serum AAT levels at only 10-20% of normal44 10-20% of normal
Normal AAT levels are approximately 20-53 µM (150-350 mg/dL); ZZ individuals typically have <11 µM
, while MZ heterozygotes have approximately 60% of normal levels. This dual pathology — loss of function in the lungs and toxic gain of function in the liver — makes the Z allele unique among common genetic disorders.

The molecular consequence is a loss of protease-antiprotease balance in the lungs. Neutrophil elastase, normally kept in check by AAT, breaks down elastin and collagen in alveolar walls55 alveolar walls
The tiny air sacs in the lungs where oxygen and carbon dioxide exchange occurs
. Without sufficient AAT protection, this leads to panlobular emphysema — progressive destruction of lung tissue starting in the bases and spreading throughout the lungs.

The Evidence

The clinical significance of the Z allele is well established through decades of research. ZZ homozygotes face 80-100% risk of developing emphysema66 ZZ homozygotes face 80-100% risk of developing emphysema
Based on ClinGen classification and long-term follow-up studies of diagnosed individuals
and 10-15% risk of liver cirrhosis by adulthood. The risk is dramatically modified by environmental factors, particularly smoking77 smoking
Smoking increases COPD risk in ZZ individuals and accelerates disease onset by 10-15 years compared to non-smokers
.

MZ heterozygotes (carriers) were long considered "safe," but recent large population studies have overturned this assumption. A meta-analysis of six studies88 meta-analysis of six studies
Dahl et al., European Respiratory Journal, 2005
found MZ smokers have 3.26-fold increased odds of COPD compared to MM individuals (95% CI: 1.24-8.57). Non-smoking MZ carriers do not appear to have increased lung disease risk, demonstrating a clear gene-environment interaction99 gene-environment interaction
The triple combination of MZ genotype, smoking, and occupational dust/fume exposure compounds risk beyond any single factor
.

For liver disease, a large cohort study1010 large cohort study
Published in Hepatology, 2018
found MZ heterozygotes have 1.53 odds ratio for cirrhosis compared to MM individuals, with risk amplified by higher BMI. Among ZZ children, 18% develop clinically recognized liver abnormalities and 2.4% develop cirrhosis in childhood1111 2.4% develop cirrhosis in childhood
Swedish newborn screening study following 200,000 children
, though most ZZ children remain clinically well.

Practical Implications

The Z allele is one of the most actionable genetic findings in genomics. Smoking avoidance is critical — the difference between a normal lifespan and severe disability by age 40. Augmentation therapy1212 Augmentation therapy
Intravenous infusions of pooled human AAT, administered weekly at 60 mg/kg, raise serum levels into the protective range
is available for ZZ individuals with established lung disease, and has been shown to slow emphysema progression in randomized controlled trials1313 randomized controlled trials
The RAPID trial demonstrated significant reduction in lung density loss: 1.5 g/L/year with treatment vs 2.6 g/L/year with placebo (p=0.07)
. The therapy is not curative but can meaningfully slow disease progression when started early.

For MZ carriers, counseling about smoking and occupational exposures is essential. Vapors, gases, dusts, and fumes1414 Vapors, gases, dusts, and fumes
Agricultural chemicals, welding fumes, silica dust, and other occupational exposures interact with MZ genotype to increase COPD risk
common in agriculture, welding, and industrial settings pose added risk. Air pollution and long-term ozone exposure are also independent risk factors for lung impairment in both ZZ and MZ individuals.

Liver monitoring is warranted for all ZZ individuals and should be considered for MZ carriers with other liver disease risk factors. The variable clinical presentation means some ZZ individuals develop life-threatening liver disease in childhood while others remain asymptomatic into adulthood. Genetic counseling and family testing is recommended — first-degree relatives of diagnosed individuals should be offered testing to enable preventive measures.

Interactions

The Z allele interacts significantly with the S allele (rs17580)1515 S allele (rs17580)
The S allele (Glu264Val) causes milder AAT deficiency, with serum levels at 60% of normal
. SZ compound heterozygotes have AAT levels intermediate between MZ and ZZ, with 20-50% risk of emphysema depending on smoking exposure. The combination warrants similar preventive counseling as for MZ carriers, particularly regarding smoking avoidance.

Beyond SERPINA1, other genes modify lung disease risk in AAT deficiency. Cryptic SERPINA1 haplotypes1616 Cryptic SERPINA1 haplotypes
Six haplotypes with a common backbone of five SNPs were found to increase COPD risk 6-50 fold
, the highest risk reported for COPD genetics. Variants in SERPINE2 (encoding another protease inhibitor) are associated with emphysema severity in autopsy studies.

The relationship between AAT deficiency and liver disease in other chronic conditions is complex. Z allele carriage increases liver disease risk in cystic fibrosis1717 Z allele carriage increases liver disease risk in cystic fibrosis
4.17-fold increased odds of CF-related liver disease
and chronic hepatitis C, suggesting that AAT deficiency exacerbates liver injury from other causes. However, hepatocellular carcinoma risk in ZZ cirrhosis is lower (0.88%/year) than in cirrhosis from viral hepatitis or NASH, challenging earlier assumptions about cancer risk.

A compound implication for MZ + active smoking + occupational dust/fume exposure would be warranted given the documented three-way interaction, with recommendations for aggressive exposure reduction and earlier pulmonary function monitoring.

A Genetic Crossroads Between Gut Immunity and Inflammatory Disease

Your MST1 gene encodes macrophage-stimulating protein (MSP), a critical regulator of innate immune responses11 innate immune responses
The first-line defense system that responds to pathogens without prior exposure
to bacterial challenges in the gut. The rs3197999 variant causes an arginine-to-cysteine substitution at position 689 (R689C) in the protein, located within a critical receptor-binding domain that determines how effectively MSP activates immune cells.

This variant sits at the intersection of immune surveillance and inflammatory disease. It's one of the most consistently replicated genetic risk factors22 It's one of the most consistently replicated genetic risk factors
Confirmed across multiple populations and study designs
for inflammatory bowel disease (IBD) and primary sclerosing cholangitis (PSC), appearing in genome-wide association studies with odds ratios of 1.20 for IBD and 1.51 for PSC. Unlike many genetic variants that simply reduce protein function, the Cys689 variant actually enhances certain aspects of macrophage activity—a gain of function that paradoxically increases inflammation risk.

The Mechanism

MSP is secreted primarily by the liver and circulates in blood as an inactive precursor. When cleaved by proteases at sites of tissue injury or bacterial invasion, it becomes biologically active and binds to the RON receptor tyrosine kinase on macrophages, epithelial cells, and other immune cells33 epithelial cells, and other immune cells
RON is expressed in tissues throughout the body but especially abundant in gut-associated immune cells
. This triggers signaling cascades that regulate chemotaxis (cell migration toward infection sites), phagocytosis (engulfment of bacteria), and cytokine production.

The R689C substitution occurs in the serine protease homology domain44 occurs in the serine protease homology domain
This domain forms the receptor-binding surface but is catalytically inactive—MSP is a "dead" protease that retained its binding structure
of the MSP β-chain. Functional studies using macrophage-like cell lines showed that the Cys689 variant significantly increases the stimulatory effect55 significantly increases the stimulatory effect
Enhanced chemotaxis and proliferation compared to wild-type Arg689
of MSP on chemotaxis and proliferation. This gain of function suggests the variant creates a hyperactive immune response—macrophages migrate more aggressively and respond more vigorously to bacterial signals.

Paradoxically, individuals with the AA genotype have approximately 10-fold lower MSP binding affinity66 approximately 10-fold lower MSP binding affinity
Measured in receptor-binding assays
to RON and profoundly decreased serum MSP levels. The mechanism appears to involve altered protein stability or secretion efficiency. So the variant simultaneously increases cellular responsiveness when MSP binds but reduces overall circulating MSP—a complex functional profile that may dysregulate normal immune homeostasis in the gut.

The Evidence

The initial discovery came from gene-centric mapping77 The initial discovery came from gene-centric mapping
Rather than unbiased GWAS, this study specifically targeted chromosome 3p21 based on prior linkage evidence
of the 3p21 IBD linkage region in 2008. Fisher and colleagues identified rs3197999 with P=3.62×10⁻⁶ in a combined screen of 1,020 IBD patients and replication in 745 additional cases. The variant showed association with both Crohn's disease and ulcerative colitis.

A 2008 meta-analysis88 A 2008 meta-analysis
Barrett et al., combining data from three studies with 3,230 Crohn's cases and 4,829 controls
confirmed the association with genome-wide significance (P=1.15×10⁻¹², OR=1.20). A parallel study in 3,133 ulcerative colitis patients and 4,494 controls replicated the finding (combined P=3.8×10⁻⁹).

The variant's role extends beyond classic IBD. A 2011 genome-wide association study in primary sclerosing cholangitis99 A 2011 genome-wide association study in primary sclerosing cholangitis
Melum et al., 715 Scandinavian and German PSC cases vs 2,962 controls
identified rs3197999 as one of two non-HLA susceptibility loci with P=1.1×10⁻¹⁶ and OR=1.51. PSC is a chronic cholestatic liver disease characterized by bile duct inflammation and frequently co-occurs with IBD. Homozygous AA carriers also show increased risk1010 Homozygous AA carriers also show increased risk
OR=1.97 for overall cholangiocarcinoma, OR=1.84 for PSC-unrelated biliary tract cancer
of cholangiocarcinoma, the feared cancer complication of chronic bile duct inflammation.

Gene-gene interaction analysis1111 Gene-gene interaction analysis
In a Chinese Crohn's disease cohort of 1,590 cases and 1,478 controls
identified significant interactions between MST1 and JAK2, IL23R, and PTGER4—all genes involved in inflammatory signaling pathways. This suggests MST1 participates in broader networks regulating mucosal immunity rather than acting in isolation.

A 2024 pediatric study1212 A 2024 pediatric study
367 pediatric IBD patients (197 Crohn's, 170 ulcerative colitis)
found the CC genotype was positively associated with systemic steroid use in Crohn's disease and more common in female CD patients, suggesting the variant may influence disease severity or treatment requirements beyond simple susceptibility.

Practical Actions

If you carry one or two copies of the A allele, your genetic profile suggests a heightened inflammatory response to gut bacterial challenges. This doesn't guarantee you'll develop IBD—most carriers remain healthy—but it warrants attention to gut barrier health and inflammatory triggers.

Dietary patterns matter significantly1313 Dietary patterns matter significantly
Multiple studies show Mediterranean diet adherence reduces IBD risk and disease activity
. The Mediterranean diet's benefits likely operate through multiple mechanisms: omega-3 fatty acids modulate inflammatory signaling, polyphenols reduce oxidative stress, and fiber feeds beneficial bacteria that produce short-chain fatty acids (SCFAs) like butyrate, which strengthen gut barrier integrity. Higher adherence to Mediterranean diet1414 Higher adherence to Mediterranean diet
In first-degree relatives of Crohn's patients, a population already at higher genetic risk
was associated with reduced intestinal inflammation and lower risk of later-onset Crohn's disease.

Monitoring inflammatory markers can catch subclinical inflammation before symptoms appear. C-reactive protein (CRP) and fecal calprotectin1515 C-reactive protein (CRP) and fecal calprotectin
Calprotectin is more specific for intestinal inflammation and correlates better with endoscopic findings than CRP
are the most validated biomarkers in IBD. Fecal calprotectin <250 μg/g identifies mucosal healing with 94% sensitivity, and rising levels can predict relapse earlier than clinical symptoms.

Probiotics have mixed evidence in IBD. For ulcerative colitis specifically1616 For ulcerative colitis specifically
Not for Crohn's disease, where most studies show no benefit over placebo
, certain strains show promise: Escherichia coli Nissle 1917 for maintenance of remission, and VSL#3 (a multi-strain probiotic) for inducing remission in mild to moderately active UC. VSL#3 is the only probiotic with strong evidence1717 VSL#3 is the only probiotic with strong evidence
Particularly for pouchitis, an inflammatory condition of the surgically created intestinal pouch
in common IBD practice.

Interactions

The rs3197999 variant lies in a complex genomic region on chromosome 3p21 that contains 10 genes within a 336 kb associated interval. Notably, it's in linkage disequilibrium1818 Notably, it's in linkage disequilibrium
Almost complete cosegregation of minor alleles, D'=0.60, r²=0.35
with rs1050450 in GPX1 (glutathione peroxidase 1), which causes a Pro198Leu substitution that reduces GPx-1 antioxidant enzyme activity. Some researchers have proposed that GPX1, rather than MST1, might be the pathophysiologically relevant gene at this locus.

The functional distinction is important: MST1 R689C affects innate immune activation (macrophage chemotaxis and bacterial response), while GPX1 Pro198Leu affects antioxidant capacity (ability to neutralize reactive oxygen species produced during inflammation). Both mechanisms could plausibly contribute to IBD pathogenesis. It's possible that the true causal variant is neither rs3197999 nor rs1050450 but another variant in linkage disequilibrium with both, or that both variants independently contribute to disease risk through complementary pathways. Given the proximity and LD structure, individuals with the MST1 A allele often also carry the GPX1 Leu198 allele, potentially compounding inflammatory susceptibility through both enhanced immune activation and reduced antioxidant defense.

Gene-gene interaction studies have identified significant epistasis between MST1 and several inflammatory pathway genes. The MST1–JAK2 interaction was replicated across original and validation datasets, and MST1 showed consistent interactions with IL23R (interleukin-23 receptor, a validated IBD susceptibility gene targeted by biologics like ustekinumab) and PTGER4 (prostaglandin E receptor 4, involved in inflammatory signaling). These interactions suggest that MST1 genetic effects may be amplified or modified by variation in other immune genes, and that personalized risk assessment should eventually incorporate multi-locus profiles rather than single-SNP analysis.

The "Bliss Gene" — FAAH and Your Endocannabinoid System

The FAAH gene encodes fatty acid amide hydrolase11 fatty acid amide hydrolase
an enzyme that breaks down anandamide, your brain's natural "bliss molecule"
, which binds to the same receptors as THC from cannabis. The rs324420 variant changes a single amino acid at position 129 from proline to threonine, making the enzyme more vulnerable to breakdown. Carriers of the A allele produce less stable FAAH enzyme, resulting in elevated anandamide levels throughout the brain and body22 elevated anandamide levels throughout the brain and body.

This common variant occurs in approximately 38% of Europeans (33% heterozygous AC, 5% homozygous AA), with markedly higher frequencies in African populations (~45%) and lower in East Asians (~10%)33 markedly higher frequencies in African populations (~45%) and lower in East Asians (~10%). The geographic distribution has led researchers to investigate whether this variant influences not just individual psychology but national happiness levels across cultures44 national happiness levels across cultures.

The Mechanism

The Pro129Thr substitution doesn't change FAAH's catalytic activity — the enzyme still breaks down anandamide at normal rates when it's present. The critical difference is protein stability55 protein stability. The threonine variant (A allele) is more sensitive to proteolytic degradation, meaning cells produce approximately 50% less FAAH enzyme in AA homozygotes compared to CC individuals. This leads to chronically elevated anandamide signaling through CB1 cannabinoid receptors, primarily in the amygdala, prefrontal cortex, and hippocampus — brain regions central to fear processing, stress response, and emotional regulation66 fear processing, stress response, and emotional regulation.

Elevated anandamide acts like a natural anxiolytic, dampening the amygdala's threat response and enhancing fronto-amygdala connectivity. In neuroimaging studies, A-allele carriers show reduced amygdala activation when viewing threatening faces77 A-allele carriers show reduced amygdala activation when viewing threatening faces and faster fear extinction learning — the ability to unlearn fear associations after a threat is no longer present.

The Evidence

Fear extinction and trauma response: Multiple fMRI studies demonstrate that A-allele carriers show enhanced fear extinction recall88 Multiple fMRI studies demonstrate that A-allele carriers show enhanced fear extinction recall, the neurobiological foundation of exposure therapy for PTSD and anxiety disorders. In a study of 55 healthy adults, AC heterozygotes showed significantly greater extinction recall compared to CC individuals, with distinct neural activation patterns in the ventromedial prefrontal cortex. This suggests the A allele may confer resilience to trauma-related symptoms, though one large veteran study (N=949) found no protective effect against PTSD development after military deployment99 one large veteran study (N=949) found no protective effect against PTSD development after military deployment.

Pain sensitivity: Women with the AA genotype undergoing breast cancer surgery reported significantly less sensitivity to cold pain1010 Women with the AA genotype undergoing breast cancer surgery reported significantly less sensitivity to cold pain (β = −1.48, 95% CI −2.14 to −0.8) and required less postoperative analgesia. A rare case report documented a Scottish woman with complete pain insensitivity, anxiety immunity, and accelerated wound healing1111 a Scottish woman with complete pain insensitivity, anxiety immunity, and accelerated wound healing, who carried both the AA genotype and a rare deletion in FAAH-OUT, a regulatory pseudogene.

Mood and well-being: The relationship with happiness is complex. A longitudinal study of 2,822 individuals found that each A-allele was associated with lower subjective well-being scores1212 A longitudinal study of 2,822 individuals found that each A-allele was associated with lower subjective well-being scores at both timepoints (B: −0.52, p = 0.007). However, cross-national studies show countries with higher A-allele frequencies report greater national happiness1313 countries with higher A-allele frequencies report greater national happiness, with Ghana, Nigeria, and Mexico ranking highest. The paradox may reflect that chronically elevated anandamide leads to CB1 receptor downregulation over time, potentially disrupting normal reward processing.

Substance use: The A allele shows divergent effects across substances1414 divergent effects across substances. For cannabis, AA individuals have roughly half the dependence rate of CC carriers (11% vs 26%) and report less subjective pleasure from marijuana — likely because they already have elevated endogenous cannabinoid tone. However, for alcohol, A-allele carriers with European ancestry show more severe alcohol dependence1515 A-allele carriers with European ancestry show more severe alcohol dependence when they do develop problems (13 more binge drinking days over a 90-day period), potentially via an indirect pathway through lower subjective well-being. The A allele has also been associated with increased risk of methamphetamine dependence in Malaysian populations1616 increased risk of methamphetamine dependence in Malaysian populations (OR 2.0-3.7 depending on ethnicity).

Athletic performance: Elite volleyball and rink-hockey players with the A allele were 2-3 times more likely to achieve "super athlete" status1717 Elite volleyball and rink-hockey players with the A allele were 2-3 times more likely to achieve "super athlete" status, possibly due to enhanced stress coping and pain tolerance during training and competition. However, contradictory evidence exists, with some studies showing the AA genotype more common in sedentary individuals than elite athletes.

Practical Implications

If you carry the A allele, your endocannabinoid system is running at a higher baseline, with downstream effects on how you process fear, pain, stress, and reward. This may make you more resilient to acute stressors and physical pain, but potentially more vulnerable to mood disturbances if you experienced childhood trauma, which can interact with the A allele to increase anxiety and depression risk1818 interact with the A allele to increase anxiety and depression risk.

For mental health treatment, A-allele carriers may respond particularly well to exposure-based therapies for anxiety and PTSD, given the enhanced fear extinction consolidation. FAAH inhibitors — drugs designed to mimic the A-allele effect — are in clinical trials for anxiety disorders and showed promise in a 12-week trial for social anxiety1919 promise in a 12-week trial for social anxiety, though results have been mixed.

For pain management, AA individuals may require less analgesia for acute pain but should still discuss post-operative pain plans with providers, as individual variation is substantial. The reduced pain sensitivity doesn't eliminate pain entirely.

For substance use, recognize that if you have the A allele, cannabis will likely be less appealing and addictive, but alcohol may pose greater risk if you develop problematic use patterns, particularly if you also struggle with low mood.

Interactions

The FAAH rs324420 variant interacts with other endocannabinoid and stress-response genes. CNR1 rs2180619 (cannabinoid receptor 1) and CRHR1 rs110402 (corticotropin-releasing hormone receptor 1) have been studied for interactions with FAAH in stress response, though a large veteran study found no significant gene-gene interaction effects2020 a large veteran study found no significant gene-gene interaction effects. The COMT rs4680 variant (which affects dopamine metabolism) has been shown to interact with FAAH rs324420 in modulating placebo analgesia response2121 interact with FAAH rs324420 in modulating placebo analgesia response.

Early life stress represents a critical environmental interaction. Individuals with the A allele who experienced repetitive childhood trauma show significantly higher anxiety and depression scores2222 Individuals with the A allele who experienced repetitive childhood trauma show significantly higher anxiety and depression scores compared to CC carriers with similar trauma exposure, suggesting chronically elevated anandamide during neurodevelopment may disrupt normal CB1 receptor function and long-term stress response systems.