CYP2A6*2 — The Nicotine Slow-Burn Variant
Your liver converts nicotine to cotinine in minutes. The enzyme doing this work — CYP2A6 —
is encoded by one of the most polymorphic drug-metabolism genes in the human genome. The
CYP2A6*2 allele11 CYP2A6*2 allele
rs1801272, also called L160H or Leu160His
swaps a single amino acid at position 160 of the protein, replacing leucine with histidine. The
result: the enzyme is catalytically dead. People carrying this variant metabolize nicotine far
more slowly than average — and the downstream consequences touch everything from smoking
behaviour to cancer risk to dosing of several unrelated drugs.
The Mechanism
CYP2A6 is responsible for roughly 70–80% of hepatic nicotine clearance, converting nicotine to
its primary metabolite cotinine22 cotinine
half-life ~16 hours; used as a biomarker for tobacco exposure.
The L160H substitution disrupts the haem-binding domain of the enzyme, abolishing catalytic
activity entirely. Heterozygous carriers (*1/*2) have approximately 50% reduced activity;
homozygous carriers (*2/*2) have essentially zero CYP2A6 function via this pathway. Because
CYP2A6 also handles ~20% of CYP2B6-independent nicotine metabolism and the bulk of cotinine
further oxidation to 3-hydroxycotinine33 ~20% of CYP2B6-independent nicotine metabolism and the bulk of cotinine
further oxidation to 3-hydroxycotinine, the impact
cascades across the entire nicotine-clearance pathway.
Beyond nicotine, CYP2A6 is the primary metaboliser of coumarin (the fragrance compound), the aromatase inhibitor letrozole, the prodrug tegafur (activated to 5-fluorouracil), and contributes meaningfully to efavirenz and valproic acid clearance. Poor metabolisers of this enzyme are not just slow smokers — they are pharmacokinetically different across a clinically significant drug panel.
The Evidence
The smoking-cessation advantage of slow metabolisers is well-documented. A systematic review
of 34 studies44 systematic review
of 34 studies
Jones et al. Nicotine Tob Res, 2022
found that reduced-function CYP2A6 carriers had more than twice the odds of quitting unaided
compared to normal metabolisers (OR 2.05, 95% CI 1.23–3.42) in European ancestry populations.
The mechanism: when nicotine lingers longer in plasma, its aversive and satiating effects are
amplified, and the urge to re-dose is suppressed.
Reduced carcinogen activation is a parallel benefit. CYP2A6 activates tobacco-specific
nitrosamines (TSNA) — including NNK, a potent lung carcinogen — as well as polycyclic aromatic
hydrocarbons. A nested case-control study of 325 lung cancer cases55 nested case-control study of 325 lung cancer cases
Yuan et al. Int J Cancer,
2016 in Chinese men found poor metabolisers had
an odds ratio of 0.64 for lung cancer, largely mediated by lower carcinogen-equivalent intake.
A similar pattern has been observed in other Asian cohorts with high frequencies of CYP2A6
null alleles.
The picture is inverted for nicotine replacement therapy (NRT). When nicotine is delivered externally and continuously (patch, gum, lozenge), the slow-clearance advantage disappears: nicotine accumulates more, side effects (nausea, palpitations, insomnia) become more prominent, and the cessation benefit narrows. The 2022 meta-analysis66 2022 meta-analysis reported that with NRT, the cessation benefit was attenuated; with bupropion, intermediate/slow metabolisers actually showed worse outcomes (OR 0.86, 95% CI 0.79–0.94).
Practical Actions
For smokers: the natural tendency in poor metabolisers is to smoke fewer cigarettes per day and to find quitting easier — align with this biology rather than against it. If NRT is used, lower-dose formulations are appropriate; standard doses may cause accumulation and side effects. Varenicline (Champix/Chantix) is not primarily metabolised by CYP2A6 and is not affected.
For non-smoking contexts, this variant has direct drug-dosing implications. Letrozole (used for breast cancer and fertility), tegafur (colorectal cancer prodrug), and efavirenz (HIV therapy) all require attention when CYP2A6 activity is absent. Standard doses may result in elevated exposure; discuss genotype-informed dosing with oncology or infectious-disease prescribers.
Interactions
CYP2A6 activity is also influenced by other variants in the same gene. The CYP2A6 deletion alleles (*4A, *4B, *4C, *4D) completely eliminate one gene copy and are most common in East Asian populations. Compound heterozygotes carrying *2 on one chromosome and a deletion on the other are effectively null metabolisers. Other partial-activity alleles (*7, *10, *12, *17) can compound the effect in trans. Full phenotype prediction requires haplotype-based analysis (e.g. PharmCAT or star-allele calling) rather than individual SNP calls. Related variants rs28399433 (*4 deletion tag) and rs5031016 (*7) are relevant for East Asian ancestry individuals in particular.
ACTN3 R577X — The Sprint Gene
The ACTN3 gene encodes alpha-actinin-311 alpha-actinin-3
A structural protein found exclusively in type II (fast-twitch) muscle fibers, where it anchors the contractile apparatus at the Z-disc, a structural protein found exclusively in
fast-twitch (type II) muscle fibers. It is arguably the most replicated finding
in exercise genetics. A single C-to-T change at position 577 converts an arginine
codon to a premature stop codon, completely abolishing protein production. About
1.5 billion people worldwide carry two copies of the T allele and produce no
alpha-actinin-3 at all — yet they are perfectly healthy. This makes ACTN3 R577X
one of the most common "loss of function" variants in the human genome.
The Mechanism
Alpha-actinin-3 is a sarcomeric22 sarcomeric
Sarcomere: the basic contractile unit of skeletal muscle, bounded by Z-discs protein that crosslinks actin
filaments at the Z-disc of fast-twitch muscle fibers. It plays a structural and
signaling role in these fibers, contributing to their ability to generate rapid,
forceful contractions. When the R577X stop codon (T allele) is present on both
chromosomes, the protein is entirely absent. Its closely related paralog,
alpha-actinin-233 alpha-actinin-2
ACTN2 is expressed in all muscle fibers and partially compensates for ACTN3 loss, explaining why XX individuals have no disease phenotype,
partially compensates for this loss, which is why deficiency causes no disease.
However, the compensation is imperfect. Fast-twitch fibers lacking alpha-actinin-3
undergo a subtle remodeling: they shift toward slower, more oxidative
contractile properties44 contractile properties
Including changes in myosin heavy chain isoforms and sarcoplasmic reticulum calcium handling,
improved aerobic enzyme activity, and enhanced fatigue recovery. In essence,
fast-twitch fibers in XX individuals behave a bit more like slow-twitch fibers.
The Evidence
The landmark 2003 study55 landmark 2003 study
Yang N et al. ACTN3 genotype is associated with human elite athletic performance. Am J Hum Genet, 2003
by Yang and colleagues at the Australian Institute of Sport found that the RR
genotype was significantly overrepresented among elite sprint and power athletes,
while no female power athlete or Olympic sprinter in their cohort had the XX
genotype. This has since been replicated extensively.
A meta-analysis of 44 studies66 meta-analysis of 44 studies
Houweling PJ et al. Association of the ACTN3 R577X polymorphism with elite power sports: A meta-analysis. PLoS One, 2019
covering 20,753 participants found the R allele at OR 1.21 (95% CI 1.07-1.37) in
power athletes versus controls. The most recent systematic review77 recent systematic review
El Ouali M et al. Systematic review and meta-analysis of ACTN3 R577X in power vs endurance athletes. Sports Med Open, 2024
of 25 studies (14,541 participants) confirmed RR overrepresentation in power
athletes with OR 1.48 (95% CI 1.25-1.75, p < 0.00001) versus controls, while
the XX genotype was significantly underrepresented (OR 0.63).
The biological mechanism was confirmed in ACTN3 knockout mice88 ACTN3 knockout mice
MacArthur DG et al. Loss of ACTN3 gene function alters mouse muscle metabolism. Nat Genet, 2007,
which showed a clear shift in fast-fiber metabolism toward aerobic pathways,
reduced fast fiber diameter, and increased endurance capacity.
Beyond Athletics
ACTN3 R577X is more than a "speed gene." The XX genotype has been associated with
superior cold tolerance99 superior cold tolerance
Wyckelsma VL et al. Loss of alpha-actinin-3 provides superior cold resilience and muscle heat generation. Am J Hum Genet, 2021 —
XX individuals maintain core body temperature better during cold exposure through
altered muscle thermogenesis (increased muscle tone rather than shivering). This
may explain why the X allele increased in frequency as humans migrated to colder
climates, reaching its highest prevalence in South Asian and East Asian populations.
The XX genotype has also been linked to increased injury susceptibility1010 increased injury susceptibility
Systematic review of ACTN3 R577X and non-contact injury risk in trained athletes,
particularly non-contact muscle injuries and ligament damage, as well as greater
exercise-induced muscle damage after eccentric exercise. In older adults,
alpha-actinin-3 deficiency is associated with reduced muscle strength, decreased
bone mineral density, and potentially faster sarcopenic decline.
Practical Implications
For CC (RR) individuals: your fast-twitch fibers are optimized for explosive power. You may have a natural advantage in sprinting, jumping, and strength sports. High-intensity interval training and power-focused resistance training align well with your fiber type profile.
For TT (XX) individuals: your muscle fibers are shifted toward endurance and aerobic efficiency. You may excel in longer-duration activities and recover from aerobic exercise more effectively. Pay extra attention to gradual eccentric loading progression and injury prevention, since your connective tissues may be more vulnerable to high-force impacts.
For CT (RX) individuals: you have an intermediate profile with one functional copy, giving you a versatile mix of power and endurance capacity. Most elite athletes across disciplines carry this genotype.
Interactions
ACTN3 R577X has been studied alongside ACE I/D (angiotensin-converting enzyme insertion/deletion polymorphism) and PPARA variants in exercise genetics. The ACE DD genotype combined with ACTN3 RR appears to compound power/sprint advantages, while ACE II plus ACTN3 XX may compound endurance traits. However, these interactions are based on observational athlete cohort data and remain at the level of moderate evidence.
HNF4A P2 Promoter — The Beta-Cell Insulin Hormone Switch
Your pancreatic beta cells carry a dedicated gene control switch — the
P2 promoter of HNF4A11 P2 promoter of HNF4A
HNF4A (Hepatocyte Nuclear Factor 4 Alpha) has two
promoters: P1 drives expression in adult liver, while P2 drives a distinct
set of HNF4A isoforms (exons 7–12) exclusively in pancreatic beta cells and
fetal liver. These P2-driven isoforms control the insulin secretion gene network.
Rare inactivating mutations at the P2 locus cause MODY1 (maturity-onset diabetes
of the young type 1) — that coordinates
dozens of genes required for glucose-stimulated insulin secretion. rs1884614 is a
common intronic variant located within the P2 haplotype block on chromosome 20.
It does not change any amino acid, but the T allele tags a haplotype associated
with subtly reduced P2 promoter activity — an attenuated version of the same
biological axis disrupted in MODY1.
The Mechanism
The HNF4A P2 promoter region22 HNF4A P2 promoter region
Located approximately 46 kb upstream of the
P1/liver promoter, the P2 promoter is active specifically in pancreatic beta cells.
P2-driven HNF4A isoforms directly regulate glucokinase (the beta-cell glucose
sensor), the Kir6.2 potassium channel subunit (gating insulin release), and the
insulin gene itself. governs a transcription factor network that scales the
insulin secretion response to incoming glucose. rs1884614 falls within an intronic
region that is annotated as a non-coding transcript variant in R3HDML-AS1 (a nearby
lncRNA on the minus strand), but its biological significance arises from its tight
co-inheritance with the functional P2 haplotype. The T allele is in near-perfect
linkage disequilibrium (r²>0.95) with rs4810424, rs1884613, and rs2144908 — all of
which tag the same P2 promoter risk haplotype associated with reduced beta-cell
HNF4A expression.
The practical result: beta cells carrying the risk haplotype produce less HNF4A
protein from the P2 transcripts, quieting the downstream insulin secretion machinery.
This manifests as a blunted
glucose-stimulated insulin secretion33 glucose-stimulated insulin secretion
Specifically the acute first-phase and
sustained second-phase insulin response to a glucose challenge, not basal fasting
insulin — the deficit is a dynamic secretory defect rather than a structural one.
This is why an oral glucose tolerance test (OGTT) reveals it while fasting glucose
can remain normal for years. response — quantifiable as a smaller insulin
area-under-curve during an oral glucose challenge.
In established type 2 diabetes, the P2 isoform undergoes paradoxical re-activation
in the liver: chronically elevated glucagon activates
TET3, which demethylates the P2 promoter44 TET3, which demethylates the P2 promoter
Li et al. Nature Communications 2020
(PMID 31953394) — TET3 is recruited by FOXA2 to demethylate the P2 promoter in
hepatocytes, switching on the fetal HNF4A isoform and driving excess hepatic glucose
output. This feed-forward loop worsens hyperglycemia in established T2D.,
driving excess hepatic glucose production. Carriers of the P2 risk haplotype may
be more susceptible to this re-activation under metabolic stress.
The Evidence
The variant was identified as part of the HNF4A P2 haplotype signal in
Silander et al. Diabetes 200455 Silander et al. Diabetes 2004
Silander K et al. Genetic variation near the
hepatocyte nuclear factor-4 alpha gene predicts susceptibility to type 2 diabetes.
Diabetes 2004. PMID:15047633
(495 Finnish families, rs2144908 OR 1.33, P=0.011 — rs2144908 is in r²≈0.99 LD
with rs1884614). Direct association of rs1884614 was confirmed by
Hansen et al. Diabetologia 200566 Hansen et al. Diabetologia 2005
Hansen SK et al. Variation near the hepatocyte
nuclear factor (HNF)-4alpha gene associates with type 2 diabetes in the Danish
population. Diabetologia 2005. PMID:15735891
in a large Danish case-control study (1,400 T2D cases, 4,700 glucose-tolerant
controls): T allele OR 1.14 (P=0.02) for T2D and elevated 2-hour post-OGTT
glucose (P=0.05). In
Damcott et al. Diabetes 200477 Damcott et al. Diabetes 2004
Damcott CM et al. Polymorphisms in both promoters
of hepatocyte nuclear factor 4-alpha are associated with type 2 diabetes in the Amish.
Diabetes 2004. PMID:15561969 (n=698
non-diabetic Amish), the T allele was directly associated with higher glucose
area-under-curve during an OGTT (P=0.022) — a direct measure of blunted beta-cell
secretory capacity.
A key mechanistic study by
Tokunaga et al. Endocrine Journal 200888 Tokunaga et al. Endocrine Journal 2008
Tokunaga A et al. A common P2 promoter
polymorphism of the hepatocyte nuclear factor-4alpha gene is associated with insulin
secretion in non-obese Japanese with type 2 diabetes. Endocr J 2008. PMID:18654034
(349 Japanese T2D patients, 203 controls) refined the phenotypic target: the TT
genotype was associated with reduced insulin secretion AUC specifically in non-obese
subjects (BMI <25 kg/m²; P=0.027), but not in obese subjects. This is clinically
important — in lean individuals, obesity-related insulin resistance cannot mask the
HNF4A secretory deficit, making the genotype's impact visible as impaired
glucose-stimulated insulin secretion per se.
Under immunological stress — which places extraordinary demand on beta-cell secretory
reserve — the variant's impact becomes more pronounced.
Yang et al. Transplantation 201199 Yang et al. Transplantation 2011
Yang J et al. Genetic and clinical risk factors
of new-onset diabetes after transplantation in Hispanic kidney transplant recipients.
Transplantation 2011. PMID:21544032
found that the TT genotype carried OR 2.44 (95% CI 1.42–4.48, P=0.002) for
new-onset diabetes after transplantation in 303 Hispanic kidney recipients — the
strongest single genetic predictor of post-transplant diabetes in that study.
Calcineurin inhibitors (tacrolimus, cyclosporine) further impair beta-cell function
by reducing calcineurin-NFAT signaling, compounding the HNF4A P2 transcriptional
deficit.
Practical Actions
Because the risk mechanism is reduced HNF4A-driven insulin secretory capacity rather than insulin resistance, the central strategy is limiting the acute insulin secretory demand placed on beta cells at each meal. This means favouring lower-glycemic-load carbohydrates — legumes, intact grains, non-starchy vegetables — that produce slower, lower glucose peaks requiring less peak insulin release. Annual fasting glucose and HbA1c screening allows detection of emerging secretory deficit before overt diabetes develops. An oral glucose tolerance test (OGTT) is more sensitive than fasting glucose for this variant's mechanism, since the deficit is specifically in glucose-stimulated (not basal) insulin secretion.
For non-obese T carriers, the evidence is most robust: lean individuals cannot compensate for the secretory deficit through insulin resistance-driven hyperinsulinemia, making the HNF4A P2 haplotype effect most visible and most actionable in this body-composition context.
For anyone anticipating immunosuppressive therapy (kidney, liver, or heart transplant), the TT genotype's 2.44-fold post-transplant diabetes risk is clinically relevant information to share with the transplant team before initiation of calcineurin inhibitors.
Interactions
rs1884614 is in near-perfect LD (r²>0.95) with the companion P2 haplotype tags rs1884613, rs4810424, and rs2144908 — all currently in the GeneOps database. These variants probe the same causal P2 haplotype signal and are not independent risk factors: carrying the T allele at rs1884614 and the G allele at rs1884613 conveys no additional risk beyond either alone. Their independent database entries exist for chip coverage breadth — different genotyping arrays tag the haplotype through different SNPs.
The gene-gene interactions most relevant to this locus are with WFS1 rs10010131 (beta-cell ER homeostasis; combined OR 3.0 in Ashkenazi subjects, Neuman 2010, PMID:20361036) and TCF7L2 rs7903146 (Wnt-driven incretin signaling; combined OR 2.4). Both represent independent beta-cell stress pathways that amplify the HNF4A P2 transcriptional deficit.
LIPG — The Endothelial Lipase That Shapes Your HDL
Your HDL cholesterol is not just a passive bystander in cardiovascular health — it is
actively dismantled and rebuilt by a family of enzymes, and endothelial lipase (EL)11 endothelial lipase (EL)
Encoded by the LIPG gene on chromosome 18; the only lipase secreted specifically
from vascular endothelial cells is one of
the primary drivers of HDL catabolism. Unlike lipoprotein lipase (which targets
triglyceride-rich particles), EL preferentially hydrolyzes the phospholipid surface
of HDL, accelerating its breakdown and removal from circulation. Higher EL activity
means lower HDL; lower EL activity means higher HDL. The rs2000813 variant is a
common missense change in LIPG that, through an indirect regulatory mechanism, is
associated with modestly higher HDL cholesterol in carriers of the T allele.
The Mechanism
The rs2000813 variant produces a Thr111Ile substitution22 Thr111Ile substitution
A change from threonine
to isoleucine at amino acid position 111 of the endothelial lipase protein
in the LIPG protein. Careful in vitro studies have shown that the Ile111 variant
has essentially identical phospholipase activity, protein stability, and regulation by
ANGPTL3 and ANGPTL433 ANGPTL3 and ANGPTL4
Angiopoietin-like proteins 3 and 4 are endogenous inhibitors
of endothelial lipase; they bind and inactivate EL on the surface of vascular
cells compared to the wild-type enzyme.
The amino acid change itself is therefore functionally silent.
The HDL association arises through a different route. The Ile111-encoding T allele
is in high linkage disequilibrium44 high linkage disequilibrium
LD means the two alleles are inherited together
on the same chromosomal segment so frequently that knowing one predicts the other
with high accuracy (R²=0.8) with
rs34474737, a variant in the 5' UTR of LIPG that directly reduces promoter activity
and lowers EL expression. Carriers of the T allele at rs2000813 therefore tend to
have lower circulating EL levels — not because the enzyme works differently, but
because less of it is produced. Lower EL expression → slower HDL phospholipid
hydrolysis → higher HDL-C. The coding variant is a marker for the regulatory variant,
not the causal agent.
The Evidence
The GLGC (Global Lipids Genetics Consortium)55 GLGC (Global Lipids Genetics Consortium)
One of the largest GWAS consortia
for lipid traits, combining data from dozens of cohorts and hundreds of thousands
of participants meta-analysis
found rs2000813 associated with HDL-C at genome-wide significance (β = −0.15 SD
per T allele, P = 1.92×10⁻¹⁴), corresponding to approximately 2 mg/dL higher
HDL-C per T allele carried. The T allele was also associated with higher HDL2,
HDL3, and apoA-I.
Huang et al. (2019)66 Huang et al. (2019)
LIPG SNPs, their haplotypes and gene-environment interactions
on serum lipid levels; 2,498 adults from Maonan and Han Chinese populations
(Lipids in Health and Disease)
replicated the HDL and apoA-I association in two East Asian ethnic groups after
Bonferroni correction, adding population diversity to the dataset.
Crucially, higher HDL from this variant does not appear to translate into
cardiovascular protection. Vergeer et al. (2009)77 Vergeer et al. (2009)
T111I variant in the
endothelial lipase gene and risk of coronary heart disease in three independent
populations; combined n=4,107 CHD cases from DCH, Nurses' Health Study, and
Health Professionals Follow-up Study (European Heart Journal)
found a pooled CHD odds ratio of 0.95 (95% CI 0.85–1.06) — statistically
null. Mean HDL differences between T allele carriers and non-carriers were
only 0–1 mg/dL within each cohort, consistent with the modest effect size
seen in the GWAS. This finding aligns with the broader body of evidence
showing that genetically elevated HDL does not always protect against
atherosclerosis88 genetically elevated HDL does not always protect against
atherosclerosis
The HDL hypothesis has faced repeated setbacks in
Mendelian randomization studies, suggesting HDL quantity and HDL function
are different traits.
The 2024 biochemical study99 2024 biochemical study
Johansen et al., Endothelial lipase variant T111I
does not alter inhibition by angiopoietin-like proteins, Scientific Reports
definitively confirmed the protein-level neutrality of T111I and reframed
rs2000813 as a marker SNP for a regulatory haplotype — an important caution
against inferring direct functional consequences from coding variant annotations alone.
Practical Actions
For CC homozygotes (the common genotype with highest EL expression), the actionable insight is awareness: lower baseline HDL from higher EL activity is a genetically normal pattern, not evidence of lifestyle failure. If HDL runs low on a standard lipid panel, diet quality — specifically the ratio of polyunsaturated to saturated fat and omega-3 intake — can modulate EL activity through inflammatory pathways. LIPG expression is upregulated by pro-inflammatory cytokines, so any strategy that reduces systemic inflammation may blunt EL-driven HDL catabolism.
For CT and TT carriers, the modestly elevated HDL reflects lower EL expression rather than superior reverse cholesterol transport. Standard lipid monitoring remains appropriate; the slight HDL advantage does not warrant different cholesterol management targets.
Interactions
The rs2000813 T allele is inherited as part of a broader LIPG haplotype that also includes rs3813082, rs3744843, and the causal regulatory variant rs34474737. Haplotype analysis consistently shows stronger lipid associations than any single SNP alone. A compound action combining rs2000813 (CT or TT) with rs2278236 in ANGPTL4 (a direct inhibitor of endothelial lipase) could be informative: carriers of HDL-lowering ANGPTL4 variants on top of the CC (high-EL) LIPG genotype would represent a double-hit for depressed HDL, potentially warranting targeted omega-3 monitoring and dietary fat quality review. Both variants are in the lipid-fat-metabolism pathway.
PON1's Hidden Regulator — How an Intronic Variant Shapes HDL's Antioxidant Power
Paraoxonase-1 (PON1) is one of the most important enzymes you've likely never heard of. It travels attached to HDL particles11 HDL particles
High-density lipoprotein, the "good cholesterol" that ferries cholesterol from tissues back to the liver in the bloodstream, where its primary job is to hydrolyze oxidized phospholipids22 oxidized phospholipids
Lipid peroxides that accumulate on LDL and artery walls, triggering the inflammatory cascade that leads to atherosclerotic plaque before they can damage artery walls. PON1 activity varies enormously between individuals — as much as 40-fold — and that variation is largely genetic. Most research focuses on the well-known coding variants Q192R (rs662) and L55M (rs854560), but a 2012 genome-wide analysis of the PON gene cluster found that intronic variants like rs2237583 contribute meaningfully to explaining that activity gap.
The Mechanism
rs2237583 sits in an intron of PON1 — a non-coding stretch between exons — and does not change any amino acid in the final protein. Instead, it appears to affect PON1 gene expression33 gene expression
The process by which DNA is transcribed into mRNA and then translated into protein; intronic variants can alter splicing efficiency, RNA stability, or regulatory element binding through regulatory mechanisms. The PON1 gene is transcribed on the minus strand of chromosome 7, and the rs2237583 T allele (plus strand) corresponds to the A allele in coding-strand notation. Carriers of the T allele show measurably higher PON1 arylesterase activity, which is the functional readout most tightly linked to HDL's capacity to protect LDL from oxidative modification. The T allele is the minor allele globally (~24–27% in European populations, ~63% in East Asians), meaning the majority of people carry the activity-limiting C allele.
The Evidence
Two large-scale genetic studies establish this variant's functional relevance. A comprehensive tag-SNP analysis of the PON gene cluster44 comprehensive tag-SNP analysis of the PON gene cluster
Kim et al. 2012. Additional Common Polymorphisms in the PON Gene Cluster Predict PON1 Activity but Not Vascular Disease. J Lipids. in 1,328 Caucasian males found that rs2237583 independently predicted arylesterase activity beyond the four established functional PON1 SNPs (β=+11.36 per T allele, p=2.82×10⁻⁴), explaining an additional 0.5% of total variance. All ten SNPs together explained 30.1% of activity variance. Critically, none of these activity-predicting SNPs — including rs2237583 — independently predicted carotid artery disease status in that cohort. A genome-wide association study of PON1 activity55 genome-wide association study of PON1 activity
Kim et al. 2013. Novel common and rare genetic determinants of paraoxonase activity. J Lipid Res. confirmed rs2237583 at genome-wide significance (p=3.88×10⁻⁸), underscoring its robust biological effect on enzyme activity.
The gap between "predicts PON1 activity" and "predicts cardiovascular events" is important context. The EPIC-Norfolk prospective study66 EPIC-Norfolk prospective study
Birjmohun et al. 2009. Both paraoxonase-1 genotype and activity do not predict the risk of future coronary artery disease. PLoS One. (1,138 CAD cases, 2,237 controls, 6-year follow-up) found that while PON1 activity inversely associated with CAD risk at the univariate level, this association disappeared after adjusting for HDL-C and HDL particle markers — suggesting PON1's protective effect is inseparable from the HDL that carries it, rather than being an independent causal factor. Yet prospective data from a high-risk cardiac catheterization cohort77 prospective data from a high-risk cardiac catheterization cohort
Bhattacharya et al. 2008. JAMA. (1,339 patients, 44-month follow-up) found stark outcome differences: the lowest PON1 activity quartile had 3.4-fold higher risk of major adverse cardiac events compared to the highest quartile (adjusted HR), and the QQ192 genotype (lower activity) was associated with HR 2.05 for all-cause mortality. The discrepancy likely reflects population differences — in a high-risk cohort, enzyme activity variation matters more.
Practical Implications
The T allele of rs2237583 is associated with modestly higher PON1 arylesterase activity, which in turn corresponds to slightly better HDL antioxidant function. The practical implication is not about an elevated cardiovascular risk so much as an opportunity: PON1 activity is highly modifiable by diet. A review of dietary PON1 modulators88 A review of dietary PON1 modulators
Goldberg et al. 2017. The Search for Dietary Supplements to Elevate or Activate Circulating Paraoxonases. Nutrients. found that pomegranate increased PON1 activity by 83% in patients with carotid artery stenosis over 12 months, quercetin by approximately 29% in serum, and catechins by up to 150% in hemodialyzed patients, though most human data comes from small or special-population studies. The mechanism involves polyphenols upregulating PON1 gene transcription through aryl hydrocarbon receptor signaling. Individuals carrying the common C allele (lower baseline activity) stand to gain the most from dietary optimization of PON1 function.
Interactions
rs2237583 acts within a multi-locus PON1 activity architecture. The four established functional SNPs — the Q192R (rs662) and L55M (rs854560) coding variants, plus two promoter polymorphisms (rs705379 and rs854571) — together explain ~25% of PON1 activity variance; rs2237583 adds an additional 0.5% independently. In the Kim et al. 2012 stepwise regression99 Kim et al. 2012 stepwise regression
Kim DS et al. Additional Common Polymorphisms in the PON Gene Cluster. J Lipids., rs2237583 and the other intronic SNPs were in low linkage disequilibrium (r²<0.60) with each other and with the coding variants, suggesting they capture distinct regulatory signals. The combined genotype across multiple PON1 loci determines an individual's overall PON1 activity phenotype — someone unfavorable at rs2237583, rs662 (QQ), and rs705379 simultaneously would have substantially lower activity than the sum of individual effects. PON1 activity also interacts with dietary polyphenol intake in a nutrigenetic relationship: a nutrigenetic observational study1010 nutrigenetic observational study
Rosales-Corral et al. 2016. Interaction between polyphenols intake and PON1 gene variants on markers of cardiovascular disease. J Transl Med. found that the relationship between polyphenol consumption and cardiovascular disease markers differed significantly by PON1 genotype, with low-activity genotypes benefiting more from high polyphenol intakes.
CFH A473A — The Complement Haplotype Tag SNP for Macular Degeneration Risk
Age-related macular degeneration (AMD) is the leading cause of irreversible vision loss
in people over 65, and dysregulation of the complement immune system is its best-understood
molecular driver. rs2274700 is a synonymous variant in exon 10 of CFH (Complement Factor H) —
it does not change the amino acid sequence at position 473 (still alanine). Its significance
lies not in any direct protein effect but in what it tags: rs2274700 is in complete linkage
disequilibrium11 complete linkage
disequilibrium
LD measures how often two variants are inherited together; r²=1 means these
two variants are perfectly correlated across populations
with rs1410996, a well-characterized AMD risk variant in intron 1 of CFH that appears
consistently in the GWAS-defined complement-risk haplotype.
In European populations, the G allele of rs2274700 occurs at roughly 60% frequency — making
it the common allele — and it is this common allele that confers AMD risk. The protective
A allele, found in only about 40% of Europeans, is consistently associated with lower AMD
risk across multiple ethnic groups and is associated with better response to complement-targeted
and anti-VEGF treatments. The 2022 Huan et al. study22 2022 Huan et al. study
Identifying Novel Genes and Variants
in Immune and Coagulation Pathways Associated with Macular Degeneration. Ophthalmology Science.
2022 confirmed the A allele protective OR of 0.64
(P=4.5×10⁻⁴) and established the perfect LD with rs1410996, explaining why this synonymous
coding variant consistently appears in AMD association studies despite having no direct protein
effect.
The Mechanism
CFH is a critical brake on the alternative complement pathway33 alternative complement pathway
The complement system is an
arm of innate immunity that can destroy pathogens and damaged cells via protein cascades; the
alternative pathway runs continuously at a low level and must be tightly regulated to prevent
self-damage. It acts primarily at mucosal and
epithelial surfaces — including Bruch's membrane and the retinal pigment epithelium (RPE) —
where it suppresses complement-mediated attack on healthy host tissue. Age-related
accumulation of oxidized lipids, cellular debris, and advanced glycation end-products in
the sub-retinal space provides an increasing stimulus for complement activation. Without
adequate CFH suppression, this drives chronic inflammation that damages photoreceptors and
the RPE, ultimately resulting in drusen (yellow lipid-protein deposits under the retina),
geographic atrophy (dry AMD), or choroidal neovascularization (wet AMD).
The G-allele haplotype tagged by rs2274700 is associated with reduced CFH expression or function at retinal surfaces relative to the A-allele haplotype. Because rs2274700 is a synonymous variant in perfect LD with the functional intronic variant rs1410996, it is likely that the causal mechanism operates through altered splicing efficiency, mRNA stability, or regulatory element binding within the CFH genomic region, rather than through any amino acid change. In East Asian populations where the Y402H variant (rs1061170) shows weak association, rs2274700 and rs1410996 remain significant, suggesting they tag distinct functional variation within CFH independent of Y402H.
The Evidence
The AMD-rs2274700 association is well-replicated across ethnicities. Francis et al. 200744 Francis et al. 2007
Haplotypes in the complement factor H (CFH) gene: associations with drusen and advanced
age-related macular degeneration. PLoS One. 2007
identified rs2274700 as the most strongly associated CFH SNP in their haplotype analysis
across three independent AMD populations, reaching p<10⁻⁹ in combination with Y402H and
rs1061147 — covering both early drusen formation and advanced AMD.
The large-scale Lu et al. 2018 meta-analysis55 Lu et al. 2018 meta-analysis
53 studies, 110,747 participants
of complement gene polymorphisms and AMD found a pooled heterozygote-model OR of 0.53
(95% CI 0.40–0.70) for rs2274700, consistent with a protective A-allele effect across
Caucasian and Asian populations. Babanejad et al. 201666 Babanejad et al. 2016
Iranian case-control study,
100 AMD patients vs 100 controls independently
confirmed the G allele as the risk allele with a significant case-vs-control frequency
difference (p<0.001).
A notable nuance is the age-dependent effect documented by Adams et al. 201277 Adams et al. 2012
Melbourne
Collaborative Cohort Study, 2,294 cases and 2,294 controls ages 48–86. Human Molecular
Genetics. 2012: in participants under 55, the
risk genotype showed a paradoxical inverse (protective) association with early AMD; in those
over 75, the association reversed to a significant risk signal. This age-modulation likely
reflects the progressive breakdown of complement regulation as the sub-retinal environment
accumulates oxidative stress with aging, at which point the G-allele haplotype's reduced
CFH function becomes clinically relevant.
The clinical relevance extends to treatment: Cui et al. 202588 Cui et al. 2025
BMJ Open Ophthalmology;
104 neovascular AMD patients treated with combercept anti-VEGF
found that the A allele of rs2274700 was associated with significantly better treatment
response to the anti-VEGF agent combercept, suggesting that genetic profiling of CFH
variants may help guide treatment selection in neovascular AMD.
Practical Implications
rs2274700 genotype information is most useful as part of a multi-variant CFH risk profile alongside the Y402H variant (rs1061170) and ARMS2 A69S (rs10490924). The G allele at rs2274700 acts in the same complement-dysregulation direction as the C allele at Y402H — both impair CFH's protective function at the retina. Carrying GG at rs2274700 without the Y402H risk allele still meaningfully elevates AMD risk, particularly in East Asian populations where Y402H is rare but rs2274700/rs1410996 remain strongly associated.
The age-dependency finding argues for beginning retinal monitoring earlier than population guidelines suggest for GG homozygotes, since complement dysregulation appears to become clinically relevant as other age-related retinal stressors accumulate. Supplementation with lutein, zeaxanthin, and omega-3 fatty acids has an evidence base for AMD prevention that is relevant across all CFH risk genotypes.
Interactions
rs2274700 is in complete LD with rs1410996 (r²=1), meaning they are interchangeable markers for the same underlying CFH haplotype risk. The Y402H variant (rs1061170) is partially correlated — both mark complement-risk haplotypes but measure partially independent variation, as evidenced by rs2274700's independent significance in East Asian populations where Y402H shows no association. Combined high-risk genotypes at rs2274700 (GG) and rs10490924/ARMS2 (TT) confer synergistically elevated AMD risk through complementary pathogenic pathways: complement dysregulation (CFH) and retinal oxidative stress (ARMS2).
ABCA1 rs2853579 — A Common Regulatory Tag in the HDL Biogenesis Gene
ABCA111 ATP-Binding Cassette Transporter A1 — a cell-membrane pump that loads
cholesterol and phospholipids onto lipid-poor apolipoprotein A-I, forming nascent
HDL particles is the primary gatekeeper of reverse cholesterol transport22 reverse cholesterol transport
the pathway by which peripheral tissues offload excess cholesterol back to the
liver for processing and excretion. Without ABCA1, cells cannot shed excess
cholesterol and HDL formation collapses. Loss-of-function mutations in both copies
cause Tangier disease33 Tangier disease
a rare autosomal recessive disorder with near-absent HDL,
cholesterol-laden macrophage deposits, and accelerated atherosclerosis. rs2853579
operates at a far subtler level — it is a common synonymous coding variant within
ABCA1 exon 15 whose effect on HDL arises not from any protein change but from its
co-inheritance with nearby regulatory variation that modulates how much ABCA1 the
liver produces.
The Mechanism
rs2853579 sits at chr9:104,828,991 (GRCh38) within the coding sequence of ABCA1
at transcript position c.2040 on NM_005502.4. The ABCA1 gene is transcribed from
the minus strand; genome files report the plus strand, where the reference allele
is G and the minor allele is T. The G>T change corresponds to a C>A substitution
on the coding strand — an ATC→ATA codon change that encodes isoleucine at position
680 in both cases (p.Ile680Ile). This synonymous44 synonymous
a variant that changes the DNA
sequence but not the resulting amino acid — sometimes called a "silent" mutation
variant therefore cannot alter ABCA1 protein function directly. Any influence on
HDL-C must arise through linkage disequilibrium55 linkage disequilibrium
LD — the tendency for nearby alleles
to be inherited together; a tag SNP statistically "marks" a causal variant nearby
without being functionally responsible itself with a causal regulatory element.
The ABCA1 locus contains several intronic enhancers governing hepatic expression.
Richardson et al.66 Richardson et al.
Richardson et al. Allele-specific enhancers mediate associations
between LCAT and ABCA1 polymorphisms and HDL metabolism. PLOS One, 2019
identified that the intronic variant rs2575875 creates an allele-specific STAT377 Signal Transducer and Activator of Transcription 3 — a transcription factor that
activates gene expression in response to cytokines and growth factors in the liver
binding site that loops physically to the ABCA1 promoter and drives hepatic transporter
expression. Multiple GWAS tag SNPs across the ABCA1 locus — including rs4149268,
rs1883025, and rs2853579 — show overlapping HDL-C associations consistent with tagging
the same or related regulatory haplotype. The G allele of rs2853579 marks the
lower-expression haplotype; the T allele marks the higher-expression haplotype.
The Evidence
The association was identified in a large EHR-based GWAS88 EHR-based GWAS
Hoffmann TJ et al.
A large electronic-health-record-based genome-wide study of serum lipids. Nature
Genetics, 2018 of 94,674 ancestrally
diverse Kaiser Permanente participants using longitudinal untreated lipid measurements.
rs2853579-G was associated with decreased HDL-C (β = −0.046 mmol/L, p = 6×10⁻¹⁶)
and decreased total cholesterol (β = −0.053 mmol/L, p = 1×10⁻¹⁹) at genome-wide
significance. A second analysis from the same dataset found p = 2×10⁻²⁰ for HDL-C
with β = −0.044 mmol/L per G allele — equivalent to roughly −1.7 mg/dL per allele.
For context, typical HDL-C ranges from 40 to 80 mg/dL, so the per-allele effect
is modest but statistically robust and biologically coherent.
Population frequencies from gnomAD show remarkable ancestry variation: the T allele (higher HDL) is rare in Europeans (~12%) and South Asians (~15%) but reaches 66% in East Asian populations and 45% in African populations. In East Asia, T is the major allele — meaning the lower-HDL GG genotype is actually the minor one in that ancestry. ClinVar classifies this variant as benign for Tangier disease and hypoalphalipoproteinemia, consistent with the GWAS data showing small common-variant effects rather than the large functional disruptions that characterize those disorders.
Practical Actions
For most carriers of the common GG genotype (~73% of the global population), this variant represents the population-typical baseline for ABCA1-driven HDL production. The most actionable information at this locus is your measured HDL-C. The per-allele effect of ~1.7 mg/dL for the T allele translates to approximately +3.4 mg/dL for TT homozygotes — a meaningful directional tendency but one that is easily matched or exceeded by lifestyle factors: regular aerobic exercise raises HDL by 3–6 mg/dL in most people; eliminating trans fats adds 2–4 mg/dL; replacement of saturated fat with monounsaturated fat (olive oil, avocado, nuts) provides additional benefit.
Carriers of the rare TT genotype (~2% in Europeans, ~44% in East Asians) have a genetically favorable signal at this locus. Confirmed low HDL despite the favorable TT genotype would warrant a broader workup of other lipid-modifying variants, dietary patterns, and metabolic factors.
Interactions
rs2853579 sits within the same ABCA1 gene as rs4149268 and rs1883025, two other intronic variants previously associated with HDL-C in GWAS. Whether they represent independent regulatory signals or partial correlates of the same haplotype has not been fully resolved at the individual-study level. The R219K missense variant (rs2230806, ABCA1) alters protein-level cholesterol efflux capacity through a distinct mechanism from regulatory tag SNPs and represents an independent functional layer. Carriers of other low-HDL variants — such as CETP (rs708272) or LIPC variants — can compound the rs2853579 G-allele effect through additive lipid pathway disruption.
The Z Allele — Alpha-1 Antitrypsin's Most Common Deficiency Variant
Alpha-1 antitrypsin (AAT) is the body's primary defense against neutrophil
elastase, a powerful enzyme11 powerful enzyme
Neutrophil elastase is released by white blood
cells during inflammation and can break down elastin, the protein that gives
lung tissue its elasticity
that can destroy lung tissue if left unchecked. The Z allele (Glu342Lys) is
the most common genetic variant causing severe AAT deficiency, affecting
approximately 1 in 2,000 to 3,500 births22 1 in 2,000 to 3,500 births
The ZZ genotype occurs in about
1:2,000-3,500 newborns in populations of European descent, though most remain
undiagnosed. This single
amino acid change — glutamic acid to lysine at position 342 — causes the
protein to misfold and polymerize inside liver cells, leading to both lung
disease (from lack of AAT in circulation) and liver disease (from toxic
accumulation in the liver).
The Mechanism
The Z variant creates a structural instability33 structural instability
The substitution of acidic
glutamic acid with basic lysine at position 342 disrupts protein folding,
causing AAT molecules to link together (polymerize) in the endoplasmic
reticulum of liver cells
that prevents normal secretion from liver cells. Instead of being released
into the bloodstream, approximately 85% of Z variant AAT gets retained in
hepatocytes as large protein polymers. ZZ homozygotes have serum AAT levels
at only 10-20% of normal44 10-20% of normal
Normal AAT levels are approximately 20-53 µM
(150-350 mg/dL); ZZ individuals typically have <11 µM,
while MZ heterozygotes have approximately 60% of normal levels. This dual
pathology — loss of function in the lungs and toxic gain of function in the
liver — makes the Z allele unique among common genetic disorders.
The molecular consequence is a loss of protease-antiprotease balance in the
lungs. Neutrophil elastase, normally kept in check by AAT, breaks down elastin
and collagen in alveolar walls55 alveolar walls
The tiny air sacs in the lungs where oxygen
and carbon dioxide exchange occurs.
Without sufficient AAT protection, this leads to panlobular emphysema —
progressive destruction of lung tissue starting in the bases and spreading
throughout the lungs.
The Evidence
The clinical significance of the Z allele is well established through decades
of research. ZZ homozygotes face 80-100% risk of developing emphysema66 ZZ homozygotes face 80-100% risk of developing emphysema
Based
on ClinGen classification and long-term follow-up studies of diagnosed
individuals and 10-15%
risk of liver cirrhosis by adulthood. The risk is dramatically modified by
environmental factors, particularly smoking77 smoking
Smoking increases COPD risk in
ZZ individuals and accelerates disease onset by 10-15 years compared to
non-smokers.
MZ heterozygotes (carriers) were long considered "safe," but recent large
population studies have overturned this assumption. A meta-analysis of six
studies88 meta-analysis of six
studies
Dahl et al., European Respiratory Journal, 2005
found MZ smokers have 3.26-fold increased odds of COPD compared to MM
individuals (95% CI: 1.24-8.57). Non-smoking MZ carriers do not appear to have
increased lung disease risk, demonstrating a clear gene-environment
interaction99 gene-environment
interaction
The triple combination of MZ genotype, smoking, and occupational
dust/fume exposure compounds risk beyond any single factor.
For liver disease, a large cohort study1010 large cohort study
Published in Hepatology, 2018
found MZ heterozygotes have 1.53 odds ratio for cirrhosis compared to MM
individuals, with risk amplified by higher BMI. Among ZZ children, 18%
develop clinically recognized liver abnormalities and 2.4% develop cirrhosis
in childhood1111 2.4% develop cirrhosis
in childhood
Swedish newborn screening study following 200,000 children,
though most ZZ children remain clinically well.
Practical Implications
The Z allele is one of the most actionable genetic findings in genomics.
Smoking avoidance is critical — the difference between a normal lifespan and
severe disability by age 40. Augmentation therapy1212 Augmentation therapy
Intravenous infusions of
pooled human AAT, administered weekly at 60 mg/kg, raise serum levels into
the protective range is
available for ZZ individuals with established lung disease, and has been shown
to slow emphysema progression in randomized controlled trials1313 randomized controlled trials
The RAPID
trial demonstrated significant reduction in lung density loss: 1.5 g/L/year
with treatment vs 2.6 g/L/year with placebo (p=0.07).
The therapy is not curative but can meaningfully slow disease progression when
started early.
For MZ carriers, counseling about smoking and occupational exposures is
essential. Vapors, gases, dusts, and fumes1414 Vapors, gases, dusts, and fumes
Agricultural chemicals, welding
fumes, silica dust, and other occupational exposures interact with MZ genotype
to increase COPD risk
common in agriculture, welding, and industrial settings pose added risk. Air
pollution and long-term ozone exposure are also independent risk factors for
lung impairment in both ZZ and MZ individuals.
Liver monitoring is warranted for all ZZ individuals and should be considered for MZ carriers with other liver disease risk factors. The variable clinical presentation means some ZZ individuals develop life-threatening liver disease in childhood while others remain asymptomatic into adulthood. Genetic counseling and family testing is recommended — first-degree relatives of diagnosed individuals should be offered testing to enable preventive measures.
Interactions
The Z allele interacts significantly with the S allele (rs17580)1515 S allele (rs17580)
The S
allele (Glu264Val) causes milder AAT deficiency, with serum levels at 60% of
normal. SZ compound
heterozygotes have AAT levels intermediate between MZ and ZZ, with 20-50% risk
of emphysema depending on smoking exposure. The combination warrants similar
preventive counseling as for MZ carriers, particularly regarding smoking
avoidance.
Beyond SERPINA1, other genes modify lung disease risk in AAT deficiency.
Cryptic SERPINA1 haplotypes1616 Cryptic SERPINA1 haplotypes
Six haplotypes with a common backbone of five
SNPs were found to increase COPD risk 6-50 fold,
the highest risk reported for COPD genetics. Variants in SERPINE2 (encoding
another protease inhibitor) are associated with emphysema severity in
autopsy studies.
The relationship between AAT deficiency and liver disease in other chronic
conditions is complex. Z allele carriage increases liver disease risk in
cystic fibrosis1717 Z allele carriage increases liver disease risk in
cystic fibrosis
4.17-fold increased odds of CF-related liver disease
and chronic hepatitis C, suggesting that AAT deficiency exacerbates liver
injury from other causes. However, hepatocellular carcinoma risk in ZZ
cirrhosis is lower (0.88%/year) than in cirrhosis from viral hepatitis or
NASH, challenging earlier assumptions about cancer risk.
A compound implication for MZ + active smoking + occupational dust/fume exposure would be warranted given the documented three-way interaction, with recommendations for aggressive exposure reduction and earlier pulmonary function monitoring.
A Genetic Crossroads Between Gut Immunity and Inflammatory Disease
Your MST1 gene encodes macrophage-stimulating protein (MSP), a critical regulator
of innate immune responses11 innate immune responses
The first-line defense system that responds to
pathogens without prior exposure
to bacterial challenges in the gut. The rs3197999 variant causes an arginine-to-cysteine
substitution at position 689 (R689C) in the protein, located within a critical
receptor-binding domain that determines how effectively MSP activates immune cells.
This variant sits at the intersection of immune surveillance and inflammatory disease.
It's one of the most consistently replicated genetic risk factors22 It's one of the most consistently replicated genetic risk factors
Confirmed across
multiple populations and study designs
for inflammatory bowel disease (IBD) and primary sclerosing cholangitis (PSC),
appearing in genome-wide association studies with odds ratios of 1.20 for IBD
and 1.51 for PSC. Unlike many genetic variants that simply reduce protein function,
the Cys689 variant actually enhances certain aspects of macrophage activity—a
gain of function that paradoxically increases inflammation risk.
The Mechanism
MSP is secreted primarily by the liver and circulates in blood as an inactive
precursor. When cleaved by proteases at sites of tissue injury or bacterial invasion,
it becomes biologically active and binds to the RON receptor tyrosine kinase on
macrophages, epithelial cells, and other immune cells33 epithelial cells, and other immune cells
RON is expressed in
tissues throughout the body but especially abundant in gut-associated immune
cells. This triggers signaling cascades
that regulate chemotaxis (cell migration toward infection sites), phagocytosis
(engulfment of bacteria), and cytokine production.
The R689C substitution occurs in the serine protease homology domain44 occurs in the serine protease homology domain
This domain
forms the receptor-binding surface but is catalytically inactive—MSP is a
"dead" protease that retained its binding structure
of the MSP β-chain. Functional studies using macrophage-like cell lines showed
that the Cys689 variant significantly increases the stimulatory effect55 significantly increases the stimulatory effect
Enhanced chemotaxis and proliferation compared to wild-type Arg689
of MSP on chemotaxis and proliferation. This gain of function suggests the variant
creates a hyperactive immune response—macrophages migrate more aggressively and
respond more vigorously to bacterial signals.
Paradoxically, individuals with the AA genotype have approximately 10-fold lower
MSP binding affinity66 approximately 10-fold lower
MSP binding affinity
Measured in receptor-binding assays
to RON and profoundly decreased serum MSP levels. The mechanism appears to involve
altered protein stability or secretion efficiency. So the variant simultaneously
increases cellular responsiveness when MSP binds but reduces overall circulating
MSP—a complex functional profile that may dysregulate normal immune homeostasis
in the gut.
The Evidence
The initial discovery came from gene-centric mapping77 The initial discovery came from gene-centric mapping
Rather than unbiased GWAS,
this study specifically targeted chromosome 3p21 based on prior linkage
evidence of the 3p21 IBD linkage
region in 2008. Fisher and colleagues identified rs3197999 with P=3.62×10⁻⁶
in a combined screen of 1,020 IBD patients and replication in 745 additional
cases. The variant showed association with both Crohn's disease and ulcerative
colitis.
A 2008 meta-analysis88 A 2008 meta-analysis
Barrett et al., combining data from three studies with
3,230 Crohn's cases and 4,829 controls confirmed
the association with genome-wide significance (P=1.15×10⁻¹², OR=1.20). A parallel
study in 3,133 ulcerative colitis patients and 4,494 controls replicated the
finding (combined P=3.8×10⁻⁹).
The variant's role extends beyond classic IBD. A 2011 genome-wide association
study in primary sclerosing cholangitis99 A 2011 genome-wide association
study in primary sclerosing cholangitis
Melum et al., 715 Scandinavian and
German PSC cases vs 2,962 controls
identified rs3197999 as one of two non-HLA susceptibility loci with P=1.1×10⁻¹⁶
and OR=1.51. PSC is a chronic cholestatic liver disease characterized by bile
duct inflammation and frequently co-occurs with IBD. Homozygous AA carriers also
show increased risk1010 Homozygous AA carriers also
show increased risk
OR=1.97 for overall cholangiocarcinoma, OR=1.84 for
PSC-unrelated biliary tract cancer
of cholangiocarcinoma, the feared cancer complication of chronic bile duct inflammation.
Gene-gene interaction analysis1111 Gene-gene interaction analysis
In a Chinese Crohn's disease cohort of 1,590
cases and 1,478 controls
identified significant interactions between MST1 and JAK2, IL23R, and PTGER4—all
genes involved in inflammatory signaling pathways. This suggests MST1 participates
in broader networks regulating mucosal immunity rather than acting in isolation.
A 2024 pediatric study1212 A 2024 pediatric study
367 pediatric IBD patients (197 Crohn's, 170 ulcerative
colitis) found the CC
genotype was positively associated with systemic steroid use in Crohn's disease
and more common in female CD patients, suggesting the variant may influence
disease severity or treatment requirements beyond simple susceptibility.
Practical Actions
If you carry one or two copies of the A allele, your genetic profile suggests a heightened inflammatory response to gut bacterial challenges. This doesn't guarantee you'll develop IBD—most carriers remain healthy—but it warrants attention to gut barrier health and inflammatory triggers.
Dietary patterns matter significantly1313 Dietary patterns matter significantly
Multiple studies show Mediterranean
diet adherence reduces IBD risk and disease activity.
The Mediterranean diet's benefits likely operate through multiple mechanisms:
omega-3 fatty acids modulate inflammatory signaling, polyphenols reduce oxidative
stress, and fiber feeds beneficial bacteria that produce short-chain fatty acids
(SCFAs) like butyrate, which strengthen gut barrier integrity. Higher adherence
to Mediterranean diet1414 Higher adherence
to Mediterranean diet
In first-degree relatives of Crohn's patients, a
population already at higher genetic risk
was associated with reduced intestinal inflammation and lower risk of later-onset
Crohn's disease.
Monitoring inflammatory markers can catch subclinical inflammation before symptoms
appear. C-reactive protein (CRP) and fecal calprotectin1515 C-reactive protein (CRP) and fecal calprotectin
Calprotectin is more
specific for intestinal inflammation and correlates better with endoscopic
findings than CRP are the
most validated biomarkers in IBD. Fecal calprotectin <250 μg/g identifies mucosal
healing with 94% sensitivity, and rising levels can predict relapse earlier than
clinical symptoms.
Probiotics have mixed evidence in IBD. For ulcerative colitis specifically1616 For ulcerative colitis specifically
Not
for Crohn's disease, where most studies show no benefit over
placebo, certain strains
show promise: Escherichia coli Nissle 1917 for maintenance of remission, and
VSL#3 (a multi-strain probiotic) for inducing remission in mild to moderately
active UC. VSL#3 is the only probiotic with strong evidence1717 VSL#3 is the only probiotic with strong evidence
Particularly for
pouchitis, an inflammatory condition of the surgically created intestinal
pouch
in common IBD practice.
Interactions
The rs3197999 variant lies in a complex genomic region on chromosome 3p21 that
contains 10 genes within a 336 kb associated interval. Notably, it's in linkage
disequilibrium1818 Notably, it's in linkage
disequilibrium
Almost complete cosegregation of minor alleles, D'=0.60,
r²=0.35 with rs1050450 in GPX1
(glutathione peroxidase 1), which causes a Pro198Leu substitution that reduces
GPx-1 antioxidant enzyme activity. Some researchers have proposed that GPX1,
rather than MST1, might be the pathophysiologically relevant gene at this locus.
The functional distinction is important: MST1 R689C affects innate immune activation (macrophage chemotaxis and bacterial response), while GPX1 Pro198Leu affects antioxidant capacity (ability to neutralize reactive oxygen species produced during inflammation). Both mechanisms could plausibly contribute to IBD pathogenesis. It's possible that the true causal variant is neither rs3197999 nor rs1050450 but another variant in linkage disequilibrium with both, or that both variants independently contribute to disease risk through complementary pathways. Given the proximity and LD structure, individuals with the MST1 A allele often also carry the GPX1 Leu198 allele, potentially compounding inflammatory susceptibility through both enhanced immune activation and reduced antioxidant defense.
Gene-gene interaction studies have identified significant epistasis between MST1 and several inflammatory pathway genes. The MST1–JAK2 interaction was replicated across original and validation datasets, and MST1 showed consistent interactions with IL23R (interleukin-23 receptor, a validated IBD susceptibility gene targeted by biologics like ustekinumab) and PTGER4 (prostaglandin E receptor 4, involved in inflammatory signaling). These interactions suggest that MST1 genetic effects may be amplified or modified by variation in other immune genes, and that personalized risk assessment should eventually incorporate multi-locus profiles rather than single-SNP analysis.
The "Bliss Gene" — FAAH and Your Endocannabinoid System
The FAAH gene encodes fatty acid amide hydrolase11 fatty acid amide hydrolase
an enzyme that breaks down anandamide, your brain's natural "bliss molecule", which binds to the same receptors as THC from cannabis. The rs324420 variant changes a single amino acid at position 129 from proline to threonine, making the enzyme more vulnerable to breakdown. Carriers of the A allele produce less stable FAAH enzyme, resulting in elevated anandamide levels throughout the brain and body22 elevated anandamide levels throughout the brain and body.
This common variant occurs in approximately 38% of Europeans (33% heterozygous AC, 5% homozygous AA), with markedly higher frequencies in African populations (~45%) and lower in East Asians (~10%)33 markedly higher frequencies in African populations (~45%) and lower in East Asians (~10%). The geographic distribution has led researchers to investigate whether this variant influences not just individual psychology but national happiness levels across cultures44 national happiness levels across cultures.
The Mechanism
The Pro129Thr substitution doesn't change FAAH's catalytic activity — the enzyme still breaks down anandamide at normal rates when it's present. The critical difference is protein stability55 protein stability. The threonine variant (A allele) is more sensitive to proteolytic degradation, meaning cells produce approximately 50% less FAAH enzyme in AA homozygotes compared to CC individuals. This leads to chronically elevated anandamide signaling through CB1 cannabinoid receptors, primarily in the amygdala, prefrontal cortex, and hippocampus — brain regions central to fear processing, stress response, and emotional regulation66 fear processing, stress response, and emotional regulation.
Elevated anandamide acts like a natural anxiolytic, dampening the amygdala's threat response and enhancing fronto-amygdala connectivity. In neuroimaging studies, A-allele carriers show reduced amygdala activation when viewing threatening faces77 A-allele carriers show reduced amygdala activation when viewing threatening faces and faster fear extinction learning — the ability to unlearn fear associations after a threat is no longer present.
The Evidence
Fear extinction and trauma response: Multiple fMRI studies demonstrate that A-allele carriers show enhanced fear extinction recall88 Multiple fMRI studies demonstrate that A-allele carriers show enhanced fear extinction recall, the neurobiological foundation of exposure therapy for PTSD and anxiety disorders. In a study of 55 healthy adults, AC heterozygotes showed significantly greater extinction recall compared to CC individuals, with distinct neural activation patterns in the ventromedial prefrontal cortex. This suggests the A allele may confer resilience to trauma-related symptoms, though one large veteran study (N=949) found no protective effect against PTSD development after military deployment99 one large veteran study (N=949) found no protective effect against PTSD development after military deployment.
Pain sensitivity: Women with the AA genotype undergoing breast cancer surgery reported significantly less sensitivity to cold pain1010 Women with the AA genotype undergoing breast cancer surgery reported significantly less sensitivity to cold pain (β = −1.48, 95% CI −2.14 to −0.8) and required less postoperative analgesia. A rare case report documented a Scottish woman with complete pain insensitivity, anxiety immunity, and accelerated wound healing1111 a Scottish woman with complete pain insensitivity, anxiety immunity, and accelerated wound healing, who carried both the AA genotype and a rare deletion in FAAH-OUT, a regulatory pseudogene.
Mood and well-being: The relationship with happiness is complex. A longitudinal study of 2,822 individuals found that each A-allele was associated with lower subjective well-being scores1212 A longitudinal study of 2,822 individuals found that each A-allele was associated with lower subjective well-being scores at both timepoints (B: −0.52, p = 0.007). However, cross-national studies show countries with higher A-allele frequencies report greater national happiness1313 countries with higher A-allele frequencies report greater national happiness, with Ghana, Nigeria, and Mexico ranking highest. The paradox may reflect that chronically elevated anandamide leads to CB1 receptor downregulation over time, potentially disrupting normal reward processing.
Substance use: The A allele shows divergent effects across substances1414 divergent effects across substances. For cannabis, AA individuals have roughly half the dependence rate of CC carriers (11% vs 26%) and report less subjective pleasure from marijuana — likely because they already have elevated endogenous cannabinoid tone. However, for alcohol, A-allele carriers with European ancestry show more severe alcohol dependence1515 A-allele carriers with European ancestry show more severe alcohol dependence when they do develop problems (13 more binge drinking days over a 90-day period), potentially via an indirect pathway through lower subjective well-being. The A allele has also been associated with increased risk of methamphetamine dependence in Malaysian populations1616 increased risk of methamphetamine dependence in Malaysian populations (OR 2.0-3.7 depending on ethnicity).
Athletic performance: Elite volleyball and rink-hockey players with the A allele were 2-3 times more likely to achieve "super athlete" status1717 Elite volleyball and rink-hockey players with the A allele were 2-3 times more likely to achieve "super athlete" status, possibly due to enhanced stress coping and pain tolerance during training and competition. However, contradictory evidence exists, with some studies showing the AA genotype more common in sedentary individuals than elite athletes.
Practical Implications
If you carry the A allele, your endocannabinoid system is running at a higher baseline, with downstream effects on how you process fear, pain, stress, and reward. This may make you more resilient to acute stressors and physical pain, but potentially more vulnerable to mood disturbances if you experienced childhood trauma, which can interact with the A allele to increase anxiety and depression risk1818 interact with the A allele to increase anxiety and depression risk.
For mental health treatment, A-allele carriers may respond particularly well to exposure-based therapies for anxiety and PTSD, given the enhanced fear extinction consolidation. FAAH inhibitors — drugs designed to mimic the A-allele effect — are in clinical trials for anxiety disorders and showed promise in a 12-week trial for social anxiety1919 promise in a 12-week trial for social anxiety, though results have been mixed.
For pain management, AA individuals may require less analgesia for acute pain but should still discuss post-operative pain plans with providers, as individual variation is substantial. The reduced pain sensitivity doesn't eliminate pain entirely.
For substance use, recognize that if you have the A allele, cannabis will likely be less appealing and addictive, but alcohol may pose greater risk if you develop problematic use patterns, particularly if you also struggle with low mood.
Interactions
The FAAH rs324420 variant interacts with other endocannabinoid and stress-response genes. CNR1 rs2180619 (cannabinoid receptor 1) and CRHR1 rs110402 (corticotropin-releasing hormone receptor 1) have been studied for interactions with FAAH in stress response, though a large veteran study found no significant gene-gene interaction effects2020 a large veteran study found no significant gene-gene interaction effects. The COMT rs4680 variant (which affects dopamine metabolism) has been shown to interact with FAAH rs324420 in modulating placebo analgesia response2121 interact with FAAH rs324420 in modulating placebo analgesia response.
Early life stress represents a critical environmental interaction. Individuals with the A allele who experienced repetitive childhood trauma show significantly higher anxiety and depression scores2222 Individuals with the A allele who experienced repetitive childhood trauma show significantly higher anxiety and depression scores compared to CC carriers with similar trauma exposure, suggesting chronically elevated anandamide during neurodevelopment may disrupt normal CB1 receptor function and long-term stress response systems.