HLA-DQA1 rs17612852 — The Peanut Allergy Haplotype Tag That Switches Sides
Buried in the most complex region of the human genome — the HLA class II locus11 HLA class II locus
Human
Leukocyte Antigen class II genes on chromosome 6p21 encode the antigen-presenting molecules
that determine which proteins the immune system tolerates and which it attacks
— lies a variant with a paradoxical property: the same genetic signal that increases peanut
allergy risk when peanut is avoided promotes powerful immune protection when peanut is consumed
early. rs17612852 is an intronic variant in HLA-DQA1 at chromosome 6:32,652,795 (GRCh38),
in strong linkage disequilibrium (D′=0.99, r²=0.65) with the HLA-DQA1*01:02 haplotype22 HLA-DQA1*01:02 haplotype
A
specific variant of the alpha chain of the HLA-DQ antigen-presenting molecule, determined by
classical HLA typing. Because G and A are the
two alleles at this position and the G allele nearly always travels with HLA-DQA1*01:02,
rs17612852 functions as a convenient tag SNP — a genotyping proxy for a classical HLA
haplotype that is expensive and technically demanding to type directly.
The Mechanism
HLA-DQ molecules are heterodimers on the surface of antigen-presenting cells33 antigen-presenting cells
Dendritic
cells, B cells, and macrophages that process proteins into peptide fragments and display them
to naïve CD4+ T cells, shaping whether tolerance or immune activation follows.
The specific DQ alpha chain encoded by DQA1*01:02 has a binding groove that presents peanut
protein peptides — particularly from Ara h 244 Ara h 2
The dominant IgE target in clinical peanut
allergy; a storage protein in the peanut seed that drives most anaphylactic reactions
— with high efficiency to naïve T cells.
Crucially, rs17612852 is not just a passive marker of the haplotype. It is the primary
cis-eQTL55 cis-eQTL
A variant that controls expression of a nearby gene in cis — on the same
chromosome segment for HLA-DQB1 in CD4+ T cells:
each additional copy of the G minor allele drives significantly higher HLA-DQB1 RNA levels
(p=8.34×10⁻¹³), increasing the amount of HLA-DQ molecule on the antigen-presenting cell
surface. The direction of immune outcome this creates — tolerance or allergy — depends entirely
on whether peanut protein arrives via the gut route (oral, tolerogenic) or the skin/airway
route (sensitising)66 gut route (oral, tolerogenic) or the skin/airway
route (sensitising)
The route of first antigen encounter decisively shapes T cell
polarisation; gut mucosal exposure favours Treg and IgG4; cutaneous exposure favours Th2
and IgE. More HLA-DQ expression on antigen-presenting
cells in the gut amplifies the tolerogenic signal; the same upregulation in sensitised
individuals amplifies the allergic response.
The Evidence
The Hong et al. 2015 GWAS77 Hong et al. 2015 GWAS
Genome-wide association study identifies peanut allergy-specific
loci and evidence of epigenetic mediation in US children. Nature Communications, 2015;
2,197 participants of European ancestry from the Chicago Food Allergy Study
was the first genome-wide study of peanut allergy and identified the HLA-DQ/DR region at
6p21.32 as the dominant susceptibility locus, with the lead SNPs rs9275596 and rs7192 tagging
the same HLA-DQA1*01:02 haplotype that rs17612852 also marks. The Asai et al. 2018 Canadian
GWAS88 Asai et al. 2018 Canadian
GWAS
A Canadian genome-wide association study and meta-analysis confirm HLA as a risk
factor for peanut allergy independent of asthma. JACI, 2018; >7,800 subjects across 8 studies
confirmed this locus as the primary genetic signal for peanut allergy and identified
rs17612852 specifically among HLA SNPs significantly associated with peanut allergy,
particularly in individuals with mild reaction history.
The mechanistic insight came from the LEAP trial. Kanchan et al. 202699 Kanchan et al. 2026
Genetic Determinants
of Peanut-Specific IgG4 Levels in the Context of Sustained Oral Peanut Exposure in the
LEAP Study. Immunology, 2026; 267 LEAP peanut-consumption group participants
conducted a GWAS of peanut-specific IgG4 in participants who consumed peanut throughout the
trial. rs17612852 showed the strongest genome-wide signal (p=5.80×10⁻⁷) for total
peanut-specific IgG4, with a component-specific association for Ara h 2 IgG4 (p=7.28×10⁻⁶,
beta=0.324). Critically, no association was observed in the peanut-avoidance group — the
relationship between the G allele and IgG4 exists only when peanut is regularly consumed.
Dantzer et al. 20221010 Dantzer et al. 2022
HLA-associated outcomes in peanut OIT trials. Frontiers Immunology,
2022; IMPACT (ages 12–48 months) and POISED (ages 7–55 years) OIT trials
demonstrated that HLA-DQA1*01:02 carriers — a group overwhelmingly identified by their G
allele at rs17612852 — show substantially superior outcomes in peanut oral immunotherapy:
desensitization in 93% vs 78% (IMPACT) and sustained unresponsiveness in 52% vs 31%
(POISED). The protective immune benefit weakens significantly with age — younger children
show the strongest effect, consistent with the critical window for immune tolerance induction.
Practical Actions
The most important clinical application of this variant is early-life peanut introduction. For G allele carriers whose infants are at elevated risk for peanut allergy (eczema, egg allergy), early peanut introduction (LEAP protocol: before 11 months) completely overrides the genetic susceptibility and channels the DQA1*01:02 haplotype toward high IgG4 protection rather than IgE sensitisation. The G allele that raises allergy risk under avoidance conditions actively promotes the strongest protective response under consumption conditions.
For individuals who have already developed peanut allergy, G allele status predicts superior response to oral immunotherapy across multiple independent trials — making this genotype a useful predictor of OIT candidacy and expected outcomes.
Interactions
rs17612852 operates within the broader HLA class II haplotype architecture centred on chromosome 6p21. Its closest functional interaction is with rs9275596, the intergenic tag SNP between HLA-DQB1 and HLA-DQA2 that also tags HLA-DQA1*01:02 and was the top GWAS hit in the Hong 2015 study. Both SNPs are in LD with the same DQA1*01:02 haplotype and have overlapping but distinct LD relationships; carrying risk alleles at both positions provides the strongest haplotype resolution.
The interaction with rs2187668 (tagging HLA-DQ2.5, associated with celiac disease and type 1 diabetes) is architecturally important: DQA1*01:02 and DQ2.5 are generally on different haplotypes, so compound heterozygotes carrying the G allele at rs17612852 and the A allele at rs2187668 carry risk across distinct HLA-mediated immune-condition pathways simultaneously. This combination is documented in the compound actions in this database.
MAPT rs1800547 — The Molecular Switch in Tau's H1/H2 Divide
Within the microtubule-associated protein tau (MAPT) gene on chromosome 17q21,
a single nucleotide difference at rs1800547 marks one of the most consequential
forks in human neurological risk: the ancient H1/H2 haplotype boundary. While the
broader H1/H2 distinction spans a 900-kilobase chromosomal inversion, rs1800547
is the canonical SNP that directly differentiates the H1 and H2 clades11 canonical SNP that directly differentiates the H1 and H2 clades
The GenePD
Study found rs1800547 was the single most statistically significant variant in the
region for Parkinson's disease after multiple testing correction.
Unlike rs17649553, which is another H1/H2 tag in the region, rs1800547 has been
shown to have direct molecular function — not merely a passive marker of haplotype
membership.
The MAPT gene produces tau, a protein whose primary job is stabilizing the neuronal microtubule skeleton and supporting axonal transport. When tau becomes hyperphosphorylated and misfolds, it aggregates into neurofibrillary tangles, the pathological hallmark of tauopathies including Alzheimer's disease, Parkinson's disease, progressive supranuclear palsy (PSP), and corticobasal degeneration (CBD).
The Mechanism: Splice Factor Binding and Exon 3
A landmark 2017 study using whole-locus MAPT expression vectors demonstrated that
rs1800547, working together with rs17651213, directly controls haplotype-specific
inclusion of MAPT exon 322 A landmark 2017 study using whole-locus MAPT expression vectors demonstrated that
rs1800547, working together with rs17651213, directly controls haplotype-specific
inclusion of MAPT exon 3
Exon 3 encodes a region of the N-terminal projection
domain of tau that affects its interaction with membranes and the neuronal cytoskeleton.
Using electrophoretic mobility shift assays, researchers found that the H1 (A) and
H2 (G) alleles at rs1800547 create distinct RNA-protein binding patterns with
splicing factors hnRNP F and hnRNP Q — critical regulators of alternative splicing
in neurons. The H2 sequence at this position allows 1.76-fold higher exon 3 inclusion
compared to H1, altering the proportion of N1 and N2 tau isoforms.
This matters because the balance of tau isoforms — not just total tau levels — appears critical to which tauopathy, if any, develops. H1 is associated with elevated 4-repeat (4R) tau isoforms, which are the primary constituents of pathological aggregates in PSP and CBD. H2's different splicing pattern may shift the balance away from aggregation-prone 4R isoforms.
The Evidence for Parkinson's Disease
The GenePD Study genotyped 21 SNPs across the MAPT region in PD families and
controls, finding that rs1800547 emerged as the most statistically significant
variant for PD association, surviving multiple testing correction33 The GenePD Study genotyped 21 SNPs across the MAPT region in PD families and
controls, finding that rs1800547 emerged as the most statistically significant
variant for PD association, surviving multiple testing correction
The study also
found 4-repeat MAPT isoforms significantly elevated in PD brains (p=0.002),
linking the H1 splicing signature to disease pathology.
A large case-control study of 1,762 PD patients and 2,010 controls found that
H1/H1 homozygotes had an odds ratio of 1.46 (95% CI 1.25–1.69, p=8×10⁻⁷) for PD
compared to H1/H2 and H2/H2 carriers44 1,762 PD patients and 2,010 controls found that
H1/H1 homozygotes had an odds ratio of 1.46 (95% CI 1.25–1.69, p=8×10⁻⁷) for PD
compared to H1/H2 and H2/H2 carriers
The association held across familial and
sporadic disease, both sexes, and early- and late-onset subgroups.
Progressive Supranuclear Palsy: The Strongest Association
The H1/H1 genotype is found in approximately 94% of PSP patients compared to ~64%
of the general population — a striking enrichment. A JAMA Neurology study of 802
neuropathologically confirmed PSP cases identified H1 subhaplotypes with markedly
elevated risk: H1d (OR 1.86), H1g (OR 3.64), and H1o (OR 2.60)55 802
neuropathologically confirmed PSP cases identified H1 subhaplotypes with markedly
elevated risk: H1d (OR 1.86), H1g (OR 3.64), and H1o (OR 2.60)
These sub-haplotype
associations suggest that specific combinations of H1-background variants, on top of
the rs1800547 A allele, determine the magnitude of PSP risk.
The chromosome 17q21.31 region — anchored by rs1800547 — represents the single
strongest genetic risk locus for PSP identified to date.
Alzheimer's Disease: A Different Pathway
A study of 17,996 participants (8,559 AD cases, 9,437 controls) across Spanish and
international cohorts found that rs1800547 itself was associated with AD risk (OR 1.12,
p=0.0025)66 A study of 17,996 participants (8,559 AD cases, 9,437 controls) across Spanish and
international cohorts found that rs1800547 itself was associated with AD risk (OR 1.12,
p=0.0025)
The effect was strongest in APOE ε4 non-carriers — suggesting MAPT H1
represents an alternative causal pathway to AD distinct from amyloid-driven disease.
The risk was highest in individuals over age 77 without APOE ε4 (p=0.001), suggesting
a late-life tau-driven pathway independent of beta-amyloid accumulation. For people
without the APOE ε4 allele, the MAPT H1 haplotype tagged by rs1800547 becomes a
more prominent contributor to AD risk.
ALS and Frontotemporal Spectrum
Beyond the primary tauopathies, a 2023 study of Bulgarian ALS patients found the
H1b subhaplotype (containing the rs1800547 A allele) conferred a nearly 2-fold
increased risk for sporadic ALS77 a 2023 study of Bulgarian ALS patients found the
H1b subhaplotype (containing the rs1800547 A allele) conferred a nearly 2-fold
increased risk for sporadic ALS
The authors propose that fine transcriptional
regulation at the MAPT locus, including rs1800547's splice factor interactions,
may influence ALS susceptibility through shared tau biology with FTD.
ALS and FTD share genetic and pathological overlap, and MAPT variation may
contribute to the clinical spectrum between them.
Practical Actions
For H1/H1 carriers (AA genotype), the relevant clinical considerations are monitoring for motor symptoms that might indicate early parkinsonism, PSP, or CBD — conditions where early specialist evaluation matters for accurate diagnosis and prognosis. PSP in particular is frequently misdiagnosed as Parkinson's disease but responds differently to treatment. For the Alzheimer's disease risk — especially relevant for APOE ε4 non-carriers — knowing your MAPT status can help contextualize the late-life cognitive monitoring picture.
There are currently no approved pharmacological agents specifically targeting MAPT splicing or H1-driven tau isoform imbalance, though several anti-tau therapies are in clinical trials. Lifestyle factors — particularly aerobic exercise and head trauma prevention — have independent evidence for neuroprotection across multiple pathways relevant to tauopathy risk.
Interactions
rs1800547 and rs17649553 both tag the same H1/H2 haplotype and are in very strong linkage disequilibrium. If a person's genome contains both SNPs, their results should be concordant. The H1 risk at this locus compounds with rs356182 (SNCA) for Parkinson's disease risk — though interaction analyses have found these act independently rather than epistatically. In Alzheimer's disease, the H1/H2 distinction interacts with APOE genotype (rs429358), with H1 risk most pronounced in APOE ε4 non-carriers.
MTHFR C677T — The Methylation Gatekeeper
MTHFR (methylenetetrahydrofolate reductase) is arguably the most talked-about gene in nutritional genomics, and for good reason. It encodes the enzyme that converts 5,10-methylenetetrahydrofolate into 5-methyltetrahydrofolate 11 The active form of folate that enters the methylation cycle (methylfolate), the biologically active form of folate that your body actually uses. Methylfolate is essential for the methylation cycle 22 The methylation cycle adds methyl groups to DNA, proteins, and neurotransmitters — essential for hundreds of reactions, which affects DNA repair, neurotransmitter production, detoxification, and hundreds of other biochemical reactions.
The Mechanism
The C677T variant (rs1801133) causes an alanine-to-valine substitution 33 Alanine-to-valine substitution at position 222 of the enzyme (p.Ala222Val) at position 222 of the MTHFR enzyme. This makes the enzyme thermolabile 44 Thermolabile: the enzyme loses stability and function at normal body temperature — it loses activity at body temperature. The AA genotype 55 TT on the coding strand — 23andMe reports the complementary strand retains only about 30% of normal enzyme activity, while the AG genotype 66 CT on the coding strand retains about 65%. This means less dietary folate and supplemental folic acid gets converted to the methylfolate your cells need.
The Evidence
The C677T variant is one of the most extensively studied genetic variants in human
biology. A meta-analysis of over 80 studies77 meta-analysis of over 80 studies
Wen YY et al. Meta-analysis across 82 studies confirming the MTHFR-homocysteine link confirmed that the TT genotype is
associated with 25% higher homocysteine levels when folate intake is low. Elevated
homocysteine is an independent risk factor for cardiovascular disease88 cardiovascular disease
Mangoni AA & Jackson SHD. Homocysteine and cardiovascular disease. Am J Med, 2002, neural tube
defects, and possibly cognitive decline. However, the key finding is that adequate
folate intake essentially normalizes homocysteine in most TT individuals. A
large meta-analysis99 large meta-analysis
Clarke R et al. Homocysteine and coronary heart disease meta-analysis, 2012 found a 15%
excess coronary heart disease risk in TT homozygotes compared to CC homozygotes.
The Folic Acid Question
Synthetic folic acid (found in fortified foods and cheap supplements) must be
converted by MTHFR to become active methylfolate. If your MTHFR is working at
only 30% capacity, this conversion is a bottleneck. Methylfolate supplements
bypass this step entirely, which is why they are often recommended for people with
the TT genotype. Riboflavin (vitamin B2) is an essential cofactor for MTHFR and
has been shown to lower blood pressure1010 shown to lower blood pressure
McNulty H et al. showed riboflavin 1.6mg/day lowers blood pressure in MTHFR TT individuals by stabilizing the thermolabile enzyme
in TT individuals by stabilizing the thermolabile enzyme.
Practical Implications
The MTHFR C677T variant is extremely common — about 10-15% of Europeans are TT and about 40% are CT. It is not a disease-causing mutation. With adequate folate (especially as methylfolate), B12, B2, and B6 intake, most people with the TT genotype function perfectly normally. The key is knowing your status so you can optimize your B vitamin strategy.
Interactions
The C677T variant interacts importantly with the A1298C variant (rs1801131) — compound heterozygosity (one copy of each) can reduce MTHFR activity to 40-50%. It also interacts with SLC19A1 (rs1051266), which controls folate transport into cells, and COMT (rs4680), which determines tolerance for methyl donors. Methotrexate, an antifolate drug, has increased toxicity in C677T carriers.
HNF4A P2 Promoter — The Population-Specific Beta-Cell Diabetes Switch
HNF4A11 HNF4A
Hepatocyte Nuclear Factor 4 Alpha — a nuclear receptor transcription factor
that controls dozens of metabolic genes in liver, intestine, kidney, and pancreatic
beta cells is one of the master regulators
of glucose and lipid homeostasis. It has a unique dual-promoter architecture: P1 drives
the adult-liver isoform, while the P2 promoter22 P2 promoter
Located approximately 46 kb upstream
of P1, the P2 promoter is active specifically in pancreatic beta cells and the fetal
liver. Essentially all beta-cell HNF4A expression derives from P2-driven transcripts —
making this promoter region critical for pancreatic function while being largely
irrelevant to hepatic HNF4A activity. drives the "beta-cell-specific" isoform
(HNF4A7–12). rs1884613 is an intronic variant that falls within the P2 haplotype
block — a cluster of highly correlated SNPs tagging the P2 promoter risk haplotype.
Rare P2 mutations that inactivate HNF4A cause MODY133 MODY1
Maturity-Onset Diabetes of
the Young type 1 — an autosomal dominant, early-onset monogenic diabetes arising from
HNF4A haploinsufficiency in beta cells.
rs1884613 is not a MODY mutation, but it operates along the same biological axis
at vastly smaller effect sizes.
The Mechanism
The P2 promoter drives HNF4A isoforms that regulate a suite of genes essential for glucose-stimulated insulin secretion — including glucokinase (the islet glucose sensor), the Kir6.2 subunit of the ATP-sensitive potassium channel, and the insulin gene itself. The G allele at rs1884613 is in near-perfect LD (r²≈0.99) with rs2144908 and marks carriers of a P2 haplotype associated with subtly reduced P2 promoter activity and lower HNF4A expression in pancreatic beta cells. Less HNF4A means quieter insulin secretion signaling, which over years accumulates into impaired glucose tolerance and type 2 diabetes susceptibility.
In established type 2 diabetes, the HNF4A P2 isoform becomes
aberrantly re-expressed in adult liver44 aberrantly re-expressed in adult liver
Glucagon (chronically elevated in T2D)
activates TET3, which demethylates the P2 promoter. FOXA2 then drives P2 transcription
in hepatocytes, causing the fetal HNF4A isoform to stimulate excessive hepatic glucose
output — a feed-forward loop worsening hyperglycemia.
The P2 risk haplotype may predispose to this aberrant hepatic re-activation under
metabolic stress, compounding both impaired beta-cell secretion and excess hepatic
glucose production.
The Evidence
The HNF4A P2 haplotype was first linked to T2D in 2004 when
Weedon et al.55 Weedon et al.
Weedon MN et al. Common variants of the hepatocyte nuclear factor
4-alpha P2 promoter are associated with type 2 diabetes in the U.K. population.
Diabetes 2004. PMID:15504983 identified
rs1884613 among four P2 region variants associated with T2D in 5,256 UK subjects
(risk haplotype OR 1.15, 95% CI 1.02–1.33). A 2007 Scandinavian meta-analysis by
Johansson et al.66 Johansson et al.
Johansson S et al. Studies in 3,523 Norwegians and meta-analysis
in 11,571 subjects indicate that variants in the HNF4A P2 region are associated with
type 2 diabetes in Scandinavians. Diabetes 2007. PMID:17827402
confirmed the signal with OR 1.14 (95% CI 1.06–1.23, P=0.0004) across 4,000 Norwegian
cases and 7,571 controls.
The most striking finding emerged from the
Barroso et al. 2008 population comparison77 Barroso et al. 2008 population comparison
Barroso I et al. Population-specific risk
of type 2 diabetes conferred by HNF4A P2 promoter variants: a lesson for replication
studies. Diabetes 2008. PMID:18728231:
rs1884613 showed OR ~1.7 in Ashkenazi Jewish subjects (n=991; P<1.6×10⁻⁶) but
only OR 1.04 (non-significant) in UK populations (n=4,022). This dramatic difference at
nearly identical G allele frequencies (~17–23%) implies the causal variant within the
P2 haplotype block is in closer LD with rs1884613 in Ashkenazi Jewish ancestry due to
extended founder-effect haplotype structure.
The Ashkenazi signal was independently confirmed by
Neuman et al. 201088 Neuman et al. 2010
Neuman RJ et al. Gene-gene interactions lead to higher risk for
development of type 2 diabetes in an Ashkenazi Jewish population. PLoS One 2010.
PMID:20361036 in 974 cases and 896
controls: rs1884613 gave unadjusted OR 1.69 (95% CI 1.40–2.03, P<0.0001) and adjusted
OR 1.77 (95% CI 1.39–2.24). Critically, gene-gene interaction analyses revealed that
carriers of both rs1884613 G and WFS1 rs10010131 risk alleles had OR 3.0 (95% CI
1.7–5.3) for T2D, and carriers of both HNF4A and TCF7L2 rs12255372 risk alleles had
OR 2.4 (95% CI 1.7–3.4) — demonstrating that HNF4A P2 haplotype risk is
substantially amplified by co-occurring beta-cell stressors.
Practical Actions
Because the primary risk mechanism is impaired HNF4A-driven beta-cell insulin secretion, the most actionable dietary lever is reducing the insulin secretory demand placed on pancreatic beta cells — specifically choosing lower-glycemic-load carbohydrates (legumes, intact grains, non-starchy vegetables) over rapidly absorbed refined starches that demand peak insulin secretory responses. Periodic monitoring of fasting glucose and HbA1c allows early detection of declining beta-cell reserve. For Ashkenazi Jewish individuals carrying the G allele, the ~1.7-fold per-allele risk estimate is clinically meaningful and warrants more proactive surveillance than for non-Ashkenazi Europeans where the effect is modest (~OR 1.14).
Interactions
rs1884613 and rs2144908 are in near-perfect LD (r²=0.99) and tag the same P2 haplotype signal — they cannot be separated in published studies. Other P2 haplotype tags (rs4810424, rs1884614, rs6031552) probe the same signal with varying LD. rs4812829 is in a different HNF4A intronic LD block and represents an independent GWAS signal in South Asians; carrying risk alleles at both rs1884613 and rs4812829 would represent additive, not correlated, HNF4A-linked risk.
The Neuman 2010 gene-gene interaction data identifies WFS1 rs10010131 (beta-cell ER homeostasis) and TCF7L2 rs12255372 (Wnt-driven incretin signaling) as the two most important interacting variants. The WFS1 interaction (OR 3.0) is especially notable because WFS1/wolframin controls ER calcium handling in beta cells — an independent beta-cell stress mechanism that synergizes with transcriptional HNF4A deficiency.
CYP4F2*3 — Vitamin K Metabolism and Warfarin Dosing
CYP4F2 encodes a cytochrome P450 enzyme11 cytochrome P450 enzyme
The CYP4F2 enzyme is the primary hepatic vitamin K1 oxidase that metabolizes vitamin K1 to hydroxyvitamin K1, effectively removing it from the vitamin K cycle. This serves as a counterbalance to VKORC1 (vitamin K epoxide reductase), preventing excessive accumulation of vitamin K. The V433M variant, also known as CYP4F2*3, is a common missense mutation that significantly impacts warfarin dosing requirements22 warfarin dosing requirements
Warfarin is an anticoagulant that works by inhibiting VKORC1, thereby limiting vitamin K availability for clotting factor activation.
The Mechanism
The rs2108622 variant causes a valine-to-methionine substitution at position 433 in the CYP4F2 protein. Research using human liver microsomes33 human liver microsomes
Tissue samples analyzed from liver banks genotyped for this variant demonstrates that individuals carrying the T allele (433Met) have both reduced CYP4F2 protein concentrations and decreased vitamin K1 oxidation activity. The T allele is associated with approximately 40-45% reduction in enzyme activity compared to the wild-type. Because less vitamin K is being metabolized and removed, hepatic vitamin K1 levels rise, providing more substrate for VKORC1 to convert into the active form needed for clotting factor synthesis. This elevated vitamin K counteracts warfarin's anticoagulant effect, necessitating higher warfarin doses to achieve the same therapeutic response.
The Evidence
The association between CYP4F2*3 and warfarin dosing was first identified in 200844 first identified in 2008
Caldwell et al. CYP4F2 genetic variant alters required warfarin dose. Blood, 2008 through a genome-wide association study that screened over 1,200 SNPs in a cohort of warfarin patients. The discovery study found that TT homozygotes required approximately 1 mg/day more warfarin than CC homozygotes across three independent cohorts representing diverse US geographic regions.
This finding has been extensively replicated worldwide55 extensively replicated worldwide
Liang et al. Influence of CYP4F2 genotype on warfarin dose requirement: a systematic review and meta-analysis. Thrombosis Research, 2012. A 2012 meta-analysis of 30 studies involving 9,470 participants confirmed that T-allele carriers require an 8.3% higher mean daily coumarin dose than CC homozygotes (95% CI: 5.6-11.1%, P < 0.0001). The effect is consistent across European and Asian populations but appears less pronounced in individuals of African ancestry, where the T allele is also much rarer.
The Clinical Pharmacogenetics Implementation Consortium (CPIC)66 Clinical Pharmacogenetics Implementation Consortium (CPIC)
Johnson et al. CPIC Guideline for Pharmacogenetics-Guided Warfarin Dosing: 2017 Update. Clinical Pharmacology & Therapeutics, 2017 incorporated CYP4F2*3 into their 2017 warfarin dosing guideline update. While the effect size is smaller than that of CYP2C9 and VKORC1 variants (which collectively explain ~40% of dose variability), CYP4F2*3 contributes an additional 1-4% to the explained variance and improves the accuracy of pharmacogenetic dosing algorithms.
Practical Implications
If you are prescribed warfarin and carry one or two copies of the T allele, you will likely need a higher maintenance dose to reach your target INR (International Normalized Ratio, typically 2-3 for most indications). CPIC recommends an optional 5-10% dose increase for non-African American individuals with at least one T allele when using a pharmacogenetic algorithm that already accounts for CYP2C9 and VKORC1 genotypes. This translates to approximately 0.5-1 mg/day additional warfarin.
The effect appears most clinically relevant in patients who also carry VKORC1 variants associated with low warfarin requirements. In these individuals, CYP4F2*3 can explain a significant portion of the remaining dose variability. The interaction makes sense mechanistically: VKORC1 variants that increase warfarin sensitivity reduce the amount of active vitamin K, while CYP4F2*3 increases available vitamin K — the two variants work in opposite directions.
It's important to note that CYP4F2 testing is considered optional rather than essential in clinical warfarin management. The major determinants remain CYP2C9 (which metabolizes warfarin itself) and VKORC1 (warfarin's direct target). However, incorporating CYP4F2*3 into dosing algorithms does incrementally improve prediction accuracy and may be particularly valuable for patients who are difficult to stabilize or who fall outside the predicted dose range from CYP2C9/VKORC1 alone.
Interactions
CYP4F2*3 is one of four genetic variants incorporated into modern pharmacogenetic warfarin dosing algorithms, alongside VKORC1 rs9923231, CYP2C9*2 (rs1799853), and CYP2C9*3 (rs1057910). For individuals of African ancestry, the CYP2C cluster variant rs12777823 is more clinically relevant than CYP4F2*3.
The combined effect of these variants is complex but predictable. A person with VKORC1 AA genotype (high warfarin sensitivity) plus CYP2C9*1/*1 (normal metabolism) plus CYP4F2 TT (reduced vitamin K metabolism) represents competing influences: the VKORC1 variant lowers dose requirements substantially, while CYP4F2 TT modestly increases them. The net effect is still a lower-than-average dose, but not as low as VKORC1 AA alone would predict. Modern dosing algorithms such as those validated by the International Warfarin Pharmacogenetics Consortium77 such as those validated by the International Warfarin Pharmacogenetics Consortium
Available at www.warfarindosing.org incorporate all these variants simultaneously to generate personalized dose predictions.
Gene-gene interactions worth noting for compound implications include CYP4F2 TT + VKORC1 low-sensitivity genotypes (requiring careful upward dose titration) and CYP4F2 TT + CYP2C9 poor metabolizer status (where warfarin clearance is slow but more drug is needed to overcome elevated vitamin K). However, these interactions are generally handled by existing pharmacogenetic algorithms rather than requiring separate clinical decision-making.
PER3 Pro864Ala — Your Internal Clock's Tempo
The PER3 gene encodes Period Circadian Regulator 311 Period Circadian Regulator 3
One of three Period proteins (PER1, PER2, PER3) that form the negative arm of the mammalian circadian clock feedback loop,
a protein at the heart of the molecular clock that governs your
~24-hour sleep-wake cycle. Every cell in your body runs a version
of this clock, and PER3 helps set its tempo. The rs228697 variant
swaps a proline for an alanine at position 864, subtly changing how
the clock protein behaves — and, with it, whether you lean toward
being a morning lark or a night owl.
PER3 is best known for its VNTR polymorphism22 VNTR polymorphism
A variable number tandem repeat (4 or 5 copies of a 54-bp repeat) in exon 18 that strongly influences sleep timing and homeostatic sleep drive, but is not on SNP genotyping chips
(4-repeat vs 5-repeat), which strongly predicts sleep timing and
sleep need but cannot be genotyped on standard SNP chips. The
Pro864Ala missense variant (rs228697) is the best SNP-chip proxy for
PER3 circadian effects and has its own independent functional
consequences.
The Mechanism
The circadian clock runs on a transcription-translation feedback loop33 transcription-translation feedback loop
CLOCK and BMAL1 proteins activate transcription of PER and CRY genes; the PER/CRY protein complex then feeds back to repress CLOCK-BMAL1, creating a ~24-hour oscillation.
CLOCK and BMAL1 proteins bind to E-box elements44 E-box elements
Short DNA sequences (CACGTG) in gene promoters that CLOCK-BMAL1 heterodimers recognize to activate transcription of clock-controlled genes
to activate PER and CRY genes. The resulting PER and CRY proteins
accumulate, form complexes, enter the nucleus, and repress their own
transcription — completing one cycle roughly every 24 hours.
The Pro864Ala substitution sits in a region containing two potential
SH3-binding motifs55 SH3-binding motifs
Src Homology 3 domains mediate protein-protein interactions; the proline-to-alanine change disrupts these binding sites, altering how PER3 interacts with partner proteins.
Replacing proline (a rigid amino acid that enforces tight bends in protein
structure) with alanine (a flexible, small amino acid) alters the local
protein conformation. Functional experiments66 Functional experiments
Lavebratt C et al. Molecular analyses of circadian gene variants. Transl Psychiatry, 2016
showed that the variant (G allele) protein is more stable than the
wild-type — it degrades more slowly, accumulates to higher levels, and
recruits more PER2 into the transcription repression complex. The result
is a stronger repressor of CLOCK-BMAL1-driven transcription.
When the variant hPER3 was expressed in mammalian fibroblasts, it caused
a significant, dose-dependent lengthening of the circadian period. A
computational model77 computational model
Liberman AR et al. Circadian clock model supports molecular link between PER3 and human anxiety. Sci Rep, 2017
estimated this lengthening at 2-6% — enough to shift a 24-hour period
toward roughly 25 hours. People whose internal clock runs long tend to
drift toward later sleep and wake times — the hallmark of evening
chronotype.
The Evidence
The initial genetic association88 initial genetic association
Hida A et al. Screening of clock gene polymorphisms demonstrates association of a PER3 polymorphism with morningness-eveningness preference and circadian rhythm sleep disorder. Sci Rep, 2014
came from a Japanese study of 925 controls, 182 delayed sleep phase
patients, and 67 free-running type patients. The G allele was
significantly associated with eveningness preference (sex-adjusted
OR 2.48, 95% CI 1.34-4.60, corrected P = 0.012). More strikingly,
G allele frequency was doubled in free-running type patients — people
whose internal clock fails to entrain to the 24-hour day (age- and
sex-adjusted OR 2.02, 95% CI 1.16-3.52, P = 0.017).
An Italian replication study99 Italian replication study
Lazar AS et al. Diurnal preference, mood and the response to morning light in relation to polymorphisms in the human clock gene PER3. Sci Rep, 2017
of 786 Caucasian subjects confirmed the chronotype association (OR
2.10, 95% CI 1.21-3.65, P = 0.008) and found that G carriers showed
lower mood scores in the late afternoon and early evening — the time
when a longer-period clock would be most misaligned with the external
day.
Beyond chronotype, a case-control study1010 case-control study
Lavebratt C et al. Molecular analyses of circadian gene variants reveal sex-dependent links between depression and clocks. Transl Psychiatry, 2016
of 592 major depressive disorder (MDD) cases and 776 controls found
the G allele associated with MDD risk (OR 1.39, allelic P = 0.007),
with a stronger effect in women (allelic P = 0.041). Separately,
anxiety levels1111 anxiety levels
Liberman AR et al. Sci Rep, 2017
were significantly higher in G allele carriers (F(2,305) = 3.195,
P = 0.042), consistent with the broader finding that circadian
misalignment elevates anxiety and depression risk.
Practical Implications
This variant does not cause disease. It shifts your circadian tendency. If you carry the G allele and find yourself naturally gravitating toward later bedtimes, the biology supports what you already feel. The key is to work with your chronotype rather than fight it:
Morning light exposure is the most powerful tool for advancing a late-running clock. Even 20-30 minutes of outdoor light before 10 AM can shift your circadian phase earlier. Conversely, avoiding bright light (especially blue-enriched screens) in the 2-3 hours before desired bedtime prevents the clock from being pushed even later.
Meal timing also entrains peripheral clocks. Eating your last substantial meal at least 3 hours before sleep, and anchoring breakfast to a consistent time, provides a secondary timing cue that reinforces the light signal.
For the mood dimension, the association between this variant and
depression/anxiety appears to operate through circadian misalignment
rather than a direct effect on mood neurocircuitry. Maintaining
regular sleep-wake timing — even on weekends — reduces
social jetlag1212 social jetlag
The discrepancy between your biological clock and your social schedule, measured as the difference between midpoint of sleep on work days vs free days
and may mitigate the mood risk.
Interactions
PER3 Pro864Ala interacts with the PER3 VNTR (4-repeat vs 5-repeat). The G allele combined with the PER3-4 repeat haplotype shows a stronger association with morningness than either variant alone (OR 2.19 for the haplotype vs OR 2.10 for the SNP alone). However, the VNTR is not genotyped on standard SNP chips, so this interaction cannot be assessed from 23andMe data.
PER3 is part of a broader circadian gene network including CLOCK, BMAL1 (ARNTL), PER1, PER2, CRY1, and CRY2. Variants in these genes may compound or buffer PER3 effects on chronotype, but specific SNP-SNP interactions with rs228697 have not been well characterized in the published literature.
LYPLAL1 and Fat Distribution: A Sex-Dimorphic Signal at Chromosome 1q41
Where you store fat matters as much as how much fat you carry. Two people
with identical BMIs can have very different metabolic and health profiles
depending on whether fat accumulates centrally (around the abdomen) or
peripherally (in the hips, buttocks, and legs). The LYPLAL1 locus on
chromosome 1q41 is one of the earliest and most replicated genetic signals
for waist-hip ratio adjusted for BMI11 waist-hip ratio adjusted for BMI
WHR-adjBMI is a phenotype that captures fat distribution independently of total adiposity — it reflects where fat is located, not how much there is,
and its effects are dramatically stronger in women than in men.
rs2605100 sits within an intronic region of the LYPLAL1 antisense RNA 1 gene
(LYPLAL1-AS1), approximately 259 kb from the LYPLAL1 protein-coding gene
itself. LYPLAL1 (Lysophospholipase-Like 1) encodes a serine hydrolase with
unclear endogenous substrate22 serine hydrolase with
unclear endogenous substrate
Crystal structure shows LYPLAL1 is structurally
similar to acyl-protein thioesterases but with a closed hydrophobic tunnel
preferring short acyl chains; it has been proposed to act as a triglyceride
lipase in adipose tissue and to regulate protein depalmitoylation.
Despite its name, the protein is not a classical lysophospholipase. Its precise
physiological substrate and pathway remain under active investigation.
The Mechanism
The G allele at rs2605100 is the major allele (frequency ~71% in Europeans, up to ~89% in Africans) and is the variant associated with higher WHR — meaning more centrally distributed fat — in women. The A allele is the protective (favorable) form linked to a more gynoid (hip-and-thigh-predominant) fat distribution pattern. The variant likely acts as a regulatory signal rather than directly altering the LYPLAL1 protein sequence, possibly influencing the expression of LYPLAL1 or its antisense RNA in adipose tissue.
Mouse knockout studies33 Mouse knockout studies
CRISPR-Cas9 whole-body Lyplal1 KO on high-fat/high-sucrose diet; n=20 per sex per diet
demonstrate a sex-specific role: female (but not male) Lyplal1 KO mice on a
high-fat, high-sucrose diet weighed approximately 5 g less than wildtype controls,
had reduced total body fat percentage, and showed smaller white adipose tissue
depots across inguinal, gonadal, and perirenal sites. This mirrors the human
GWAS pattern where LYPLAL1 variants affect fat distribution exclusively or
predominantly in women. One proposed mechanism is that LYPLAL1 functions as
an acyl thioesterase involved in protein palmitoylation44 protein palmitoylation
Palmitoylation is the
reversible attachment of palmitate to cysteine residues, which regulates membrane
targeting and activity of proteins including estrogen receptors and metabolic
signaling intermediates, potentially
influencing estrogen receptor localization and thus coupling the gene's function
to sex hormone signaling.
The Evidence
The original discovery came from a meta-analysis of 38,580 individuals55 meta-analysis of 38,580 individuals
Lindgren et al. 2009, PLoS Genetics; 32 GWAS cohorts in stage 1, followed
by replication in 70,689 subjects
that identified rs2605100 as the sentinel variant at the LYPLAL1 locus.
The association with WHR reached genome-wide significance in women
(p = 1.3×10⁻⁸) but was completely absent in men (p = 0.50). The absolute
effect — 0.0014 units of WHR per G allele — is modest, explaining approximately
0.02% of WHR variance in women. This is typical for GWAS hits: individually
small effects that are biologically real and replicated.
A subsequent GIANT consortium analysis66 subsequent GIANT consortium analysis
Heid et al. 2010, Nature Genetics;
n=77,167 discovery, n=113,636 replication; 32 genome-wide association studies
confirmed the LYPLAL1 signal alongside 13 newly discovered loci. Seven of the
14 confirmed WHR loci showed marked sex dimorphism, all with stronger effects
in women — a pattern consistent with the well-established sex differences in
adipose tissue distribution driven by estrogen signaling.
Metabolic consequences of the G allele extend beyond fat location. In a
Danish population study77 Danish population study
Dalgaard et al. 2011, PLoS ONE; Inter99 cohort,
n=6,038 adults, each additional G
allele was associated with 3% higher fasting triglycerides (p = 0.003),
3% higher fasting insulin (p = 0.003), and 4% higher HOMA-IR (p = 0.001).
Notably, the triglyceride effect was male-restricted (6% per G allele in men,
p = 2.4×10⁻⁴; interaction p = 0.02), suggesting different metabolic routes
through which the variant affects men vs. women.
Evidence from bariatric surgery outcomes88 bariatric surgery outcomes
Lund et al. 2016, n=251 RYGB
patients (186 women); rs4846567, which is in high LD with rs2605100
shows that individuals homozygous for the favorable T allele of the correlated
SNP rs4846567 lost 7% more excess body weight after gastric bypass surgery
and reported 74% lower hunger scores and 53% lower disinhibition scores on
validated eating behavior questionnaires compared to G-allele carriers.
Overall, evidence for rs2605100 is strong: multiple large, independent GWAS in European and Asian populations, consistent replication, plausible biological mechanism supported by animal models, and metabolic downstream effects that extend beyond the index phenotype.
Practical Actions
For carriers of one or two copies of the G allele — particularly women — the key insight is a genetic tendency toward central fat redistribution. This does not override lifestyle factors but does establish a meaningful background predisposition. Two areas are specifically supported by the biological evidence:
First, triglyceride management is directly implicated. The G allele is associated with elevated fasting triglycerides regardless of fat distribution. Dietary approaches that specifically lower triglycerides — not generic "healthy eating" — are warranted: reducing refined carbohydrate and added sugar intake, increasing omega-3 fatty acids (EPA/DHA from marine sources), and limiting alcohol.
Second, body composition monitoring matters more than scale weight. Since the LYPLAL1 locus affects fat distribution rather than total fat mass, standard BMI tracking misses the relevant phenotype. Waist circumference and waist-hip ratio track the fat depot that this variant influences most directly.
Interactions
The LYPLAL1 locus signal is most commonly indexed by rs4846567, which is in
high linkage disequilibrium99 linkage disequilibrium
LD means the two variants are almost always
co-inherited; knowing your rs2605100 genotype reliably predicts your rs4846567
genotype with rs2605100.
For practical interpretation, the two variants can be treated as equivalent.
The LYPLAL1 fat distribution signal operates independently of FTO/MC4R total adiposity signals — having a high-risk FTO genotype (rs9939609 AA) drives increased overall fat mass, while LYPLAL1 determines where that fat is preferentially stored. These can compound: an individual with high-risk genotypes at both loci may gain more total fat and distribute it more centrally.
Supervisor note — candidate compound action: individuals carrying the G allele at rs2605100 (LYPLAL1, central redistribution) AND the AA genotype at rs9939609 (FTO, higher total fat mass) represent the highest-risk combination for central obesity among these two loci. A combined recommendation addressing both elevated total fat accumulation and its central distribution — specifically targeting fasting triglycerides and waist circumference rather than weight alone — would be appropriate and supported by additive effects documented in stratified GWAS analyses.
FADS2 rs2727271 — Your Delta-6 Desaturase Throttle
FADS211 FADS2
Fatty acid desaturase 2, encoding the delta-6 desaturase (D6D) enzyme — the first
and rate-limiting step in converting the plant-based omega-3 alpha-linolenic acid (ALA)
into EPA and ultimately DHA is one of the most
consequential enzymes in human lipid metabolism. It processes both the omega-3 pathway
(ALA → stearidonic acid → EPA) and the omega-6 pathway (linoleic acid → gamma-linolenic
acid → DGLA → arachidonic acid). Without adequate FADS2 activity, the plant-based omega-3
fats in flaxseed, walnuts, and chia seeds cannot be converted into the long-chain forms
that the brain, cardiovascular system, and immune cells actually use.
rs2727271 is an intronic variant within FADS2 on chromosome 11q12-13 that tags a haplotype block associated with altered FADS2 expression. The T allele appears in about 14% of Europeans but in 41% of East Asians, pointing to population-specific evolutionary pressures on fatty acid conversion capacity — likely tied to differences in dietary omega-3 sources across ancestral populations.
The Mechanism
rs2727271 sits deep within an intron of FADS2 (position c.142-1892 relative to the
coding sequence), with no direct protein change. Its functional effect is presumed to be
cis-regulatory — altering FADS2 mRNA transcription efficiency22 cis-regulatory — altering FADS2 mRNA transcription efficiency
Analogous to the
promoter-methylation mechanism of rs174537 in FADS1, where intronic/regulatory variants
in the same FADS cluster modulate expression via allele-specific DNA methylation at
enhancer sites between FADS1 and FADS2.
The T allele is associated with reduced enzyme activity, which produces a predictable
biochemical cascade: ALA and linoleic acid accumulate because the first desaturation
step is slowed, while downstream long-chain products (EPA, arachidonic acid) are
produced at lower rates.
A genome-wide association study identified rs2727271-A (the major allele) as associated with decreased circulating cis/trans-18:2 fatty acid levels (p=7×10⁻⁹), consistent with A-allele carriers more efficiently converting 18:2 precursors through the desaturation pathway. By contrast, T-allele carriers accumulate more precursor fatty acids — a biochemical signature of reduced D6D throughput.
The Evidence
The strongest evidence for the FADS2 locus comes from large genome-wide association
studies of circulating polyunsaturated fatty acids. A landmark 2009 GWAS in 1,075
InCHIANTI participants33 landmark 2009 GWAS in 1,075
InCHIANTI participants
Tanaka et al., PLoS Genetics — the FADS1/2/3 cluster on
chromosome 11 emerged as the dominant genetic determinant of plasma PUFA levels, with
rs174537 alone explaining 18.6% of additive variance in arachidonic acid (p=5.95×10⁻⁴⁶)
and associating strongly with EPA (p=1.07×10⁻¹⁴)
established that natural variation at this locus has an unusually large effect on fatty
acid status — larger than most single SNPs in metabolic genetics.
Subsequent studies refined the picture for FADS2 specifically. Zec et al. 2020 in
Nutrition Research44 Zec et al. 2020 in
Nutrition Research
Cross-sectional study of 286 Serbian adults; FADS2 rs174576 minor
allele carriers showed plasma AA β=−1.14 (95% CI: −2.25 to −0.43) and reduced estimated
desaturase-5 activity after multivariate adjustment
confirmed that FADS2 minor alleles consistently associate with lower arachidonic acid.
Schuchardt et al. 201655 Schuchardt et al. 2016
111 MCI patients; FADS2 minor allele carriers at rs3834458,
rs1535, rs174575, and rs174576 showed higher precursor PUFA levels and lower AA in
erythrocyte membranes demonstrated that
the effect on circulating PUFA composition is detectable in red blood cell membranes —
the most clinically relevant tissue compartment for omega-3 status assessment.
Isotope tracer work directly quantifying in-vivo ALA conversion found that FADS minor
allele homozygotes had lower plasma EPA and lower [13C]EPA enrichment at 24 and 48 hours
after ALA tracer dosing66 FADS minor
allele homozygotes had lower plasma EPA and lower [13C]EPA enrichment at 24 and 48 hours
after ALA tracer dosing
Gillingham et al. 2013, Am J Clin Nutr, n=103; four FADS SNPs
studied across all dietary conditions,
providing direct mechanistic confirmation that reduced FADS activity translates into
measurably less ALA-to-EPA conversion.
The East Asian frequency of the T allele (41%) versus African populations (3%) is striking. This stratification suggests that the T allele may have been neutral or mildly adaptive in populations historically relying on preformed dietary EPA and DHA from fish, while being selected against in populations that depended more on plant-based omega-3 conversion.
Practical Actions
For T-allele carriers, the core problem is that plant-based omega-3 sources (flaxseed, chia, walnuts, hemp) are unreliable. These foods supply ALA, which requires functional FADS2 to produce EPA and DHA. The T allele slows this conversion at the first and rate-limiting step, meaning even high ALA intakes may not translate to adequate EPA and DHA status.
The practical solution is to bypass the impaired conversion step by providing preformed EPA and DHA directly — from fatty fish, concentrated fish oil, or algae-based EPA/DHA supplements. AT heterozygotes benefit from 1–2 g EPA+DHA daily; TT homozygotes require 2–4 g daily to compensate for substantially reduced D6D throughput.
Omega-3 index testing (erythrocyte EPA+DHA as a percentage of total fatty acids) provides the most direct readout of whether supplementation is working. A value below 4% indicates clinically significant deficiency requiring dose adjustment.
Interactions
rs2727271 is in linkage disequilibrium with other FADS2 cluster variants, including rs3834458, rs174575, rs174576, and rs1535. Carrying multiple minor alleles across the FADS cluster (both FADS1 rs174537 T allele and FADS2 rs2727271 T allele) would compound the conversion deficit, since FADS1 (delta-5 desaturase) acts downstream of FADS2 in the same pathway. Combined FADS1+FADS2 minor allele carriers face a double bottleneck in long-chain PUFA synthesis.
FADS2 activity is also substrate-competitive: when dietary saturated fat (palmitic acid, 16:0) is high, FADS2 preferentially processes the saturated substrate rather than ALA or linoleic acid, further reducing long-chain PUFA synthesis in T-allele carriers who have less enzyme capacity to begin with.
ANRIL and Functional Aging — The 9p21.3 Longevity Locus
A single stretch of chromosome 9 — the 9p21.3 locus — is the most consistently
replicated region in genome-wide association studies for age-related disease.
Coronary artery disease, type 2 diabetes, glioma, melanoma, and now physical
aging have all been linked to variation here. At the heart of the locus sits
ANRIL (antisense non-coding RNA in the INK4 locus)11 ANRIL (antisense non-coding RNA in the INK4 locus)
Also designated CDKN2BAS or CDKN2B-AS1 — a long non-coding RNA transcribed antisense to the CDKN2A/CDKN2B tumor suppressor genes,
a molecular rheostat for cellular senescence.
rs2811712 is a tag SNP within ANRIL that was identified in a landmark study as
associated with physical function in older people22 associated with physical function in older people
Melzer D et al. A common variant of the p16INK4a genetic region is associated with physical function in older people. Mech Ageing Dev. 2007.
The G allele — carried by roughly one in five people of European ancestry — is
associated with substantially better preserved physical function in old age, while
the common A allele is the risk-conferring genotype.
The Mechanism
The 9p21.3 locus contains three protein-coding genes — CDKN2A (encoding
p16INK4a and p14ARF), CDKN2B (encoding p15INK4b), and MTAP — all flanked and
overlapped by ANRIL. p16INK4a and p15INK4b are
cyclin-dependent kinase inhibitors33 cyclin-dependent kinase inhibitors
CDK inhibitors block CDK4/6, preventing phosphorylation of Rb and halting cell-cycle progression from G1 to S phase — the canonical senescence checkpoint
that drive cellular senescence: as cells age and accumulate damage, p16 levels
rise, arresting the cell cycle and converting cells into
senescent "zombie" cells44 senescent "zombie" cells
Senescent cells stop dividing but remain metabolically active, secreting pro-inflammatory cytokines (the SASP — senescence-associated secretory phenotype) that damage neighboring tissue
that fuel the chronic inflammation underlying age-related functional decline.
ANRIL regulates CDKN2A/CDKN2B expression in cis through Polycomb group protein recruitment, particularly the PRC2 complex, which methylates histone H3K27 to silence the INK4 locus. In proliferating cells, ANRIL keeps senescence genes suppressed; as ANRIL expression shifts with aging or genetic variation, the balance tips toward elevated p16 and p15, accelerating the senescent phenotype. rs2811712 is an eQTL for CDKN2B expression: studies show that variation here correlates with altered ANRIL isoform abundance and downstream CDKN2B levels, providing a molecular pathway from genotype to functional aging phenotype.
The Evidence
The primary association was established by
Melzer et al. (2007)55 Melzer et al. (2007)
A common variant of the p16INK4a genetic region is associated with physical function in older people. Mechanisms of Ageing and Development.
across three independent European cohorts totaling 3,372 elderly individuals (EPIC-Norfolk sets 1 and 2 plus InCHIANTI). Severely limited physical function — defined by performance-based tests of walking speed, chair stand, and balance — was present in 15.0% of AA homozygotes versus 7.0% of GG homozygotes, nearly half the rate. The per-A-allele odds ratio was 1.48 (95% CI 1.17–1.88, p = 0.001), adjusted for age, sex, and study. The association held across all three cohorts tested independently, including a trend in the Iowa-EPESE cohort (n=419, p=0.079 one-sided), making this one of the first robust genetic associations for functional aging specifically.
A Han Chinese case-control study
Wu et al. (2012)66 Wu et al. (2012)
Heterozygote genotypes at rs2222823 and rs2811712 SNP loci are associated with cerebral small vessel disease in Han Chinese population
found the heterozygote genotype at rs2811712 was associated with OR 1.75 (CI 1.13–2.71, p=0.004) for cerebral small vessel disease — consistent with the broader pattern that the A (risk) allele at this locus impairs vascular and neurological aging beyond musculoskeletal function.
The broader 9p21.3 locus literature confirms that ANRIL variants influence
CDKN2B expression and cellular senescence77 CDKN2B expression and cellular senescence
SNPs here show inverse effects on ANRIL and CDKN2B expression, supporting a role of antisense transcription in regulating senescence pathways,
with the risk alleles generally increasing ANRIL expression in ways that paradoxically reduce appropriate p16/p15 upregulation in vascular cells, impairing the normal senescence response to damage. This locus is the strongest genetic signal for coronary artery disease in Europeans (OR ~1.3 per risk allele), though rs2811712 itself does not independently associate with CAD — its primary documented phenotype is functional physical aging.
Practical Actions
For carriers of two A alleles (AA), the evidence supports a targeted approach to slowing the accumulation of senescent cells and their inflammatory secretome. The most directly relevant interventions for the CDKN2B/p16 axis include:
- Senolytic compounds: Fisetin (a flavonoid concentrated in strawberries and onions) and quercetin have demonstrated senolytic activity in human studies — selectively promoting apoptosis in p16-expressing senescent cells. Intermittent high-dose fisetin (≥500 mg/day for 2–3 days per month) reduced senescent cell burden and SASP markers in published clinical data.
- Cardiovascular monitoring: The 9p21.3 region is the strongest common genetic locus for coronary artery disease. While rs2811712 itself does not independently associate with CAD, it tags a haplotype block where other risk variants reside. AA carriers should ensure thorough cardiovascular risk assessment.
- Strength and resistance training: p16-driven muscle senescence contributes directly to sarcopenia (age-related muscle loss) — the leading mediator of physical impairment in older adults. Resistance training specifically reduces senescent cell burden in skeletal muscle and preserves muscle fiber function through mechanisms that parallel the CDKN2B pathway.
AG carriers carry one protective G allele: physical function risk is intermediate, and the same principles apply with moderated urgency.
Interactions
rs10757278 and rs1333049 (9p21.3 haplotype): These are the primary CAD-associated SNPs at the 9p21 locus, in moderate-to-high LD with rs2811712. If you carry the risk haplotype at rs10757278 (G allele) in addition to AA at rs2811712, the combined burden on ANRIL/CDKN2B regulation is likely greater than either marker alone. The 9p21.3 block spans ~58 kb and multiple SNPs contribute to its overall phenotypic effect.
rs564398 (upstream of CDKN2A): Another 9p21 tag SNP associated with type 2 diabetes and glioma risk, which shares the ANRIL regulatory context. Combined risk at rs2811712 and rs564398 may implicate broader ANRIL dysregulation beyond the physical aging phenotype.
The Anabolic Switch — IGF-1 and Your Muscle-Building Potential
Insulin-like growth factor 1 (IGF-1) is one of the most powerful anabolic hormones
in the human body. It activates the PI3K/Akt/mTOR pathway11 activates the PI3K/Akt/mTOR pathway
The canonical growth
signaling cascade that controls muscle protein synthesis and hypertrophy,
stimulates satellite cell activation22 satellite cell activation
Muscle stem cells that divide and fuse to
repair damage and create new muscle tissue,
and drives skeletal muscle hypertrophy in response to training. The rs35767
polymorphism sits in the promoter region of the IGF1 gene, 1,245 base pairs
upstream of the transcription start site, where it regulates how much IGF-1 your
body produces.
The A allele is associated with higher circulating IGF-1 levels compared to the C
allele, and AA carriers tend to have greater muscle mass and superior athletic
performance33 greater muscle mass and superior athletic
performance
Particularly in power and combined power-endurance sports like
decathlon. This variant has emerged as
one of the most replicated genetic markers for elite athletic performance.
The Mechanism
rs35767 is a regulatory variant located in the promoter region of the IGF1 gene on
chromosome 12. The T-to-C substitution at position -1245 affects transcription
factor binding and gene expression. Studies show the A allele leads to higher IGF-1
production, though the exact transcription factor interactions remain under
investigation. Some research suggests the G allele may allow binding of C/EBPD
transcription activator44 G allele may allow binding of C/EBPD
transcription activator
A DNA-binding protein that regulates gene expression,
while other evidence indicates the A allele results in higher circulating levels
through mechanisms that may involve altered promoter activity.
Once IGF-1 is secreted (primarily by the liver in response to growth hormone), it
binds to IGF-1 receptors on muscle cells. This triggers a signaling cascade:
PI3K converts PIP2 to PIP3, activating PDK1 and Akt. Akt then phosphorylates
mTORC1, which activates ribosomal protein S6 and translation initiation factor
eIF4E, ramping up protein synthesis. Simultaneously, Akt inhibits FoxO
transcription factors, blocking the expression of muscle atrophy genes55 blocking the expression of muscle atrophy genes
E3
ubiquitin ligases like atrogin-1 and MuRF1 that tag muscle proteins for
degradation.
IGF-1 also activates muscle satellite cells—the stem cells responsible for muscle
repair and growth. After intense exercise or muscle damage, satellite cells
proliferate and differentiate into new myonuclei, contributing approximately
half of the muscle mass gained during hypertrophy66 half of the muscle mass gained during hypertrophy
Based on studies using viral
IGF-1 delivery in animal models.
The Evidence
The rs35767 variant has been studied extensively in athletic populations. In a
2013 Israeli study of 87 power athletes and 78 endurance athletes77 2013 Israeli study of 87 power athletes and 78 endurance athletes
Including
international and Olympic-level competitors,
the A allele was significantly more frequent in top-level power athletes compared
to national-level athletes. Among the elite power cohort, 4.8% carried the TT
genotype versus 0% in non-athletic controls—a striking overrepresentation.
A 2022 study of decathlon athletes88 2022 study of decathlon athletes
Decathlon demands both power and endurance
across 10 events found the AA genotype
was significantly more prevalent among decathletes compared to other athlete groups,
and AA carriers demonstrated superior speed performance. These findings align with
the physiological role of IGF-1 in fast-twitch muscle fiber development and
force production.
A 2024 meta-analysis99 2024 meta-analysis
Pooling data across multiple cohorts to increase
statistical power
confirmed the A allele as a favorable genetic marker for both power and endurance
athletic performance, supporting the variant's role across multiple training
modalities.
At the molecular level, a 2014 study of European adults1010 2014 study of European adults
n=569 in discovery
cohort measured circulating IGF-1 and
found that carriers of the GG genotype (equivalent to TT on the minus strand)
had significantly higher IGF-1 levels (218 ng/ml) compared to AA carriers (190
ng/ml, p=0.007). The higher IGF-1 group also showed better insulin sensitivity,
suggesting metabolic benefits beyond muscle growth.
However, not all effects are beneficial. A Japanese longitudinal cohort of 1,506
individuals1111 Japanese longitudinal cohort of 1,506
individuals
Followed for long-term health outcomes
found that AA carriers experienced faster decline in renal function over time
compared to GG carriers, suggesting chronically elevated IGF-1 may have tradeoffs
for kidney health.
Practical Actions
If you carry one or two A alleles, you have a genetic advantage for building muscle and responding to strength training. To capitalize on this:
Prioritize resistance training. Your elevated IGF-1 levels mean you're biochemically primed for hypertrophy. Focus on progressive overload—gradually increasing weight, volume, or intensity over time. AA carriers may see faster strength gains and better recovery from high-volume training compared to CC carriers.
Consume adequate protein. IGF-1 activates mTOR, the master regulator of protein
synthesis, but mTOR needs amino acid availability to function. Aim for 1.6-2.2
g/kg body weight daily, with post-workout protein intake1212 post-workout protein intake
20-40g within 2 hours
of training to maximize the
anabolic window when IGF-1 signaling is elevated.
Optimize sleep and recovery. Growth hormone (the primary driver of hepatic IGF-1 production) peaks during deep sleep. AA carriers producing more IGF-1 may benefit even more from adequate sleep (7-9 hours) for muscle repair and satellite cell activation.
Consider monitoring kidney function if you're TT. While the athletic benefits are clear, the Japanese cohort data suggests potential long-term renal effects. If you're a TT carrier pursuing intense athletic training, periodic monitoring of eGFR and creatinine may be prudent, especially as you age or if you have other kidney risk factors.
Interactions
rs35767 interacts with other variants in the IGF axis. rs71364461313 rs7136446
An intronic
IGF1 variant has also been associated
with athletic performance and may compound with rs35767 to influence IGF-1 levels
and muscle phenotype. Similarly, rs9729361414 rs972936
Another IGF1 intronic variant
affects IGF-1 expression and has been linked to neurological outcomes and muscle
force production.
Beyond the IGF1 gene, interactions with the growth hormone receptor and myostatin
pathway are likely. Carriers of both the IGF1 A allele and myostatin rare R
allele1515 myostatin rare R
allele
Loss-of-function variants in MSTN that reduce this muscle growth inhibitor
show even greater muscle mass and performance, suggesting an additive or synergistic
effect.
For power athletes, the combination of rs35767 TT and ACTN3 RR1616 ACTN3 RR
Alpha-actinin-3,
the "gene for speed" may represent an
elite genetic profile for explosive strength and sprint performance.